<div dir="ltr">Hi Renato,<br><div class="gmail_extra"><br><br><div class="gmail_quote">On Tue, Jul 23, 2013 at 5:08 PM, Renato Golin <span dir="ltr">&lt;<a href="mailto:renato.golin@linaro.org" target="_blank">renato.golin@linaro.org</a>&gt;</span> wrote:<br>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div>On 24 July 2013 00:11, Marcelo Sousa <span dir="ltr">&lt;<a href="mailto:marceloabsousa@gmail.com" target="_blank">marceloabsousa@gmail.com</a>&gt;</span> wrote:<br>


</div><div class="gmail_extra"><div class="gmail_quote"><div>

<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div>First of all, I want to understand if you&#39;re stating that analyzing the Linux Kernel through LLVM IR is not a good approach or analyzing LLVM IR in general is no good. </div>

</div></div></div></blockquote><div><br></div></div><div>The latter.</div></div></div></div></blockquote><div><br></div><div>I dont&#39; agree and in fact it feels to me that there is almost a new field in software verification over LLVM IR. Just to start, there are several examples of bounded model checkers, symbolic execution procedures that operate over LLVM IR. Are you somehow implying that this analysis is no good?</div>


<div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr">

<div class="gmail_extra">

<div class="gmail_quote"><div>I do not understand what you mean by &quot;the AST is gone on the IR level&quot;. I can argue that in a naive compilation the IR code is structurally to the original C code, in fact, the SSA transformation is &quot;functionalizing&quot;. <br>


</div></div></div></div></blockquote><div><br></div></div><div>That&#39;s not entirely true. All the IR needs to do is to &quot;behave&quot; like the original code, but most of the semantics is gone for good. </div></div>


</div></div></blockquote><div><br></div><div>You need to define &quot;behave&quot; to talk about semantics. Are you saying that bisimulation is not related with semantics?</div>

<div><br></div><div>Lets go through your examples and please do enumerate all cases because I&#39;m very interested in bundling them.</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">


<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><div><br></div>

<div>Examples:</div><div> - Sign information is on instructions, not types</div></div></div></div></blockquote><div>This is not a semantic problem, because you can refine the LLVM IR type system. In fact, that is something I am doing.</div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - Structures are flattened out, or cast to arrays</div></div></div></div></blockquote><div>The structural information is still there otherwise it&#39;s not a sound compilation. I need to see real examples of C code where the original structures cannot be retrieved anymore. Moreover, a sound flattening (which I still need to understand in the particular case of LLVM) is probably a more reliable model for certain types of verification where it is important to know that there is a flatenning happening. </div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - Multiple structures are commoned up if they&#39;re structurally equivalent</div></div></div></div></blockquote><div>Not sure what the problem is here. If LLVM IR can identify that two structs are structurally equivalent for free, that seems good for me. Are you saying that it will give wrong type annotations to values that have the same structural type? If that&#39;s the case, it&#39;s clearly something that should/can be fixed.</div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - Function calls are changed (ByVal, ABI specific, architecture specific)</div></div></div></div></blockquote><div>Not sure why is this a problem also for verification. In fact, again I get a much more reliable execution model.</div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - Intrinsics are expanded into IR code</div></div></div></div></blockquote><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">


<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><div> - Perfectly valid code is shrunk into intrinsics (optimizing, legalizing, atomic ops, pragmas) </div></div></div></div></blockquote><div>Looking at some IR of the kernel I&#39;m not sure if there is heavy use of Intrinsics. Nevertheless, this is one criticism for LLVM IR, that it&#39;s not suppose to have any intrinsics at all but be able to generate appropriate optimized assembly code through patterns. Note that I&#39;m being a somewhat purist here, i.e. LLVM is supposed to be Virtual Machine. There are almost more intrinsics than instructions, and that&#39;s not good for any perspective.  </div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - Vectorization can explode the amount of IR beyond recognition</div></div></div></div></blockquote><div>Are you referring to vectorization as a compiler transformation? Please elaborate and if possible provide an example of this. Nevertheless, I doubt that this happens in the particular case of the Linux Kernel. </div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - Inlining shuffles code and will hide any analysis you do on the inlinee</div></div></div></div></blockquote><div>I fail to see this point if your analysis is sound, i.e. your analysis results of the inlinee when analyzed independently are more general than the ones when you analyze the function with the inlining. This may affect precision of the inlinee independently but the analysis is still precise in the inlining. </div>

<div>For example, in a type-based analysis this can result that the inlinee would have a polymorphic type and the function that has the body of the inlinee is just a specialization.   </div>

<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - PHI nodes, basic blocks for loops and switches can be recognized, but they don&#39;t mean the same things anymore</div></div></div></div></blockquote><div>So what do they mean? Sure there is a small-step operational semantics of phi instruction which does not mean that there is a loop, but if you can identify a pattern that is a loop. However, note that i&#39;m not claiming that there is a bijection between a particular C program and the LLVM IR code but I do claim a bisimulation relation with C + Arch and LLVM IR + Arch&#39;, where Arch&#39; is a subset of Arch, and they really complete the *real/concrete* execution model with the information about the architecture. </div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - Target-specific lowering does destroy the IR, making it impossible to be portable, sometimes even to sub-architectures</div></div></div></div></blockquote><div>Can you get me some concrete examples of this? Even if that&#39;s the case, this is useful for software verification that considers architectural information. This is exactly what I&#39;m trying to do with the Linux Kernel at the moment: i&#39;m just trying to show that for x86 certain behaviours that are arch-specific due to inline assembly instructions do not occur. </div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div> - C++ gets flattened out to C in the Clang AST, and then to target-specific in IR, it&#39;s virtually impossible to attach any meaning to it</div></div></div></div></blockquote><div>If that&#39;s the case, what is the argument to claim the &quot;capability of representing ‘all’ high-level languages cleanly&quot;? I agree with you than it&#39;s an hard task to have a back-end to C++ but semantically is not &quot;virtually impossible&quot;.</div>

<div class="gmail_extra" style="font-family:arial,sans-serif;font-size:13px">

 - SCEV, DCE, CSE and undefined behaviour will *also* change IR beyond recognition</div><div class="gmail_extra" style="font-family:arial,sans-serif;font-size:13px">I don&#39;t recognize SCEV, DCE, CSE. About undefined behaviour, are you talking about *undef*? We should really have a discussion just about the semantics of *undef*, or write a paper about &quot;Defining undef&quot;. Again, I agree that is possible that transformations *can* possibly change IR which makes lifting to a similar C program w.r.t the original very hard, but *sound* transformations will not change its behaviour - semantic preservation. </div>

<div class="gmail_extra" style="font-family:arial,sans-serif;font-size:13px">Moreover, note that by performing analysis at the IR level you are not only able to identify potential real bugs in your &quot;high-level&quot; program but you also are doing compiler certification. Understanding the origin of the error might be hard but I find the analysis particularly valuable.</div>


<div><span style="font-family:arial,sans-serif;font-size:13px"> - IR has no knowledge of the pre-compiler syntax</span></div><div><span style="font-family:arial,sans-serif;font-size:13px">I&#39;m not sure of why this is relevant for verification. Can you provide an example?</span></div>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div><br></div><div>I could go on forever, but you get the idea. IR is NOT a representation of your code. At best, it&#39;s a high-level assembly, and that&#39;s what it was designed for, and for that, it does a pretty good job.</div>


</div></div></div></blockquote><div><br></div><div>I need to be further convinced that an Intermediate Representation is NOT a Representation of the code.</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">


<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><div>

<div><br></div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra">


<div class="gmail_quote"><div><div><span style="color:rgb(34,34,34)">Not entirely sure what you mean in this paragraph. I believe that the sort of information that you loose if because LLVM IR has design faults, not necessarily because of transformation to the LLVM IR. Perhaps you can elaborate on what sort of information you loose besides the annoying implicit unsignedness that is relevant for verification and the fact that it may be harder to identify higher-abstraction constructs like for-loops.</span></div>


</div></div></div></div></blockquote><div><br></div></div><div>Not design flaws, design. It wasn&#39;t designed to represent any particular language and it won&#39;t be a 1-to-1 match to any of them.</div><div>

<div><br></div><div><br>

</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div><div><span style="color:rgb(34,34,34)">Can you provide a reference to this work? At this point, I&#39;m really not sure what you mean by &quot;this kind of experiments&quot;. </span></div>

</div></div></div></div></blockquote><div><br></div></div><div>Unfortunately not, it was an internal pet project. Basically, the idea was to add pass flags to LLC via a Python script that would discard the bad results (can&#39;t compile, can&#39;t run) and cross the good results, optimizing for performance. It was a simple experiment, but shown with great clarity the interaction between the most used (and most powerful) passes.</div>


<div>

<div><br></div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra">


<div class="gmail_quote"><div><div><span style="color:rgb(34,34,34)">Surely I can apply certain levels of analysis (intra-procedural, inter-procedural and even inter-modular) to verify components of the kernel. The hard problem is how to verify several components in a concurrent setting.</span></div>


</div></div></div></div></blockquote><div><br></div></div><div>Not to mention the sheer size of the kernel, that will make any small task a pain.</div></div></div></div></blockquote><div>I have a lightweight type analysis that I believe is scalable to verify the kernel in a couple of minutes. I can verify modules within a second.</div>

<div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">


<div><div><br></div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">

<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><div><div><span style="color:rgb(34,34,34)">Another question: Is one of the goals of the google summer project to apply the clang-analyzer to several versions of the kernel or just the latest one?</span></div>


</div></div></div></div></blockquote><div></div></div></div><br></div><div class="gmail_extra">I don&#39;t know. But AFAIK, the Clang Analyser is not done in IR, but in the Clang AST (where it should be, because of the problems I mentioned). So, it&#39;s quite possible that you don&#39;t need to generate IR at all. </div>


<div class="gmail_extra"><br></div><div class="gmail_extra">Any semantic analysis must be done at the Clang&#39;s AST, which is quite rich and powerful. IR is most suitable for language/target-independent transformations.</div>


</div></blockquote><div><br></div><div>I don&#39;t know the details of Clang&#39;s AST but I need to be convinced. In fact, I know of examples where an analysis at the LLVM IR can catch errors (semantics errors of the original program) that currently clang-analyse does not.</div>

<div><br></div><div>Thanks for this discussion.</div><div><br></div><div>Regards,</div><div>Marcelo</div>

<div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr">

<div class="gmail_extra"><br></div><div class="gmail_extra">Hope that clears up a bit... ;)</div><div class="gmail_extra"><br></div><div class="gmail_extra">cheers,</div><div class="gmail_extra">--renato</div></div>

<br>_______________________________________________<br>

LLVMLinux mailing list<br>

<a href="mailto:LLVMLinux@lists.linuxfoundation.org" target="_blank">LLVMLinux@lists.linuxfoundation.org</a><br>

<a href="https://lists.linuxfoundation.org/mailman/listinfo/llvmlinux" target="_blank">https://lists.linuxfoundation.org/mailman/listinfo/llvmlinux</a><br></blockquote></div><br></div></div>