The GEP formats when generating IR.

user214 · May 30, 2021, 7:13am

Hi everyone,

I am learning llvm IR from official documents, especially the GEP instruction.

Now I have a problem about the GEP instructions generated by llvm.

As the example in https://llvm.org/docs/LangRef.html#getelementptr-instruction say, for the function "foo" in the following code example:

struct RT {
  char A;
  int B[10][20];
  char C;
};
struct ST {
  int X;
  double Y;
  struct RT Z;
};

int *foo(struct ST *s) {
  return &s[1].Z.B[5][13];
}

the IR could be:

define i32* @foo(%struct.ST* %s) nounwind uwtable readnone optsize ssp {
entry:
  %arrayidx = getelementptr inbounds %struct.ST, %struct.ST* %s, i64 1, i32 2, i32 1, i64 5, i64 13
  ret i32* %arrayidx
}

or

define i32* @foo(%struct.ST* %s) {
  %t1 = getelementptr %struct.ST, %struct.ST* %s, i32 1                        ; yields %struct.ST*:%t1
  %t2 = getelementptr %struct.ST, %struct.ST* %t1, i32 0, i32 2                ; yields %struct.RT*:%t2
  %t3 = getelementptr %struct.RT, %struct.RT* %t2, i32 0, i32 1                ; yields [10 x [20 x i32]]*:%t3
  %t4 = getelementptr [10 x [20 x i32]], [10 x [20 x i32]]* %t3, i32 0, i32 5  ; yields [20 x i32]*:%t4
  %t5 = getelementptr [20 x i32], [20 x i32]* %t4, i32 0, i32 13               ; yields i32*:%t5
  ret i32* %t5
}

I wonder when the llvm will generate the former one, and when it will generate the later one?

Thank you very much!

Sincerely,
Shulin

Stefanos_Baziotis · May 31, 2021, 8:50pm

Hi Shulin,

Sorry for the late reply. The quick answer is: The first one will be generated when optimizations are on and the second one when not. What follows is peeling off the levels of detail in case you want to know more.

I wonder when the llvm will generate the former one, and when it will generate the later one?

“llvm” is a very vague term here. We should define what high-level component we’re talking about.

I guess you mean Clang, i.e., the C/C++ front-end, whose job we can assume ends with the generation of LLVM IR (then comes the “middle-end” which does a bunch of, let’s say, target-independent optimizations in this LLVM IR
and then comes the back-end which does a bunch more target-dependent optimizations and finally generates assembly).

Clang generally generates the simplest IR possible. So, Clang will generate the second version because it is simple and it goes something like this. This &s[1].Z.B[5][13]; is actually broken into many small expressions in the Abstract Syntax Tree (AST). Every dereference is basically one expression. When Clang generates code, for every sub-expression generates a getelementptr (GEP), which is used as input to the next expression and that’s about it. So, Clang doesn’t go one step further
to understand all these small sub-expressions as one big expression and create only one GEP.

Now, when optimizations are turned on, all these GEPs are folded into one [1].

You can even go one step further and try to find which transformation does the job, by using opt, the tool which helps experimenting with optimizations. You literally just copy and paste the LLVM IR you got from Clang again into Godbolt,
add -O1 and add the argument -print-changed=quiet [2]. This will show you the LLVM IR only after the passes that succeeded to do some change. You can see that the real work was done by Instruction Combining.
You can run it on its own to see the effect [3].

Finally, you can even go one step further and see where in the code of the pass this happens. That would require downloading LLVM, inspecting the debug output and setting a bunch of breakpoints
but after doing all that, you could find this [4].

Hope this helps!

Best,
Stefanos

[1] https://godbolt.org/z/51cs6656o
[2] https://godbolt.org/z/T1v4MfrGW
[3] https://godbolt.org/z/sTehE7eo1
[4] https://github.com/llvm/llvm-project/blob/6b9524a05bab21c6b0ba4fe025864cb613605b99/llvm/lib/Transforms/InstCombine/InstructionCombining.cpp#L2031

Στις Κυρ, 30 Μαΐ 2021 στις 10:13 π.μ., ο/η 周书林 via llvm-dev <llvm-dev@lists.llvm.org> έγραψε:

user214 · June 1, 2021, 12:47pm

Hi Stefanos,

Thank you very much!

With your detailed explanation about the difference, and the examples you provided, I have a more clearer view about the procedure of how Clang front-end and transformer Passes deal with the source code into IR.

Thank you again!

Best regards,
Shulin

Stefanos Baziotis <stefanos.baziotis@gmail.com> 于2021年6月1日周二上午4:50写道：

Stefanos_Baziotis · June 1, 2021, 1:34pm

Hi Shulin,

No problem, glad it helped! Let us know if you have more questions.

Best,
Stefanos

Στις Τρί, 1 Ιουν 2021 στις 3:48 μ.μ., ο/η 周书林 <zhoushulin1992@gmail.com> έγραψε:

Topic		Replies	Views
the generation of getelementptr instruction LLVM Dev List Archives	2	81	July 22, 2010
[RFC] Replacing getelementptr with ptradd IR & Optimizations	38	7238	November 18, 2024
More Qs about llvm IR to access struct fields LLVM Dev List Archives	3	125	April 21, 2014
RFC: GEP as canonical form for pointer addressing LLVM Dev List Archives	18	109	February 25, 2014
Opaque pointers and i8 GEPs LLVM Dev List Archives	5	148	September 6, 2021

The GEP formats when generating IR.

Related topics