Help Needed: Pattern Matching Across Basic Blocks in AArch64 LLVM

sopyb · July 31, 2024, 11:30am

I’m currently dealing with an optimization issue involving the fdiv and fsqrt instructions in the context of AArch64. The core of the problem lies in the separation of these instructions into different basic blocks due to a conditional statement at the end of the function, which would require me to pattern match across basic blocks.

From this GitHub issue, the example illustrates how the division (fdiv) is placed into a different block from the square root operation (fsqrt), making it challenging to match and optimize them together.

Here’s the relevant code:

double res, res2, tmp;
void foo (double a, double b, int c, int d) {
  tmp = 1.0 / __builtin_sqrt (a); // fdiv & fsqrt
  res = tmp * tmp;

  if (d)
    res2 = a * tmp; // fdiv
}

Has anyone encountered a similar issue or have suggestions on how to handle pattern matching across basic blocks? Any insights or guidance would be greatly appreciated!

tschuett · July 31, 2024, 12:09pm

Can you give us a bit more details? Are you working on LLVM-IR, the SelectionDag, GlobalISel, or MIR?

sopyb · July 31, 2024, 1:09pm

I am an Outreachy intern, and my previous work has involved the SelectionDAG, where I wrote a custom lowering for another optimization. I am not tied to a specific area, as my internship description is “Improve AArch64 performance,” and this issue is on the list of potential tasks I could tackle.

From my research, it appears that the SelectionDAG may not be suitable for addressing the current optimization issue, so I am currently exploring GlobalISel.

At this point I am not yet familiar with the codebase, so I am navigating through the documentation to figure out what I am looking for.

tschuett · July 31, 2024, 2:47pm

SelectionDag has limitations for cross basic block optimizations. GlobalIsel works at function-scope. If you are not tied to AArch64 assembler, you could also try LLVM-IR. It has more tools and analyses.

Topic		Replies	Views
[AArch64][SVE] Floating Point Code Gen LLVM Dev List Archives	2	127	June 22, 2020
Pattern transformation between scalar and vector on IR. LLVM Dev List Archives	3	104	September 26, 2016
[AArch64] Target-specific loop idiom recognition IR & Optimizations	11	949	August 18, 2023
Pattern Matching in LLVM IR & Optimizations llvm	4	864	April 23, 2023
[arm, aarch64] Alignment checking in interleaved access pass LLVM Dev List Archives	7	126	October 14, 2016

Help Needed: Pattern Matching Across Basic Blocks in AArch64 LLVM

Related topics