Couple of general questions about PGO

barakgla · July 25, 2023, 3:03pm

Hi all ,

I started to look into PGO for a project and a few questions came to mind.

Why the recommendation is to use IR-PGO(fprofile-generate) instead of FE-PGO(fprofile-instr-generate)?
I understand that FE-PGO is based on AST-instrumentation where IR-PGO is based on CFG-instrumentation, so there are less counter updates in IR-PGO, so it seems reasonable to use IR-PGO to reduce the runtime of the instrumented program but performance-wise there is a difference between the two ? IR-PGO can do a better job optimizing the code than FE-PGO ? why ?
I saw that there is a discussion to deprecate FE-PGO (Status of IR vs. frontend PGO).
Do you know maybe what is the status about it ?
Why deprecate FE-PGO in the first place if it needed for code coverage ?
It seems to me that IR-PGO is less resilient to compiler changes then FE-PGO ,because it depends on CFG where it seems to me that AST is more stable.
So if the compiler is used by internal users that keep the compiler version updated frequently , it might be more reasonable for them to use FE-PGO and not IR-PGO.

Thanks a lot

ellishg · July 26, 2023, 9:14pm

Thanks for linking the discussion on deprecating FE-PGO, I hadn’t read it before. There were some claims that IRPGO profiles lead to better performance (Status of IR vs. frontend PGO (fprofile-generate vs fprofile-instr-generate) - #3 by rnk), but it would be nice to see a more thorough investigation. Some comments also pointed out that IRPGO has more features than FE-PGO. Here’s a list off the top of my head:

Value profiling
Lightweight Instrumentation
- [InstrProfiling] Lightweight Instrumentation
Temporal Profiling
- RFC: Temporal Profiling Extension for IRPGO

rnk · July 27, 2023, 10:53pm

I am not the best expert here, but I think the simple reason is that, to do PGO well, you want to instrument and apply branch weights after inlining to make sure your counters and branch weights are as context sensitive as possible. Conditional branches in inlined functions may do radically different things in different inlined call sites, and you get worse results if you can only apply weights to the original branch generated by the frontend.

The secondary reason is that IR PGO gets more investment. All the features you mentioned are good examples of investment. This is what Google uses and where it applies its effort. The lightweight instrumentation mode is a contribution from Meta.

Anyone can put together a more thorough investigation comparing the two PGO modes, but it’s quite a bit of work. We shared the results we got at the time on Chromium (11% vs 17%), but didn’t take the investigation further.

Topic		Replies	Views
Status of IR vs. frontend PGO (fprofile-generate vs fprofile-instr-generate) Clang Frontend	13	709	June 15, 2021
Profile-Guided Optimization (PGO) related questions and suggestions LLVM Project pgo	24	1343	December 20, 2023
[Sample PGO] Which optimizations currently use sample PGO in llvm? Beginners pgo	2	440	June 28, 2023
LLVM-17 optimization levels comparison IR & Optimizations	1	1102	November 13, 2023
Current PGO status LLVM Dev List Archives	8	134	February 26, 2018

Couple of general questions about PGO

Related Topics