LLVM AutoFDO status

Dehao_Chen · October 9, 2015, 10:08pm

With recent bug fixes and performance tunings, AutoFDO@llvm has reached a usable state. To evaluate performance, we used O3/-fprofile-use/-fprofile-sample-use respectively to optimize clang itself, and measure its speed.

clang built with -fprofile-use is ~20% faster than clang built with O3
clang built with -fprofile-sample-use is ~10% faster than clang built with O3

AutoFDO can deliver 50% of the FDO speedup to clang. The gap is mainly due to inaccurate/lost debug info, which is used to represent the profile. I am still tuning the performance to fill in the gap.

During the meantime, we encourage you to try it out. Bug reports/fixes are always welcome. For more information about how to generate AutoFDO profile, please refer to https://github.com/google/autofdo

Cheers,
Dehao

echristo · October 10, 2015, 7:09am

Hi Dehao,

Do you have any specific bugs for “inaccurate/lost debug info”? I haven’t seen anything and I’m curious what you might be running into.

Thanks.

-eric

echristo · October 10, 2015, 7:10am

That said, this is great news! I’m ecstatic to see that the sample based FDO is doing well on llvm.

-eric

davidxl · October 10, 2015, 4:53pm

Hi Dehao,

Do you have any specific bugs for "inaccurate/lost debug info"? I haven't
seen anything and I'm curious what you might be running into.

Those lost info are mostly due to optimizations (examples include code
introduced by the optimizer, such as those from strength reduction,
runtime condition check etc) -- not that the base debug info
generation has anything wrong..

David

echristo · October 10, 2015, 5:05pm

Hi Dehao,

Do you have any specific bugs for “inaccurate/lost debug info”? I haven’t
seen anything and I’m curious what you might be running into.

Those lost info are mostly due to optimizations (examples include code
introduced by the optimizer, such as those from strength reduction,
runtime condition check etc) – not that the base debug info
generation has anything wrong…

Aha! Excellent. Both good to hear and I look forward (and don’t look forward) to those bugs

-eric

Topic		Replies	Views
Announcement - A tool to convert Perf profiles to use with LLVM's sample profiler LLVM Dev List Archives	1	181	April 15, 2014
[RFC] Control Flow Sensitive AutoFDO (FS-AFDO) LLVM Dev List Archives	5	204	November 20, 2020
AutoFDO sample profiles v. SelectInst, LLVM Dev List Archives	9	105	August 17, 2016
Current PGO status LLVM Dev List Archives	8	152	February 26, 2018
how to use sampling profiler outputs with opt LLVM Dev List Archives	0	122	December 20, 2017

LLVM AutoFDO status

Related topics