Speedup of clang build with PGO and BOLT on AArch64

peterwaller-arm · March 11, 2025, 1:30pm

Hi! I’d like to share some performance results on AArch64 using PGO and BOLT. The attached graph illustrates the speedup, measured as seen with the elapsed time for a parallel build of bin/clang (version 19.1.6, built with clang-19.1.6). In each case, instrumented profiles were used with a build of libLLVMSupport used as training data.

It highlights the impact of various PGO techniques on AArch64. The Clang/BOLT version under test is main @ 2025-01-18, close to LLVM 20.

Topic		Replies	Views
Clang PGO mystery - am I holding this wrong? Clang Frontend pgo , clang	2	199	April 22, 2024
Making Clang/LLVM faster using code layout optimizations LLVM Dev List Archives	3	247	October 19, 2018
Clang-14.0.6 performance optimization Using Clang clang	23	2459	October 15, 2022
llvm.org pre-built clang significantly slower than apple/xcode clang LLVM Dev List Archives	7	158	November 29, 2018
AArch64 Instruction Selection taking a long time with --lto-O0 AArch64	2	272	October 20, 2023

Speedup of clang build with PGO and BOLT on AArch64

Related topics