[analyzer][RFC] `-ftime-trace` time scopes for steps and entry points

necto · January 31, 2025, 12:50pm

Hello,
We’ve started performance investigations into slow-to-analyze translation units. One of the tools to quickly find the problematic part or even to diagnose a known inefficiency is the time tracing already implemented in LLVM. See how it is used for Clang frontend here.

Yet, the facility is useless for Clang Static Analyzer at the moment, because it relies on instrumentation of the source code. I have implemented some high-level instrumentation delineating

analysis entry points,
“analysis steps” - the work items processed by the analyzer
individual checker callbacks invoked as part of some steps
issue refutation stage

This is a starter list, and I expect to extend it in the future as we investigate particular cases to cover more and more blind spots.

The instrumentation has proved very helpful when investigating analysis time of some slow TUs, and I think it is a handy tool to have at your disposal useful for many CSA developers.

We would like to upstream it to both give back to the community and to reduce our maintenance burden: the instrumentation, as you might expect, touches quite a few places in the code.

In my experience, running the instrumented analyzer with time-tracing disabled added no significant overhead. I’ve run the analyzer in the configuration we use for our product to analyze 200+ open source projects of various sizes. The analysis time is normally quite variable in our CI – about 5-15% of variation in normal conditions. The instrumented version (with tracing disabled, the default) did not show any performance difference outside of the noise levels.

Here are a couple of exmples of insights such traces enable:

Here you can spot that a single entry point takes 8.2s to analyze (which is rather long) because ExprEngine::removeDead takes 6 seconds cumulatively.

2025-01-31T13:40:22,381514581+01:001714×1322 243 KB

… Another example was forbidden by discourse as a new-user limitation …

necto · January 31, 2025, 1:04pm

Here is the second example that didn’t fit into the initial post:

Here You can observe that a single analysis step processing a PostStmt InitListExpr takes almost 2 minutes and all that time is spent initializing a huge array:

2025-01-31T13:42:50,266309361+01:001708×1324 324 KB

Xazax-hun · January 31, 2025, 1:13pm

Nice! I would love to see this upstream with some documentation how to use it.

DonatNagyE · January 31, 2025, 7:23pm

I agree that this is a really promising tool that could help the whole community in ensuring better performance. Thanks for developing and releasing it!

necto · February 3, 2025, 2:40pm

I’m happy that this work is welcome. Feel free to comment on the technical details in the pull request.

Topic		Replies	Views
Instrumentation of Clang/LLVM for Compile Time GSoC	21	2936	March 18, 2023
Instrumenting a program for timing LLVM Dev List Archives	2	86	October 21, 2020
[RFC] Adding a different mode of "where clang spends time" reporting (timeline/flamegraph style) Clang Frontend	14	113	January 28, 2019
Clang: How to Analyse Compiletime of Templates? Clang Frontend	5	76	March 9, 2020
Adding more fine-grained -ftime_report info to clang front end Clang Frontend	2	118	August 2, 2015

[analyzer][RFC] `-ftime-trace` time scopes for steps and entry points

Related topics