SNAP Performance analysis, more detailed than the presentation

sscalpone · April 11, 2022, 4:20pm

There’s a thought about fir to keep things at a high-level for as long as possible. I don’t know how well that axiom is followed today, but it’s a worthy goal.

For lowering sum, are you thinking of calling specialized sum1d and sum2d routines? Or changing the existing sum (et al) to special case 1d and 2d cases?

Perhaps the runtime can be compiled to bitcode & supplied to llvm for llvm-driven inlining?

So many options, which to choose! Will you publish a short design spec before you get too far along with the coding?

Topic		Replies	Views
Status of Flang's Optimization Flang	11	1563	December 4, 2023
RFC: How to inline Fortran inrinsics Flang	44	1756	August 4, 2022
Performance analysis for TSVC Flang	13	1009	October 3, 2024
food for optimizer developers Clang Frontend	7	109	August 11, 2010
[RFC] Inline hlfir.copy_in for trivial types Flang	29	495	May 28, 2025

SNAP Performance analysis, more detailed than the presentation

Related topics