[RFC] Enabling the HLFIR lowering by default

szakharin · August 15, 2023, 12:00am

Hello,

I would like to discuss enabling HLFIR lowering by default in Flang soon. High level MLIR operations for Flang were proposed by @jeanPerier in [RFC] Add higher level operations in FIR and ⚙ D134285 [flang][RFC] Adding higher level FIR ops to ease expression lowering. Accompanied by a new FIR value type (hlfir.expr), the new representation (HLFIR) allowed simplifying existing lowering PFT to MLIR, and also helped supporting Fortran features that were hard to support with the direct PFT-to-FIR lowering.

For example, CPU2006/465.tonto and CPU2017/621.wrf_s,628.pop2_s benchmarks are now successfully compiling with the HLFIR lowering, while hitting unimplemeted TODOs with the FIR lowering. Some polymorphic types cases are also supported only with the HLFIR lowering. In our internal testing we see about 600 new tests passing with the HLFIR lowering, while the amount of tests that work with the FIR lowering and do not work with the HLFIR lowering is 21.

On the performance side, the HLFIR lowering also provides benefits such as allocating much smaller temporary arrays (e.g. in Polyhedron/test_fpu2 and CPU2017/527.cam), and applying more complex post-lowering “conflict” analysis for LHS and RHS parts of the array assignments (e.g. see @tblah’s work in ⚙ D157107 [flang][hlfir] add an optimized bufferization pass).

With all the work that has been put into the HLFIR lowering, I think we are almost ready to make the HLFIR lowering the default, but I would like to set several short-term goals that we have to achieve first. Please let me know if you want to add something in the list or something looks off to you.

C1: Keep Flang lowering covered with LIT tests as much as possible without delaying HLFIR enabling too much.

The HLFIR lowering is covered by LIT tests in flang/test/Lower/HLFIR. While there are fewer tests than for the FIR lowering (in flang/test/Lower), I think we may switch to the HLFIR lowering without migrating all the tests. The complete migration from flang/test/Lower to flang/test/Lower/HLFIR may be done after the switch. The quality of LIT testing will degrade in the menatime, but not much, given that the HLFIR lowering has decent amount of LIT tests as well.
I would like to adjust flang/test/Lower/OpenACC tests to work with the HLFIR lowering (e.g. as done in ⚙ D157560 [flang][hlfir] Propagate acc.declare from hlfir.declare to fir.declare. for a single test). I consider this a blocker for switching to the HLFIR lowering, and I will be working on this.
I suppose, teams actively working on OpenMP will want to do the same for flang/test/Lower/OpenMP tests, so that the testing coverage does not degrade too much. I would consider this a blocker as well, but I have no resources to work on that. I would greatly appreciate help with this.

C2: Keep end-to-end testing in “good shape”:

As already mentioned, SPEC CPU 2000, 2006 and 2017 benchmarks compile and run successfully (w/o OpenMP) with the HLFIR lowering. We just need to keep it the same until we switch to the HLFIR lowering.
Some tests from llvm-test-suite only work with the HLFIR lowering (https://github.com/llvm/llvm-test-suite/blob/main/Fortran/gfortran/regression/EnabledFilesHLFIR.cmake), but some tests do not work either hitting a TODO or showing incorrect behavior in runtime (https://github.com/llvm/llvm-test-suite/blob/main/Fortran/gfortran/regression/DisabledFilesHLFIR.cmake). I would say that the disabled tests might be resolved after the switch to the HLFIR lowering, so we can just keep llvm-test-suite lists untouched. If you think that some of the disabled tests must be fixed before the switch, please let me know and I will work on them.
I do not have information about the SPEC CPU benchmarks or other tests that use OpenMP and won’t work with the HLFIR lowering. If you think we need to add some of them as blockers for the switch, please let me know.
- Many thanks to @kiranchandramohan who has already started fixing bugs with HLFIR lowering for OpenMP!

P1: Keep performance on par with the FIR lowering (preliminary performance data (multiple tabs): HLFIR-perf.xlsx - Google Sheets).

I would like to keep SPEC CPU2006 and CPU2017 ref data set 1 copy performance on par with the FIR lowering, meaning we either achieve the same performance with the HLFIR lowering right away, or have exact explanation of the performance regressions not exceeding 25% and agree on the plans for fixing each of them. @tblah is pretty close on resolving some part of the huge regression of 548.exchange2 (other benchmarks will be improved as well). After that we should be able to do additional analysis and define our next steps. Please let me know if switching to the HLFIR lowering with pending performance regressions is not acceptable for your team.
I would like to have Polyhedron performance issues addressed the same way as SPEC CPU above (i.e. regressions of more than 25% regressions are not acceptable). Basically, we have to resolve inefficiencies in temporary arrays creation and array assignments. These issues will affect majority of Fortran programs, so switching to the HLFIR lowering with slowing down the rest of the world does not sound right.

These are all the checkboxes that I want to check before we switch. Please share your thoughts. Thank you!

Slava

kiranchandramohan · August 15, 2023, 11:46am

Thanks @szakharin for sharing this RFC. Switch to HLFIR is exciting! Thanks to you, @tblah and @jeanPerier and others (@cabreraam, Jacob, …) who participated in the HLFIR work.

From our side the blockers are:
1.
We would like Spec2017 rate benchmarks to not degrade overall with the switch. At the moment, we see the following (with -Ofast -flto -mcpu=native) compared to FIR lowering as per results collected by @tblah. I think once the 554.roms benchmark is fixed, we are good to go.

Benchmark	Slowdown	Comment
503.bwaves	1.00
521.wrf	1.16
527.cam4	0.71
548.exchange2	1.18
549.fotonik3d	1.01
554.roms	1.56	Loop-versioning pass is not working with HLFIR lowering
Geometric Mean	1.07

BLAS/LAPACK should continue to work. We are in the process of checking where we are with this.

kiranchandramohan · August 15, 2023, 12:03pm

For OpenMP support we would like:

all OpenMP 1.1 features to continue to work after the switch. Currently there are issues with reduction, atomic, threadprivate support for common blocks.
No regressions with gfortran-testsuite, NPB, spec-2017 speed, spec-omp-2012, SNAP
Reasonable LIT test coverage.
Elementary GPU offloading to continue to work.

tblah · August 15, 2023, 1:57pm

Thanks for sharing these Kiran. My numbers were measured with ⚙ D157107 [flang][hlfir] add an optimized bufferization pass applied.

szakharin · August 16, 2023, 12:56am

Thank you for sharing your requirements, @kiranchandramohan! If you do not mind, I will add them into the list in the description, so that we have all the pieces together.

Can you please clarify the requirement for “no regressions with gfortran-testsuite”? Is it a hard one or you will be okay with switching to HLFIR lowering having the failures from https://github.com/llvm/llvm-test-suite/blob/main/Fortran/gfortran/regression/DisabledFilesHLFIR.cmake?

kiranchandramohan · August 16, 2023, 9:22am

+1

I was talking about the OpenMP tests here. These should be enabled. There are only a few of them now and the second list below is already empty.
OpenMP Appendix : https://github.com/llvm/llvm-test-suite/blob/main/Fortran/gfortran/regression/gomp/appendix-a/DisabledFilesHLFIR.cmake
OpenMP : https://github.com/llvm/llvm-test-suite/blob/main/Fortran/gfortran/regression/gomp/DisabledFilesHLFIR.cmake

jeanPerier · November 9, 2023, 11:09am

Thank you very much @szakharin, @kiranchandramohan, @tblah and all the others who contributed to making flang ready for the switch!

Since all tasks in Issues · llvm/llvm-project · GitHub are closed it’s time to do it.

Here is the transition plan I propose:

Step 1: The Switch (now)

Patch 1: add a hidden -flang-deprecated-no-hlfir option to flang-new so that external users can revert to the current lowering if they face issues until any potential HLFIR regression is fixed. It will also be handy to transition remaining FIR tests that do not test HLFIR outputs.
Patch 2: update all these lowering tests to use bbc -hlfir=false and flang-deprecated-no-hlfir.
Patch 3: THE SWITCH. Set HLFIR lowering to be default in flang-new and bbc.

Step 2: The Wait and Clean (aiming for a duration of one LLVM release cycle).

Remove the -flang-experimental-hlfir (now redundant).
Have at least one LLVM release where HLFIR is enabled by default and -flang-deprecated-no-hlfir is accessible and in the meantime, remove all the no-hlfir version LIT tests that have already ported to HLFIR and transition the remaining no-hlfir LIT tests.
Debug any HLFIR regression reported by users

Step 3: The Removal (when step 2 is over)

Remove the -flang-deprecated-no-hlfir option in flang-new and -hlfir option in bbc.
Remove all usages of ConvertExpr.h interface inside Bridge.cpp and the rest of lowering.
Remove ConvertExpr.h/ ConvertExpr.cpp files and any helpers specific to it.

tblah · November 9, 2023, 11:13am

Thanks @jeanPerier. Those steps look good to me. Give me a shout if you’d like me to pick up any of those patches

jeanPerier · November 9, 2023, 4:19pm

Thanks @tblah, I think I should be fine with the patches of step 1.

The first one is here: [flang][driver] add -flang-deprecated-no-hlfir hidden option by jeanPerier · Pull Request #71820 · llvm/llvm-project · GitHub
Since Phabricator stack reviews are gone, the rest will come when this is merged (I do not want to create reviews in private forks).

jeanPerier · November 15, 2023, 2:15pm

Step 1 is done: HLFIR is now ON by default

github.com/llvm/llvm-project

[flang] Switch lowering to use the HLFIR step by default

llvm:main ← jeanPerier:enable-hlfir-patch-3

opened 08:48AM - 13 Nov 23 UTC

jeanPerier

+19 -17

Patch 3/3 of the transition step 1 described in https://discourse.llvm.org/t/rf…c-enabling-the-hlfir-lowering-by-default/72778/7 This patch changes bbc and flang-new driver to use HLFIR lowering by default. `-hlfir=false` can be used with bbc and `-flang-deprecated-no-hlfir` with flang-new to get the previous default lowering behavior, but these options will only be available for a limited period of time. If any user needs these options to workaround bugs, they should open an issue against flang in llvm github repo so that the regression can be fixed in HLFIR.

Is anyone familiar with the LLVM release schedule? It looks like the next big release will be LLVM 18. How can we check that this change will make it in?

tblah · November 15, 2023, 2:22pm

There was more information in a talk from eurollvm this year https://www.youtube.com/watch?v=onaNb2U1Od8&list=PL_R5A0lGi1AD-bqRaY61l5Q-EozbfyLZr&index=1&pp=iAQB

I believe LLVM 18 won’t branch until January How To Release LLVM To The Public — LLVM 18.0.0git documentation

jeanPerier · November 15, 2023, 3:03pm

Thanks, that gives some idea of the time frame for step 2 then. We should keep the old lowering until the branching at least, and then remove it.

h-vetinari · November 19, 2023, 6:05am

Now that HLFIR is on by default, what interactions (if any) does this have with the rename of flang to flang-new? Last public discussion on this that I’m aware of made it sound like the switch to HLFIR by default would be the last checkbox for the switching the name:

(I guess one could nitpick that I’m equating “finish refactoring” with “switch on by default”; OTOH, once that happens, any further refactoring arguably just becomes regular development to progress the code-base.)

Topic		Replies	Views
[RFC] Add higher level operations in FIR Flang	1	697	November 8, 2022
Performance analysis for TSVC Flang	13	752	October 3, 2024
MLIR LLVM-IR dialect -- status and lowering questions Flang	8	152	December 10, 2019
RFC: Merging FIR Flang	4	116	February 27, 2020
Upstreaming Lowering of Parse Tree/PFT to FIR Flang	0	117	May 11, 2020

[RFC] Enabling the HLFIR lowering by default

Related Topics