OpenMP lowering from PFT to FIR

kparzysz · November 27, 2023, 7:45pm

What is the motivation behind how OpenMP constructs are lowered to FIR? For those constructs that contain executable code (like loops or critical), we seem to create some skeleton in Lower/OpenMP.cpp, then fill it out in Lower/Bridge.cpp, e.g.

github.com

llvm/llvm-project/blob/main/flang/lib/Lower/Bridge.cpp#L2404


      
                                           bridge.openAccCtx(), accDecl,
                                           accRoutineInfos);
            for (Fortran::lower::pft::Evaluation &e : getEval().getNestedEvaluations())
              genFIR(e);
          }
          
          void genFIR(const Fortran::parser::OpenACCRoutineConstruct &acc) {
            // Handled by genFIR(const Fortran::parser::OpenACCDeclarativeConstruct &)
          }
          
          void genFIR(const Fortran::parser::OpenMPConstruct &omp) {
            mlir::OpBuilder::InsertPoint insertPt = builder->saveInsertionPoint();
            localSymbols.pushScope();
            genOpenMPConstruct(*this, bridge.getSemanticsContext(), getEval(), omp);
          
            const Fortran::parser::OpenMPLoopConstruct *ompLoop =
                std::get_if<Fortran::parser::OpenMPLoopConstruct>(&omp.u);
            const Fortran::parser::OpenMPBlockConstruct *ompBlock =
                std::get_if<Fortran::parser::OpenMPBlockConstruct>(&omp.u);
          
            // If loop is part of an OpenMP Construct then the OpenMP dialect

Why not have the OpenMPConstruct be lowered entirely (edit: inside of genOMPConstruct), instead of doing it in parts? Was there a reason for that, or did it just happen this way?

clementval · November 27, 2023, 9:23pm

Because the Lower/OpenMP.cpp file is mainly dealing with the creation of OpenMP operations from the OpenMP dialect. Construct that have region needs to use the FIR lowering to lower the Fortran code inside those region.

kparzysz · November 27, 2023, 9:40pm

Right, but why doesn’t the lowering code in OpenMP.cpp use FIR lowering from Bridge? The OMP and non-OMP code is intertwined, so it would make sense to lower the body of a block from an OpenMP construct that contains that block.

clementval · November 27, 2023, 9:45pm

I’m not sure I get what you mean here. The Fortran code contained in an OpenMP construct is lowered by the Bridge. It is possible that the code in Lower/OpenMP.cpp produces some FIR operations but that’s OpenMP specific. The OpenACC lowering has the same design.

Right, but why doesn’t the lowering code in OpenMP.cpp use FIR lowering from Bridge?

It does and it is done in the bridge itself.

kparzysz · November 27, 2023, 9:50pm

What I mean is that instead of doing

  void genFIR(const Fortran::parser::OpenMPConstruct &omp) {
    mlir::OpBuilder::InsertPoint insertPt = builder->saveInsertionPoint();
    localSymbols.pushScope();
    genOpenMPConstruct(*this, bridge.getSemanticsContext(), getEval(), omp);
    [...]
    for (Fortran::lower::pft::Evaluation &e : curEval->getNestedEvaluations())
      genFIR(e);
    [...]
  }

have genOpenMPConstruct generate the FIR for the nested evaluations.

clementval · November 27, 2023, 9:54pm

That would mean all the genFIR functions need to be made visible which is not the case now.

kparzysz · November 27, 2023, 9:58pm

Visible in OpenMP.cpp (and other dialect-specific files), yes.

That would help avoid the saving/restoring/resetting of the insertion point. It would also provide a natural context for lowering nested constructs (specifically in target, which requires special handling of host symbols, for example).

clementval · November 27, 2023, 10:03pm

That would help avoid the saving/restoring/resetting of the insertion point.

You would do that anyway because you need to move the builder inside the region when you generate operation with region and then put the builder after the op once the region has been generated.

kparzysz · November 27, 2023, 10:05pm

Some of it is unavoidable, but some cases aren’t:

  // Reset the insert point to before the terminator.
  resetBeforeTerminator(firOpBuilder, storeOp, block);

Edit: At least it would look less like a state machine, and more like local save/restores:

   x = get(...)
   do_something()
   set(x)

clementval · November 27, 2023, 10:20pm

For this particular case, I think this is needed because privatization is done during lowering and it has no representation in the dialect. In OpenACC we don’t have such stuff.

Anyway I don’t have a strong opinion on this but if you want to change this you probably need to open a RFC and get feedback from the people contributing to the OpenMP lowering.

kiranchandramohan · November 27, 2023, 11:28pm

Basically, if we create anything inside the region then we will have to set/reset the insertion point. For eg. I think omp.target operations are implemented as isolated from above with block arguments. The
block arguments are the new bindings for symbols and have hlfir.declares generated for them in the region. If our representation for privatisation is to use block arguments in the region of privatisation they will have the same issue I think. For privatisation specifically, we will move to some higher level representation for tasks.

@kparzysz I don’t know whether you are facing any specific issue or this was a question just for information.

kparzysz · November 28, 2023, 1:01am

I want to change it to do what I described above—for each construct, generate the whole thing recursively, instead doing it one part at the time. I think it would make the code much clearer, but I wanted to see if there are any compelling reasons for the current approach.

I’m actually working on dealing with firstprivate inside of target, and the privatization inside DataSharingProcessor implementation doesn’t handle it right (at least in my simple testcase), specifically due to the “isolation from above”.

kiranchandramohan · November 28, 2023, 1:25pm

One reason is to keep the lowerings separate and to disallow OpenMP code generation performing generation of Fortran constructs.

What is your proposal here?
→ Move genOmp functions to FirConverter ?
→ Derive from the FirConverter to create an OpenMPFIRConverter? And create the right converter based on whether the -fopenmp flag is set?
→ Or something else?

Will recursive generation fix the issue fully for you? Or will you have to make large scale changes to the DataSharingProcessor? Since the ‘isolated from above’ situation leads to the usage of block-arguments, you could also consider representing this in the dialect. Trivial types (scalars, static-arrays) can be alloca’d later while interfacing with the OpenMPIRBuilder, others will need more information which can be based on the OpenACC recipe generation or wrappers to createHostAssociateVarClone and copyHostAssociateVar procedures.

omp.target private(%a->%a_pvt) {
bb0:(%a_pvt):
}

kparzysz · November 28, 2023, 2:41pm

I’ve thought of the second approach, i.e. converter subclass, or perhaps some other mechanism to allow combining additional converters in the future.

I don’t have a specific proposal yet—my first goal here is to understand why the code is written this way to make sure I’m not missing something that is not evident in the code. In the first sentence I quoted above you wrote that disallowing OpenMP lowering to generate Fortran constructs was intentional—what was the motivation for this?

The way I see generating code for omp target is something like this (roughly):

genOpenMPTarget(OpenMPConstruct &omp) {
  localSymbols.pushScope(IsolatedFromAbove);  // indicate that it's isolated
  process-directives();
  genOp<omp::TargetOp>(...)
  genFIR(eval);                               // generate the contents recursively
  localSymbols.popScope();
}

If we overload symbol handling (lookup, cloning, etc.) in the extended converter, we could generate appropriate code on demand, instead of having to pre-collect lists of symbols. The handling would then do the right thing for the context (i.e. generating symbol clones across target boundary, vs. on the same host).

The (meta) benefit of doing lowering “recursively” is that the code organization follows the structure of the IR being lowered.

kiranchandramohan · November 28, 2023, 4:36pm

That follows from the intention to keep the OpenMP lowering separate from the Fortran lowering. If we think about the other implementation options which are lumping all OpenMP codegen into FIRConverter, or having further derived classes that need to be selected, I think the current organization is a good choice.

I will tag @schweitz to check whether the reasons are different.

kparzysz · November 28, 2023, 5:07pm

Ok, what are your reasons for this?

We have FirOpBuilder that extends mlir::OpBuilder. At the same time we’re keeping the building of FIR separate from building the OpenMP extensions to it. It’s the same principle in both cases.

jdoerfert · November 28, 2023, 5:12pm

FWIW, (I’m not super involved) you should keep in mind OpenMP might not be only used by Flang/FIR in the future. Whatever that implies.

kparzysz · November 28, 2023, 5:34pm

True, but we’re lowering a post-processed Fortran parse tree into [HL]FIR. The OpenMP lowering still relies on functions/types from both, so it’s not “pure” in that sense.

What I want is to call the FIR builder from the OMP lowering code. The FIR generation would still be encapsulated in the FirOpBuilder, it would just be called from OpenMP.cpp.

clementval · November 28, 2023, 6:58pm

The dialect must be generic to support not only FIR/HLFIR but the lowering is specific the flang in this case.

clementval · November 28, 2023, 7:01pm

FWIW we have no issue with handling firstpriavte for OpenACC with the currrent lowering and this is mainly due because we have a representation of it in the dialect so the heavy lifting is not done during lowering. So you might want to explore this path too.

Topic		Replies	Views
Flang Technical Call : Summary of presentation on OpenMP for Flang Flang	12	161	September 27, 2019
About OpenMP dialect in MLIR LLVM Dev List Archives	23	232	February 19, 2020
RFC: OpenMP dialect in MLIR MLIR	10	3513	April 17, 2020
MLIR LLVM-IR dialect -- status and lowering questions Flang	8	180	December 10, 2019
[RFC] Enabling the HLFIR lowering by default Flang	12	1661	November 19, 2023

OpenMP lowering from PFT to FIR

Related topics