[RFC] Landing MDL in LLVM CodeGen

akorobeynikov · February 2, 2024, 3:37am

I think I can say that from the first glance MDL could allow better and more precise description of some non-VLIW downstream ISAs. The scope is some pipeline hazards, extra interdependencies, etc. These might be current solved via existing scheduler infrastructure, but the solutions are far from being sane.

SundeepKushwaha · February 2, 2024, 4:25am

I will need more time to digest all the details about MDL proposal, but I was curious if MDL framework is “feature-equivalent” to TD. For example, we extensively use relational mappings to describe instructions and their predicate/new/tmp forms in Hexagon. Are those features also supported in MDL? You mentioned you have tested on x86 and didn’t find any significant performance regressions and also noticed identical code in many cases. That’s comforting, but have you done similar experiments on other targets.

reidtatge · February 2, 2024, 10:01pm

Its able to model everything schedules or itineraries can model, with the exception of how we model forwarding. Schedules model it, roughly, on an instruction-by-instruction basis, and Itineraries model it for each itinerary. MDL models forwarding networks as a graph of functional units - which attempts to mirror what the hardware actually does. We found that for some of the targets that model forwarding, they’ve only modeled part of the forwarding network, so we end up with minor differences when we model the entire network.

So with the exception of forwarding network modeling, MDL provides a superset of what TD can express wrt microarchitecture. The whole purpose of this integration (where we translate every schedule and itinerary for every target into equivalent MDL) was to prove to ourselves that the MDL was able to handle everything in TD. (It does.)

Yes. Note that we don’t change instructions at all, just their modeled behaviors. So instructions that get rewritten are handled properly. FWIW, the current integration translates the Hexagon TD files into MDL, and passes all but a very few lit tests (27 tests “fail” out of 1280 when using MDL). The “failures” are very minor (hand-checked for correctness) schedule differences, or incidental debug output differences. (Not to say there aren’t bugs…). Note that some of this rewriting is unnecessary in an MDL-based back-end - the instruction’s description can directly deal with things like predication.

Actually, per the lit tests, we’ve seen very few cases where it generates different code, which was expected since we haven’t changed any of the algorithms in the schedulers. But some of the heuristics in the MI schedulers (in particular) very occasionally are perturbed by the order of scheduling alternatives provided by TD vs MDL.

Beyond running lit tests: no, not to date. I wish there was a way to do this.

A note about Hexagon: some of the initial problems we had with Hexagon involved target-specific back-end code that dealt with things that TD couldn’t model. A good example of that is “tryAllocateResourcesForConstExt”, which is called by the Hexagon Packetizer to deal with instructions that use extra issue slots for “constant” operands. This is something MDL can handle trivially. But since in this situation we’re working with a strict translation of TD, and TD couldn’t express it, the MDL infrastructure couldn’t deal with it directly (so we had to hack the function, with an appropriate comment).

Let me know if this doesn’t answer your questions!

reidtatge · February 2, 2024, 10:10pm

FWIW, I’ve removed the PR for now. I’ve clearly misjudged the degree of concern about external dependencies, and consequently the discussion has gotten derailed. I apologize for that.

I’ll resubmit in a few weeks after I’ve replaced the ANTLR parser generator with a conventional recursive descent parser, and perhaps a little retooling to make the development flow a bit cleaner.

In the meantime, I’m happy to discuss any questions people have about the work.

-Reid

danilaml · February 5, 2024, 3:59pm

Is MDL expected to stabilize in the near future? If there are no expected notable changes to its grammar, the value of (re-)generating parser via ANTLR drops somewhat (we can still keep the ANTLR-consumable grammar description somewhere as the “source of truth”/reference parser).

reidtatge · February 5, 2024, 5:34pm

Well, every language evolves, and I expect this one to evolve too. Regardless, I’m just writing a RD parser.

reidtatge · February 29, 2024, 1:32am

I’ve written and integrated a recursive descent parser for the MDL compiler, and replaced/removed the Antlr-based parser. There are now no Antlr/Java build dependencies in the MDL repo at GitHub - MPACT-ORG/llvm-project at all

Please take a look.

reidtatge · April 4, 2024, 8:00pm

There were a number of questions earlier in the thread about why we created a new language. I’ve updated the introductory part of the language spec (llvm-project/llvm/docs/Mdl/MDLSpec.md at all · MPACT-ORG/llvm-project · GitHub) to provide some of the motivation for that decision. If you’re interested in the topic, please take a look.

Topic		Replies	Views
[RFC] MDL: A Micro-Architecture Description Language for LLVM Common Infrastructure llvm	51	18230	January 24, 2024
Discussing feasibility: Generating Tablegen files for easier LLVM backend development? Common Infrastructure	11	535	June 26, 2023
LLVM Weekly - #462, November 7th 2022 Newsletters llvm-weekly	0	526	November 7, 2022
LLVM Weekly - #453, September 5th 2022 Newsletters llvm-weekly	0	353	September 5, 2022
Proposal for TableML, llvmc2 configuration language LLVM Dev List Archives	3	104	November 29, 2008

[RFC] Landing MDL in LLVM CodeGen

Related Topics