[Beginner] Understanding Tablegen language

rightrotate · July 13, 2020, 6:11pm

Hi,
I am new to LLVM and I find TableGen language really cryptic. The reference manual to the language is not helpful either. I can look at the existing .td file and reverse engineer but I am looking for a detailed manual. Specifically, I have below questions:

What is a basic syntax for writing a dag? From the lang ref manual I can see that its something like operator followed by ArgList which is enclosed in parentheses. Where does predicate fit in this picture? I don’t see any mention of predicates in lang ref manual. A DAG should have an operator, one or more return value and a bunch of arguments. Each of them would have a type. I am not sure how that maps to syntax provided by TableGen language. In TargetSelectionDAG.td I see (vt SDNode) in definition of ImmLeaf. Does that mean vt is return type of SDNode?
Entity followed after “(” is always need to be an operator? or it can be ValueType or something else?
What are keywords like “ins”, “outs” and “ops”? They are not mentioned in lang ref manual either.
What is a “node” keyword?
How are PatFrags used? I see some .td files I see, like X86InstrFMA.td, PatFrag MemFrag is passed as argument to multiclass and then used along with addr:$src3 in it. I really don’t understand what this means. Does this mean that whatever comes after PatFrag “object” is substituted as Args in PatFrag? e.g. TargetSelectionDAG defines

def not : PatFrag<(ops node:$in), (xor node:$in, -1)>; how do you visualize this?

Thanks.

Praveen
BTech Student, VIT.

tlively · July 13, 2020, 7:30pm

Part of the problem is that ISel patterns are like their own DSL inside the TableGen DSL, so keywords like “ins”, “outs”, and “ops” aren’t keywords at the TableGen level, but rather at the level of the ISel system implemented with TableGen. Copying existing patterns and reading the comments in Target.td and TargetSelectionDAG.td are the best ways I know of learning how this works. I haven’t seen a separate guide, although it would be very cool if one existed.

Concretely, a PatFrag is essentially just a macro for patterns. In your example, (ops node:$in) says that this pattern fragment takes a single argument called $in, which can be any other dag or dag operation. The right hand side (xor node:$in, -1) is what the pattern fragment expands to wherever it is used. So if I write a pattern that includes (not <some stuff>), that will expand to (xor <some stuff>, -1). “ops” here is just a marker operation (in the tablegen language sense, not in the instruction selection sense) to introduce the operands for the pattern fragment. If ISel were its own DSL rather than implemented on top of TableGen, this would probably have a more straightforward syntax (but then we’d have another separate DSL to deal with).

MattPD · July 14, 2020, 8:04am

FWIW, there are also some third-party resources that may be of help:

"Lessons in TableGen"
FOSDEM 2019; Nicolai Hähnle

Slides: https://archive.fosdem.org/2019/schedule/event/llvm_tablegen/attachments/slides/3304/export/events/attachments/llvm_tablegen/slides/3304/tablegen.pdf

Series:
- What has TableGen ever done for us?: Tagebuch eines Interplanetaren Botschafters: TableGen #1: What has TableGen ever done for us?
- Functional Programming: Tagebuch eines Interplanetaren Botschafters: TableGen #2: Functional Programming
- Bits: Tagebuch eines Interplanetaren Botschafters: TableGen #3: Bits
- Resolving variables: Tagebuch eines Interplanetaren Botschafters: TableGen #4: Resolving variables
- DAGs: Tagebuch eines Interplanetaren Botschafters: TableGen #5: DAGs

Some of the parts of TableGen used in SelectionDAG are in the backend docs (e.g., the keywords OP asked about):
https://llvm.org/docs/WritingAnLLVMBackend.html#instruction-set
& Writing an LLVM Backend — LLVM 17.0.0git documentation (has a simple example of `PatFrag` for `store`).

There are a few examples of simple .td files an LLVM backend in the following:

LLVM backend development by example (RISC-V)
2018 LLVM Developers’ Meeting; Alex Bradbury

2014 - Building an LLVM Backend - LLVM Developer's Meeting
https://llvm.org/devmtg/2014-10/#tutorial1

http://web.archive.org/http://llvm.org/devmtg/2014-10/Videos/Building%20an%20LLVM%20backend-720.mov
http://llvm.org/devmtg/2014-10/#tutorial1

llvm-leg: LEG Example Backend
LEG Example Backend: a simple example LLVM backend for an ARM-like architecture: 'LEG'.

Best,
Matt

rightrotate · July 14, 2020, 8:43am

Thanks Matt and Thomas. I will go through them.

rightrotate · July 15, 2020, 5:33pm

Is there a backend to Tablegen which can dump a map of pattern-to-matched to instruction-to-be-generated?

–help doesn’t seem to indicate anything like that.

arsenm · July 15, 2020, 5:44pm

If you run tablgen with no arguments, it produces the fully expanded tablegen. You can directly view what ends up getting interpreted there

-Matt

tlively · July 15, 2020, 5:57pm

Adding -debug to a -gen-dag-isel run can also print useful information about the parsed patterns.

madhur13490 · July 15, 2020, 6:03pm

I use --print-records and then search for opcode name or pattern name and then look for “PatternToMatch” key. It is close to the map you’re looking for.

rightrotate · July 15, 2020, 6:05pm

Thanks. -print-records is useful in addition to other tips.

Topic		Replies	Views
TableGen pattern for negated operand LLVM Dev List Archives	2	149	May 15, 2012
What does the set in the dag of the td file in tablegen refer to, (set a, b), where is the set defined? Beginners riscv , llvm	8	543	August 26, 2023
How to understand `PatFrags` in backend codegen? MLIR llvm	15	124	December 13, 2024
Isel DAG documentation? LLVM Dev List Archives	7	114	March 11, 2014
What does the "set" and "add" keyword mean in Tablegen Lauguage? Beginners llvm	2	229	January 14, 2023

[Beginner] Understanding Tablegen language

Related topics