TableGen: Defining an instruction with a shared operand and multiple register classes

egorshamshura · July 14, 2025, 10:38am

Hello everyone! I’m implementing a custom target in LLVM’s TableGen and ran into two related issues when defining my instructions:

I have instructions where the destination and source are the same register (e.g. an increment). How do I tell TableGen that $dst and $src are actually the same operand, rather than two independent register operands?
Some of my instructions take a single register index but that index refers to a general-purpose register (GPR) and a floating-point register (FPR), like (ins GPR:$src, FPR:$src).

Any advice or examples of how to fold $dst and $src into one physical operand and to let a single index use two register classes, would be greatly appreciated!

s-barannikov · July 14, 2025, 12:15pm

Use let Constraints = "$dst = $src" on the instruction.
Such constraints are poorly modeled in LLVM. The only built-in way is to define a superregister containing both GPR and FPR as subregisters. See RegisterTuples.

egorshamshura · July 14, 2025, 1:25pm

Thank you very much for your reply! I have one more question. Am I right in assuming that multiple output operands are fully supported? If so, how can I create a pattern that returns multiple registers if it is not an intrinsic?

s-barannikov · July 14, 2025, 1:36pm

When it comes to writing patterns, this is fully supported only for patterns embedded into instruction definitions (let Pattern = ...). Top-level patterns (def Pat : ...) have very limited support: they require that the source and destination results match in order and belong to the same node/instruction.

Example:

def MyInstr {
  let OutOperandList = (outs RC:$dst1, RC:$dst2);
  let InOperandList = (ins RC:$src1, RC:$src2);
  let Pattern = [(set RC:$dst2, RC:$dst1, (divrem $src1, $src2))];
}

where divrem has two results in reverse order compared to your instruction.

If you need more than one pattern or you don’t like embedded patterns you will have to write custom selection code in C++.

egorshamshura · July 14, 2025, 2:08pm

I’m sorry, but I do not understand how to create a pattern for dags with two or more different output branches (It is not intuitive how to process all these nodes in general):
    $src1         $src2
          \            /
              add
              /     \
         abs     abs
            |          |
        $dst1    neg
                       |
                   $dst2
let Pattern = [(set GPR:$dst1, GPR:$dst2, (???))];.

s-barannikov · July 14, 2025, 2:38pm

In this case $dst1 and $dst2 belong to different nodes (abs and neg). It is not possible to write a pattern for this example.

egorshamshura · July 14, 2025, 2:44pm

Is it true that a dag always has one output node? And if the semantics of the instruction is like that, then how to describe such instructions?

s-barannikov · July 14, 2025, 4:59pm

Yes, they have one root node.

I’m not sure I understand the question. Instruction semantics is opaque to LLVM. It only knows about inputs/outputs and properties like hasSideEffects etc.

egorshamshura · July 15, 2025, 8:17am

Thanks a lot! I now understand it a little better. However, I thought that it was possible to create a pattern with multiple root (output) nodes.

Hexa · July 24, 2025, 4:05pm

To achieve such behavior, you have to define a new SDNode, i believe

Hexa · August 21, 2025, 11:41am

Alternatively, you can also use Complex Patterns and implement the Cpp function to match it. I recommend “LLVM Code Generation” by Quentin Colombet, i just started it and it cleared up a lot of my confusion about this

Topic		Replies	Views
[TableGen] Define single instruction allowing Reg/Imm operands or split to multiple instructions instead? Code Generation	6	1254	November 17, 2022
Tablegen: How to define a Pattern with multiple result instructions LLVM Dev List Archives	7	447	August 31, 2025
Multi-Instruction Patterns LLVM Dev List Archives	7	270	September 25, 2008
RegisterClass constraints in TableGen LLVM Dev List Archives	2	158	October 11, 2012
Instruction with Multiple Destination operands LLVM Dev List Archives	1	99	January 30, 2009

TableGen: Defining an instruction with a shared operand and multiple register classes

Related topics