[RFC] Merging 1:1 and 1:N Dialect Conversions

matthias-springer · October 13, 2024, 3:02pm

Context

There are currently two incompatible dialect conversion drivers is MLIR: the “default” driver and the 1:N dialect conversion driver. The 1:N driver was added some time ago to address limitations in the default driver. E.g., replacing an op that produces 1 SSA value with multiple SSA values is not supported by the default driver. The 1:N driver has uses mainly in downstream projects.

This RFC proposes:

Extending the default driver with full 1:N conversion support.
Deleting the 1:N driver.

The purpose of this change is to remove the duplicate dialect conversion infrastructure. Another goal is to clean up the API (in particular, remove argument materializations) before continuing with the One-Shot Dialect Conversion effort.

1:N Conversions in the Default Driver

The default driver has limited support for 1:N conversions: The type converter API (convertType and materializations) supports it. There is also an API to replace a block argument with multiple replacement values (during applySignatureConversion). However, the remaining infrastructure (in particular the ConversionValueMapping and ConversionPattern API) does not support it. To support 1:N replacements for block arguments, the default driver uses “argument materializations” as a workaround: these convert the N replacement values back into a single SSA value that can be stored in the mapping and that can be passed to adaptors.

Implementation Outline

The following changes are required to support 1:N conversion support in the default driver:

Minor extensions to the TypeConverter API. The argument materializations carry additional information that must be passed to target materializations. For details, see [mlir][Transforms] Dialect conversion: add `originalType` param to materializations by matthias-springer · Pull Request #112128 · llvm/llvm-project · GitHub.
Minor extension to the OpAdaptor C++ code generation, so that a 1:1 adaptor can be constructed from a 1:N adaptor. For details, see [mlir][tblgen] Add additional constructor to Adaptor class by matthias-springer · Pull Request #112144 · llvm/llvm-project · GitHub.
ConversionValueMapping: Store Value → ValueRange instead of Value → Value. That’s because an SSA value can now be replaced with multiple SSA values.
Provide an additional 1:N ConversionPattern API, where each adaptor method now returns a ValueRange instead of a Value.
Provide an additional replaceOp API that supports 1:N replacements.
Remove argument materializations entirely. They are no longer needed.
Remove other workarounds that were needed because of 1:N limitations. A typical workaround is "manually packing/unpacking replacement values in an unrealized_conversion_cast. (E.g., there’s one in the sparse compiler.)

`ConversionValueMapping` Data Structure

The ConversionValueMapping keeps track of replacements of SSA values that have not materialized in IR yet. Until now, this essentially used to be an IRMapping. To support 1:N mappings, a data structure such as the following is needed:

struct ConversionValueMapping {
  /// Replacement mapping: Value -> ValueRange
  DenseMap<Value, SmallVector<Value, 1>> mapping;

  /// Materializations: ValueRange -> [ValueRange]
  DenseMap<SmallVector<Value, 1>,
           SmallVector<SmallVector<Value, 1>>,
           SmallVectorMapInfo>
      materializations;
};

The materializations map stores unresolved materializations. In the current implementation, it is combined with IRMapping. (Which I believe is incorrect in some cases, because we cannot distinguish between actual replacements and type conversions of replaced values.) Materializations can occur in the following shapes:

1:1: This is the typical case in a 1:1 dialect conversion.
1:N: A target materialization could be a 1:N conversion. (1 type is legalized to N types.)
N:1: A source materialization could convert a legalized type range (N types) back to a single type.
N:M: E.g.: a single value of type t was replaced with N legalized types. Now a subsequent pattern has a different type converter that legalizes t to M different types. Now an N:M conversion is inserted.

Furthermore, there can be multiple materializations for the same type/type range. That’s why the key of the map is a ValueRange and the value of the map is a list of ValueRanges.

Note: To optimize for the more common case of 1:1 conversions, all ValueRanges are actually SmallVector<Value, 1>.

Note: With the One-Shot Dialect Conversion driver, the ConversionValueMapping will be deleted. However, I cannot do all of that in one step. I have to break things down into smaller refactoring to make it possible for downstream users to adapt to the changes step-by-step.

`ConversionPattern` API

To support N replacement values, the adaptor must now return ValueRange instead of Value. The auto-generated adaptors already allow for custom value types (via C++ templates). I propose to add a second matchAndRewrite overload that supports 1:N replacements via their adaptors.

class ConversionPattern {
 public:
  // Current 1:1 API. Will remain unchanged.
  virtual LogicalResult
  matchAndRewrite(Operation *op, ArrayRef<Value> operands,
                  ConversionPatternRewriter &rewriter) const;

  // New 1:N API.
  virtual LogicalResult
  matchAndRewrite(Operation *op, ArrayRef<ArrayRef<Value>> operands,
                  ConversionPatternRewriter &rewriter) const {
    // By default, dispatches to 1:1 API.
    return matchAndRewrite(op, getOneToOneAdaptorOperands(operands), rewriter);
  }

 private:
  // Unpack 1:1 replacements. Fail if 1:N replacements are found.
  SmallVector<Value> getOneToOneAdaptorOperands(
      ArrayRef<ArrayRef<Value>> operands) const {
    SmallVector<Value> oneToOneOperands;
    oneToOneOperands.reserve(operands.size());
    for (ArrayRef<Value> operand : operands) {
      if (operand.size() != 1)
        llvm::report_fatal_error("pattern '" + getDebugName() +
                                 "' does not support 1:N conversion");
      oneToOneOperands.push_back(operand.front());
    }
    return oneToOneOperands;
  }
}

The dialect conversion driver always invokes the 1:N matchAndRewrite entry point of patterns. That’s because it fills the adaptor with mappings from the ConversionValueMapping, which are stored as 1:N mappings.

If an existing 1:1 pattern is run in a 1:1 conversion, nothing changes. However, if the type converter returns a 1:N mapping and the pattern does not implement the 1:N entry point a fatal error is triggered.

(Similar changes for OpConversionPattern etc. See prototype for details.)

`ConversionPatternRewriter` API

The rewriter API remains mostly unchanged. Just one new function is needed:

void ConversionPatternRewriter::replaceOpWithMultiple(
    Operation *op, ArrayRef<ValueRange> newValues);

I tried overloading replaceOp but ran into various cases where existing code no longer compiles due to ambiguous overload resolution.

Changes for Downstream Users

If you are a users of the 1:N dialect conversion driver, you have to update your patterns. (The 1:N driver does not use ConversionPattern.) This can be done gradually: the 1:N driver can stay around for a while after 1:N support was added to the default driver.

If you are using argument materializations, you have to incorporate the functionality into target materializations. That’s because argument materializations will be removed.

If you are using a type converter that has 1:N type conversions, you are exercising a code path in the dialect conversion that is incomplete (example). You may have to update some existing 1:1 conversion patterns to 1:N conversion patterns, as those will start receiving 1:N replacements.

Status

I am working a prototype here: [mlir][draft] Support 1:N dialect conversion by matthias-springer · Pull Request #112141 · llvm/llvm-project · GitHub. It is still in a very rough state (not ready for review yet) and not all tests are passing yet. In particular, the MemRef → LLVM conversions are not working yet.

Over the next weeks, I’d like to gather feedback about the design described above. I hope to have a working and polished prototype some time after the LLVM conference.

@ingomueller-net @zero9178 @ftynse @mehdi_amini @River707 @jpienaar

rengolin · October 13, 2024, 4:28pm

+1 on the proposal.

Also, we’re looking into N:1 conversions (for pattern matching and lowering to a single op/intrinsic/function call, and it would be nice if we could do this as part of the default mechanism, instead of creating our own.

We do the matching, then we replace a single op (the hero op), then after replacement, we remove all the others in the list, but that’s messy because of producer/user mapping.

It would be nice if a single function would be able to detect all producers and users and replaceAllUsesWith for each value.

void ConversionPatternRewriter::replaceOpsWithMultiple(ArrayRef<Operation *op>, ArrayRef<ValueRange> newValues);

Or even just have the one method and call the internal implementation when either ops or values is only one:

void ConversionPatternRewriter::replaceOpsWith(ArrayRef<Operation *op>, ArrayRef<ValueRange> newValues);`

stellaraccident · October 13, 2024, 4:54pm

Thank you. We’ve used various workarounds for dealing with 1:N issues and it will be good to unroll these. Thanks for taking the time to think through an implementation plan for how these changes will be made. There is a certain amount of archaeology involved in uncovering and adapting to these things and this helps a lot when planning for that.

MaheshRavishankar · October 13, 2024, 6:42pm

Thanks @matthias-springer . Any cleanup here is great! Would it be possible to stage the changes to reduce downstream disruption, i.e. allow downstream projects to try the change out to prefetch issues and untested code paths. Just notifying folks here would be good, and expected timeline of when changes are going to land.

matthias-springer · October 14, 2024, 1:03pm

I’m going to break it up into smaller changes as good as possible. E.g., type converter and most ConversionPattern changes can be done separately and should not affect existing code, as they’re just adding additional functionality that won’t be exercised until the “main” PR. There’s likely going to be one larger PR that switches the ConversionValueMapping and removes the argument materializations. That PR cannot really be broken up. I’m going to post it here when it’s ready for review, so that you can give it a try.

jpienaar · October 15, 2024, 2:28am

This SGTM and welcome unification.

ftynse · October 17, 2024, 12:04am

No objection to cleaning up this beast.

matthias-springer · December 20, 2024, 2:03pm

@MaheshRavishankar I’m going to list all PRs in this post. Will edit this post and add new PRs as I upload them.

Out for Review

[mlir][Transforms] Delete 1:N dialect conversion driver by matthias-springer · Pull Request #121389 · llvm/llvm-project · GitHub (major change)
~~[mlir][Transforms] Detect mapping overwrites during block signature conversion by matthias-springer · Pull Request #121646 · llvm/llvm-project · GitHub (minor change)~~

Merged

[mlir][Transforms] Dialect conversion: add originalType param to materializations by matthias-springer · Pull Request #112128 · llvm/llvm-project · GitHub (minor change)
[mlir][tblgen] Add additional constructor to Adaptor class by matthias-springer · Pull Request #112144 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Dialect Conversion: Simplify materialization fn result type by matthias-springer · Pull Request #113031 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Merge 1:1 and 1:N type converters by matthias-springer · Pull Request #113032 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Dialect conversion: Hardening replaceOp by matthias-springer · Pull Request #109540 · llvm/llvm-project · GitHub (minor change)
[mlir][func] Fix incorrect API usage in FuncOpConversion by matthias-springer · Pull Request #113977 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms][NFC] Dialect conversion: Reformat materialization error message by matthias-springer · Pull Request #114176 · llvm/llvm-project · GitHub (minor change)
[mlir][func] Replace ValueDecomposer with target materialization by matthias-springer · Pull Request #114192 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Dialect Conversion: Add replaceOpWithMultiple by matthias-springer · Pull Request #115816 · llvm/llvm-project · GitHub (minor change)
[mlir][SparseTensor][NFC] Pass tensor type to descriptor helper by matthias-springer · Pull Request #116468 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms][NFC] Dialect Conversion: Delete dead code from ConversionValueMapping by matthias-springer · Pull Request #116758 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms][NFC] Dialect conversion: Remove “finalize” phase by matthias-springer · Pull Request #116934 · llvm/llvm-project · GitHub (minor change)
[mlir][Func] Support 1:N result type conversions in func.call conversion by matthias-springer · Pull Request #117413 · llvm/llvm-project · GitHub (minor change)
[mlir][LLVM] LLVMTypeConverter: Tighten materialization checks by matthias-springer · Pull Request #116532 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Dialect conversion: extra signature conversion check by matthias-springer · Pull Request #117471 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms][NFC] Dialect conversion: Improve docs for materializations by matthias-springer · Pull Request #117847 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Dialect conversion: Fix bug in UnresolvedMaterializationRewrite rollback by matthias-springer · Pull Request #105949 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Add 1:N matchAndRewrite overload by matthias-springer · Pull Request #116470 · llvm/llvm-project · GitHub (major change)
[mlir][Func] Delete DecomposeCallGraphTypes.cpp by matthias-springer · Pull Request #117424 · llvm/llvm-project · GitHub (minor change)
[mlir][Vector] Move mask materialization patterns to greedy rewrite by matthias-springer · Pull Request #119973 · llvm/llvm-project · GitHub (minor change)
[mlir][Vector] Clean up populateVectorToLLVMConversionPatterns by matthias-springer · Pull Request #119975 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Dialect Conversion: No target mat. for 1:N replacement by matthias-springer · Pull Request #117513 · llvm/llvm-project · GitHub (minor change)
[mlir][sparse_tensor] Migrate `SparseIterationToScf.cpp` to dialect conversion by matthias-springer · Pull Request #121054 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Dialect Conversion: Add 1:N op replacement test case by matthias-springer · Pull Request #121271 · llvm/llvm-project · GitHub (minor change)
[mlir][ArmSME] Migrate `arm-sme-vector-legalization` to dialect conversion by matthias-springer · Pull Request #121101 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Mark 1:N conversion driver as deprecated by matthias-springer · Pull Request #121102 · llvm/llvm-project · GitHub (minor change)
https://github.com/llvm/llvm-project/pull/116524 (major change)
[mlir][IR][NFC] `DominanceInfo`: Share same impl for block/op dominance by matthias-springer · Pull Request #115587 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms][NFC] Use `DominanceInfo` to compute materialization insertion point by matthias-springer · Pull Request #120746 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms][NFC] Delete unused `nTo1TempMaterializations` by matthias-springer · Pull Request #121647 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Fix build after #116524 (part 2) by matthias-springer · Pull Request #121662 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms][NFC] Simplify `buildUnresolvedMaterialization` implementation by matthias-springer · Pull Request #121651 · llvm/llvm-project · GitHub (minor change)
[mlir][ArmSME] Remove func patterns from vector lowering by matthias-springer · Pull Request #121640 · llvm/llvm-project · GitHub (minor change)
[mlir][Transforms] Fix mapping in `findOrBuildReplacementValue` by matthias-springer · Pull Request #121644 · llvm/llvm-project · GitHub (minor change)

MaheshRavishankar · December 22, 2024, 9:34pm

Thanks @matthias-springer !

matthias-springer · January 4, 2025, 8:43am

This migration is complete. The “default” dialect conversion driver now supports 1:N replacements. All 1:N conversion-based passes in MLIR (except for the test cases of the 1:N conversion framework itself) have been migrated to the default dialect conversion driver. A few more minor cleanups may follow.

The 1:N conversion driver is now marked as deprecated. I plan to delete it from the MLIR code base in April 2025. Please migrate your passes until then or let me know if you need more time.

As a next step, I will continue looking at removing the rollback functionality from the dialect conversion framework.

A big shout out to @zero9178 for many in-depth code reviews and design discussions!

Topic		Replies	Views
[RFC] A New "One-Shot" Dialect Conversion Driver MLIR	74	3660	August 9, 2025
Request for advice: extending type conversion to 1:N conversion MLIR	10	677	June 2, 2023
[Question] Conditional OneToN conversion using `OneToNOpConversionPattern` MLIR mlir	12	246	January 24, 2025
Dialect conversion: type materialization MLIR	19	2649	March 10, 2021
Operation conversion with unequal numbers of SSA-results, best approach MLIR	12	238	March 12, 2024