Error when lower affine to gpu (affine.load is not recognized)

zhangxj19 · March 23, 2022, 4:05pm

Here is my mlir file for onnx.add. It has been lowered to affine dialect.

module attributes {llvm.data_layout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128", llvm.target_triple = "x86_64-unknown-linux-gnu"} {
  func @main_graph(%arg0: memref<3x2xf32>, %arg1: memref<3x2xf32>) -> memref<3x2xf32> attributes {input_names = ["X1", "X2"], output_names = ["Y"]} {
    %0 = memref.alloc() {alignment = 16 : i64} : memref<3x2xf32>
    affine.for %arg2 = 0 to 3 {
      affine.for %arg3 = 0 to 2 {
        %1 = affine.load %arg0[%arg2, %arg3] : memref<3x2xf32>
        %2 = affine.load %arg1[%arg2, %arg3] : memref<3x2xf32>
        %3 = arith.addf %1, %2 : f32
        affine.store %3, %0[%arg2, %arg3] : memref<3x2xf32>
      }
    }
    return %0 : memref<3x2xf32>
  }
  "krnl.entry_point"() {func = @main_graph, numInputs = 2 : i32, numOutputs = 1 : i32, signature = "[    { \22type\22 : \22f32\22 , \22dims\22 : [3 , 2] , \22name\22 : \22X1\22 }\0A ,    { \22type\22 : \22f32\22 , \22dims\22 : [3 , 2] , \22name\22 : \22X2\22 }\0A\0A]\00@[   { \22type\22 : \22f32\22 , \22dims\22 : [3 , 2] , \22name\22 : \22Y\22 }\0A\0A]\00"} : () -> ()
}

I want to lower it to gpu dialect and I write the code below:

void addAffineToGPUPasses(mlir::PassManager &pm) {
  pm.addNestedPass<FuncOp>(mlir::createAffineForToGPUPass()); 

  // pm.addPass(mlir::createGpuKernelOutliningPass());
  // pm.addPass(mlir::createGpuToLLVMConversionPass());
}

But here is the error I get:

error: 'affine.load' op index must be a dimension or symbol identifier

Can anyone help me with it ?

ftynse · March 24, 2022, 10:11am

Affine memory operations expect their subscript operands to be defined in a specific way (see 'affine' Dialect - MLIR). Thread index values cannot be used as subscripts in affine operations, unlike affine.for induction variables, hence the error.

You need to convert affine memory operations to memref operations first, and then convert the loops separately. This may require more code than just setting up a pass pipeline. Alternatively, you can lower the affine operations to a mix of memref+scf, and then map that to GPU.

zhangxj19 · March 24, 2022, 10:21am

Thank you very much. I’m trying now.

Topic		Replies	Views
Load issues from memref<3xi1> to vector<3xi1> and data layout MLIR	5	796	January 27, 2021
Lowering Affine Loops to LLVM MLIR mlir	7	92	July 22, 2024
Help lowering GPU modules to LLVM MLIR	11	740	August 24, 2023
Error at lower gpu dialect to llvmir MLIR	6	516	March 26, 2022
MLIR: Failed to lower affine.for to gpu dialect MLIR	3	721	January 13, 2021

Error when lower affine to gpu (affine.load is not recognized)

Related Topics