Sparse Tensors in MLIR

Thank you so much ! I successfully get the right result ! I’ll keep playing with the sparse dialect :wink: Very interesting

1 Like

I have observed that, in the SparseGPUCodegen.cpp, the kernel definition appears to be automatically generated from the linalg computations (as a fallback).

I intend to modify this to instead make calls into CuSparse (or any other sparse-tensor optimal CUDA library).

This would necessitate modifying the GPU MLIR dialect to represent calls and types of this CUDA library as MLIR types and operations.

However, I am uncertain about how to implement these changes from the MLIR dialect down to the GPU executable binary. Which transformations or other dialects do I need to modify?

What do you think is the estimated effort required to carry out these changes? Are there any existing works that I can leverage?

In general, do you believe this project is a worthwhile endeavor?

Thank you for your time and consideration.

Hi Arun,

Note that there is already a lowering to library calls (mainly cuSPARSE, but any external sparse library in principle). The MLIR sparsifier has two ways of generating GPU code, which we call “codegen” and “libgen”. Both paths can be found in SparseGPUCodegen.cpp:

“codegen”: populateSparseGPUCodegenPatterns
“libgen”: populateSparseGPULibgenPatterns

Both paths are there for experimentation: the direct codegen path that converts outermost parallel loops to kernel methods, and the libgen path that converts common operation (SpMV, SpMM, spGEMM, 2:4 SpMM, SDDMM) into GPU specific ops that are lowered into library calls later. You can find that in e.g. GPUToLLVMConversion.cpp with wrappers in CudaRuntimeWrappers.cpp.

You can enable the libgen path as follows (for any nonzero value, the codegen path is taken):

--sparse-gpu-codegen="num-threads=0"

You can find more information and examples in this prior GPU posting.

Overall, I think the libgen path performs much better, since codegen never took off. So you are of course welcome to improve either the codegen or libgen path (or both), but please don’t add libgen to codegen, that would break the underlying design.