Searching for a opensource MLIR end-to-end Nvidia GPU lowering example project

pal-stdr · November 29, 2023, 4:51pm

Hello

I am searching for an end-to-end opensource (from github or, gitlab) project/example repo, in which it is demonstrated:

Take simple C or Python code as input for a sample operation (e.g. vector add). Or even take just MLIR (i.e. .mlir file) as input
Then progressively lowered using different features of GPU dialect to exploit Nvidia GPU HW resources.
Finally the cubin generation.

I would be really grateful, if someone shares repository like this.

There are Tensorflow or Pytorch implementations available. But I want something more “simple” to understand, how a custom end-to-end GPU implementation (Nvidia or even AMDGPU) works through MLIR compilation pipeline.

Also I share another context, from which I am searching for such thing. Let’s see the following code example (shared from here)

// example.cu ()
__global__ void fill(int n, float a, float *x) {
  int i = threadIdx.x + blockIdx.x * blockDim.x;
  if (i < n)
    x[i] = a;
}
int main(void) {
  int n = 1 << 20;
  float *x;
  cudaMallocManaged(&x, n * sizeof(float));
  fill<<<(n + 255) / 256, 256>>>(n, 2.0f, x);
  cudaDeviceSynchronize();
  cudaFree(x);
}

All the information I can find from MLIR doc, focuses on MLIR features only for GPU kernel (e.g. fill kernel in example code). And some information on GPU offloading such as, how can I call this kernel from host. But it would be really nice, if I could see an example GPU offloading repository showing how can I use MLIR for managing memory buffers similar to cudaMalloc, number of block or threads from the host, etc.

In simple words, I am searching for something, from which I can have a compact idea on, how can I built an end-to-end pipeline.

Thanks in advance!

mehdi_amini · November 29, 2023, 7:47pm

What about our integration tests? Like this one for example: https://github.com/llvm/llvm-project/blob/main/mlir/test/Integration/GPU/CUDA/printf.mlir

pal-stdr · November 30, 2023, 8:42am

Wonderful!!!
Thanks a lot!
Just bothering you little more with some related question

Should I use the latest MLIR/LLVM release (e.g. now 17.0.6) to get all the new features for MLIR + GPU dialect?
Could you please provide some expert suggestions for newbies like me? Such as, things to consider, things to-do/not-to-do while developing this GPU compilation pipeline?

Thanks again

mehdi_amini · November 30, 2023, 9:30am

Probably the current main branch, depending what you’re looking for (Hopper support is steadily progressing for example, Sparse compilation as well).

Sorry: this is a bit vague of a question for me to answer right now
Maybe others will chime in?

Topic		Replies	Views
Any example project/code for mlir nvidia GPU offloader inside llvm-project? Or can I find it in any other place? Beginners mlir	0	123	January 18, 2024
How to build a mlir gpu-codegen project？ MLIR gpu , llvm	7	1066	March 7, 2023
GPU code generation status: NVidia, OpenCL MLIR	6	3332	October 23, 2020
MLIR GPU execution path - non-JIT trial MLIR	5	1258	June 14, 2021
GSoC 2012 Proposal: Automatic GPGPU code generation for llvm LLVM Dev List Archives	11	110	April 4, 2012

Searching for a opensource MLIR end-to-end Nvidia GPU lowering example project

Related topics