Topics tagged gpu

Topic	Replies	Views	Activity
Emissary APIs, a general purpose framework for GPU-initiated host execution of native host APIs OpenMP gpu	0	31	December 18, 2025
Question about gpu.subgroup_mma_load_matrix and store ops MLIR gpu , mlir	5	105	October 31, 2025
Using function level analysis inside a module pass LLVM Project gpu , llvm	0	53	October 24, 2025
[GPU] Why does `gpu.launch_func` only accept a single async dependency? MLIR gpu , mlir	4	167	October 14, 2025
DNN model lowering from Linalg dialect to GPU target MLIR gpu	1	90	October 12, 2025
Run linalg.matmul on gpu MLIR gpu , llvm , mlir	4	534	October 9, 2025
Relationship between gpu.subgroup_mma_compute and spirv.KHR.CooperativeMatrixMulAdd in MLIR MLIR gpu , mlir , spirv	1	61	October 9, 2025
[CFP][LLVM-DEV'25] LLVM/Offload Workshop Announcements cuda , hip , gpu , llvm , openmp	1	259	October 8, 2025
[RFC] Cleaning the GPU dialect MLIR gpu	52	1451	October 6, 2025
Mlir-opt crashing with gpu.launch operation MLIR gpu , mlir	2	63	September 30, 2025
About the input of DL training compiler Tensor Compiler gpu , mlir	0	76	September 15, 2025
CUDA_ERROR_INVALID_VALUE when converting tiled scf.parallel to GPU kernels MLIR cuda , gpu	0	51	September 9, 2025
[RFC] LLVM policy for top level directories and language runtimes LLVM Project gpu	19	775	September 8, 2025
[RFC] Add memory scope to GPU barrier MLIR gpu , mlir	26	873	August 28, 2025
[RFC][SPIR-V] Way to represent float8 in LLVM IR Code Generation gpu , spirv	9	374	August 24, 2025
[RFC] No-loop mode for OpenMP GPU kernels Flang gpu , openmp	15	445	August 21, 2025
OpenMP Offload Fortran Tests Pass with Flang-new OpenMP gpu , llvm	7	428	July 31, 2025
Gpu.memcpy has problems with lowering when processing strided memref MLIR gpu	1	71	July 31, 2025
Can shared memory be used at the affine level? MLIR gpu	2	87	July 17, 2025
Issues Compiling With Offloading Support OpenMP gpu	8	343	July 14, 2025
How to disable section merging? LLD gpu , riscv , clang , llvm	14	264	July 9, 2025
How can I save the callgraph in a separate section .callgraph in an ELF file? LLD core , gpu , riscv , clang , llvm	0	58	July 2, 2025
Modular and our MAX Platform team is looking for Compiler and LLDB engineers Job Postings gpu , llvm	0	256	July 1, 2025
How to set tile sizes for affine-loop-tile pass in MLIR C++ API? MLIR gpu , mlir	2	93	June 26, 2025
How do you measure GPU kernel execution time in MLIR? MLIR gpu , mlir	6	252	June 25, 2025
How to pass arguments to the mlir file MLIR gpu , llvm , mlir	1	115	June 19, 2025
GPU Offloading Docker Image OpenMP gpu , clang , llvm	16	310	June 2, 2025
[mlir][vector distribution] WarpOpScfForOp fails when scf.for has results that are unused MLIR gpu , mlir	2	128	May 28, 2025
Converting CUDA program to mlir (gpu, linalg etc.) MLIR cuda , gpu , llvm	9	274	May 21, 2025
[RFC] Add GPU operations to permute data in 2 loaded mma_matrix MLIR gpu , mlir	5	242	May 19, 2025