Add LLVM type support for fp8 data types (F8E4M3 and F8E5M2)

krzysz00 · January 20, 2023, 6:27pm

I heard that the context for this is something to do with XLA, and not so much that the types are needed in LLVM but in MLIR’s LLVM dialect to distinguish between the different 8-bit floats when lowering to target-specific matrix multiply ops.

I’d suggest that, instead, you’d define MLIR ops that wrap the lower-level froat8 operations and lower first to nvgpu.our_matmul_thing : ... e5m2 and then nvvm.our_matmul_intrinsic_e5m2 : i8 by defining a type conversion from e5m2 and e4m3 to i8 during the *ToLLVM p4rocess.

codemzs · March 19, 2024, 4:01am

@kushanam Hello, can you please update on the status of this change?

Topic		Replies	Views
RFC: Add APFloat and MLIR type support for fp8 (e5m2) LLVM Project	18	2970	November 1, 2022
Adding a floating point type in an odd way. LLVM Dev List Archives	0	114	December 16, 2020
[RFC] Adding better support for higher precision floating-point MLIR	5	836	January 15, 2021
RFC: First-class Matrix type LLVM Dev List Archives	28	262	October 24, 2018
16 bit floats LLVM Dev List Archives	8	79	February 6, 2009

Add LLVM type support for fp8 data types (F8E4M3 and F8E5M2)

Related Topics