[RFC] Add suport for QuantileQuantizedType in Quant dialect

rafaelubal · August 1, 2024, 2:59pm

Thanks for the mention, Stella, and for the proposal, Zoran.

As a heads-up, we have currently an active PR for a revamp of the quant dialect, focusing on the !quant.uniform type with per-layer and per-axis quantization, a spec refinement with full documentation for the quant ops, lowering support for these ops, and a pass to flatten quant types. This PR involves some file restructuring to mirror other dialects, so it’d be best to get this merged before considering other dialect extensions.

I can’t speak to the relevance of quantile quantization support for the MLIR community at the moment, or specifically to the tradeoff between feature value and maintenance liability, but I’ll be happy to review your work, Zoran, if it moves forward. Your feedback on our PR could be very valuable, given your vision on a possible evolution of the dialect.

Topic		Replies	Views
How to Support F8E4M3FType in Quant Dialect uniformQuantizeType？ MLIR	5	398	January 15, 2024
[RFC] Extending UniformQuantizedType with interface-based support for new storage types in Quant dialect MLIR mlir	8	376	December 15, 2025
[RFC] Improvements in the 'quant' dialect MLIR	12	912	August 2, 2024
Help with Quant dialect MLIR llvm , mlir	5	219	October 29, 2024
MLIR Quantization Roadmap? MLIR	13	2287	March 10, 2020

[RFC] Add suport for QuantileQuantizedType in Quant dialect

Related topics