[RFC] Add suport for QuantileQuantizedType in Quant dialect

Thanks for the mention, Stella, and for the proposal, Zoran.

As a heads-up, we have currently an active PR for a revamp of the quant dialect, focusing on the !quant.uniform type with per-layer and per-axis quantization, a spec refinement with full documentation for the quant ops, lowering support for these ops, and a pass to flatten quant types. This PR involves some file restructuring to mirror other dialects, so it’d be best to get this merged before considering other dialect extensions.

I can’t speak to the relevance of quantile quantization support for the MLIR community at the moment, or specifically to the tradeoff between feature value and maintenance liability, but I’ll be happy to review your work, Zoran, if it moves forward. Your feedback on our PR could be very valuable, given your vision on a possible evolution of the dialect.

1 Like