sorry for joining this thread late. Few comments/questions.
I do like the proposal to add getScalarBitwidth() and (we may have discussed it before) having an interface for it would allow this to work with any type that may live downstream which would be a big improvement. I’m happy to help if we have a consensus about this.
About allowing fp4/fp6 formats in tensors, I still think this would be useful for high level representations. We already support sub-bytes type in tensor for integers so it doesn’t seem like a new problem.
+1, and this seems orthogonal to the block representation.
Note that there is a recent PR that goes in that direction (I’m not the author nor I have discussed much with them but I think this would be useful):