Lowering of scatter operations

sabauma · May 10, 2023, 7:17pm

There seem to be two representations of scatter operations in the core MLIR repo: tosa.scatter and tensor.scatter.

As far as I can tell, there are no lowerings out of either of these representations.

For tosa.scatter, this was noted on in TOSA to Linalg lowering (tosa.scatter) and still seems to be true. The difficulty there being the inability for any existing Linalg operation to represent a scatter-like operation.

The tensor.scatter operation appears to have been introduced some time after that discussion. tensor.scatter seems like a possible lowering target for tosa.scatter except that it also lacks any lowerings.

My question is, are there any plans to have lowerings for these operations at some point in the future to the core MLIR repo? Does everyone just implement their own lowerings for these ops or is there just little demand for such a lowering?

kuhar · May 11, 2023, 1:43pm

We also have vector.scatter.

bondhugula · May 21, 2023, 12:45pm

Normally, if someone added an op like this to an upstream dialect, the expectation is that there is a lowering path to the LLVM dialect or an imminent plan to support that. Can you check via git blame and tag the author who added it (or look up the commit summary)? That will answer your questions. I personally don’t find it ideal if ops are added to such core dialects without a path or an imminent plan to lower them to the LLVM dialect.

matthias-springer · May 23, 2023, 7:53am

We should add bufferization support for tensor.gather and tensor.scatter (in Tensor/Transforms/BufferizableOpInterfaceImpl.cpp). The implementation does not have to be very efficient, it just has to lower to MemRef in some way, so that it is executable. In fact, tensor.gather will always bufferize to a new memory allocation; that may not be desireable.

An alternative vectorization pass could lower those two ops more efficiently. (Maybe we can already vectorize tensor.gather/tensor.scatter, given that we also have vector.gather/vector.scatter?)

This is a pattern that we have for many other ops: E.g., linalg.generic can be bufferized and lowered to loops (which is somewhat inefficient in most cases). Or it can be vectorized.

medbzkst · October 30, 2023, 4:04am

Just wanted to bring up this issue. Is there any support for bufferization of tensor.scatter / tensor.gather or a trick to work it around?

sabauma · November 1, 2023, 7:02pm

@medbzkst, this may not be very helpful, but we were able to target the tosa.scatter and tosa.gather ops and managed to upstream a lowering for tosa.scatter. That proved sufficient for our use cases.

nicolasvasilache · November 2, 2023, 2:30pm

not at this time

nice, is there something here that can be generalized and used for tensor.scatter too ?

nicolasvasilache · November 2, 2023, 2:38pm

I looked into this but unfortunately, I found the multi-dimensional form of vector.gather to not be powerful enough in practice.

I.e. the current implementation (https://github.com/llvm/llvm-project/blob/23ad865e1090bf5895ad60cc7ed785fff31e7ec0/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td#L1891) the n-D vector of indices indexes only into a 1-D memref.

This means the following:

%0 = vector.gather %base[%c0][%v], %mask, %pass_thru
  : memref<?xf32>, vector<2x16xi32>, vector<2x16xi1>, vector<2x16xf32> into vector<2x16xf32>

is not semantically different from

%tmp = vector.gather %base[%c0][%v], %mask, %pass_thru
  : memref<?xf32>, vector<32xi32>, vector<32xi1>, vector<32xf32> into vector<32xf32>
%0 = vector.shape_cast %tmp : vector<32xf32> to vector<2x16xf32>

It would be much more powerful if indices could specify indexings into n-D memrefs.

xTayEx · March 6, 2025, 7:35am

Just wanted to bring up this issue. Is there any further progress on the bufferization of tensor.scatter? It has been 2 years.

Topic		Replies	Views
TOSA to Linalg lowering (tosa.scatter) MLIR	2	985	December 3, 2021
How to lower tensor.scatter TOSA llvm , mlir	2	130	November 15, 2024
[RFC] Adding Gather, Scatter Ops MLIR	17	1673	August 30, 2022
[RFC] Improving gather codegen for Vector Dialect MLIR	12	420	March 23, 2025
[PSA] Scalable auto-vec in Linalg without masking MLIR	9	546	June 25, 2024

Lowering of scatter operations

Related topics