Hi All -
Regarding the memref dialect
store and related operations I couldn’t tell if the spec requires these to be implemented as if volatile and atomic (
llvm does have
volatile qualifiers) ?
Is this an oversight or work in progess?
more fundamentally the execution model of
llvm was a single thread executing the IR. However for GPUs some operations are
collective and it is not clear how the various builtin dialects specify such requirements?