When issuing affine parallel, the compiler sometimes has the notion that the parallel region should use threads nearby (e.g. to benefit from spacial locality between the threads) or conversely that the parallel region should use threads that are far apart (e.g. to benefit from less sharing of caches between concurrent threads).
OpenMP specs, and dialect, have support for this via the proc-bind
attribute. Values are typically “close” and “spread” for “nearby” vs “apart” in the above description.
Is there an appetite to provide such thread affinity concept into the affine dialect parallel operation?
Thanks for your insights on the matter
Alexandre