Default value of OMP_PROC_BIND=false

xzhan23 · April 29, 2026, 3:57pm

Hi all,

I would like to initiate a discussion regarding the default thread affinity / binding behavior in the LLVM OpenMP runtime , specifically around the default value of OMP_PROC_BIND=false.

While evaluating OpenMP performance with Flang on modern multi-core linux system supporting affinity binding, we have observed that enabling thread affinity binding (e.g., setting’OMP_PROC_BIND=close’ and ‘OMP_PLACES=cores’) often results in significant speedup – e.g., the SPEComp2017 / ROMS2017_OMP benchmark shows a ~2X speedup.

The Cray compiler uses a default binding policy that distributes threads across cores, automatically capturing the performance advantages mentioned above (see “OMP_PROC_BIND” in https://cpe.ext.hpe.com/docs/latest/cce/man7/intro_openmp.7.html.

Has there been prior discussion on whether a simple topology-aware placement default would be preferable (especially for HPC workloads), compared with the current default of OMP_PROC_BIND=false? It can help users to avoid performance degradation in common cases (such as when the number of requested threads is less than or equal to the number of places) and skip binding for most problematic cases (where binding is more likely to degrade performance).

Topic		Replies	Views
KMP_AFFINITY and PROC_BIND for ARM OpenMP mlir	26	541	August 6, 2024
Behavior of OMP_PROC_BIND=true OpenMP	9	294	July 20, 2020
Does Clang v5.0.0 OpenMP Implementation recognize OMP_PLACES=threads env variable? Clang Frontend	1	125	September 27, 2018
Three patches: affinity cleanup, 8-bit atomics, types cleanup OpenMP	2	119	March 9, 2015
Thread affinity in affine.parallel MLIR	2	176	September 12, 2023

Default value of OMP_PROC_BIND=false

Related topics