instruction 'vote' without '.sync' is not supported on .target sm_70 and higher from PTX ISA version 6.4

Is it a known problem?

int main()
{
int t=-1;

#pragma omp target teams map(t)
{
#pragma omp parallel reduction(+: t)
{
#pragma omp parallel reduction(+: t)
{
t = 1;
}
}
}
}

$ clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda tmp.c
ptxas /tmp/thread_limit-984c26.s, line 1792; error : Instruction ‘vote’ without ‘.sync’ is not supported on .target sm_70 and higher from PTX ISA version 6.4
ptxas fatal : Ptx assembly aborted due to errors
clang-11: error: ptxas command failed with exit code 255 (use -v to see invocation)

$ bin/clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda tmp.c -Xopenmp-target -march=sm_60
ptxas /tmp/thread_limit-5cc9b6.s, line 1792; warning : Instruction ‘vote’ without ‘.sync’ is deprecated since PTX ISA version 6.0 and will be discontinued in a future PTX ISA version

Kelvin

I haven't seen that.

If no one knows a bug ID for this, please open up one.

It seems similar to https://bugs.llvm.org/show_bug.cgi?id=44587
My code also has reduction within each team. The reproducer here is much smaller and easier to work with.

Ye