Bug in clang-cuda overload set filtering

calewis · July 30, 2020, 2:45am

Hello all,

I believe I have found a bug in overload resolution for cuda code. I am currently awaiting permission to post to the bug tracker.

The following code doesn’t compile with newer versions of clang:

template
device host int foo(T *x) {
return 1;
}

device int foo(int *x) {
return 2;
}

host int foo(long *x) {
return 3;
}

device host int bar() {
auto long_val = 1l;
return foo(&long_val);
}

clang++ -O2 -g -x cuda --cuda-gpu-arch=sm_61 -std=c++14 -o main -c main.cpp give me:

error: reference to host function ‘foo’ in host device function
return foo(&long_val);
^
main.cpp:10:14: note: ‘foo’ declared here
host int foo(long *x) {

I believe that the issue is at https://github.com/llvm/llvm-project/blob/8224c5047e9cef2db4b0e31427cdf90a2568a341/clang/lib/Sema/SemaOverload.cpp#L9860

It’s possible that IdentifyCUDAPreference will return CFP_HostDevice for valid overloads, but this code doesn’t erase the wrong side candidates in that case. Then because the wrong side candidate is an exact match, minus its host device attributes, clang picks it as the best overload.

If I rewrite those lines as:

bool ContainsSameSideCandidate =
llvm::any_of(Candidates, [&](OverloadCandidate *Cand) {
// Check viable function only.
if (Cand->Viable && Cand->Function) {
auto MatchType = S.IdentifyCUDAPreference(Caller, Cand->Function);
return MatchType == Sema::CFP_HostDevice ||
MatchType == Sema::CFP_SameSide;
}
return false;
});

My code compiles again. I can submit a bug report once I am approved, but I figured I would post here in the mean time.

-Drew

Topic		Replies	Views
Cannot pass __device__ function as template parameter in CUDA? Using Clang cuda , gpu	3	999	June 28, 2022
How does Clang CUDA handle __host__ __device__ template instantiation with __host__ or __device__ only constructs? Clang Frontend	2	133	October 31, 2018
CUDA Support for clang-tidy clang-tidy cuda	6	730	July 25, 2022
Bug: Device friend function is not seen in a different file Clang Frontend	1	77	June 8, 2020
Heterogeneous target attributes overloading in Clang CUDA (__CUDA_ARCH__ considered harmful) Clang Frontend	1	84	November 7, 2018

Bug in clang-cuda overload set filtering

Related topics