How to let LLVM handle undefined behavior more gracefully?

Weiming_Zhao · March 17, 2016, 1:16am

Hi,

There are cases where LLVM is able to detect some UB but clang is not.

For example,

unsigned int foo(unsigned int x) {

   int ret = 0;
   for(int i = 0; i <= 32; ++i)
     ret += x >> i;
   return ret;

}

When the loop is unrolled, LLVM InstructionSimplify will catch it and
return a UNDEF value.
How can we let LLVM report some warning message to help developers
correct the error?
Or should we use similar behavior as GCC (e.g. x >> 32 returns 0)?

This can also saves compiler engineer's effort: users complain that it's
a compiler bug because their code works with GCC or older version of
LLVM (because the loop is not unrolled). And it's really hard to debug
such UB in some large code base.

Thanks,
Weiming

mehdi_amini · March 17, 2016, 1:29am

Isn't it catched by UBSAN?

Weiming_Zhao · March 17, 2016, 4:42am

Yes, thought of UBSan. But in our case, the target program runs on baremetal. It has very tight code size restriction and it has no stderr.
Since LLVM already caught the behavior during compilation, it should notify users about it.

TNorthover · March 17, 2016, 5:08am

It's not that simple.

First, without debug info all LLVM could realistically say is "there
might be undefined behaviour somewhere in this program". Even with
debug info, that location may or may not be accurate.

Second, there could be legitimate cases for an shl 32 to exist in a
program. As long as it's not actually executed it's fine. The usual
example is a template instantiation: they fairly often have generic
code that would be UB in some cases, but is guarded by checks to never
execute at runtime. There's no realistic way for LLVM to determine
this locally.

Third, we don't want the diagnostics Clang produces to depend on
optimization level.

If you want this kind of diagnostic at compile-time, it's probably a
job for the static analyzer. It doesn't currently catch this case
though, and I don't know enough about its inner workings to say how
feasible that would be.

Cheers.

Tim.

Cacho · March 17, 2016, 5:36am

What the title says, so far it was uploaded for Debian Jessie but not
for Debian unstable (which probably works for testing too).

I was looking at: http://llvm.org/apt/ (which says that it has not been
updated since Feb but still).

Is this still maintained or do you recommend any other source for
getting Debian packages?

Thanks,
C

mehdi_amini · March 17, 2016, 6:03am

I'll add that there is a great reading on the llvm blog on this topic: What Every C Programmer Should Know About Undefined Behavior #1/3 - The LLVM Project Blog

Best,

Renato_Golin1 · March 17, 2016, 10:38am

I believe release 3.8.0 doesn't work on Debian unstable/testing
because of the GCC ABI 5.

https://llvm.org/bugs/show_bug.cgi?id=23529

We're working with the distros to make this right, but as of now, I'm
not sure how it'll play out. (I'm not a Debian user myself).

cheers,
--renato

Topic		Replies	Views
finding integer undefined behaviors using clang LLVM Dev List Archives	5	104	April 12, 2011
Optimizer ub notification in llvmc LLVM Project	4	433	February 14, 2020
some undefined behaviors in llvm/clang LLVM Dev List Archives	3	73	July 30, 2010
Feature Request: __builtin_undefined() Clang Frontend	2	91	October 7, 2017
[RFC] Defining Undefined Behavior in Libc C	2	806	June 2, 2023

How to let LLVM handle undefined behavior more gracefully?

Related Topics