I spent some time digging into a Valgrind report of uninitialized values in LLVM r287520 built using itself. (One of quite a few such reports that comes up during a "make check".)
I could use another set of eyes on the issue if someone has time.
Here I've refactored the code into a minimal (noinline) function that still triggers the problem. xfunc2() and xfunc3() are also noinline. The problem goes away if either isReg() or isDef() is marked noinline.
void xfuncx(const MachineOperand &MO,
const TargetRegisterInfo *TRI,
BitVector &LivePhysRegs) {
if (MO.isReg() && // <<<<------ problem reported here
MO.isDef()) {
xfunc2(MO, TRI, LivePhysRegs);
} else {
xfunc3(MO, LivePhysRegs);
}
}
The asm is below. Maybe I've been staring too long but I don't see the problem Valgrind is talking about.
The testcase mentioned below builds a MachineOperand that prints like this:
<BB#2>
The bottom word of this MachineOperand now looks like this, with (according to Valgrind) the x's corresponding to uninitialized bits:
xxxx xxxx xxxx 0000 0000 0000 0000 0100
At this point isReg() can be called safely since it looks only at the lower bits. isDef() cannot be called safely because it looks at bit 25. However it is clear that the C++ code (below) never calls isDef() when isReg() returns false, as it does here.
It grabs the low word of the MO and uses a mask to grab bit 25 and also the low 8 bits. Next, it branches on bit 25, which isn't initialized. The code is clever but -- I think -- wrong.
GCC does the right thing here, first branching on the low bits and only then looking at bit 25.
Reading uninitialized memory is safe provided the result isn’t observed.
The isReg() test and the isDef() test are OK because whether bit 25 is set or not is irrelevant if the low bits are not zero (the comparison will be false no matter what value bit 25 holds). So there is only an observable effect of examining bit 25 if the low bits are all zero, satisfying the abstract requirement.
Valgrind can’t know this sadly, so it flags this as a bug. This is something that I would expect MSan to do a better job of by helping the compiler not merge these two tests by showing a MSan check that differentiates them.
Argh, thanks. It's a bummer to have to throw out Valgrind.
Something is messing up Souper when we use it to build LLVM. It could easily be a Souper bug, but in the meantime it makes a good excuse to track down and eradicate UBs in LLVM. I'll go back to UBSan/ASan/MSan.