[PATH] Add sub.ovf/mul.ovf intrinsics

Zoltan_Varga · December 9, 2008, 2:11pm

Hi,

Here is the next iteration of the patch. The only comment not
addressed is this one:

It would be better to implement a target-independent check for
overflow for the "Legal" case (like how SADDO does). Hacker's > Delight
has some hints on how to do this. It's not easy for the signed case,
but is do-able.

It can be lowered to a division + a branch, so it would be
inefficient, plus it would
be a lot of work to implement it correctly (for me at least).

I was subscribed to llvmdev in 'digest' mode, so the reply might not
be correctly
handled by mailers. Sorry about that.

Zoltan

llvm-ovf.diff (37 KB)

Eli_Friedman1 · December 9, 2008, 5:53pm

If you can get the relevant high product (UMUL_LOHI and friends), it's
a relatively straightforward comparison. Otherwise, yes, the general
case is quite tricky; inserting a division here is non-trivial.

There's also the special-case of multiplication by a constant: here,
the computation can be done with a single straightforward comparison.

-Eli

void · December 9, 2008, 8:58pm

Hi,

Here is the next iteration of the patch. The only comment not
addressed is this one:

Thanks! It's looking good.

It would be better to implement a target-independent check for
overflow for the "Legal" case (like how SADDO does). Hacker's > Delight
has some hints on how to do this. It's not easy for the signed case,
but is do-able.

It can be lowered to a division + a branch, so it would be
inefficient, plus it would
be a lot of work to implement it correctly (for me at least).

Okay. It would be tricky. Please put a "FIXME" in there indicating
that it would be nice to have at some point. It's okay to submit this.

-bw

Zoltan_Varga · December 9, 2008, 9:12pm

Hi,

Attached is the final version of the patch, adding the requested
FIXME. If this is ok, can
somebody check it in ?

thanks

Zoltan

llvm-ovf.diff (37.1 KB)

void · December 9, 2008, 10:08pm

Applied. Thanks, Zoltan!

-bw

Zoltan_Varga · December 9, 2008, 11:59pm

Hi,

The add.with.overflow instrinsics don't seem to work with constant
arguments, i.e.
changing the call in add-with-overflow.ll to:
%t = call {i32, i1} @llvm.sadd.with.overflow.i32(i32 0, i32 0)

causes the following exception when running the codegen tests:

llc: DAGCombiner.cpp:646:
void<unnamed>::DAGCombiner::Run(llvm::CombineLevel): Assertion
`N->getValueType(0) == RV.getValueType() && N->getNumValues() == 1 &&
"Type mismatch"' failed.
0 llc 0x00000000010601ef
1 llc 0x00000000010604ec
2 libc.so.6 0x00007f58a754df60
3 libc.so.6 0x00007f58a754ded5 gsignal + 53
4 libc.so.6 0x00007f58a754f3f3 abort + 387
5 libc.so.6 0x00007f58a7546dc9 __assert_fail + 233
6 llc 0x0000000000cd0444
7 llc 0x0000000000cd0575
llvm::SelectionDAG::Combine(llvm::CombineLevel, llvm::AliasAnalysis&,
bool) + 55
8 llc 0x0000000000d5075a
llvm::SelectionDAGISel::CodeGenAndEmitDAG() + 2176
9 llc 0x0000000000d52a6e
llvm::SelectionDAGISel::SelectBasicBlock(llvm::BasicBlock*,
llvm::ilist_iterator<llvm::Instruction>,
llvm::ilist_iterator<llvm::Instruction>) + 642
10 llc 0x0000000000d533e5
llvm::SelectionDAGISel::SelectAllBasicBlocks(llvm::Function&,
llvm::MachineFunction&, llvm::MachineModuleInfo*,
llvm::TargetInstrInfo const&) + 2175
11 llc 0x0000000000d540f6
llvm::SelectionDAGISel::runOnFunction(llvm::Function&) + 778
12 llc 0x0000000000fedd1d
llvm::FPPassManager::runOnFunction(llvm::Function&) + 239
13 llc 0x0000000000fee860
llvm::FunctionPassManagerImpl::run(llvm::Function&) + 116
14 llc 0x0000000000fee9be
llvm::FunctionPassManager::run(llvm::Function&) + 128
15 llc 0x0000000000822a95 main + 2234

Zoltan

Eli_Friedman1 · December 10, 2008, 12:32am

Oh, and a few more possibilities:
1) Hackersdelight.org, for unsigned;
whether this is a good idea depends on the speed of the nlz
implementation. That version uses branches, but of course it can also
be done by ORing together the comparison results.
2) (C && A) | MulOverflow(B,C) | MulOverflow(A,D) | AddOverflow(B*C,
MulHi(B, D), A*D), where A, B are the two halves of Op1, C, D are the
two halves of Op2, and all operations are in the width of the halves.
This could be useful on x86 for 64-bit multiplies; it's not cheap, but
it can't really get much cheaper (besides the fact that Legalize can't
insert branches, which would be a nice improvement here). This is
roughly just adding overflow checks to every step of the normal 64-bit
multiply splitting algorithm.
3) (C && A) || ((C?B:D)*(C?C:A) + (B*D>>(Width/2)) >= 2^(Width/2)),
for unsigned, where A, B are the two halves of Op1, C, D are the two
halves of Op2, Width is the width of the original operands, and
everything is done zero-extended in the original width. Not the
greatest approach, but it's difficult to do much better with just
regular arithmetic and conditionals (besides the divide and compare
approach, which as far as I know isn't feasible here because of the
required chain?).

-Eli

Eli_Friedman1 · December 10, 2008, 12:46am

Just a possible issue from inspection: does this handle 64-bit
operations correctly on x86? Unless I'm missing something, it seems
likely to lead to a mysterious error for such operations.

Also, does DAGCombiner know not to touch operations with a known user
of the non-primary return value? This seems like it could lead to
strange code in some cases, and it seems likely to be the cause of the
crash Zoltan mentioned... would it be better to use target-specific
nodes for the calculation?

-Eli

void · December 10, 2008, 6:26am

It's the DAG combiner that's barfing. I'll look into it.

-bw

Topic		Replies	Views
Legalization code robustness for overflow intrinsics LLVM Dev List Archives	0	71	March 2, 2014
[RFC] Introduce overflow builtins Clang Frontend	34	315	May 25, 2012
[PATH] Add sub.ovf/mul.ovf intrinsics LLVM Dev List Archives	1	72	December 9, 2008
`llvm.$op.with.overflow`, InstCombine and ScalarEvolution LLVM Dev List Archives	8	142	March 27, 2015
RFC: optimizing integer overflow checks LLVM Dev List Archives	1	94	August 25, 2012

[PATH] Add sub.ovf/mul.ovf intrinsics

Related topics