InstCombine wrongful (?) optimization on BinOp with SameOperands

Nicolas_Brunie1 · September 30, 2015, 6:01am

Hi all,
I have been looking at the way LLVM optimizes code before forwarding it to the backend I develop for my company and while building
define i32 @test_extract_subreg_func(i32 %x, i32 %y) #0 {
entry:
%conv = zext i32 %x to i64
%conv1 = zext i32 %y to i64
%mul = mul nuw i64 %conv1, %conv
%shr = lshr i64 %mul, 32
%xor = xor i64 %shr, %mul
%conv2 = trunc i64 %xor to i32
ret i32 %conv2
}

I came upon the following optimization (during instcombine):
IC: Visiting: %mul = mul nuw i64 %conv, %conv1
IC: Visiting: %shr = lshr i64 %mul, 32
IC: Visiting: %conv2 = trunc i64 %shr to i32
IC: Visiting: %conv3 = trunc i64 %mul to i32
IC: Visiting: %xor = xor i32 %conv3, %conv2
IC: ADD: %xor6 = xor i64 %mul, %shr
IC: Old = %xor = xor i32 %conv3, %conv2
New = = trunc i64 %xor6 to i32

which seems to be performed by SDValue DAGCombiner::SimplifyBinOpWithSameOpcodeHands(SDNode *N)

In my backend’s architecture truncate is free, but zext is not (and i64 is not a desirable type for xor or any binary operation in general), so I would expect this optimization to be bypassed but because of the following statement :
(N0.getOpcode() == ISD::TRUNCATE && (!TLI.isZExtFree(VT, Op0VT) || !TLI.isTruncateFree(Op0VT, VT))
it is not (as isZExtFree return false for my architecture while isTruncateFree returns true). The comment on binop simplification says that binop over truncs should be optimize only if trunc is not free, so I do not understand the point of adding !isZExtFree at this point.
Can someone enlighten my ignorance on this optimization ?

best regards,
Nicolas Brunie

Finkel_Hal_J · October 26, 2015, 5:40pm

From: "Nicolas Brunie via llvm-dev" <llvm-dev@lists.llvm.org>
To: llvm-dev@lists.llvm.org
Sent: Wednesday, September 30, 2015 1:01:52 AM
Subject: [llvm-dev] InstCombine wrongful (?) optimization on BinOp with SameOperands

Hi all,
I have been looking at the way LLVM optimizes code before forwarding
it to the backend I develop for my company and while building
define i32 @test_extract_subreg_func(i32 %x, i32 %y) #0 {
entry:
%conv = zext i32 %x to i64
%conv1 = zext i32 %y to i64
%mul = mul nuw i64 %conv1, %conv
%shr = lshr i64 %mul, 32
%xor = xor i64 %shr, %mul
%conv2 = trunc i64 %xor to i32
ret i32 %conv2
}

I came upon the following optimization (during instcombine):
IC: Visiting: %mul = mul nuw i64 %conv, %conv1
IC: Visiting: %shr = lshr i64 %mul, 32
IC: Visiting: %conv2 = trunc i64 %shr to i32
IC: Visiting: %conv3 = trunc i64 %mul to i32
IC: Visiting: %xor = xor i32 %conv3, %conv2
IC: ADD: %xor6 = xor i64 %mul, %shr
IC: Old = %xor = xor i32 %conv3, %conv2
New = <badref> = trunc i64 %xor6 to i32

which seems to be performed by SDValue
DAGCombiner::SimplifyBinOpWithSameOpcodeHands(SDNode *N)

You might have figured this out by now, but no, InstCombine and DAGCombine are two completely different pieces of code. One is driven by the code in lib/Transforms/InstCombine/* and the other in lib/CodeGen/SelectionDAG/DAGCombiner.cpp. InstCombine's job is to move the IR toward our chosen canonical form, which is designed to simplify operations in a way that exposes further optimization opportunities (as well as being generally beneficial). It does not take target costs into account.

In my backend's architecture truncate is free, but zext is not (and
i64 is not a desirable type for xor or any binary operation in
general),

Why, then, have you listed i64 as a legal type?

-Hal

Nicolas_Brunie · October 26, 2015, 7:05pm

----- Mail original -----

Topic		Replies	Views
Does LLVM optimize rudimentary i16 -> i32 conversions LLVM Dev List Archives	2	70	April 20, 2015
How to add optimizations to InstCombine correctly? LLVM Dev List Archives	20	243	September 22, 2017
[InstCombine] Simplify `Select i1 (and/or/xor %x %y), i32 (and/or/xor %x %y), i32 (and/or/xor %x, %y)` IR & Optimizations	4	237	November 24, 2023
Missing InstCombine optimization. LLVM Dev List Archives	3	99	June 4, 2013
Instcombine and instsimplify have similar optimizations , may I copy optmization from instcombine to instsimplify? Beginners	7	296	June 29, 2023

InstCombine wrongful (?) optimization on BinOp with SameOperands

Related Topics