Strange codegen in LLVM for RISCV64

mr_nacho · January 6, 2023, 7:53pm

Hi,

This question is probably of a very elementary nature, but I just can’t figure it out. I am hoping you experts in here will be able to help.

In this Godbolt link Compiler Explorer can someone please tell me why I have the slli and srli intructions in the strange_square function? To avoid those instructions, I have to compute the more cumbersome expression mul(n, n-1) + n, rather than computing mul(n, n) directly, which seems odd to me.

Thanks!

jrtc27 · January 6, 2023, 8:00pm

This is probably an “optimisation” where LLVM sees that n*n is always positive and so “optimises” the sign-extending for the ABI to be a zero-extend, but then the backend doesn’t know the output is positive and so has to explicitly zero-extend. Cc @topperc.

topperc · January 6, 2023, 10:12pm

Yuck.

Looks like the backend inserted a sign_extend before type legalization due to the function return. DAGCombiner saw the mul had the nsw flag so the multiply can’t overflow so the sign bit is assumed to be 0. This caused the sign extend to convert to zero extend. Then the type legalization promoted everything to 64 bits. This dropped the nsw flag so we have no way to reverse the transform.

I could add a DAGCombiner for (zext (mulnsw X, X)) before type legalization to turn it back into (sext (mul X, X)), dropping the nsw flag in the process.

Or I could convince DAGCombine that sext is cheaper than zext and that it shouldn’t do this replacement.

@rotateright @RKSimon @LebedevRI

Topic		Replies	Views
better code for IV LLVM Dev List Archives	3	79	February 28, 2014
global type legalization? LLVM Dev List Archives	12	74	September 15, 2010
X86ISelLowering: Promote 'add nsw' to a wider type LLVM Dev List Archives	5	106	August 8, 2016
SCEVExpander bug? LLVM Dev List Archives	2	89	June 25, 2019
sign and zero extensions question LLVM Dev List Archives	1	105	May 26, 2009

Strange codegen in LLVM for RISCV64

Related Topics