Motivation
When comparing two values that are known to have the same sign, signed and unsigned predicates return the same result, e.g. ult
is the same as slt
.
In such cases, we attempt to canonicalize the icmp to use the unsigned predicate. However, this can cause major optimization regressions in some cases: If there are multiple comparisons involving related values, and only some of them can be converted from signed to unsigned, then later optimizations may have a hard time reasoning about how these comparisons relate to each other, as they now have different signedness.
In InstCombine, we currently suppress the signed ā unsigned conversion in some cases where it is likely other analyses will not be able to re-derive relevant facts. Of course, this is sub-optimal as the heuristic may fail in both directions.
Proposal
I propose to add a new icmp
instruction flag samesign
. If both operands of the icmp
do not have the same sign, then the result is a poison value. (The name is very much up for bikeshedding, Iām really not a fan of samesign
!)
As such, icmp samesign ult i8 %x, %y
is equivalent to icmp samesign slt i8 %x, %y
, with the former variant considered canonical.
Alternatives
Use nneg instead
A minor variant on this proposal would be to support the nneg
flag on icmp
instead.
This is weaker than samesign
, because it does not support two negative values, but has the advantage that the nneg
flag is already used by other instructions. The āboth non-negativeā case is a lot more common than the āboth negativeā case, so this probably captures most of the value of the proposal.
Introduce signedness-independent predicates instead
An alternative solution would be to introduce four new signedness-independent predicates instead: lt
, le
, gt
and ge
. These predicates indicate that the comparison can be performed in either signedness. If the signs of the values differ (or equivalently, if the results of the unsigned and signed predicates differ), the result is a poison value.
To minimize impact on code which doesnāt care, ICmpInst::getPredicate()
would implicitly normalize lt
to ult
and so on, while a separate method would provide the raw predicate.
The advantage I see in this approach is that we can easily pass the signedness-independent predicates around in various analysis tooling. For example we can pass it to isImpliedCond()
and the implementation can then decide itself as what kind of predicate it wants to treat it. Especially for APIs that deal with multiple predicates at the same time, I expect this to be a lot cleaner than passing around an extra flag.
The downside is that this is outside our usual model of poison generating flags, and will require special casing in various places, like āif-definedā instruction comparisons and flag intersection.
While this is the variant I originally wanted to propose, I think that ultimately we can get the best of both worlds by using a normal poison-generating flag in IR, but change the type returned by ICmpInst::getPredicate()
to be a struct that also carries the āsame signā information. That should allow clean handling in analysis code, while using well-established concepts in IR.