Failure to optimize ? operator

Brent_Walker · December 13, 2011, 1:59pm

The following seemingly identical functions, get compiled to quite
different machine code. The first is correctly optimized (the
computation of var y is nicely moved into the else branch of the "if"
statement), which the second one is not (the full computation of var y
is always done). The output was produced using the demo page on
llvm's web site (optimization level LTO).

Can someone shed some light on this strange behavior?

Thanks,
Brent

Eli_Friedman1 · December 14, 2011, 12:58am

The IR you're seeing is the IR we naturally generate for the given IR,
and LLVM doesn't have an optimization to turn your f2 into f1. Which
version is better depends on the values passed into the function,
which makes it a bit tricky to write an optimization to turn one into
the other.

-Eli

Haohui_Mai · December 14, 2011, 1:55am

Hi folks,

I'm interested in building a secure and verifiable operating system. One good way to do it is to build the OS with high level language, such as Java or C# since verification is significantly easier.

I have some experience on analyzing LLVM byte codes, so I also plan to compile everything into LLVM byte codes to analyze and verify my OS.

However, I also plan to write my own runtime, since I don't want to rely on GC (I'm considering something like region-based memory management).

My question is what's the status of high level language support in LLVM?

To my best knowledge, (1) there's a java class to llvm byte code translator in the SVN. (2) the VMKit project claims that it has a good JVM.

I appreciate if you could give me some more information. Comments and suggestions are also appreciated.

Thanks!

~Haohui

Eli_Friedman1 · December 14, 2011, 2:07am

The question isn't really clear. What features are you looking for?

-Eli

Brent_Walker · December 14, 2011, 1:19pm

I don't understand your point. Which version is better does NOT
depend on what inputs are passed to the function. The compiled code
for (as per llvm) f1 will always take less time to execute than f2.

for x > 0 => T(f1) < T(f2)
for x <= 0 => T(f1) = T(f2)

where T() is the time to execute the given function.

So always T(f1) <= T(f2).

I would call this a missed optimization opportunity. I think it
warrants a bug report.

If I do the same experiment with gcc I get identical code for the two functions:

Brent_Walker · December 14, 2011, 1:25pm

Apologies for the formatting but you get the point.

Neo

Eli_Friedman1 · December 14, 2011, 7:32pm

I don't understand your point. Which version is better does NOT
depend on what inputs are passed to the function. The compiled code
for (as per llvm) f1 will always take less time to execute than f2.

for x > 0 => T(f1) < T(f2)
for x <= 0 => T(f1) = T(f2)

where T() is the time to execute the given function.

So always T(f1) <= T(f2).

You're not taking branch prediction into account. Given the cost of
the multiplies, it probably doesn't matter so much for your testcase,
but we still need a cost model.

I would call this a missed optimization opportunity. I think it
warrants a bug report.

Sure.

-Eli

Topic		Replies	Views
missed optimizations LLVM Dev List Archives	8	82	September 16, 2008
Optimizing functions using logical operators LLVM Dev List Archives	2	122	September 26, 2019
Possible miscompilation? LLVM Dev List Archives	9	69	June 12, 2008
Missed optimization of bitwise expressions LLVM Dev List Archives	11	156	December 23, 2021
optimizer problem, possibly involving instcombine LLVM Dev List Archives	1	78	November 16, 2012

Failure to optimize ? operator

Related Topics