Trying out Loop Vectorizer

Michael_Lam · December 31, 2012, 7:03pm

Hi all,

I am trying out the new loop vectorizer in LLVM 3.2. I wanted to see the effect of the pass in opt but I have no success. I used LLVM IR generated from C examples in http://blog.llvm.org/2012/12/new-loop-vectorizer.html#more and pass them to opt -S -O3 -vectorize-loops example.ll. However, I do not see vectorized output. What am I doing wrong?

Thanks,
Siu

d0k · December 31, 2012, 7:57pm

I'm not entirely sure why this is the case, the target specific stuff for opt is still very new, but at the moment you have to explicitly set a triple for opt so it can access target-specific bits to estimate the cost of vectorization. Something like "opt -mtriple=x86_64-linux-gnu -S -O3 -vectorize-loops" should work. You can also specify a target cpu with -mcpu (which takes the same arguments as clang -march) to experiment with different SSE levels.

- Ben

Michael_Lam · December 31, 2012, 10:22pm

Setting triple for opt worked. In my case, it is

$ opt -mtriple=x86_64-apple-macosx10.8.0 -S -O3 -vectorize-loops loopvectorize.ll

Thanks,
Siu

Nadav_Rotem1 · December 31, 2012, 11:26pm

I think that this is a good opportunity to discuss this topic. At the moment ‘opt’ does not use the triple that is found in the module in order to initialize the backend and the backend related analysis passes. It relies on the ‘-mtriple’ command line flag. LLC on the other hand uses the host triple. I see the following options:

‘opt’ does not initialize the backend passes unless ‘-mtriple’ is used. (the current option)
‘opt’ grabs the triple from the bit code file and uses it to initialize the backend passes.
‘opt’ gets the default target triple (like llc).

I think that Nick said that he prefers #2.

Nadav

Chandler_Carruth · December 31, 2012, 11:31pm

I'm not entirely sure why this is the case, the target specific stuff for
opt is still very new, but at the moment you have to explicitly set a
triple for opt so it can access target-specific bits to estimate the cost
of vectorization.

I think that this is a good opportunity to discuss this topic. At the
moment 'opt' does not use the triple that is found in the module in order
to initialize the backend and the backend related analysis passes. It
relies on the '-mtriple' command line flag. LLC on the other hand uses the
host triple. I see the following options:

1. 'opt' does not initialize the backend passes unless '-mtriple' is used.
(the current option)
2. 'opt' grabs the triple from the bit code file and uses it to initialize
the backend passes.
3. 'opt' gets the default target triple (like llc).

My only strong opinion is that I dislike #3.

I have a mild preference for encoding more in bitcode and less in
commandline switches, while still allowing commandline switches to override
what is in the bitcode.

Finkel_Hal_J · January 1, 2013, 1:55am

From: "Nadav Rotem" <nrotem@apple.com>
To: "Benjamin Kramer" <benny.kra@gmail.com>
Cc: "llvmdev@cs.uiuc.edu List" <llvmdev@cs.uiuc.edu>
Sent: Monday, December 31, 2012 5:26:18 PM
Subject: Re: [LLVMdev] Trying out Loop Vectorizer

I'm not entirely sure why this is the case, the target specific stuff
for opt is still very new, but at the moment you have to explicitly
set a triple for opt so it can access target-specific bits to
estimate the cost of vectorization.

I think that this is a good opportunity to discuss this topic. At the
moment 'opt' does not use the triple that is found in the module in
order to initialize the backend and the backend related analysis
passes. It relies on the '-mtriple' command line flag. LLC on the
other hand uses the host triple. I see the following options:

1. 'opt' does not initialize the backend passes unless '-mtriple' is
used. (the current option)
2. 'opt' grabs the triple from the bit code file and uses it to
initialize the backend passes.
3. 'opt' gets the default target triple (like llc).

I think that Nick said that he prefers #2.

As do I.

-Hal

Chris_Lattner · January 1, 2013, 4:59am

+1 on both points. Both opt and llc should work this way. To get llc’s current behavior, you should have to pass -mcpu=native or something, explicitly.

-Chris

Nadav_Rotem1 · January 1, 2013, 8:16am

I changed ‘opt’ in r171341 and opened PR14770 for ‘llc’.

Topic		Replies	Views
LLVM Loop Vectorizer is enabled by default??? LLVM Dev List Archives	1	120	June 3, 2013
Decouple LoopVectorizer from O3 LLVM Dev List Archives	9	76	April 15, 2013
[Loop Vectorize] Question on -O3 LLVM Dev List Archives	4	125	July 2, 2013
LLVM Loop Vectorizer LLVM Dev List Archives	42	299	October 6, 2012
LLVM IR vectorized with opt but not through the API LLVM Dev List Archives	3	123	November 5, 2013

Trying out Loop Vectorizer

Related topics