Hi all,
I found some issues during my testing of “loop unrolling” capabilities of LLVM’s opt.
Seems like LLVM generates slower code with -O3 since it wrongly decides to unroll a simple loop.
With option -Os, no loop unrolling, the output looks well.
Thanks