microbenchmark shows 3-6x perf loss w/ LLVM vs GCC at -O3, >1.5x w/ -flto

I was doing some micro benchmarking for a code path that w/ profiling was showing to take up a large portion of execution time. I narrowed down the code to querying the configuration database extracted the attached testcase. While seeing what was the optimal pattern performance wise, I noticed that clang was dramatically slower than gcc

option_config_perf.tar.gz (1.08 KB)