Complex arithmetic ignores -ffast-math after clang r219557, serious performance regressions

Richard_Campbell · July 17, 2015, 12:26am

Hal,

SVN now seems to be respecting the -ffast-math flag in the way we desire without Matthijs’ temporary fix. I didn’t see any further traffic about this on the cfe-dev list - was there a discussion elsewhere? Did it get fixed by accident as part of some other change, and we should worry about whether it will come up again?

Richard

rnk · July 20, 2015, 6:53pm

Not that I’m aware of. Did anything happen here?

Finkel_Hal_J · July 22, 2015, 3:01am

Hi Richard,

I think you're seeing a change in the backend (in combination with an older frontend change) -- perhaps the backend change is when we added fast-math flags to fcmp?

Here's what happens:

Given some code like this:
$ cat /tmp/c.c
typedef _Complex double dc;

dc foo(dc a, dc b) {
return a*b;
}

Compiling it produces IR that looks like this:

define { double, double } @foo(double %a.coerce0, double %a.coerce1, double %b.coerce0, double %b.coerce1) #0 {
entry:
...
[perform the fast code]
%isnan_cmp = fcmp fast uno double %mul_r, %mul_r
br i1 %isnan_cmp, label %complex_mul_imag_nan, label %complex_mul_cont, !prof !1

complex_mul_imag_nan: ; preds = %entry
%isnan_cmp1 = fcmp fast uno double %mul_i, %mul_i
br i1 %isnan_cmp1, label %complex_mul_libcall, label %complex_mul_cont, !prof !1

complex_mul_libcall: ; preds = %complex_mul_imag_nan
  %call = call { double, double } @__muldc3(double %a.real, double %a.imag, double %b.real, double %b.imag) #1
  %4 = extractvalue { double, double } %call, 0
  %5 = extractvalue { double, double } %call, 1
  br label %complex_mul_cont

complex_mul_cont: ; preds = %complex_mul_libcall, %complex_mul_imag_nan,
...

So we always do the fast calculation, and then only if we get NaN, do we go back and do the slow calculation. But because of the fast-math flags, the backend can constant fold the relevant comparisons, and eliminate that entire set of branches, leaving on the fast code.

-Hal

Topic		Replies	Views
Complex arithmetic ignores -ffast-math after clang r219557, serious performance regressions Clang Frontend	3	103	July 5, 2015
different output with fast-math flag LLVM Dev List Archives	6	117	August 22, 2018
The priority of -fno-fast-math regarding complex number calculations Clang Frontend	10	360	March 12, 2025
Propogation of fpclass assumptions vis a vis fast-math flags IR & Optimizations	28	708	February 3, 2024
RFC: Consider changing the semantics of 'fast' flag implying all fast-math-flags LLVM Dev List Archives	39	237	November 23, 2016

Complex arithmetic ignores -ffast-math after clang r219557, serious performance regressions

Related topics