FP contraction (FMA) on by default

dslater38 · August 31, 2022, 5:13pm

So I just read that with clang 14 -ffp-contract=on is now the default. This is a horrible idea. The results of FMA instructions are not IEEE compliant as they do not exhibit correct rounding behavior, and should only be turned on explicitly by someone who understands exactly what they’re doing and understands how these instructions affect results.

jyknight · August 31, 2022, 7:06pm

IEEE754 specifies a fusedMultiplyAdd operation, but does not have anything to say about which C syntax represesnts which operations. So, this behavior is neither compliant nor non-compliant w.r.t. that spec. The behavior controlled by -ffp-contract=on is compliant with the C standard, however, which explicitly discusses and blesses this.

Note that (as required by the C standard) contraction is only permitted within a single expression – that is, double fma(double a, double b) { return a * b + 5; } can emit an fusedMultiplyAdd operation, but double ma(double a, double b) { double m = a * b; return m + 5; } cannot.

Additionally, the behavior is controllable in the source code by the standard #pragma STDC FP_CONTRACT {ON|OFF}.

(Contrast with the -ffp-contract=fast flag, which enables non-C-standard-compliant behavior that ignores the #pragma and will create contractions after other optimizations like inlining, and across expressions.)

dslater38 · August 31, 2022, 7:48pm

My issue isn’t with the FMA optimization or the behavior of -ffp-contract=on. My issue is with the fact that now -ffp-contract=on is now on by default with -ffp-model=precise. This results in expressions like return a * b + 5; returning values that are different from what’s expected.
If you want to have FMA optimizations, then you should have to choose them explicitly, either by using -ffp-contract=on or -ffp-contract=fast because you really need to know what you’re doing when you turn these things on.

serge · August 9, 2023, 9:30pm

Here is an unexpected behavior with fma contractions when done in a wild manner:
x1y2 - x2y1.

We found that this is evaluated as fma(x1, y2, -x2*y1) in precise model on apple M1, that is one multiplication and subtraction is done using high precision the second multiplication is rounded. As a result this expression is not zero when x1 == x2 and y1 == y2.

It seems the contraction should not be applied to anything more complicated than a * b + c.

thesamesam · October 27, 2023, 9:44pm

This came up on the GCC side last month, with @fweimer-rh bringing up some of the issues it causes to have -ffp-contract=fast by the default for GCC: Concerns regarding the -ffp-contract=fast default - Florian Weimer.

jyknight · October 30, 2023, 1:29am

I’d concur that GCC should not be using the non-standard “fp-contract=fast” mode by default, and use “on” instead, like clang already does.

Topic		Replies	Views
fp-contract at -O0 Clang Frontend	13	147	February 20, 2020
AllowFPOpFusion vs. SDNodeFlags::hasAllowContract() Code Generation	7	163	September 5, 2024
[RFC] FP Contract = fast? LLVM Dev List Archives	25	154	March 23, 2017
defaults for FP contraction [e.g. fused multiply-add]: suggestion and patch to be slightly more aggressive and to make Clang`s optimization settings closer to having the same meaning as when they are given to GCC [at least for "-O3"] LLVM Dev List Archives	10	122	September 12, 2016
Documentation on -ffp-contract=fast vs pragma contract(off) Clang Frontend	1	121	September 25, 2020

FP contraction (FMA) on by default

Related topics