Vectorization with fast-math on irregular ISA sub-sets

Renato_Golin1 · February 21, 2016, 12:14pm

- ARMv7 NEON ignores the rounding mode set in bits 23:22 of FPSCR and always uses round to nearest.
- ARMv7 NEON ignores the trap enable bits (15:8) in FPSCR and always uses default exception handling.

If I read the manuals correctly, these are not strictly defined on
IEEE 754 to be one way or another, so these don't violate the
standard. The subnormal treatment does.

As with denormal support, the issue at hand is not so much that these differ from IEEE 754 as it is that they differ from the behavior of the scalar (VFP) arithmetic.

This one of the practical consequences, yes, but of no relevance to
this work. Right now, I'm only trying to avoid surprises. If a user
has different results using -ffast-math, it's expected. Without, not
so much.

cheers,
--renato

Topic		Replies	Views
NEON FP flags LLVM Dev List Archives	9	79	April 1, 2016
NEON vector instructions and the fast math IR flags LLVM Dev List Archives	18	98	June 10, 2013
ARM NEON VMUL.f32 issue LLVM Dev List Archives	4	81	March 20, 2013
Implementing the ARM NEON Intrinsics for PowerPC LLVM Dev List Archives	15	70	October 2, 2013
ARM vectorized fp16 support LLVM Dev List Archives	4	84	September 6, 2019

Vectorization with fast-math on irregular ISA sub-sets

Related Topics