[PATCH 2/2] math: Add tan implementation

Aaron_Watry · September 5, 2014, 11:35pm

>
> >> Uses the algorithm:
> >> tan(x) = sin(x) / sqrt(1-sin^2(x))
> >>
> >> An alternative is:
> >> tan(x) = sin(x) / cos(x)
> >>
> >> Which produces more verbose bitcode and longer assembly.
> >
> > this is weird. both EG and SI have both sin and cos instructions. Is the
> > input normalization code so bad that we are better of doing MUL+SUB+SQRT
> > instead?
>
> Those are only useful for native_sin / native_cos. For the standard
> function, they are far from precise enough. The current (float) sin
> implementation should be correct, though native_sin right now is still
> defined to just be the regular sin function instead of the LLVM
> intrinsic

oh I didn't know the hw implementaion was so imprecise. In that case it
makes sense. Although I wonder why it ended up needing twice as many
instructions. it looks to me that sin and cos don't differ in more than
4 operations, so CSE should have eliminated most of it.
either way it's not going to be more efficient than this patch.

LGTM

Thanks for the review. I'm guessing that the CSE isn't recognizing the pattern correctly... And then doing the sqrt, sub, and mul ends up being cheap in comparison to another sin or cos operation

--Aaron

Topic		Replies	Views
[PATCH 0/2] More trig builtins OpenCL	8	80	September 8, 2014
[PATCH 1/4] Implement atan builtin OpenCL	13	85	September 2, 2014
SIMD trigonometry/logarithms? LLVM Dev List Archives	15	74	February 14, 2013
[PATCH 1/2] amdgcn/fmin: fcanonicalize operands OpenCL	2	105	March 8, 2018
[PATCH v2 1/2] tan: Port from amd_builtins OpenCL	2	118	January 19, 2018

[PATCH 2/2] math: Add tan implementation

Related Topics