I think these intrinsics could have default expansions, like the one you gave. Hopefully that would be good enough for CPU.
For “dot” in particular, we should allow integer versions as well.
I think these intrinsics could have default expansions, like the one you gave. Hopefully that would be good enough for CPU.
For “dot” in particular, we should allow integer versions as well.