Intrinsics __readeflags and __writeeflags

Hello all,

I am trying to implement intrinsics __readeflags and __writeeflags reading and writing EFLAGS register on x86.
These intrinsics expand to two instructions popf and push to register for __readeflags and pushf and pop to register for __writeeflags.
These instructions are not connected explicitly so I can’t use patterns in .td file to match intrinsics.

I tried to implement custom expansion making COPY DAG node with copy from EFLAGS to register.
But this solution works only at -O0 level and failed at -O1 and higher: the problem is that Post-RA pseudo instruction expansion pass seems to be called only at -O0.

Another way is to expand intrinsics to DAG nodes for each PUSH, POP, PUSHF and POPF instructions.
This will add 4 new X86ISD types for DAG nodes for these instructions.

What is the proper way to expand these intrinsics?

I don’t know enough about LLVM CodeGen to answer your questions. I’m just curious.

What is the intended level of support for these intrinsics? Are they for reading ALU flags like CF, OF, etc, or for seldom changed control flags like TF and AC? Even DF is typically scratch, and could be used for an -Oz memmove lowering for example.

I don’t think LLVM will ever really support capturing ALU flags from previous ops without “using” the operation. LLVM does have overflow intrinsics though:

This intrinsic seems very ill-defined, apparently it can be freely reordered and does not act like a compiler barrier. [1]
Other than source compatibility, why would one want this intrinsic? What semantics is it supposed to give?

[1] <>

Even more, why can't it just be defined as inline function in some


These intrinsics are introduced for compatibility purposes.
Besides MSVC GCC also supports it in its main trunk; ICC supports it on Windows and is going to support in the next version on Linux.

There have been two questions, neither of which is really answered. The questions are:

- Why does this need to be an LLVM intrinsic, rather than an inline function in a clang header expanding to some inline asm?

- Given that this instruction has such poorly defined semantics that it effectively returns an arbitrary number in any function that does arithmetic, what possible benefit is there in providing it as an LLVM intrinsic?


I don't insist on implementing LLVM intrinsic, clang header with inline asm
should be enough and is much easier to implement.
I am just looking for the best way to introduce this functionality to LLVM.
I understand the problem with arbitrary results in some cases. Again, the
main benefit here is compatibility with other compilers.