MLIR GPU libdevice linking support

Ah you’re right sorry: of course in this direction it works: that is recent LLVM can link in the old bytecode from libDevice. I was thinking about the other direction: using libNVVM to compile the code we generate…

So here it is more related to this work I guess [RFC] Extending MLIR GPU device codegen pipeline ?