Hello People,
I’m trying to understand the compilation/execution flow of a ML program(using TensorFlow). Particularly interested in understanding how TensorFlow graphs are lowered to MLIR using XLA dialect and lowering to a CPU/GPU via LLVM.
It would be great, if someone can give an overview on it. Thanks.