Automatic GPU Code Generation

You can use OpenMP. Clang supports offloading to the Nvidia GPUs using OpenMP directives. The work is still in progress, but mostly everything should work already.

Best regards,
Alexey Bataev