NAS Parallel Benchmark Kernels with Python: A performance and programming effort analysis focusing on GPUs | IEEE Conference Publication | IEEE Xplore