Abstract: Heterogeneous CPU-GPU systems are extensively utilized in high-performance computing. Compute Unified Device Architecture (CUDA) [1] is a model for programming the GPUs. A CUDA program ...