Yuhan Li, and Mingzhuo Yu. “Benchmarking CUDA, CuPy, and Triton Kernel Optimizations for 3D Point Cloud Segmentation: An Empirical Comparison of Latency, Memory Efficiency, and GPU Utilization”. Journal of Advanced Computing Systems 6, no. 5 (May 8, 2026): 21–30. Accessed May 6, 2026. https://scipublication.com/index.php/JACS/article/view/365.