1.
Yuhan Li, Mingzhuo Yu. Benchmarking CUDA, CuPy, and Triton Kernel Optimizations for 3D Point Cloud Segmentation: An Empirical Comparison of Latency, Memory Efficiency, and GPU Utilization. JACS. 2026;6(5):21-30. doi:10.69987/JACS.2026.60503