YUHAN LI; MINGZHUO YU. Benchmarking CUDA, CuPy, and Triton Kernel Optimizations for 3D Point Cloud Segmentation: An Empirical Comparison of Latency, Memory Efficiency, and GPU Utilization. Journal of Advanced Computing Systems , [S. l.], v. 6, n. 5, p. 21–30, 2026. DOI: 10.69987/JACS.2026.60503. Disponível em: https://scipublication.com/index.php/JACS/article/view/365.. Acesso em: 6 may. 2026.