(1)
Yuhan Li; Mingzhuo Yu. Benchmarking CUDA, CuPy, and Triton Kernel Optimizations for 3D Point Cloud Segmentation: An Empirical Comparison of Latency, Memory Efficiency, and GPU Utilization. JACS 2026, 6 (5), 21-30. https://doi.org/10.69987/JACS.2026.60503.