[1]
Lee Ji-su 2026. GPU Memory Usage Prediction for Generative AI Serving Pipelines with Queue, Latency, and Utilization Signals. Journal of Advanced Computing Systems . 6, 7 (Jul. 2026), 1–15. DOI:https://doi.org/10.69987/JACS.2026.60701.