Mingzhuo Yu, and Zan Li. “An Empirical Comparison of Discrete Video Tokenization Schemes for Video Question Answering and Video Captioning”. Artificial Intelligence and Machine Learning Review 6, no. 2 (April 11, 2025): 27–50. Accessed June 4, 2026. https://scipublication.com/index.php/AIMLR/article/view/377.