Mingzhuo Yu, and Zan Li. “An Empirical Comparison of Discrete Video Tokenization Schemes for Video Question Answering and Video Captioning”. Artificial Intelligence and Machine Learning Review , vol. 6, no. 2, Apr. 2025, pp. 27-50, https://doi.org/10.69987/AIMLR.2025.60203.