Mingzhuo Yu, & Zan Li. (2025). An Empirical Comparison of Discrete Video Tokenization Schemes for Video Question Answering and Video Captioning. Artificial Intelligence and Machine Learning Review , 6(2), 27-50. https://doi.org/10.69987/AIMLR.2025.60203