1.
Mingzhuo Yu, Zan Li. An Empirical Comparison of Discrete Video Tokenization Schemes for Video Question Answering and Video Captioning. AIMLR. 2025;6(2):27-50. doi:10.69987/AIMLR.2025.60203