Hanqi Zhang. “Counterfactual Learning-to-Rank for Ads: Off-Policy Evaluation on the Open Bandit Dataset”. Journal of Advanced Computing Systems , vol. 5, no. 12, Dec. 2025, pp. 1-11, https://doi.org/10.69987/JACS.2025.51201.