Hanqi Zhang (2025) “Counterfactual Learning-to-Rank for Ads: Off-Policy Evaluation on the Open Bandit Dataset”, Journal of Advanced Computing Systems, 5(12), pp. 1–11. doi:10.69987/JACS.2025.51201.