Hanqi Zhang. “Counterfactual Learning-to-Rank for Ads: Off-Policy Evaluation on the Open Bandit Dataset”. Journal of Advanced Computing Systems 5, no. 12 (December 3, 2025): 1–11. Accessed January 18, 2026. https://scipublication.com/index.php/JACS/article/view/271.