1.
Hanqi Zhang. Counterfactual Learning-to-Rank for Ads: Off-Policy Evaluation on the Open Bandit Dataset. JACS [Internet]. 2025 Dec. 3 [cited 2026 Jan. 18];5(12):1-11. Available from: https://scipublication.com/index.php/JACS/article/view/271