Hanqi Zhang (2026) “Serving-Aware CTR Prediction: Embedding Compression and Interaction Distillation Under Memory and Latency Constraints”, Artificial Intelligence and Machine Learning Review, 7(1), pp. 1–15. doi:10.69987/AIMLR.2026.70101.