1.
Hanqi Zhang. Serving-Aware CTR Prediction: Embedding Compression and Interaction Distillation Under Memory and Latency Constraints. AIMLR. 2026;7(1):1-15. doi:10.69987/AIMLR.2026.70101