[1]
Hanqi Zhang, “Serving-Aware CTR Prediction: Embedding Compression and Interaction Distillation Under Memory and Latency Constraints”, AIMLR, vol. 7, no. 1, pp. 1–15, Jan. 2026, doi: 10.69987/AIMLR.2026.70101.