Harry Wilson and Leo Carter (2025) “Reproducible Evidence-Centric Evaluation of Multi-Hop Retrieval-Augmented QA on MuSiQue”, Artificial Intelligence and Machine Learning Review, 6(3), pp. 18–33. doi:10.69987/AIMLR.2025.60302.