Harry Wilson, & Leo Carter. (2025). Reproducible Evidence-Centric Evaluation of Multi-Hop Retrieval-Augmented QA on MuSiQue. Artificial Intelligence and Machine Learning Review , 6(3), 18-33. https://doi.org/10.69987/AIMLR.2025.60302