THOMAS REED; GEORGE MASON. Hallucination Detection and Confidence Calibration for Large Language Model Outputs: Reproducible Experiments on HaluEval. Artificial Intelligence and Machine Learning Review , [S. l.], v. 6, n. 4, p. 1–17, 2025. DOI: 10.69987/AIMLR.2025.60401. Disponível em: https://scipublication.com/index.php/AIMLR/article/view/321.. Acesso em: 13 jun. 2026.