[1]
Daren Zheng et al. 2024. VerifySafe: Toxicity-Safe Agent Responses under Adversarial Prompts with Evidence-Based Self-Verification. Journal of Advanced Computing Systems . 4, 1 (Jan. 2024), 67–82. DOI:https://doi.org/10.69987/JACS.2024.40106.