1.
Daren Zheng, Boning Zhang, Julie Geibel. VerifySafe: Toxicity-Safe Agent Responses under Adversarial Prompts with Evidence-Based Self-Verification. JACS. 2024;4(1):67-82. doi:10.69987/JACS.2024.40106