Daren Zheng, Chenyu Li, and Harvey Davidson. “Continual Red-Teaming for In-the-Wild Jailbreaks via Online Guardrail Updates and Guardrail Distillation”. Journal of Advanced Computing Systems 3, no. 2 (February 9, 2023): 35–49. Accessed March 5, 2026. https://scipublication.com/index.php/JACS/article/view/325.