Paper Reading: JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

雪溯發表於2024-12-10

Abstract

相關文章