AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models

Dong Shu, Mingyu Jin, Chong Zhang, Lingyao Li, Zihao Zhou, Yongfeng Zhang