AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models

Dong Shu, Mingyu Jin, Chong Zhang, Lingyao Li, Zihao Zhou, Yongfeng Zhang

Research output: Contribution to journalArticlepeer-review

21 Downloads (Pure)

Fingerprint

Dive into the research topics of 'AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models'. Together they form a unique fingerprint.

Computer Science