Prompt attack and defense

From Rice Wiki

Attack methods

Name Description Paper
GPT Fuzzer Repeatedly mutate attacks to retain effective ones. Outperforms existing methods. 2309.10253v2