Prompt attack and defense
From Rice Wiki
Attack and defense methods
| Name | Type | Description | Paper |
|---|---|---|---|
| GPT Fuzzer | Attack | Repeatedly mutate attacks to retain effective ones. Outperforms existing methods. | 2309.10253v2 |
Datasets
| Name | Type | Description | Paper |
|---|---|---|---|
| TensorTrust | Prompt extraction/hijacking
Attack and defense |
Gathered from a game. | 2311.01011v1 |
