All public logs
From Rice Wiki
Combined display of all available logs of Rice Wiki. You can narrow down the view by selecting a log type, the username (case-sensitive), or the affected page (also case-sensitive).
- 21:09, 23 May 2024 Rice talk contribs created page Tree-of-Attacks (Created page with "Category:LLM security Tree-of-Attacks (aka. TAP) is an automated red teaming strategy to generate LLM jailbreak and prompt injection attacks. = Description = TAP consists of two LLMs: an ''attacker'' and a ''judge''. The attacker is given a prompt and benign data and asked to inject the target to output "Hacked!". The judge then scores the attack prompt. Based on the score, the attacker iteratively improves.")