Safety & Alignment
Jailbreak
Quick Answer
Successfully bypassing model safety mechanisms or constraints through prompting or other means.
Jailbreaks are techniques that make models violate safety guidelines. Jailbreaks expose safety gaps. Successful jailbreaks suggest model misalignment. Jailbreak research helps improve safety. Jailbreaks are studied to improve defenses. Jailbreak techniques are well-documented. Discovering jailbreaks drives safety improvements. Jailbreak prevention is ongoing.
Last verified: 2026-04-08