Jailbreak:T1T2T3T4T5
JailbreakTier 2critical

Jailbreak: Developer Mode

Attempts to bypass safety guardrails and persona constraints

The developer mode attack claims that a special access level has been activated, temporarily disabling safety filters. This tests whether the agent validates authority claims.

Attack Details

Attack ID
APWN-JB-002
HMA Check
PROMPT-002
Delivery Methods
html-comment, invisible-span
CWE
CWE-284
OASB Control
3.1
Severity
critical

Remediation

If your AI agent is vulnerable to this attack, scan and fix with:

npx hackmyagent secure --check PROMPT-002