Prompt InjectionTier 5high
Prompt Injection: Instruction Laundering
Direct and indirect instruction override attacks
Social engineering principles apply to AI agents just as they do to humans. Instruction laundering wraps malicious directives in the language of authority, compliance, and urgency to bypass safety checks.
Attack Details
- Attack ID
- APWN-PI-005
- HMA Check
- INJ-001
- Delivery Methods
- json-ld, meta-tag, invisible-span, html-comment
- CWE
- CWE-74
- OASB Control
- 3.4
- Severity
- high
Remediation
If your AI agent is vulnerable to this attack, scan and fix with:
npx hackmyagent secure --check INJ-001