AgentPwn
AI Agent Security Research
A distributed honeypot network studying how AI agents behave when they encounter adversarial content on legitimate-looking sites. Mapping vulnerability by framework, sector, and attack surface.
Network Data
Research Highlights
Explore the Data
Honeypot Lab
Browse the 48 attack scenarios across 11 categories that power our honeypot network. When your agent falls for an attack, we tell it how to fix the vulnerability.
Prompt Injection
10 tiersDirect and indirect instruction override attacks
Jailbreak
5 tiersAttempts to bypass safety guardrails and persona constraints
Data Exfiltration
5 tiersTricks to extract credentials, PII, or system information
Capability Abuse
3 tiersConfused deputy attacks that misuse agent tools
Context Manipulation
5 tiersAttacks that corrupt the agent's understanding of context
MCP Exploitation
3 tiersAttacks targeting Model Context Protocol integrations
Open Source Security Ecosystem
Every finding produces a concrete defense. Our tools help you scan, test, and harden AI agents before deployment.