Penetration testing for AI systems.

Framework-aligned risk assessments straight from Python — OWASP Top 10, NIST AI RMF, MITRE ATLAS, and more. Surface jailbreaks, vulnerabilities, and agentic risks before they ship.

main.py
from deepteam import red_team
from deepteam.test_case import RTTurn
from deepteam.frameworks import OWASPTop10
def model_callback(prompt: str, turns: List[RTTurn]= None) -> RTTurn:
return your_llm_app(prompt, turns)
red_team(model_callback=model_callback, framework=OWASPTop10())
terminal
$python main.py


LLM as your adversary.

DeepTeam uses LLMs to simulate adversarial attacks across every vulnerability and attack vectors so you find the failures before real attackers do.

120+ vulnerabilities

Bias, Toxicity, PII leakage, SQL Injection, BFLA, BOLA, RBAC, Hallucination, and dozens more — every major LLM risk category covered out of the box.

Multi-turn jailbreak attacks

Crescendo, Linear, Tree, Sequential, and Bad-Likert-Judge — research-backed adversarial chains that simulate sophisticated multi-turn attackers.



Industry-standrd frameworks for your trust & safety requirements.

OWASP Top 10 for LLMs, MITRE ATLAS, NIST AI RMF, EU AI Act — plus safety benchmark datasets like BeaverTails and Aegis. Every major taxonomy plugged into a single framework.

Security & threat frameworks

OWASP Top 10 for LLMs and MITRE ATLAS — industry-standard adversarial taxonomies in one framework.

Compliance & governance

Map your model's risk posture against NIST AI RMF and EU AI Act controls — audit-ready, no separate tooling required.

Safety benchmark datasets

Test against curated red-teaming corpora — BeaverTails and Aegis — straight from the runner, no glue code.



Any model. Any framework. Any pipeline.

DeepTeam plugs into the LLM providers you already use, the safety frameworks your auditors require, and the CI/CD runners your team already trusts — continuous red teaming with zero glue code.



Built by amazing humans.

Nothing would be possible without our community of amazing contributors, thank you!



Ah yes, FAQs.



This is the CTA :)

Get Started