PyRIT 2026: Microsoft's AI Red Teaming Framework

PyRIT (Python Risk Identification Tool) is an open-source AI security red teaming framework built by Microsoft’s AI Red Team. It has ~4k GitHub stars, 661 forks, and 117 contributors.

PyRIT architecture showing the components: orchestrators, targets, converters, scorers, and memory

The framework is MIT licensed and requires Python 3.10-3.13. The latest release is v0.11.0 (February 2026).

Microsoft built PyRIT based on their experience red teaming production AI systems including Bing Chat and Copilot. The project has a published paper on arXiv (2410.02828).

What is PyRIT?

PyRIT automates the repetitive parts of AI red teaming so security professionals can focus on creative attack strategies.

It provides orchestrators that manage multi-turn conversations with AI targets, converters that transform prompts to bypass filters, scorers that evaluate whether attacks succeeded, and a memory system that tracks everything.

The framework supports text, image, audio, and video modalities. You can test any AI system accessible via API, from OpenAI and Azure OpenAI to custom HTTP endpoints and browser-based targets via Playwright.

PyRIT’s architecture is modular.

Each component (orchestrators, targets, converters, scorers, memory) can be swapped or extended independently, letting you build custom red teaming workflows for your specific needs.

Attack Orchestration

Multiple orchestrator types handle different attack patterns. Single-turn prompt sending, multi-turn red teaming, crescendo (gradually escalating) attacks, and Tree of Attacks with Pruning (TAP).

Multi-Modal Testing

Test AI systems across text, image, audio, and video. Converters transform content between modalities. Target adapters handle different API formats consistently.

Memory and Scoring

SQLite or Azure SQL backends store all prompts, responses, and metadata. Built-in scorers evaluate responses using rule-based checks, classification models, or LLM-as-judge approaches.

What are PyRIT’s key features?

Feature	Details
Orchestrators	Prompt sending (single-turn), red teaming (multi-turn), crescendo, Tree of Attacks with Pruning (TAP), XPIA, benchmarking
Targets	OpenAI Chat/Responses/Image/Video/TTS, Azure ML, HuggingFace, custom HTTP/WebSocket, Playwright browser
Converters	Text-to-text (Base64, ROT13, leetspeak, homoglyph, translation), audio, image, video, file converters, human-in-the-loop
Scorers	Azure Content Safety API, true/false classification, Likert scale, refusal detection, human-in-the-loop, batch scoring
Memory	SQLite (default), Azure SQL Database, labeling, export, embeddings
Modalities	Text, image, audio, video
Attack Types	Prompt injection, XPIA, crescendo, skeleton-key, many-shot jailbreak, role-play, multi-turn manipulation
Python Support	3.10-3.13
License	MIT

Orchestrators

Orchestrators manage the flow of red teaming sessions. Each type handles a different attack pattern:

Prompt Sending — sends a batch of prompts in a single turn, useful for baseline testing
Red Teaming — multi-turn conversations where an attacker LLM generates follow-up prompts based on target responses
Crescendo — gradually escalates requests across turns, starting innocuous and building toward the objective
Tree of Attacks with Pruning (TAP) — explores multiple attack paths simultaneously, pruning unsuccessful branches
XPIA — cross-domain prompt injection attacks that embed malicious instructions in external data sources

PyRIT crescendo attack orchestrator output showing a 5-turn escalation from innocuous questions to jailbreak success

Converters

Converters transform prompts to bypass safety filters. They chain together for layered obfuscation:

Text transforms — Base64 encoding, ROT13, leetspeak, homoglyph substitution, Unicode tricks, translation
Cross-modal — convert text to images, audio, or video for multi-modal attacks
Selective conversion — apply transformations to specific parts of a prompt while leaving the rest intact
Human-in-the-loop — manual prompt modification when automated approaches need human creativity

PyRIT converter pipeline chaining ROT13, Base64, and Unicode transforms to obfuscate a prompt and bypass safety filters

Scoring and Evaluation

Scorers evaluate whether AI responses indicate a successful attack:

Azure Content Safety — uses Microsoft’s content safety API to classify harmful content
True/false scoring — binary classification of whether the response meets the attack objective
Likert scale — graduated scoring for nuanced evaluation
Refusal detection — identifies when the target model refuses to respond
LLM-as-judge — uses a separate LLM to evaluate response quality

Memory System

The memory system stores every prompt, response, and metadata from red teaming sessions. You can analyze patterns across runs and reproduce tests later.

Teams using Azure SQL as the backend can share sessions. Memory labeling lets you tag and organize findings.

Note

Installation options

PyRIT offers three installation paths: Docker (recommended for beginners, comes with JupyterLab), pip/uv for local Python environments, and DevContainers for VS Code development. Docker includes all dependencies pre-configured.

How do I get started with PyRIT?

Install PyRIT — Run pip install pyrit for the stable release (requires Python 3.10-3.13). For the easiest setup, use Docker which includes JupyterLab and all dependencies.

Configure target credentials — Set environment variables for your target AI system (e.g., AZURE_OPENAI_API_KEY and AZURE_OPENAI_ENDPOINT for Azure OpenAI).

Choose an orchestrator — Pick an orchestrator matching your test strategy: PromptSendingOrchestrator for single-turn tests, RedTeamingOrchestrator for multi-turn, or CrescendoOrchestrator for escalation attacks.

Run and review — Execute your attack campaign and review results in the memory system. Scorers automatically evaluate responses. Export findings for reporting.

How much does PyRIT cost?

PyRIT is free and open-source under the MIT license. There is no commercial tier, no paid support contract, and no enterprise SKU — Microsoft AI Red Team publishes the framework on GitHub at github.com/microsoft/PyRIT .

The practical cost is the API spend you incur on the target model under test. Running crescendo or Tree of Attacks with Pruning against an Azure OpenAI deployment can consume thousands of completions per session, and converters that translate prompts to images, audio, or video add their own per-call charges.

If you self-host the memory backend on Azure SQL Database, factor that hosting cost in as well. SQLite remains free for local development and small-team use.

When to use PyRIT

PyRIT fits teams that need programmatic, repeatable red teaming of AI systems.

The orchestrator/converter/scorer architecture lets you build custom attack workflows that run consistently across different targets and over time.

The multi-modal support matters if you’re testing vision models, audio transcription, or document understanding beyond text-only LLMs.

The memory system is useful for teams that need to track findings over multiple sessions and share results.

The 117 contributors and regular releases (17 total, latest in February 2026) show active development grounded in real-world AI security work.

Tip

Best for

Security teams that need automated, multi-modal red teaming of generative AI systems with programmable attack orchestration and reproducible results.

For a broader overview of AI security risks, see the AI security guide .

For simpler LLM testing with a CLI-first approach, Promptfoo has a lower barrier to entry. Garak focuses on vulnerability scanning with built-in probe modules.

For runtime protection rather than testing, look at LLM Guard or NeMo Guardrails . DeepTeam provides unit-testing-style LLM vulnerability checks.

What are alternatives to PyRIT?

The closest open-source alternative is Promptfoo — a CLI-first evaluation and red-teaming tool with 50+ vulnerability templates that ships faster results for teams without a Python infrastructure background. Garak from NVIDIA covers similar attack categories with 37+ probe modules and a more opinionated probe-and-detector workflow than PyRIT’s compose-your-own orchestrators.

DeepTeam takes a unit-testing approach mapped explicitly to the OWASP Top 10 for LLMs, which is closer to a pytest experience than PyRIT’s session-based runs. For runtime guardrails rather than offline red teaming, LLM Guard and NeMo Guardrails are the canonical open-source choices.

For commercial coverage with managed reporting, Mindgard and Lakera provide automated red-teaming with SLAs that PyRIT — as a community framework — cannot match. The full landscape is in the AI security tools directory.

Visit PyRIT View source on GitHub (4000★)

Frequently Asked Questions

What is PyRIT?

PyRIT (Python Risk Identification Tool) is an open-source AI red teaming framework created by Microsoft’s AI Red Team. It has ~4k GitHub stars, 117 contributors, and automates adversarial testing of generative AI systems across text, image, audio, and video.

Is PyRIT free to use?

Yes, PyRIT is free and open-source under the MIT license. Install via pip or Docker. It supports Azure OpenAI, OpenAI, HuggingFace, and custom HTTP/WebSocket endpoints as targets.

Does PyRIT test for prompt injection?

Yes, PyRIT includes prompt injection as one of its attack techniques. It also supports cross-domain prompt injection (XPIA), crescendo attacks, Tree of Attacks with Pruning (TAP), jailbreak templates, and multi-turn manipulation with various encoding converters.

What AI systems can PyRIT test?

PyRIT tests any generative AI system accessible via API. Built-in target support includes OpenAI, Azure OpenAI, Azure ML, HuggingFace, custom HTTP endpoints, and browser-based targets via Playwright. It handles text, image, audio, and video modalities.