For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
ModelsChatRankingsDocs
DocsAPI ReferenceClient SDKsAgent SDKCookbookChangelog
DocsAPI ReferenceClient SDKsAgent SDKCookbookChangelog
  • Get Started
    • Quickstart: Build a Chat App
    • Enterprise Quickstart
    • Free Models Router
  • Working with Coding Agents
    • Automatic Code Review
    • Claude Code
    • Claude Desktop
    • Codex CLI
    • Cursor
    • Hermes Agent
    • Junie CLI
    • MCP Servers
    • OpenClaw 🦞
    • OpenCode
  • Building Agents
    • Add Human-in-the-Loop Controls
    • Build a Long-Horizon Agent
    • Build Your Own Agent TUI
    • Build Your Own Headless Agent
  • Video Generation
    • Choose a Video Generation Model
    • Generate and Download a Video from Text
    • Get Video Results with Webhooks
    • Guide a Video with Reference Images
    • Turn an Image into a Video
    • Use Provider-Specific Video Options
  • Evaluate & Optimize
    • Distillation
    • RAG with Embeddings & Rerank
    • Red Teaming
  • Administration
    • Activity Export
    • API Key Rotation
    • Crypto API
    • Organization Management
    • Usage Accounting
    • User Tracking
LogoLogo
ModelsChatRankingsDocs
On this page
  • Approval Required
  • Legitimate Red Teaming
  • Compatibility with Zero Data Retention
  • Request Approval
Evaluate & Optimize

Red Teaming

Policy for red teaming and adversarial testing on OpenRouter
Was this page helpful?
Previous

Activity Export

Next
Built with

Red teaming (testing AI applications and/or models for prompt injection, jailbreaking, or other adversarial scenarios) is a valuable part of AI safety research, but may also violate model terms-of-service.

Approval Required

Because most model and provider terms of service do not support red-teaming, you cannot red-team models on OpenRouter without prior approval. This includes prompt injection, jailbreaking, or any attempt to get models to behave in ways that violate their terms of service.

Customers who engage in red teaming without approval may be flagged for model or provider terms of service violations. Unapproved red teaming may result in:

  • Provider bans
  • Model access restrictions
  • Platform-wide account bans

Legitimate Red Teaming

OpenRouter supports legitimate AI safety research and red teaming with prior approval. Getting approval allows us to coordinate with providers and prevent your account from being flagged for policy violations.

If you have a legitimate use case for adversarial testing, we encourage you to reach out.

Compatibility with Zero Data Retention

Note that certain types of safety classifiers are run online (while the prompt is in-flight and in-memory). Prompts may therefore be flagged by classifiers even with full ZDR configured. These classifiers operate independently of data retention policies and are fully compatible with Zero Data Retention (ZDR).

Request Approval

To request red teaming approval, email safety@openrouter.ai with the following details:

  • A description of your research or use case
  • The models & providers you intend to test
  • The types of adversarial techniques you plan to use
  • Your expected timeline

Approval generally takes 5 business days. Approval is not guaranteed and is granted at OpenRouter’s discretion based on the details of your use case.