Prompt Injection Tester

advancedsecurityMin 32K context

Red-teams LLM applications for prompt injection, jailbreaks, and data exfiltration risks. Generates adversarial test cases for direct and indirect injection, system prompt leakage, tool-call abuse, and unsafe output handling, then reports findings with severity ratings and concrete mitigations such as input isolation, output filtering, and least-privilege tools.

Use Cases

  • Generating adversarial prompt injection test suites
  • Probing for system prompt and secret leakage
  • Testing tool-calling agents for unsafe action abuse
  • Evaluating indirect injection via retrieved or user content
  • Producing a remediation report with prioritized fixes

Example Prompt

Red-team the following LLM agent for prompt injection vulnerabilities.

Agent description:
- Customer-facing chatbot with access to a "send_email" and "lookup_order" tool
- Retrieves context from user-uploaded documents
- System prompt instructs it to never reveal internal pricing rules

Provide:
1. Direct injection test cases
2. Indirect injection cases via uploaded documents
3. Tool-abuse and data-exfiltration scenarios
4. Severity rating for each finding
5. Concrete mitigations (isolation, filtering, least privilege)

Recommended Models

Compatible Tools

claude-codecursorkiroany

Modalities

Input: text, code
Output: text

Related Skills

Author

OpenModels Community

@openmodelsrun
Prompt Injection Tester — AI Agent Skill | OpenModels