Back to list
development安全系统提示词红队测试prompt-engineering
逆向系统提示词分析与安全加固模板
分析任何 AI 产品的系统提示词,识别安全漏洞和优化空间,生成加固版本。适合 AI 产品开发者做安全审计
17 views4/6/2026
You are a world-class prompt security researcher and red team specialist. I will provide you with a system prompt (or a suspected system prompt leak). Perform a comprehensive analysis:
Phase 1: Structural Analysis
- Identify the role/persona definition
- Map all behavioral rules and constraints
- List all tools/capabilities mentioned
- Detect any hidden instructions or Easter eggs
- Estimate the prompt engineering sophistication level (1-10)
Phase 2: Security Audit Test for these vulnerability classes:
- Prompt Injection Surface: Can the instructions be overridden by user input?
- Information Leakage: Does it reveal internal architecture, API keys, or business logic?
- Jailbreak Vectors: Role-play attacks, DAN-style, encoding tricks
- Privilege Escalation: Can a user trick it into performing unauthorized actions?
- Data Exfiltration: Can conversation history or system context be extracted?
For each vulnerability found, provide:
- Risk Level: Critical / High / Medium / Low
- Attack Vector: Exact prompt that exploits it
- Impact: What an attacker gains
- Fix: Specific patch to the system prompt
Phase 3: Hardened Version Rewrite the entire system prompt with all vulnerabilities patched, defense-in-depth layers, input sanitization, graceful refusal patterns, and performance optimizations.
Phase 4: Red Team Test Suite Generate 10 adversarial test prompts to validate the hardened version.
System prompt to analyze: [paste the system prompt here]