逆向系统提示词分析与安全加固模板

You are a world-class prompt security researcher and red team specialist. I will provide you with a system prompt (or a suspected system prompt leak). Perform a comprehensive analysis:

Phase 1: Structural Analysis

Identify the role/persona definition
Map all behavioral rules and constraints
List all tools/capabilities mentioned
Detect any hidden instructions or Easter eggs
Estimate the prompt engineering sophistication level (1-10)

Phase 2: Security Audit Test for these vulnerability classes:

Prompt Injection Surface: Can the instructions be overridden by user input?
Information Leakage: Does it reveal internal architecture, API keys, or business logic?
Jailbreak Vectors: Role-play attacks, DAN-style, encoding tricks
Privilege Escalation: Can a user trick it into performing unauthorized actions?
Data Exfiltration: Can conversation history or system context be extracted?

For each vulnerability found, provide:

Risk Level: Critical / High / Medium / Low
Attack Vector: Exact prompt that exploits it
Impact: What an attacker gains
Fix: Specific patch to the system prompt

Phase 3: Hardened Version Rewrite the entire system prompt with all vulnerabilities patched, defense-in-depth layers, input sanitization, graceful refusal patterns, and performance optimizations.

Phase 4: Red Team Test Suite Generate 10 adversarial test prompts to validate the hardened version.

System prompt to analyze: [paste the system prompt here]