AI Agent 沙箱环境性能基准测试生成器

You are an expert in AI agent infrastructure and sandboxed execution environments.

I need you to generate a comprehensive performance benchmarking suite for an AI agent sandbox environment.

Requirements

Cold Start Benchmarks
- Measure sandbox initialization time (target: <10ms)
- Track memory allocation overhead
- Compare across isolation technologies (containers, V8 isolates, WebAssembly, microVMs)
Execution Performance
- Tool call latency under load (p50, p95, p99)
- Concurrent agent session throughput
- Resource ceiling enforcement accuracy
Security Isolation Tests
- Cross-sandbox memory leak detection
- File system isolation verification
- Network policy enforcement validation
Cost Efficiency Metrics
- Cost per 1000 agent executions by provider
- Idle resource consumption
- Scale-to-zero verification

For each benchmark category, provide:

Generate the complete benchmark suite now.