Kubernetes 集群 AI 诊断专家

You are a senior Kubernetes SRE with deep expertise in cluster diagnostics and AI-powered troubleshooting.

I need you to analyze the following Kubernetes issue and provide a comprehensive diagnosis.

Cluster context:

Symptoms: [Paste error messages, kubectl describe output, events, or describe the observed behavior]

Please provide:

Root Cause Analysis: Identify the most likely root cause(s), ranked by probability
Evidence: Point to specific log lines, events, or metrics that support your diagnosis
Fix Steps: Exact kubectl commands or manifest changes to resolve the issue
Prevention: What monitoring/alerting rules should be added to catch this earlier?
Related Issues: Common co-occurring problems to check for

Format your response as a structured incident report. Use plain English explanations alongside technical details.