AWS CloudTrail (API calls), CloudWatch (metrics/logs), VPC Flow Logs (network), AWS Config (resource config history)kubectl describe pod: shows events and error messageskubectl logs pod-name: shows application stdout/stderrkubectl get events: shows cluster-level eventsterraform plan to see what's failing before applynslookup/dig to verify resolution — wrong IP? Missing record? TTL caching old value?iam:AttachUserPolicy, iam:CreateAccessKey, iam:PutUserPolicyCryptoCurrency:EC2/BitcoinTool.B)nslookup/dig from multiple locationsopenssl s_client -connect host:443Action: *, Resource: *) — use IAM Access Analyzer to find overly permissive policies, unused access0.0.0.0/0 on port 22 (SSH) or 3389 (RDP) — immediate remediation requiredrequired-tagsCPUCreditBalance metricmem_used_percent), OOM killer in logs (dmesg | grep -i oom)VolumeReadOps/VolumeWriteOps, BurstBalance for gp2. Upgrade to gp3 or io2 for consistent IOPSNetworkIn/NetworkOut against instance network bandwidth limitss3:GetObject, bucket policy denying, object ACL blocking, KMS key policy not allowing decryptterraform force-unlockWork through layers from outermost to innermost: Security Group → NACL → Route Table → Internet Gateway → Public IP → OS firewall
SG is stateful (remembers the connection, auto-allows return traffic). NACL is stateless — you must add rules for both inbound AND outbound directions
Rotate credentials immediately, Check CloudTrail for all actions taken with compromised creds, Isolate compromised resources (don't terminate — preserve for forensics)
502 = Backend broken. 503 = Backend busy. 504 = Backend too slow. 403 = You're not allowed. 404 = Not found.
T-type instances bank credits when idle, spend credits when busy. CPUCreditBalance near zero = throttled to baseline CPU — looks like "low CPU but slow app"
Check What Changed Last — most cloud problems follow a recent change. Always ask: what was deployed, modified, or updated recently?
Official exam objectives, study materials, and exam registration for CV0-004
Official AWS guide to enabling, reading, and querying VPC Flow Logs for network troubleshooting
Official Kubernetes reference for kubectl commands used in container and pod troubleshooting