P95 latency is suddenly four seconds
Fine in staging, dies in production. The actual cause is almost always one of three things and almost never the one the team thinks. We instrument the real request path (frontend, API, database, model), find the real bottleneck, ship the fix.












