As asked
A customer calls saying their integration stopped returning data an hour ago. No errors in their logs. Walk me through exactly how you diagnose this, starting from zero context.
Sample answer outline
Strong answers work outward from the symptom: check the customer's request shape, then the API gateway logs, then the upstream service logs, correlating by trace ID or timestamp. Candidates should mention reproducing the exact request locally or via curl, verifying auth tokens haven't expired, and checking whether a recent deploy or config change correlates with the failure window.
Expect these follow-ups
- How would your approach differ if the customer is in a timezone 8 hours ahead and you can't get them on a call?
- What do you add to the API response to make this kind of debugging easier next time?