As asked
Name the four golden signals from the Google SRE book, explain what each measures, and give me an example of a specific metric for each in a typical web service.
Sample answer outline
Latency (time to serve a request: http_request_duration_seconds), Traffic (demand on the system: http_requests_total rate), Errors (rate of failed requests: http_requests_total where status=5xx), Saturation (how full the system is: cpu_usage_seconds rate vs limit, or queue depth). They were chosen because they are the signals most directly correlated with user experience without being so low-level that they require deep system knowledge to interpret. A strong answer notes that saturation is the hardest to measure because it requires knowing capacity as well as usage.
Expect these follow-ups
- Which golden signal is most likely to predict a problem before users notice it?
- How do the four golden signals map to RED and USE methods?