API Server & Client
Errori 5xx sull’API server
Section titled “Errori 5xx sull’API server”sum(rate(apiserver_request_total{job="apiserver", code=~"5.."}[1m])) by (instance, job)/sum(rate(apiserver_request_total{job="apiserver"}[1m])) by (instance, job) * 100 > 3Errori client-side (4xx/5xx) su rest_client
Section titled “Errori client-side (4xx/5xx) su rest_client”( sum(rate(rest_client_requests_total{code=~"(4|5).."}[1m])) by (instance, job) / sum(rate(rest_client_requests_total[1m])) by (instance, job)) * 100 > 1Latenza API server (p99, escluse WATCH/CONNECT)
Section titled “Latenza API server (p99, escluse WATCH/CONNECT)”histogram_quantile(0.99, sum(rate(apiserver_request_duration_seconds_bucket{verb!~"(?:CONNECT|WATCHLIST|WATCH|PROXY)"}[10m])) without (subresource)) > 1Certificati client in scadenza (entro 7 giorni)
Section titled “Certificati client in scadenza (entro 7 giorni)”apiserver_client_certificate_expiration_seconds_count{job="apiserver"} > 0andhistogram_quantile(0.01, sum by (job, le) (rate(apiserver_client_certificate_expiration_seconds_bucket{job="apiserver"}[5m]))) < 7*24*60*60Certificati client in scadenza (entro 24h — urgente)
Section titled “Certificati client in scadenza (entro 24h — urgente)”apiserver_client_certificate_expiration_seconds_count{job="apiserver"} > 0andhistogram_quantile(0.01, sum by (job, le) (rate(apiserver_client_certificate_expiration_seconds_bucket{job="apiserver"}[5m]))) < 24*60*60Rate di richieste per verbo/risorsa (capire chi genera carico)
Section titled “Rate di richieste per verbo/risorsa (capire chi genera carico)”sum by (verb, resource) (rate(apiserver_request_total{job="apiserver"}[5m]))