You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With #205 merged we have a few new dashboards for the control plane (apiserver, scheduler, proxy, kubelet).
Here are a few TODOs outline for the future:
We should unclutter the names and separate components and workload dashboards more.
We should make sure that components alerts are represented in dashboards. Example: KubeAPIErrorsHigh needs to be visible in the apiserver dashboard. Reuse recording rule.
Reuse more recording rules for control plane dashboards (lots of similar queries across dashboards).
Go metrics about components should probably be separated. Either own dashboard or no need at all? Let's discuss.
Go metrics about components should probably be separated. Either own dashboard or no need at all? Let's discuss.
I find those really useful when you are debugging a control plane failure, OOMs / crashloops etc.
The way I envisioned this is that SREs get a single view where they look for symptoms, like "cpu is going thru the roof and we are getting tons of requests, " type of deal.
With #205 merged we have a few new dashboards for the control plane (apiserver, scheduler, proxy, kubelet).
Here are a few TODOs outline for the future:
/cc @povilasv @brancz
The text was updated successfully, but these errors were encountered: