-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Find CAPI alternatives for our inhibitions #3338
Comments
We can use CAPI conditions and phases (thanks @nprokopic for the suggestion!). In particular, we already have metrics exposed by the capi controller. On a cluster level (probably what we want for the inhibitions:
On the control plane level:
On the worker nodes level:
|
Opened giantswarm/prometheus-rules#1153 to update the inhibition expressions using capi metrics. |
PR has been merged. Note: from the review It might be somewhat useful when using the cluster downscaler if scaling some worker nodepools to 0 causes alerts, but at the moment it is not used. |
Motivation
Part of #3315.
As part of Atlas' efforts to migrate alerting to mimir, they asked us to look into out inhibitions (cluster_upgrading, cluster_creating) as they use metrics coming from vintage components and only work in vintage.
Todo
The text was updated successfully, but these errors were encountered: