Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
split ksm alerts in 2 separate ones #912
split ksm alerts in 2 separate ones #912
Changes from 1 commit
5e8fe73
68a3ac8
d7f5331
b67266c
8abf5e1
7e97dd2
671c4f5
834eb01
4e55467
95030ca
c88a232
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's the difference with KubeSecretMetricMissing and KubeStateMetricsSlow?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also we have a KSMdown inhibition :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
From my own understanding,
KubeStateMetricsSlow
is monitoring the response time from KSM to make sure it doesn't take too long to retrieve metrics whileKubeStateMetricsNotRetrievingMetrics
is making sure that there are actually metrics retrieved by KSMBut maybe we can consider that if KSM is taking too long to retrieve metrics then it means that it's unable to retrieve it and thus we can get rid of
KubeStateMetricsNotRetrievingMetrics
to only rely onKubeStateMetricsSlow
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure about the slow one but is this alert not the same as those
prometheus-rules/helm/prometheus-rules/templates/alerting-rules/kube-state-metrics.rules.yml
Line 31 in 737a5e9
Also maybe we should regroup all KSM related alerts to this file
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure, is
kube_configmap_created
a random metric normally retrieved by KSM that we check to make sure KSM is retrieving metrics in general ?In that case I'd prefer also keeping the new alert because its name is more straightforward about its usage and moreover it indicates that KSM as a whole is not able to retrieve metrics while the
KubeConfigMapCreatedMetricMissing
would indicate that KSM is not able to retrive metrics from a particular instanceThere was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well kube_configmap_created is the metric exposed by KSM when access the apiserver but we can have both if you prefer
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess it would make sense to have both yes