-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add alerts for alloy-metrics #1417
Conversation
…storage.rules.yml
…logging.rules.yaml
…deployment.workload-cluster.rules.yml
…prometheus-operator.rules.yml
…rules/chart.rules.yml
…rules/helm.rules.yml
…s/helm-operations.rules.test.yml
Ops-recipe update https://github.com/giantswarm/giantswarm/pull/32043 |
test/tests/providers/global/platform/atlas/alerting-rules/alloy.rules.test.yml
Show resolved
Hide resolved
cancel_if_cluster_status_creating: "true" | ||
cancel_if_cluster_status_deleting: "true" | ||
## This alert pages if any of the monitoring-agent shard is not running. | ||
- alert: MonitoringAgentShardsNotSatisfied |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's no unit test for this one and its associated inhibition?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I might have forgotten them
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added alerts for those :)
helm/prometheus-rules/templates/platform/atlas/alerting-rules/monitoring-pipeline.rules.yml
Show resolved
Hide resolved
test/tests/providers/global/platform/atlas/alerting-rules/alloy.rules.test.yml
Outdated
Show resolved
Hide resolved
test/tests/providers/global/platform/atlas/alerting-rules/alloy.rules.test.yml
Outdated
Show resolved
Hide resolved
…y.rules.test.yml Co-authored-by: Hervé Nicol <[email protected]>
…y.rules.test.yml Co-authored-by: Hervé Nicol <[email protected]>
Do you mind reviewing the opsrecipes again @hervenicol |
Before adding a new alerting rule into this repository you should consider creating an SLO rules instead.
SLO helps you both increase the quality of your monitoring and reduce the alert noise.
Towards: giantswarm/roadmap#3590
This PR adds alert for the alloy monitoring component. See Changelog for more context
Checklist
oncall-kaas-cloud
GitHub group).