Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alloy monitoring #3590

Closed
Tracked by #3520
Rotfuks opened this issue Jul 23, 2024 · 6 comments
Closed
Tracked by #3520

Alloy monitoring #3590

Rotfuks opened this issue Jul 23, 2024 · 6 comments
Assignees
Labels
team/atlas Team Atlas

Comments

@Rotfuks
Copy link
Contributor

Rotfuks commented Jul 23, 2024

When one component from alloy fails, we should get an alert.

Related incident: we had an incident where alloy-rules failed to load rules on all installations, and got no alert (https://gigantic.slack.com/archives/C01176DKNP4/p1720628254293449)

Probably Alloy mixins provide all we need:

  • servicemonitors to retrieve data
  • dashboards for easy investigation
  • sensible alerts
@github-project-automation github-project-automation bot moved this to Inbox 📥 in Roadmap Jul 23, 2024
@Rotfuks Rotfuks added the team/atlas Team Atlas label Jul 23, 2024
@QuentinBisson
Copy link

For mixins, please, do it the same way as the mimir and loki mixins and do not import them by hand

@QuentinBisson
Copy link

@TheoBrigitte I think you added mixins already right? We are probably only missing sensible alerts?

@QuentinBisson
Copy link

QuentinBisson commented Oct 30, 2024

Current status:

Missing:

  • Opsrecipes for the alloy components health alerts
  • Alloy monitoring pipeline alerts and ops-recipes.

@giantswarm/team-atlas do we want to add alerts related to alloy clustering for this? Is it too soon?

@QuentinBisson
Copy link

QuentinBisson commented Nov 6, 2024

@QuentinBisson
Copy link

Blocked waiting for reviews

@QuentinBisson
Copy link

This is done, will need adjustsments

@github-project-automation github-project-automation bot moved this from Inbox 📥 to Done ✅ in Roadmap Nov 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
team/atlas Team Atlas
Projects
Archived in project
Development

No branches or pull requests

2 participants