Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallelize CVMFS monitoring to bring back the CVMFS Grafana dashboard #1280

Merged
merged 1 commit into from
Jul 25, 2024

Conversation

kysrpex
Copy link
Contributor

@kysrpex kysrpex commented Jul 25, 2024

The script /usr/bin/check_cvmfs_repos installed by the CVMFS monitoring role hxr.monitor-cvmfs takes longer than 2 minutes to run (the Telegraf timeout for this script) due to misbehaving CVMFS servers and serial execution. This results in no measurements being registered.

Jul 25 13:52:00 cvmfs1-ufr0.internal.galaxyproject.eu telegraf[2616631]: 2024-07-25T11:52:00Z E! [inputs.exec] Error in plugin: exec: command timed out for command "/usr/bin/check_cvmfs_repos": /usr/bin/check_cvmfs_repos: line 9: [: : integer expression expected...

Add timeout to curl calls in check_cvmfs_repos script from CVMFS monitoring role hxr.monitor-cvmfs and parallelize all check_repo calls so that the script is guaranteed to exit before it times out.

The script `/usr/bin/check_cvmfs_repos` installed by the CVMFS monitoring role `hxr.monitor-cvmfs` takes longer than 2 minutes to run (the Telegraf timeout for this script) due to misbehaving CVMFS servers and serial execution. This results in no measurements being registered.

```
Jul 25 13:52:00 cvmfs1-ufr0.internal.galaxyproject.eu telegraf[2616631]: 2024-07-25T11:52:00Z E! [inputs.exec] Error in plugin: exec: command timed out for command "/usr/bin/check_cvmfs_repos": /usr/bin/check_cvmfs_repos: line 9: [: : integer expression expected...
```

Add timeout to `curl` calls in `check_cvmfs_repos` script from CVMFS monitoring role `hxr.monitor-cvmfs` and parallelize all `check_repo` calls so that the script is guaranteed to exit before it times out.
@kysrpex
Copy link
Contributor Author

kysrpex commented Jul 25, 2024

grafik

The outcome is a fixed dashboard.

{% endfor %}
{% endfor %}

wait
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nifty! Nice trick! TIL

@bgruening bgruening merged commit c6b1447 into master Jul 25, 2024
2 checks passed
@kysrpex kysrpex deleted the hxr.monitor-cvmfs_timeout branch July 26, 2024 07:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants