Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Snapstats Collector not properly handling connection errors to remote graphite server #2

Closed
Dieterbe opened this issue Sep 2, 2017 · 2 comments

Comments

@Dieterbe
Copy link

Dieterbe commented Sep 2, 2017

not sure if this is the correct repo, but last night I got paged for our grafanalabs alert [Alerting] Disabled Snap Tasks - task-agent-2 alert. the chart did not show any data .

looking at the snap tasks I noticed that the Snapstats Collector had this "last error":

Error: dial tcp xxx.xxx.xxx.xxx:2003: i/o timeout

so it seems to me this task needs to try harder to reconnect to the graphite server after it encounters a timeout, connection drop, etc. it should buffer data and try to flush later.
i presume the source code for that task lives in this repo but i'm not sure.

in the meantime I will change the "no data" condition to result in "no data" instead of "alerting".
we can change it back when this is fixed.

@woodsaj
Copy link

woodsaj commented Sep 3, 2017

The collector plugins dont send metrics to TSDB's/ that is the job of publisher plugins. In this case that would be the snap-plugin-publisher-graphite

@woodsaj woodsaj closed this as completed Sep 3, 2017
@Dieterbe
Copy link
Author

Dieterbe commented Sep 3, 2017

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants