Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

check_logfiles on /var/log/message resulting in frequent socket timeouts #59

Open
ChristopherP1221 opened this issue Oct 19, 2020 · 1 comment

Comments

@ChristopherP1221
Copy link

Hello,

I'm looking for some guidance as this issue has been plaguing me for a little while now and I'm almost positive it's related to something I'm doing inefficiently.

I am using the "check_logfiles" plugin against my syslog located at /var/log/messages. I wanted the granularity of defining different properties and thresholds for different patterns so I am choosing to use different .cfg patterns and different nagios service checks. I have been receiving many socket timeouts from these service checks. They are not constant and happen on different hosts but it occurs all day long intermittently on different servers

It should be noted, there are also unrelated checks that are not exhibiting the same "socket timeout" behavior.

Here are the config files in question:

check_logfiles_messages_qla_critical.cfg
@searches = (
{
tag => 'critical qla',
logfile => '/var/log/messages',
criticalpatterns => 'Abort command issued nexus',
options => "criticalthreshold=15",
},
);

check_logfiles_messages_qla_warning.cfg
@searches = (
{
tag => 'warning qla',
logfile => '/var/log/messages',
warningpatterns => ['QUEUE FULL detected', 'FCPort state transitioned from'],
options => "warningthreshold=8",
},
);

Other examples that seem to run just fine (no intermittent socket timeouts)...
@searches = (
{
tag => 'lpfc',
logfile => '/var/log/messages',
criticalpatterns => 'kernel: lpfc',
},
);

Below is how the nagios command is being issued, sudoers has already been configured, I recently added the --rununique flag to see if that would help, it hasn't. Any help/guidance/insight into what this plugin is doing that I might be overlooking would be extremely helpful! For example, I know that a temporary index file gets created, is it possible that several of these index files are being created and conflicting with each other or somehow confusing the script?

/usr/bin/sudo /usr/lib64/nagios/plugins/check_logfiles --rununique -f /etc/nagios/plugins/check_logfiles_messages_qla_critical.cfg

@lausser
Copy link
Owner

lausser commented Oct 19, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants