Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thold 1.8.1 #675

Open
botts99 opened this issue Jun 8, 2024 · 10 comments
Open

Thold 1.8.1 #675

botts99 opened this issue Jun 8, 2024 · 10 comments

Comments

@botts99
Copy link

botts99 commented Jun 8, 2024

Describe the bug
A clear and concise description of what the bug is.
It appears running thold 1.8.1 on cacti 1.2.26 with remote pollers causes an issue where one a large amount of devices go offline, it never recovers and sends out emails about downed hosts but never stop. Disabling the plugin stops the emails.

Screenshots
If applicable, add screenshots to help explain your problem.

Plugin (please complete the following information):

  • Version: [1.8.1]
  • Source: [ github]
    downed host event emails are sent
    image

Most devices recover but 4 do not.
image

Thold continues to crash and time out so alerting continues untill disabled.
image

Login and disable thold and recovers
image

I did just notice though that thold comes back onine for some reason so I had to disable it a second time.

Prior to those events it runs and processes fine.

image

@bmfmancini
Copy link
Member

bmfmancini commented Jun 8, 2024 via email

@botts99
Copy link
Author

botts99 commented Jun 8, 2024 via email

@botts99
Copy link
Author

botts99 commented Jun 8, 2024

Here is more of the log strange that it cant pull data but if I go to those devices they are online.

Seems to do that over and over till I disable thold and turn back on.

One a side note, I updated thold from 1.6.0 to the new version just a few days ago. Had no issues on 1.6.0 but assume there have been many changes since then.

image

@botts99
Copy link
Author

botts99 commented Jun 8, 2024

Logs from the remote poller

Capture

@TheWitness
Copy link
Member

@botts99,

What is the availability method that you are using for these devices?

@TheWitness
Copy link
Member

Also, what are those snmp2_get() calls from? Some script I imagine. Are you using the Notification Queue in the latest THOLD also, have you selected to receive a single Email notification?

image

@TheWitness
Copy link
Member

These is also a feature that was withdrawn due to timing about suspending notification when X devices at a site go down. Next release I guess.

@botts99
Copy link
Author

botts99 commented Jun 20, 2024

I upgraded from 1.60 to the current version so perhaps somthing didnt update correctly. I remove everything with thold and resetup the the currnet release and have not had an issue since. I am wondeing if somthing didnt populate right to the remote poller and since I did a complete remove and setup if that corrected those issue.

image

@TheWitness
Copy link
Member

Well, it's not working, but it will soon. Look for a commit. I'll reference it here.

@TheWitness
Copy link
Member

Okay, you should have better luck now. Some of your Emails may be coming from Monitor. It's something you can disable too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants