We are running 4 pollers, 3 virtualized and on on a bare metal server. all of them are up-to-date running v1.1.7
properly - most of the time.
Occasionally (10 days ish) they will each start failing pretty consistently. a reboot always fixes it.
When I login to the poller and run the “getWork.php” on a failing poller I get >
Skipping ICMP polling, as it is still pending. Skipping SNMP polling, as it is still pending. Enqueued device mapping job and got token 771e7......
which seems to be what I’m supposed to get when its working.
I can resetPolling.php then getWork.php to get a new instruction set but it continues to fail in sonar with
Pollers a minute ago Poller "Poller2-VM AP7-Tik3 (4.5-7-8-9.14-13-14-15-16-32-36-39-60)" has not returned any network monitoring data to Sonar since Dec 11, 2018 06:08:07. Please check it for errors, or disable it to stop receiving this alert. Pollers a minute ago Poller "Poller1-VM AP7-TIK4 (2-3-4-18-19-20-29-30-34-35)" has not returned any network monitoring data to Sonar since Dec 11, 2018 06:51:09. Please check it for errors, or disable it to stop receiving this alert.
Is there anything I can script in to make it a little more reliable? aside from a daily/weekly reboot?
Also a side note - these are the only things running on the virtualized Ubuntu 16.04 VM’s
– EDIT Output of redis server status –
● redis-server.service - Advanced key-value store Loaded: loaded (/lib/systemd/system/redis-server.service; enabled; vendor preset: enabled) Active: active (running) since Tue 2018-12-11 06:51:08 MST; 6h ago Docs: http://redis.io/documentation, man:redis-server(1) Main PID: 30230 (redis-server) Tasks: 3 Memory: 7.8M CPU: 19.448s CGroup: /system.slice/redis-server.service └─30230 /usr/bin/redis-server 127.0.0.1:6379 Dec 11 06:51:08 poller1 systemd: Starting Advanced key-value store... Dec 11 06:51:08 poller1 run-parts: run-parts: executing /etc/redis/redis-server.pre-up.d/00_example Dec 11 06:51:08 poller1 run-parts: run-parts: executing /etc/redis/redis-server.post-up.d/00_example Dec 11 06:51:08 poller1 systemd: Started Advanced key-value store.
– ^ – looks like the redis server restarted at the same timestamp as the last update in sonar - but says its still running.