xymonnet Fatal signal caught
list Scott J Post Sr
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs. If I get over 2000 hosts, then xymonnet continually crashes. OS is Opensuse 12.1. I am simply trying to monitor the systems for connectivity right now, using fping. Is there a limit on the number of hosts that Xymon can monitor? Are there any other options that I can look at to see what is causing the error?
list Jeremy Laidman
Does xymonnet give a core dump, or any log messages? Does xymonnet crash when starting to probe, or part-way through?
▸
On Sun, Jul 8, 2012 at 3:04 PM, Scott J Post Sr <user-0db42f24127d@xymon.invalid> wrote:
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs. If I get over 2000 hosts, then xymonnet continually crashes. OS is Opensuse 12.1.**** ** ** I am simply trying to monitor the systems for connectivity right now, using fping.**** ** ** Is there a limit on the number of hosts that Xymon can monitor?**** ** ** Are there any other options that I can look at to see what is causing the error? ****
list Scott Post
Sorry for the delay in responding to this: There does not appear to be any core file or log messagess that indicated why it crashes, However, most of the hosts that I am pinging are DHCP and I am seeing a lot of issues where is it unable to resolve hostname. Statistics: Hosts total : 2552 Hosts with no tests : 0 Total test count : 2554 Status messages : 2555 Alert status msgs : 0 Transmissions : 28 DNS statistics: # hostnames resolved : 2553 # succesful : 3902 # failed : 1202 # calls to dnsresolve : 2554 TCP test statistics: # TCP tests total : 2 # HTTP tests : 1 # Simple TCP tests : 1 # Connection attempts : 2 # bytes written : 128 # bytes read : 320 ...... <list of hosts.. cannot resolve hostnames> WARNING: Runtime 621 longer than time limit (300) TIME SPENT Event Start time Duration xymonnet startup 574671.360650 - Service definitions loaded 574671.436934 0.076284 Tests loaded, hostname lookups done 575212.690797 541.253862 Test engine setup completed 575212.757481 0.066683 TCP tests completed 575213.058966 0.301485 PING test completed (2552 hosts) 575282.431094 69.372128 PING test results sent 575291.815708 9.384613 Test result collection completed 575291.859808 0.044100 LDAP test engine setup completed 575291.859810 0.000001 LDAP tests executed 575291.859814 0.000003 LDAP tests result collection completed 575291.859816 0.000001 Test results transmitted 575292.261780 0.401964 xymonnet completed 575292.522109 0.260329 TIME TOTAL 621.161459 I make backups of the hosts.cfg file before adding hosts, so when it crashes, I just restore the last working hosts.cfg file. I am trying to add hosts in smaller increments to determine just at what point the the program crashes again.
▸
On Sun, Jul 8, 2012 at 12:04 AM, Scott J Post Sr <user-0db42f24127d@xymon.invalid> wrote:
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs. If I get over 2000 hosts, then xymonnet continually crashes. OS is Opensuse 12.1.**** ** ** I am simply trying to monitor the systems for connectivity right now, using fping.**** ** ** Is there a limit on the number of hosts that Xymon can monitor?**** ** ** Are there any other options that I can look at to see what is causing the error? ****
list Henrik Størner
▸
On 13-07-2012 17:43, Scott Post wrote:
There does not appear to be any core file or log messagess that indicated why it crashes,
They should appear in the ~xymon/data/tmp/ directory
▸
However, most of the hosts that I am pinging are DHCP and I am seeing a lot of issues where is it unable to resolve hostname. ...... <list of hosts.. cannot resolve hostnames> WARNING: Runtime 621 longer than time limit (300)
Definitely some issue with DNS lookups. I would be interested to hear if the soon-to-be-released 4.3.8 version works better for you ? You can download it from http://xymon.svn.sourceforge.net/viewvc/xymon/branches/4.3.8/?view=tar
Is there a limit on the number of hosts that Xymon can monitor?____
No.
▸
Are there any other options that I can look at to see what is
causing the error?Anything in the xymonnet logfile ? Regards, Henrik
list Scott Post
I can definitely try downloading it/updating I don't have a ~xymon/data/tmp/ directory
list Scott Post
Ok, I tried loading all the host again today and now have a corefile. I know that I have a lot of hosts that are failing DNS resolves, but that is a whole different issue that we are working on. Some hosts have been "hard-coded" to an IP address and this is is not updating DNS. Thus far, I have found that I can resolve some of the names from Windows, but not the Linux (Suse) Xymon server. I have create a vbscript that pulls a list of all the hosts that I want from Active Directory and then resolves the hostname to IP. It builds the config file and then I can just upload it to the Xymon server.