Xymon Mailing List Archive search

xymonnet Fatal signal caught

6 messages in this thread

list Scott J Post Sr · Sun, 8 Jul 2012 00:04:17 -0500 ·
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs.  If
I get over 2000 hosts, then xymonnet continually crashes.  OS is Opensuse
12.1.

 
I am simply trying to monitor the systems for connectivity right now, using
fping.

 
Is there a limit on the number of hosts that Xymon can monitor?

 
Are there any other options that I can look at to see what is causing the
error?
list Jeremy Laidman · Mon, 9 Jul 2012 14:04:58 +1000 ·
Does xymonnet give a core dump, or any log messages?  Does xymonnet crash
when starting to probe, or part-way through?
quoted from Scott J Post Sr

On Sun, Jul 8, 2012 at 3:04 PM, Scott J Post Sr <user-0db42f24127d@xymon.invalid> wrote:
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs.
If I get over 2000 hosts, then xymonnet continually crashes.  OS is
Opensuse 12.1.****

** **

I am simply trying to monitor the systems for connectivity right now,
using fping.****

** **

Is there a limit on the number of hosts that Xymon can monitor?****

** **

Are there any other options that I can look at to see what is causing the
error?

****

list Scott Post · Fri, 13 Jul 2012 10:43:53 -0500 ·
Sorry for the delay in responding to this:

There does not appear to be any core file or log messagess that indicated
why it crashes,


However, most of the hosts that I am pinging are DHCP and I am seeing a lot
of issues where is it unable to resolve hostname.

Statistics:
 Hosts total           :     2552
 Hosts with no tests   :        0
 Total test count      :     2554
 Status messages       :     2555
 Alert status msgs     :        0
 Transmissions         :       28

DNS statistics:
 # hostnames resolved  :     2553
 # succesful           :     3902
 # failed              :     1202
 # calls to dnsresolve :     2554

TCP test statistics:
 # TCP tests total     :        2
 # HTTP tests          :        1
 # Simple TCP tests    :        1
 # Connection attempts :        2
 # bytes written       :      128
 # bytes read          :      320

......
<list of hosts.. cannot resolve hostnames>
WARNING: Runtime 621 longer than time limit (300)


TIME SPENT
Event                                           Start time          Duration
xymonnet startup                             574671.360650                 -
Service definitions loaded                   574671.436934          0.076284
Tests loaded, hostname lookups done          575212.690797        541.253862
Test engine setup completed                  575212.757481          0.066683
TCP tests completed                          575213.058966          0.301485
PING test completed (2552 hosts)             575282.431094         69.372128
PING test results sent                       575291.815708          9.384613
Test result collection completed             575291.859808          0.044100
LDAP test engine setup completed             575291.859810          0.000001
LDAP tests executed                          575291.859814          0.000003
LDAP tests result collection completed       575291.859816          0.000001
Test results transmitted                     575292.261780          0.401964
xymonnet completed                           575292.522109          0.260329
TIME TOTAL                                                        621.161459


I make backups of the hosts.cfg file before adding hosts, so when it
crashes, I just restore the last working hosts.cfg file.  I am trying
to add hosts in smaller
increments to determine just at what point the the program crashes again.
quoted from Scott J Post Sr


On Sun, Jul 8, 2012 at 12:04 AM, Scott J Post Sr <user-0db42f24127d@xymon.invalid> wrote:
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs.
If I get over 2000 hosts, then xymonnet continually crashes.  OS is
Opensuse 12.1.****

** **

I am simply trying to monitor the systems for connectivity right now,
using fping.****

** **

Is there a limit on the number of hosts that Xymon can monitor?****

** **

Are there any other options that I can look at to see what is causing the
error?

****
list Henrik Størner · Sat, 14 Jul 2012 11:58:16 +0200 ·
quoted from Scott Post
On 13-07-2012 17:43, Scott Post wrote:
There does not appear to be any core file or log messagess that
indicated why it crashes,
They should appear in the ~xymon/data/tmp/ directory
quoted from Scott Post
However, most of the hosts that I am pinging are DHCP and I am seeing a
lot of issues where is it unable to resolve hostname.
......
<list of hosts.. cannot resolve hostnames>
WARNING: Runtime 621 longer than time limit (300)
Definitely some issue with DNS lookups. I would be interested to hear if 
the soon-to-be-released 4.3.8 version works better for you ? You can 
download it from 
http://xymon.svn.sourceforge.net/viewvc/xymon/branches/4.3.8/?view=tar
    Is there a limit on the number of hosts that Xymon can monitor?____
No.
quoted from Scott Post
    Are there any other options that I can look at to see what is
    causing the error?
Anything in the xymonnet logfile ?


Regards,
Henrik
list Scott Post · Mon, 16 Jul 2012 18:25:41 -0500 ·
I can definitely try downloading it/updating

I don't have a ~xymon/data/tmp/ directory
list Scott Post · Wed, 25 Jul 2012 14:25:30 -0500 ·
Ok, I tried loading all the host again today and now have a corefile.

I know that I have a lot of hosts that are failing DNS resolves, but that
is a whole different issue that we are working on.
Some hosts have been "hard-coded" to an IP address and this is is not
updating DNS.

Thus far, I have found that I can resolve some of the names from Windows,
but not the Linux (Suse) Xymon server.

I have create a vbscript that pulls a list of all the hosts that I want
from Active Directory and then resolves the hostname to IP.

It builds the config file and then I can just upload it to the Xymon server.