Xymon Mailing List Archive search

Devmon causing core dumps

10 messages in this thread

list Vernon Everett · Fri, 31 Oct 2008 12:51:42 +0900 ·
Hi all

Devmon was causing the hobbitd_rrd module to crash and burn.
Now this could be a bug, but it could also be a PEBKAC. I am hoping somebody can assist either way.

I added a Cisco 2851 to Hobbit, using devmon.
Now here is the possible PEBKAC
Since Devmon doesn't have templates for the 2851, I used the template for the Cisco 2811. (Network guru told me they are pretty much the same, except for a few extra bells and whistles on the 2851.)

The data for the device started appearing in Hobbit, and all looked good. Devmon even created the rrd files for the new Cisco device.

However, the hobbitd_rrd module started core dumping, and the Hobbit server page started displaying red for hobbitd_rrd with the crash detected message.
See core data below.
Took the new Cisco device out of Hobbit, and cores stopped, and life was good again.

Is there a significant enough difference between the 2851 and the 2811 to cause this, or are we looking at a genuine bug?
I am leaning towards a bug, because even if the collected data was complete rubbish, should it cause the module to core?

Regards
     Vernon

My Linux guy reckons this is the important stuff from the core.
uname -a
Linux las006 2.6.18-92.1.1.el5 #1 SMP Thu May 22 09:01:47 EDT 2008 x86_64 x86_64 x86_64 GNU/Linux cat /etc/redhat-release Red Hat Enterprise Linux Client release 5.2 (Tikanga)

gdb -c core.8550 /usr/lib/hobbit/server/bin/hobbitd_rrd
GNU gdb Red Hat Linux (6.5-37.el5_2.1rh) Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "x86_64-redhat-linux-gnu"...Using host libthread_db library "/lib64/libthread_db.so.1".

Reading symbols from /usr/lib64/librrd.so.2...done.
Loaded symbols for /usr/lib64/librrd.so.2 Reading symbols from /usr/lib64/libpng12.so.0...done.
Loaded symbols for /usr/lib64/libpng12.so.0 Reading symbols from /lib64/libpcre.so.0...done.
Loaded symbols for /lib64/libpcre.so.0
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib64/libc.so.6
Reading symbols from /usr/lib64/libfreetype.so.6...done.
Loaded symbols for /usr/lib64/libfreetype.so.6 Reading symbols from /usr/lib64/libz.so.1...done.
Loaded symbols for /usr/lib64/libz.so.1
Reading symbols from /usr/lib64/libart_lgpl_2.so.2...done.
Loaded symbols for /usr/lib64/libart_lgpl_2.so.2 Reading symbols from /lib64/libm.so.6...done.
Loaded symbols for /lib64/libm.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2 Core was generated by `hobbitd_rrd --rrddir=/var/lib/hobbit/rrd --debug'.
Program terminated with signal 6, Aborted.
#0  0x0000003db7a30155 in raise () from /lib64/libc.so.6
(gdb) bt
#0  0x0000003db7a30155 in raise () from /lib64/libc.so.6
#1  0x0000003db7a31bf0 in abort () from /lib64/libc.so.6
#2  0x00000000004119f3 in sigsegv_handler (signum=<value optimized out>) at sig.c:57
#3  <signal handler called>
#4  0x0000003db7a77ac0 in strcat () from /lib64/libc.so.6
#5  0x000000000040462a in do_devmon_rrd (hostname=0x2ada311e2806 "PERIR205", testname=0x2ada311e280f "if_load", msg=<value optimized out>, tstamp=<value optimized out>)
    at rrd/do_devmon.c:87
#6  0x000000000040b656 in update_rrd (hostname=0x2ada311e2806 "PERIR205", testname=0x2ada311e280f "if_load",
    msg=0x2ada311e2842 "status PERIR205.if_load green Fri Oct 31 10:31:39 2008", tstamp=1225416699, sender=<value optimized out>, ldef=0xfeffffffffffff00) at do_rrd.c:372
#7  0x000000000040261d in main (argc=<value optimized out>, argv=0x7fff7a088318) at hobbitd_rrd.c:153
(gdb)


NOTICE: This email and any attachments are confidential. 
They may contain legally privileged information or 
copyright material. You must not read, copy, use or 
disclose them without authorisation. If you are not an 
intended recipient, please contact us at once by return 
email and then delete both messages and all attachments.
list Buchan Milne · Fri, 31 Oct 2008 12:15:27 +0200 ·
quoted from Vernon Everett
On Friday 31 October 2008 05:51:42 Everett, Vernon wrote:
Hi all

Devmon was causing the hobbitd_rrd module to crash and burn.
Now this could be a bug, but it could also be a PEBKAC. I am hoping
somebody can assist either way.

I added a Cisco 2851 to Hobbit, using devmon.
Now here is the possible PEBKAC
Since Devmon doesn't have templates for the 2851, I used the template for
the Cisco 2811. (Network guru told me they are pretty much the same, except
for a few extra bells and whistles on the 2851.)

The data for the device started appearing in Hobbit, and all looked good.
Devmon even created the rrd files for the new Cisco device.

However, the hobbitd_rrd module started core dumping, and the Hobbit server
page started displaying red for hobbitd_rrd with the crash detected
message. See core data below.
Took the new Cisco device out of Hobbit, and cores stopped, and life was
good again.

Is there a significant enough difference between the 2851 and the 2811 to
cause this, or are we looking at a genuine bug?
Real bug. I see it on the temperature tests on a new IOS.
quoted from Vernon Everett
I am leaning towards a bug,
because even if the collected data was complete rubbish, should it cause
the module to core?

Regards
     Vernon

My Linux guy reckons this is the important stuff from the core.
uname -a
Linux las006 2.6.18-92.1.1.el5 #1 SMP Thu May 22 09:01:47 EDT 2008 x86_64
x86_64 x86_64 GNU/Linux cat /etc/redhat-release Red Hat Enterprise Linux
Client release 5.2 (Tikanga)

gdb -c core.8550 /usr/lib/hobbit/server/bin/hobbitd_rrd
GNU gdb Red Hat Linux (6.5-37.el5_2.1rh) Copyright (C) 2006 Free Software
Foundation, Inc. GDB is free software, covered by the GNU General Public
License, and you are welcome to change it and/or distribute copies of it
under certain conditions. Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "x86_64-redhat-linux-gnu"...Using host
libthread_db library "/lib64/libthread_db.so.1".

Reading symbols from /usr/lib64/librrd.so.2...done.
Loaded symbols for /usr/lib64/librrd.so.2 Reading symbols from
/usr/lib64/libpng12.so.0...done. Loaded symbols for
/usr/lib64/libpng12.so.0 Reading symbols from /lib64/libpcre.so.0...done.
Loaded symbols for /lib64/libpcre.so.0
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib64/libc.so.6
Reading symbols from /usr/lib64/libfreetype.so.6...done.
Loaded symbols for /usr/lib64/libfreetype.so.6 Reading symbols from
/usr/lib64/libz.so.1...done. Loaded symbols for /usr/lib64/libz.so.1
Reading symbols from /usr/lib64/libart_lgpl_2.so.2...done.
Loaded symbols for /usr/lib64/libart_lgpl_2.so.2 Reading symbols from
/lib64/libm.so.6...done. Loaded symbols for /lib64/libm.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2 Core was generated by
`hobbitd_rrd --rrddir=/var/lib/hobbit/rrd --debug'. Program terminated with
signal 6, Aborted.
#0  0x0000003db7a30155 in raise () from /lib64/libc.so.6
(gdb) bt
#0  0x0000003db7a30155 in raise () from /lib64/libc.so.6
#1  0x0000003db7a31bf0 in abort () from /lib64/libc.so.6
#2  0x00000000004119f3 in sigsegv_handler (signum=<value optimized out>) at
sig.c:57 #3  <signal handler called>
#4  0x0000003db7a77ac0 in strcat () from /lib64/libc.so.6
#5  0x000000000040462a in do_devmon_rrd (hostname=0x2ada311e2806
"PERIR205", testname=0x2ada311e280f "if_load", msg=<value optimized out>,
tstamp=<value optimized out>) at rrd/do_devmon.c:87
#6  0x000000000040b656 in update_rrd (hostname=0x2ada311e2806 "PERIR205",
testname=0x2ada311e280f "if_load", msg=0x2ada311e2842 "status
PERIR205.if_load green Fri Oct 31 10:31:39 2008", tstamp=1225416699,
sender=<value optimized out>, ldef=0xfeffffffffffff00) at do_rrd.c:372 #7 
0x000000000040261d in main (argc=<value optimized out>,
argv=0x7fff7a088318) at hobbitd_rrd.c:153 (gdb)

Could you show the Devmon RRD section of the message for the if_load test on 
the PERIR205 host? I can confirm the cause, and maybe offer a workaround.

I am actually (constantly) reproducing the issue on my workstation against the 
new IOS that can trigger this, I have a workaround in place in production, and 
was hoping to get around to fixing this next week.

Regards,
Buchan
list Robert Holden · Fri, 31 Oct 2008 08:35:44 -0700 ·
I have seen this as well.  I finally determined it was caused by ATM
interfaces.  Devmon does not give different components of an ATM circuit
(the physical interface, the -atm layer, .0 sub interface, -aal5 layer)
unique names.  So rrd was receiving data for 5 interfaces all with the same
name.  As a temporary interface, I stopped monitoring the atm interfaces,
but this is a bug.

Interface names:
ATM5/0/0
ATM5/0/0-atm layer
ATM5/0/0.0-atm subif
ATM5/0/0-aal5 layer
ATM5/0/0.0-aal5 layer

Devmon sees these all as: ATM5/0/0  because devmon templates (atleast for
6509's) are looking at ifName as the main identifier, which is not always
unique.  Not sure on a solution yet.  MRTG uses ifIndex as it's unique key.

Robert
quoted from Buchan Milne

On Fri, Oct 31, 2008 at 3:15 AM, Buchan Milne <user-9b139aff4dec@xymon.invalid>wrote:
On Friday 31 October 2008 05:51:42 Everett, Vernon wrote:
Hi all

Devmon was causing the hobbitd_rrd module to crash and burn.
Now this could be a bug, but it could also be a PEBKAC. I am hoping
somebody can assist either way.

I added a Cisco 2851 to Hobbit, using devmon.
Now here is the possible PEBKAC
Since Devmon doesn't have templates for the 2851, I used the template for
the Cisco 2811. (Network guru told me they are pretty much the same,
except
for a few extra bells and whistles on the 2851.)

The data for the device started appearing in Hobbit, and all looked good.
Devmon even created the rrd files for the new Cisco device.

However, the hobbitd_rrd module started core dumping, and the Hobbit
server
page started displaying red for hobbitd_rrd with the crash detected
message. See core data below.
Took the new Cisco device out of Hobbit, and cores stopped, and life was
good again.

Is there a significant enough difference between the 2851 and the 2811 to
cause this, or are we looking at a genuine bug?
Real bug. I see it on the temperature tests on a new IOS.
I am leaning towards a bug,
because even if the collected data was complete rubbish, should it cause
the module to core?

Regards
     Vernon

My Linux guy reckons this is the important stuff from the core.
uname -a
Linux las006 2.6.18-92.1.1.el5 #1 SMP Thu May 22 09:01:47 EDT 2008 x86_64
x86_64 x86_64 GNU/Linux cat /etc/redhat-release Red Hat Enterprise Linux
Client release 5.2 (Tikanga)

gdb -c core.8550 /usr/lib/hobbit/server/bin/hobbitd_rrd
GNU gdb Red Hat Linux (6.5-37.el5_2.1rh) Copyright (C) 2006 Free Software
Foundation, Inc. GDB is free software, covered by the GNU General Public
License, and you are welcome to change it and/or distribute copies of it
under certain conditions. Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for
details.
This GDB was configured as "x86_64-redhat-linux-gnu"...Using host
libthread_db library "/lib64/libthread_db.so.1".

Reading symbols from /usr/lib64/librrd.so.2...done.
Loaded symbols for /usr/lib64/librrd.so.2 Reading symbols from
/usr/lib64/libpng12.so.0...done. Loaded symbols for
/usr/lib64/libpng12.so.0 Reading symbols from /lib64/libpcre.so.0...done.
Loaded symbols for /lib64/libpcre.so.0
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib64/libc.so.6
Reading symbols from /usr/lib64/libfreetype.so.6...done.
Loaded symbols for /usr/lib64/libfreetype.so.6 Reading symbols from
/usr/lib64/libz.so.1...done. Loaded symbols for /usr/lib64/libz.so.1
Reading symbols from /usr/lib64/libart_lgpl_2.so.2...done.
Loaded symbols for /usr/lib64/libart_lgpl_2.so.2 Reading symbols from
/lib64/libm.so.6...done. Loaded symbols for /lib64/libm.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2 Core was generated by
`hobbitd_rrd --rrddir=/var/lib/hobbit/rrd --debug'. Program terminated
with
signal 6, Aborted.
#0  0x0000003db7a30155 in raise () from /lib64/libc.so.6
(gdb) bt
#0  0x0000003db7a30155 in raise () from /lib64/libc.so.6
#1  0x0000003db7a31bf0 in abort () from /lib64/libc.so.6
#2  0x00000000004119f3 in sigsegv_handler (signum=<value optimized out>)
at
sig.c:57 #3  <signal handler called>
#4  0x0000003db7a77ac0 in strcat () from /lib64/libc.so.6
#5  0x000000000040462a in do_devmon_rrd (hostname=0x2ada311e2806
"PERIR205", testname=0x2ada311e280f "if_load", msg=<value optimized out>,
tstamp=<value optimized out>) at rrd/do_devmon.c:87
#6  0x000000000040b656 in update_rrd (hostname=0x2ada311e2806 "PERIR205",
testname=0x2ada311e280f "if_load", msg=0x2ada311e2842 "status
PERIR205.if_load green Fri Oct 31 10:31:39 2008", tstamp=1225416699,
sender=<value optimized out>, ldef=0xfeffffffffffff00) at do_rrd.c:372 #7
0x000000000040261d in main (argc=<value optimized out>,
argv=0x7fff7a088318) at hobbitd_rrd.c:153 (gdb)

Could you show the Devmon RRD section of the message for the if_load test
on
the PERIR205 host? I can confirm the cause, and maybe offer a workaround.

I am actually (constantly) reproducing the issue on my workstation against
the
new IOS that can trigger this, I have a workaround in place in production,
and
was hoping to get around to fixing this next week.

Regards,
Buchan

list Buchan Milne · Fri, 31 Oct 2008 18:19:46 +0200 ·
quoted from Robert Holden
On Friday 31 October 2008 17:35:44 Robert Holden wrote:
I have seen this as well.  I finally determined it was caused by ATM
interfaces.  Devmon does not give different components of an ATM circuit
(the physical interface, the -atm layer, .0 sub interface, -aal5 layer)
unique names.  So rrd was receiving data for 5 interfaces all with the same
name.  As a temporary interface, I stopped monitoring the atm interfaces,
but this is a bug.

Interface names:
ATM5/0/0
ATM5/0/0-atm layer
ATM5/0/0.0-atm subif
ATM5/0/0-aal5 layer
ATM5/0/0.0-aal5 layer
It's the spaces in the interface names.
quoted from Robert Holden
Devmon sees these all as: ATM5/0/0  because devmon templates (atleast for
6509's) are looking at ifName as the main identifier, which is not always
unique.
No, on earlier IOSs, there will still many interface names for ATM interfaces, 
but there were no spaces in the names. I have adjusted my template in this 
case as follows to avoid the problem with spaces:

in transforms:

ifNameFixed     : REGSUB        : {ifName} /(\S+)(\s*)?(\S+)$/$1$3/

then replace ifName with ifNameFixed in the message file.
Not sure on a solution yet.  MRTG uses ifIndex as it's unique key.
Well, this is not the issue, but using ifIndex is not a solution either.

(BTW, there isn't a bug filed on this issue, so so far I've just been wanting 
to fix this for my own setup ...)

Regards,
Buchan
list Robert Holden · Fri, 31 Oct 2008 10:55:35 -0700 ·
When I run snmpwalk on my interfaces I get the following:
(I believe that the devmon template is using this ... notice that they are
all the same.  A transform won't help)
IF-MIB::ifName.70 = STRING: AT5/0/0
IF-MIB::ifName.71 = STRING: AT5/0/0
IF-MIB::ifName.72 = STRING: AT5/0/0
IF-MIB::ifName.73 = STRING: AT5/0/0
IF-MIB::ifName.74 = STRING: AT5/0/0

(This could be used, but you would end up w/ very long names.  G0/0 would
become GigabitEthernet0/0)
IF-MIB::ifDescr.70 = STRING: ATM5/0/0
IF-MIB::ifDescr.71 = STRING: ATM5/0/0-atm layer
IF-MIB::ifDescr.72 = STRING: ATM5/0/0.0-atm subif
IF-MIB::ifDescr.73 = STRING: ATM5/0/0-aal5 layer
IF-MIB::ifDescr.74 = STRING: ATM5/0/0.0-aal5 layer

(This may be helpful ... somehow ignore all type 37 and type 49, and sub
interface 0 ??)
IF-MIB::ifType.70 = INTEGER: sonet(39)
IF-MIB::ifType.71 = INTEGER: atm(37)
IF-MIB::ifType.72 = INTEGER: atmSubInterface(134)
IF-MIB::ifType.73 = INTEGER: aal5(49)
IF-MIB::ifType.74 = INTEGER: aal5(49)

I will open a bug if you like.  Is the issue that originated this thread the
same as the one I am experiencing?
quoted from Buchan Milne

Robert

On Fri, Oct 31, 2008 at 9:19 AM, Buchan Milne <user-9b139aff4dec@xymon.invalid>wrote:
On Friday 31 October 2008 17:35:44 Robert Holden wrote:
I have seen this as well.  I finally determined it was caused by ATM
interfaces.  Devmon does not give different components of an ATM circuit
(the physical interface, the -atm layer, .0 sub interface, -aal5 layer)
unique names.  So rrd was receiving data for 5 interfaces all with the
same
name.  As a temporary interface, I stopped monitoring the atm interfaces,
but this is a bug.

Interface names:
ATM5/0/0
ATM5/0/0-atm layer
ATM5/0/0.0-atm subif
ATM5/0/0-aal5 layer
ATM5/0/0.0-aal5 layer
It's the spaces in the interface names.
Devmon sees these all as: ATM5/0/0  because devmon templates (atleast for
6509's) are looking at ifName as the main identifier, which is not always
unique.
No, on earlier IOSs, there will still many interface names for ATM
interfaces,
but there were no spaces in the names. I have adjusted my template in this
case as follows to avoid the problem with spaces:

in transforms:

ifNameFixed     : REGSUB        : {ifName} /(\S+)(\s*)?(\S+)$/$1$3/

then replace ifName with ifNameFixed in the message file.
Not sure on a solution yet.  MRTG uses ifIndex as it's unique key.
Well, this is not the issue, but using ifIndex is not a solution either.

(BTW, there isn't a bug filed on this issue, so so far I've just been
wanting
to fix this for my own setup ...)

Regards,
Buchan
list Buchan Milne · Mon, 3 Nov 2008 09:46:42 +0200 ·
quoted from Robert Holden
On Friday 31 October 2008 19:55:35 Robert Holden wrote:
When I run snmpwalk on my interfaces I get the following:
(I believe that the devmon template is using this ... notice that they are
all the same.
Depending on the SNMP implementation (and may differ on different IOS 
versions). 
quoted from Robert Holden
A transform won't help)
IF-MIB::ifName.70 = STRING: AT5/0/0
IF-MIB::ifName.71 = STRING: AT5/0/0
IF-MIB::ifName.72 = STRING: AT5/0/0
IF-MIB::ifName.73 = STRING: AT5/0/0
IF-MIB::ifName.74 = STRING: AT5/0/0
One of our 7613's has:

IF-MIB::ifName.1 = STRING: Gi3/1
[...]
IF-MIB::ifName.128 = STRING: ATM10/1/0.0-atm subif
IF-MIB::ifName.129 = STRING: ATM10/1/0-aal5 layer
IF-MIB::ifName.130 = STRING: ATM10/1/0.0-aal5 layer
IF-MIB::ifName.131 = STRING: VLAN-1
quoted from Robert Holden
(This could be used, but you would end up w/ very long names.  G0/0 would
become GigabitEthernet0/0)
IF-MIB::ifDescr.70 = STRING: ATM5/0/0
IF-MIB::ifDescr.71 = STRING: ATM5/0/0-atm layer
IF-MIB::ifDescr.72 = STRING: ATM5/0/0.0-atm subif
IF-MIB::ifDescr.73 = STRING: ATM5/0/0-aal5 layer
IF-MIB::ifDescr.74 = STRING: ATM5/0/0.0-aal5 layer
On the same device as above:

IF-MIB::ifDescr.1 = STRING: GigabitEthernet3/1
[...]
IF-MIB::ifDescr.128 = STRING: ATM10/1/0.0-atm subif
IF-MIB::ifDescr.129 = STRING: ATM10/1/0-aal5 layer
IF-MIB::ifDescr.130 = STRING: ATM10/1/0.0-aal5 layer
IF-MIB::ifDescr.131 = STRING: unrouted VLAN 1
quoted from Robert Holden
(This may be helpful ... somehow ignore all type 37 and type 49, and sub
interface 0 ??)
IF-MIB::ifType.70 = INTEGER: sonet(39)
IF-MIB::ifType.71 = INTEGER: atm(37)
IF-MIB::ifType.72 = INTEGER: atmSubInterface(134)
IF-MIB::ifType.73 = INTEGER: aal5(49)
IF-MIB::ifType.74 = INTEGER: aal5(49)
But, in my case, I want to graph:
IF-MIB::ifName.103 = STRING: ATM10/1/0.1-aal5 layer

(it is the only interface which shows the correct traffic)
quoted from Robert Holden
I will open a bug if you like.  Is the issue that originated this thread
the same as the one I am experiencing?
Well, there are two aspects to the bug:

1)Devmon should strip invalid characters out of interface names for the RRD 
data sent with the status message.

2)The Hobbit RRD collector module for devmon should not segfault if data is 
not in the format:

<name_without_spaces> value[:value[:value]]


Regards,
Buchan
list Buchan Milne · Fri, 7 Nov 2008 11:28:12 +0200 ·
quoted from Buchan Milne
On Monday 03 November 2008 09:46:42 Buchan Milne wrote:
On Friday 31 October 2008 19:55:35 Robert Holden wrote:
When I run snmpwalk on my interfaces I get the following:
(I believe that the devmon template is using this ... notice that they
are all the same.
Depending on the SNMP implementation (and may differ on different IOS
versions).
A transform won't help)
IF-MIB::ifName.70 = STRING: AT5/0/0
IF-MIB::ifName.71 = STRING: AT5/0/0
IF-MIB::ifName.72 = STRING: AT5/0/0
IF-MIB::ifName.73 = STRING: AT5/0/0
IF-MIB::ifName.74 = STRING: AT5/0/0
One of our 7613's has:

IF-MIB::ifName.1 = STRING: Gi3/1
[...]
IF-MIB::ifName.128 = STRING: ATM10/1/0.0-atm subif
IF-MIB::ifName.129 = STRING: ATM10/1/0-aal5 layer
IF-MIB::ifName.130 = STRING: ATM10/1/0.0-aal5 layer
IF-MIB::ifName.131 = STRING: VLAN-1
(This could be used, but you would end up w/ very long names.  G0/0 would
become GigabitEthernet0/0)
IF-MIB::ifDescr.70 = STRING: ATM5/0/0
IF-MIB::ifDescr.71 = STRING: ATM5/0/0-atm layer
IF-MIB::ifDescr.72 = STRING: ATM5/0/0.0-atm subif
IF-MIB::ifDescr.73 = STRING: ATM5/0/0-aal5 layer
IF-MIB::ifDescr.74 = STRING: ATM5/0/0.0-aal5 layer
On the same device as above:

IF-MIB::ifDescr.1 = STRING: GigabitEthernet3/1
[...]
IF-MIB::ifDescr.128 = STRING: ATM10/1/0.0-atm subif
IF-MIB::ifDescr.129 = STRING: ATM10/1/0-aal5 layer
IF-MIB::ifDescr.130 = STRING: ATM10/1/0.0-aal5 layer
IF-MIB::ifDescr.131 = STRING: unrouted VLAN 1
(This may be helpful ... somehow ignore all type 37 and type 49, and sub
interface 0 ??)
IF-MIB::ifType.70 = INTEGER: sonet(39)
IF-MIB::ifType.71 = INTEGER: atm(37)
IF-MIB::ifType.72 = INTEGER: atmSubInterface(134)
IF-MIB::ifType.73 = INTEGER: aal5(49)
IF-MIB::ifType.74 = INTEGER: aal5(49)
But, in my case, I want to graph:
IF-MIB::ifName.103 = STRING: ATM10/1/0.1-aal5 layer

(it is the only interface which shows the correct traffic)
I will open a bug if you like.  Is the issue that originated this thread
the same as the one I am experiencing?
Well, there are two aspects to the bug:

1)Devmon should strip invalid characters out of interface names for the RRD
data sent with the status message.

2)The Hobbit RRD collector module for devmon should not segfault if data is
not in the format:

<name_without_spaces> value[:value[:value]]
I fixed this one last night. You can either grab the new do_devmon.c, and run 
'make' again, and copy the hobbitd_rrd over the previous binary, or you can 
grab the new complete patch for a build from scratch:

http://devmon.svn.sourceforge.net/viewvc/devmon?view=rev&revision=90

(at present, viewvc seems to be a bit bust on sourceforge, but these should be 
the URLs to the current versions of the files when it is working:
http://devmon.svn.sourceforge.net/viewvc/devmon/trunk/extras/do_devmon.c
http://devmon.svn.sourceforge.net/viewvc/devmon/trunk/extras/hobbit-4.2.0-
devmon-complete.patch
)

I will update my Hobbit packages with this once I've upgraded my production 
Hobbit box (but the SRPM is in Mandriva cooker already).

Regards,
Buchan
list Mario Andre · Thu, 19 Nov 2009 15:49:28 -0200 ·
Hi,

Bunchan.

I'm getting hobbitd_rrd crash. I'm running 4.2.3 RC1 with your do_devmon.c
revision 154.
I figured that the core file are repeating for the same equipments. cisco
6513 and cisco 4507 for if_load.

THis could be because of the number of interfaces? What do you think?

Thanks in advance,

Mario.

Cisco 4507:

Alarming on (Gi1/1,Gi1/2,Gi2/1,Gi2/2,Gi3/1,Gi3/2,Gi3/3,Gi3/4,Gi3/5,Gi3/6,Gi3/7)
Alarming on (Gi3/8,Gi3/9,Gi3/10,Gi3/11,Gi3/12,Gi3/13,Gi3/14,Gi3/15)
Alarming on (Gi3/16,Gi3/17,Gi3/18,Gi3/19,Gi3/20,Gi3/21,Gi3/22)
Alarming on (Gi3/23,Gi3/24,Gi3/25,Gi3/26,Gi3/27,Gi3/28,Gi3/29)
Alarming on (Gi3/30,Gi3/31,Gi3/32,Gi3/33,Gi3/34,Gi3/35,Gi3/36)
Alarming on (Gi3/37,Gi3/38,Gi3/39,Gi3/40,Gi3/41,Gi3/42,Gi3/43)
Alarming on (Gi3/44,Gi3/45,Gi3/46,Gi3/47,Gi3/48,Gi4/1,Gi4/2,Gi4/3)
Alarming on (Gi4/4,Gi4/5,Gi4/6,Gi4/7,Gi4/8,Gi4/9,Gi4/10,Gi4/11)
Alarming on (Gi4/12,Gi4/13,Gi4/14,Gi4/15,Gi4/16,Gi4/17,Gi4/18)
Alarming on (Gi4/19,Gi4/20,Gi4/21,Gi4/22,Gi4/23,Gi4/24,Gi4/25)
Alarming on (Gi4/26,Gi4/27,Gi4/28,Gi4/29,Gi4/30,Gi4/31,Gi4/32)
Alarming on (Gi4/33,Gi4/34,Gi4/35,Gi4/36,Gi4/37,Gi4/38,Gi4/39)
Alarming on (Gi4/40,Gi4/41,Gi4/42,Gi4/43,Gi4/44,Gi4/45,Gi4/46)
Alarming on (Gi4/47,Gi4/48,Gi5/1,Gi5/2,Gi5/3,Gi5/4,Gi5/5,Gi5/6)
Alarming on (Gi5/7,Gi5/8,Gi5/9,Gi5/10,Gi5/11,Gi5/12,Gi5/13,Gi5/14)
Alarming on (Gi5/15,Gi5/16,Gi5/17,Gi5/18,Gi5/19,Gi5/20,Gi5/21)
Alarming on (Gi5/22,Gi5/23,Gi5/24,Gi5/25,Gi5/26,Gi5/27,Gi5/28)
Alarming on (Gi5/29,Gi5/30,Gi5/31,Gi5/32,Gi5/33,Gi5/34,Gi5/35)
Alarming on (Gi5/36,Gi5/37,Gi5/38,Gi5/39,Gi5/40,Gi5/41,Gi5/42)
Alarming on (Gi5/43,Gi5/44,Gi5/45,Gi5/46,Gi5/47,Gi5/48,Gi6/1)
Alarming on (Gi6/2,Gi6/3,Gi6/4,Gi6/5,Gi6/6,Gi6/7,Gi6/8,Gi6/9)
Alarming on (Gi6/10,Gi6/11,Gi6/12,Gi6/13,Gi6/14,Gi6/15,Gi6/16)
Alarming on (Gi6/17,Gi6/18,Gi6/19,Gi6/20,Gi6/21,Gi6/22,Gi6/23)
Alarming on (Gi6/24,Gi6/25,Gi6/26,Gi6/27,Gi6/28,Gi6/29,Gi6/30)
Alarming on (Gi6/31,Gi6/32,Gi6/33,Gi6/34,Gi6/35,Gi6/36,Gi6/37)
Alarming on (Gi6/38,Gi6/39,Gi6/40,Gi6/41,Gi6/42,Gi6/43,Gi6/44)
Alarming on (Gi6/45,Gi6/46,Gi6/47,Gi6/48)


And Cisco 6513:

Input load:  yellow=75%, red=95%
Output load: yellow=75%, red=95%
Alarming on (Gi1/1,Gi1/2,Gi2/1,Gi2/2,Gi3/1,Gi3/2,Gi3/3,Gi3/4,Gi3/5,Gi3/6,Gi3/7)
Alarming on (Gi3/8,Gi3/9,Gi3/10,Gi3/11,Gi3/12,Gi3/13,Gi3/14,Gi3/15)
Alarming on (Gi3/16,Gi3/17,Gi3/18,Gi3/19,Gi3/20,Gi3/21,Gi3/22)
Alarming on (Gi3/23,Gi3/24,Gi3/25,Gi3/26,Gi3/27,Gi3/28,Gi3/29)
Alarming on (Gi3/30,Gi3/31,Gi3/32,Gi3/33,Gi3/34,Gi3/35,Gi3/36)
Alarming on (Gi3/37,Gi3/38,Gi3/39,Gi3/40,Gi3/41,Gi3/42,Gi3/43)
Alarming on (Gi3/44,Gi3/45,Gi3/46,Gi3/47,Gi3/48,Fa6/1,Fa6/2,Fa6/3)
Alarming on (Fa6/4,Fa6/5,Fa6/6,Fa6/7,Fa6/8,Fa6/9,Fa6/10,Fa6/11)
Alarming on (Fa6/12,Fa6/13,Fa6/14,Fa6/15,Fa6/16,Fa6/17,Fa6/18)
Alarming on (Fa6/19,Fa6/20,Fa6/21,Fa6/22,Fa6/23,Fa6/24,Fa6/25)
Alarming on (Fa6/26,Fa6/27,Fa6/28,Fa6/29,Fa6/30,Fa6/31,Fa6/32)
Alarming on (Fa6/33,Fa6/34,Fa6/35,Fa6/36,Fa6/37,Fa6/38,Fa6/39)
Alarming on (Fa6/40,Fa6/41,Fa6/42,Fa6/43,Fa6/44,Fa6/45,Fa6/46)
Alarming on (Fa6/47,Fa6/48,Fa9/1,Fa9/2,Fa9/3,Fa9/4,Fa9/5,Fa9/6)
Alarming on (Fa9/7,Fa9/8,Fa9/9,Fa9/10,Fa9/11,Fa9/12,Fa9/13,Fa9/14)
Alarming on (Fa9/15,Fa9/16,Fa9/17,Fa9/18,Fa9/19,Fa9/20,Fa9/21)
Alarming on (Fa9/22,Fa9/23,Fa9/24,Fa9/25,Fa9/26,Fa9/27,Fa9/28)
Alarming on (Fa9/29,Fa9/30,Fa9/31,Fa9/32,Fa9/33,Fa9/34,Fa9/35)
Alarming on (Fa9/36,Fa9/37,Fa9/38,Fa9/39,Fa9/40,Fa9/41,Fa9/42)
Alarming on (Fa9/43,Fa9/44,Fa9/45,Fa9/46,Fa9/47,Fa9/48,Fa10/1)
Alarming on (Fa10/2,Fa10/3,Fa10/4,Fa10/5,Fa10/6,Fa10/7,Fa10/8)
Alarming on (Fa10/9,Fa10/10,Fa10/11,Fa10/12,Fa10/13,Fa10/14,Fa10/15)
Alarming on (Fa10/16,Fa10/17,Fa10/18,Fa10/19,Fa10/20,Fa10/21)
Alarming on (Fa10/22,Fa10/23,Fa10/24,Fa10/25,Fa10/26,Fa10/27)
Alarming on (Fa10/28,Fa10/29,Fa10/30,Fa10/31,Fa10/32,Fa10/33)
Alarming on (Fa10/34,Fa10/35,Fa10/36,Fa10/37,Fa10/38,Fa10/39)
Alarming on (Fa10/40,Fa10/41,Fa10/42,Fa10/43,Fa10/44,Fa10/45)
Alarming on (Fa10/46,Fa10/47,Fa10/48,Gi11/1,Gi11/2,Gi11/3,Gi11/4)
Alarming on (Gi11/5,Gi11/6,Gi11/7,Gi11/8,Gi11/9,Gi11/10,Gi11/11)
Alarming on (Gi11/12,Gi11/13,Gi11/14,Gi11/15,Gi11/16,Gi12/1,Gi12/2)
Alarming on (Gi12/3,Gi12/4,Gi12/5,Gi12/6,Gi12/7,Gi12/8,Gi12/9)
Alarming on (Gi12/10,Gi12/11,Gi12/12,Gi12/13,Gi12/14,Gi12/15)
Alarming on (Gi12/16,Gi13/1,Gi13/2,Gi13/3,Gi13/4,Gi13/5,Gi13/6)
Alarming on (Gi13/7,Gi13/8,Gi13/9,Gi13/10,Gi13/11,Gi13/12,Gi13/13)
Alarming on (Gi13/14,Gi13/15,Gi13/16,EO0/0,Po1,Po2,Po7,Po8,Gi5/1)
Alarming on (Gi5/2,Gi5/3,Gi5/4,Gi5/5,Gi5/6,Gi5/7,Gi5/8,Gi5/9)
Alarming on (Gi5/10,Gi5/11,Gi5/12,Gi5/13,Gi5/14,Gi5/15,Gi5/16)
Alarming on (Gi5/17,Gi5/18,Gi5/19,Gi5/20,Gi5/21,Gi5/22,Gi5/23)
Alarming on (Gi5/24,Gi5/25,Gi5/26,Gi5/27,Gi5/28,Gi5/29,Gi5/30)
Alarming on (Gi5/31,Gi5/32,Gi5/33,Gi5/34,Gi5/35,Gi5/36,Gi5/37)
Alarming on (Gi5/38,Gi5/39,Gi5/40,Gi5/41,Gi5/42,Gi5/43,Gi5/44)
Alarming on (Gi5/45,Gi5/46,Gi5/47,Gi5/48,Po10,Po30,Po50)
quoted from Buchan Milne


On Fri, Nov 7, 2008 at 7:28 AM, Buchan Milne <user-9b139aff4dec@xymon.invalid>wrote:
On Monday 03 November 2008 09:46:42 Buchan Milne wrote:
On Friday 31 October 2008 19:55:35 Robert Holden wrote:
When I run snmpwalk on my interfaces I get the following:
(I believe that the devmon template is using this ... notice that they
are all the same.
Depending on the SNMP implementation (and may differ on different IOS
versions).
A transform won't help)
IF-MIB::ifName.70 = STRING: AT5/0/0
IF-MIB::ifName.71 = STRING: AT5/0/0
IF-MIB::ifName.72 = STRING: AT5/0/0
IF-MIB::ifName.73 = STRING: AT5/0/0
IF-MIB::ifName.74 = STRING: AT5/0/0
One of our 7613's has:

IF-MIB::ifName.1 = STRING: Gi3/1
[...]
IF-MIB::ifName.128 = STRING: ATM10/1/0.0-atm subif
IF-MIB::ifName.129 = STRING: ATM10/1/0-aal5 layer
IF-MIB::ifName.130 = STRING: ATM10/1/0.0-aal5 layer
IF-MIB::ifName.131 = STRING: VLAN-1
(This could be used, but you would end up w/ very long names.  G0/0
would
become GigabitEthernet0/0)
IF-MIB::ifDescr.70 = STRING: ATM5/0/0
IF-MIB::ifDescr.71 = STRING: ATM5/0/0-atm layer
IF-MIB::ifDescr.72 = STRING: ATM5/0/0.0-atm subif
IF-MIB::ifDescr.73 = STRING: ATM5/0/0-aal5 layer
IF-MIB::ifDescr.74 = STRING: ATM5/0/0.0-aal5 layer
On the same device as above:

IF-MIB::ifDescr.1 = STRING: GigabitEthernet3/1
[...]
IF-MIB::ifDescr.128 = STRING: ATM10/1/0.0-atm subif
IF-MIB::ifDescr.129 = STRING: ATM10/1/0-aal5 layer
IF-MIB::ifDescr.130 = STRING: ATM10/1/0.0-aal5 layer
IF-MIB::ifDescr.131 = STRING: unrouted VLAN 1
(This may be helpful ... somehow ignore all type 37 and type 49, and
sub
interface 0 ??)
IF-MIB::ifType.70 = INTEGER: sonet(39)
IF-MIB::ifType.71 = INTEGER: atm(37)
IF-MIB::ifType.72 = INTEGER: atmSubInterface(134)
IF-MIB::ifType.73 = INTEGER: aal5(49)
IF-MIB::ifType.74 = INTEGER: aal5(49)
But, in my case, I want to graph:
IF-MIB::ifName.103 = STRING: ATM10/1/0.1-aal5 layer

(it is the only interface which shows the correct traffic)
I will open a bug if you like.  Is the issue that originated this
thread
the same as the one I am experiencing?
Well, there are two aspects to the bug:

1)Devmon should strip invalid characters out of interface names for the
RRD
data sent with the status message.

2)The Hobbit RRD collector module for devmon should not segfault if data
is
not in the format:

<name_without_spaces> value[:value[:value]]
I fixed this one last night. You can either grab the new do_devmon.c, and
run
'make' again, and copy the hobbitd_rrd over the previous binary, or you can
grab the new complete patch for a build from scratch:

http://devmon.svn.sourceforge.net/viewvc/devmon?view=rev&revision=90

(at present, viewvc seems to be a bit bust on sourceforge, but these should
be
the URLs to the current versions of the files when it is working:
http://devmon.svn.sourceforge.net/viewvc/devmon/trunk/extras/do_devmon.c
http://devmon.svn.sourceforge.net/viewvc/devmon/trunk/extras/hobbit-4.2.0-

devmon-complete.patch<http://devmon.svn.sourceforge.net/viewvc/devmon/trunk/extras/hobbit-4.2.0-%0Adevmon-complete.patch>;
quoted from Buchan Milne
)

I will update my Hobbit packages with this once I've upgraded my production
Hobbit box (but the SRPM is in Mandriva cooker already).

Regards,
Buchan

list Buchan Milne · Mon, 23 Nov 2009 16:37:13 +0100 ·
On Thursday, 19 November 2009 18:49:28 mario andre wrote:
Hi,

Bunchan.

I'm getting hobbitd_rrd crash. I'm running 4.2.3 RC1
Any reason not to use the final release of 4.2.3?
quoted from Mario Andre
with your do_devmon.c
revision 154.
I figured that the core file are repeating for the same equipments. cisco
6513 and cisco 4507 for if_load.
How did you determine this?
THis could be because of the number of interfaces? What do you think?
Unlikely. Some 6509's and 7613's I was monitoring had about the same number or 
more ports and didn't have problems.
quoted from Mario Andre
Thanks in advance,

Mario.

Cisco 4507:

Alarming on
(Gi1/1,Gi1/2,Gi2/1,Gi2/2,Gi3/1,Gi3/2,Gi3/3,Gi3/4,Gi3/5,Gi3/6,Gi3/7)
Alarming on (Gi3/8,Gi3/9,Gi3/10,Gi3/11,Gi3/12,Gi3/13,Gi3/14,Gi3/15)
Alarming on (Gi3/16,Gi3/17,Gi3/18,Gi3/19,Gi3/20,Gi3/21,Gi3/22)
Alarming on (Gi3/23,Gi3/24,Gi3/25,Gi3/26,Gi3/27,Gi3/28,Gi3/29)
Alarming on (Gi3/30,Gi3/31,Gi3/32,Gi3/33,Gi3/34,Gi3/35,Gi3/36)
Alarming on (Gi3/37,Gi3/38,Gi3/39,Gi3/40,Gi3/41,Gi3/42,Gi3/43)
Alarming on (Gi3/44,Gi3/45,Gi3/46,Gi3/47,Gi3/48,Gi4/1,Gi4/2,Gi4/3)
Alarming on (Gi4/4,Gi4/5,Gi4/6,Gi4/7,Gi4/8,Gi4/9,Gi4/10,Gi4/11)
Alarming on (Gi4/12,Gi4/13,Gi4/14,Gi4/15,Gi4/16,Gi4/17,Gi4/18)
Alarming on (Gi4/19,Gi4/20,Gi4/21,Gi4/22,Gi4/23,Gi4/24,Gi4/25)
Alarming on (Gi4/26,Gi4/27,Gi4/28,Gi4/29,Gi4/30,Gi4/31,Gi4/32)
Alarming on (Gi4/33,Gi4/34,Gi4/35,Gi4/36,Gi4/37,Gi4/38,Gi4/39)
Alarming on (Gi4/40,Gi4/41,Gi4/42,Gi4/43,Gi4/44,Gi4/45,Gi4/46)
Alarming on (Gi4/47,Gi4/48,Gi5/1,Gi5/2,Gi5/3,Gi5/4,Gi5/5,Gi5/6)
Alarming on (Gi5/7,Gi5/8,Gi5/9,Gi5/10,Gi5/11,Gi5/12,Gi5/13,Gi5/14)
Alarming on (Gi5/15,Gi5/16,Gi5/17,Gi5/18,Gi5/19,Gi5/20,Gi5/21)
Alarming on (Gi5/22,Gi5/23,Gi5/24,Gi5/25,Gi5/26,Gi5/27,Gi5/28)
Alarming on (Gi5/29,Gi5/30,Gi5/31,Gi5/32,Gi5/33,Gi5/34,Gi5/35)
Alarming on (Gi5/36,Gi5/37,Gi5/38,Gi5/39,Gi5/40,Gi5/41,Gi5/42)
Alarming on (Gi5/43,Gi5/44,Gi5/45,Gi5/46,Gi5/47,Gi5/48,Gi6/1)
Alarming on (Gi6/2,Gi6/3,Gi6/4,Gi6/5,Gi6/6,Gi6/7,Gi6/8,Gi6/9)
Alarming on (Gi6/10,Gi6/11,Gi6/12,Gi6/13,Gi6/14,Gi6/15,Gi6/16)
Alarming on (Gi6/17,Gi6/18,Gi6/19,Gi6/20,Gi6/21,Gi6/22,Gi6/23)
Alarming on (Gi6/24,Gi6/25,Gi6/26,Gi6/27,Gi6/28,Gi6/29,Gi6/30)
Alarming on (Gi6/31,Gi6/32,Gi6/33,Gi6/34,Gi6/35,Gi6/36,Gi6/37)
Alarming on (Gi6/38,Gi6/39,Gi6/40,Gi6/41,Gi6/42,Gi6/43,Gi6/44)
Alarming on (Gi6/45,Gi6/46,Gi6/47,Gi6/48)


And Cisco 6513:

Input load:  yellow=75%, red=95%
Output load: yellow=75%, red=95%
Alarming on
(Gi1/1,Gi1/2,Gi2/1,Gi2/2,Gi3/1,Gi3/2,Gi3/3,Gi3/4,Gi3/5,Gi3/6,Gi3/7)
Alarming on (Gi3/8,Gi3/9,Gi3/10,Gi3/11,Gi3/12,Gi3/13,Gi3/14,Gi3/15)
Alarming on (Gi3/16,Gi3/17,Gi3/18,Gi3/19,Gi3/20,Gi3/21,Gi3/22)
Alarming on (Gi3/23,Gi3/24,Gi3/25,Gi3/26,Gi3/27,Gi3/28,Gi3/29)
Alarming on (Gi3/30,Gi3/31,Gi3/32,Gi3/33,Gi3/34,Gi3/35,Gi3/36)
Alarming on (Gi3/37,Gi3/38,Gi3/39,Gi3/40,Gi3/41,Gi3/42,Gi3/43)
Alarming on (Gi3/44,Gi3/45,Gi3/46,Gi3/47,Gi3/48,Fa6/1,Fa6/2,Fa6/3)
Alarming on (Fa6/4,Fa6/5,Fa6/6,Fa6/7,Fa6/8,Fa6/9,Fa6/10,Fa6/11)
Alarming on (Fa6/12,Fa6/13,Fa6/14,Fa6/15,Fa6/16,Fa6/17,Fa6/18)
Alarming on (Fa6/19,Fa6/20,Fa6/21,Fa6/22,Fa6/23,Fa6/24,Fa6/25)
Alarming on (Fa6/26,Fa6/27,Fa6/28,Fa6/29,Fa6/30,Fa6/31,Fa6/32)
Alarming on (Fa6/33,Fa6/34,Fa6/35,Fa6/36,Fa6/37,Fa6/38,Fa6/39)
Alarming on (Fa6/40,Fa6/41,Fa6/42,Fa6/43,Fa6/44,Fa6/45,Fa6/46)
Alarming on (Fa6/47,Fa6/48,Fa9/1,Fa9/2,Fa9/3,Fa9/4,Fa9/5,Fa9/6)
Alarming on (Fa9/7,Fa9/8,Fa9/9,Fa9/10,Fa9/11,Fa9/12,Fa9/13,Fa9/14)
Alarming on (Fa9/15,Fa9/16,Fa9/17,Fa9/18,Fa9/19,Fa9/20,Fa9/21)
Alarming on (Fa9/22,Fa9/23,Fa9/24,Fa9/25,Fa9/26,Fa9/27,Fa9/28)
Alarming on (Fa9/29,Fa9/30,Fa9/31,Fa9/32,Fa9/33,Fa9/34,Fa9/35)
Alarming on (Fa9/36,Fa9/37,Fa9/38,Fa9/39,Fa9/40,Fa9/41,Fa9/42)
Alarming on (Fa9/43,Fa9/44,Fa9/45,Fa9/46,Fa9/47,Fa9/48,Fa10/1)
Alarming on (Fa10/2,Fa10/3,Fa10/4,Fa10/5,Fa10/6,Fa10/7,Fa10/8)
Alarming on (Fa10/9,Fa10/10,Fa10/11,Fa10/12,Fa10/13,Fa10/14,Fa10/15)
Alarming on (Fa10/16,Fa10/17,Fa10/18,Fa10/19,Fa10/20,Fa10/21)
Alarming on (Fa10/22,Fa10/23,Fa10/24,Fa10/25,Fa10/26,Fa10/27)
Alarming on (Fa10/28,Fa10/29,Fa10/30,Fa10/31,Fa10/32,Fa10/33)
Alarming on (Fa10/34,Fa10/35,Fa10/36,Fa10/37,Fa10/38,Fa10/39)
Alarming on (Fa10/40,Fa10/41,Fa10/42,Fa10/43,Fa10/44,Fa10/45)
Alarming on (Fa10/46,Fa10/47,Fa10/48,Gi11/1,Gi11/2,Gi11/3,Gi11/4)
Alarming on (Gi11/5,Gi11/6,Gi11/7,Gi11/8,Gi11/9,Gi11/10,Gi11/11)
Alarming on (Gi11/12,Gi11/13,Gi11/14,Gi11/15,Gi11/16,Gi12/1,Gi12/2)
Alarming on (Gi12/3,Gi12/4,Gi12/5,Gi12/6,Gi12/7,Gi12/8,Gi12/9)
Alarming on (Gi12/10,Gi12/11,Gi12/12,Gi12/13,Gi12/14,Gi12/15)
Alarming on (Gi12/16,Gi13/1,Gi13/2,Gi13/3,Gi13/4,Gi13/5,Gi13/6)
Alarming on (Gi13/7,Gi13/8,Gi13/9,Gi13/10,Gi13/11,Gi13/12,Gi13/13)
Alarming on (Gi13/14,Gi13/15,Gi13/16,EO0/0,Po1,Po2,Po7,Po8,Gi5/1)
Alarming on (Gi5/2,Gi5/3,Gi5/4,Gi5/5,Gi5/6,Gi5/7,Gi5/8,Gi5/9)
Alarming on (Gi5/10,Gi5/11,Gi5/12,Gi5/13,Gi5/14,Gi5/15,Gi5/16)
Alarming on (Gi5/17,Gi5/18,Gi5/19,Gi5/20,Gi5/21,Gi5/22,Gi5/23)
Alarming on (Gi5/24,Gi5/25,Gi5/26,Gi5/27,Gi5/28,Gi5/29,Gi5/30)
Alarming on (Gi5/31,Gi5/32,Gi5/33,Gi5/34,Gi5/35,Gi5/36,Gi5/37)
Alarming on (Gi5/38,Gi5/39,Gi5/40,Gi5/41,Gi5/42,Gi5/43,Gi5/44)
Alarming on (Gi5/45,Gi5/46,Gi5/47,Gi5/48,Po10,Po30,Po50)
There are no funny names here, which is what would have caused problems in 
previous versions of the devmon collector (but rev 154 shouldn't).

Can you provide a backtrace from one of the core files? It will provide at 
least the devmon test name that triggered the core, and possibly information 
that would help to fix the issue. You can do that with something like:

gdb /path/to/xymon-4.2.3RC1/hobbitd/hobbitd_rrd -c /path/to/core.xxxx

Where the path to hobbitd above is the hobbitd in the source directory where 
you built the hobbitd in question. Then, in the gdb prompt, type 'bt full'

It would also help more to provide the output of something like:

bb localhost 'hobbitdlog hostname.if_load'. This would allow me to test with 
the exact data your server is getting.

Regards,
Buchan
list Mario Andre · Mon, 5 Apr 2010 18:11:49 -0300 ·
Hi all,

I´m getting a lot of core files because of devmon if_load tests for a cisco
6509.

I´m running xymon4.3.0-beta2, I´ve already tried xymon-4.3.0.-beta2 latest
branch with do_devmon.c revision 6222 with no success.
My rrdtool version is 1.3.9.

This core is generated with the default package from xymon-4.3.0-beta2

#0  0x0063b402 in __kernel_vsyscall ()
#1  0x00513040 in raise () from /lib/i686/nosegneg/libc.so.6
#2  0x00514a21 in abort () from /lib/i686/nosegneg/libc.so.6
#3  0x08069e93 in sigsegv_handler (signum=11) at sig.c:57
#4  <signal handler called>
#5  0x0055a4c9 in strcat () from /lib/i686/nosegneg/libc.so.6
#6  0x0804e6e4 in do_devmon_rrd (hostname=0xb7b42207 "neuss-r1",
testname=0xb7b42210 "if_load",
    classname=0xb7b42244 "imat/imat_network/Infrastructure_Devices",
pagepaths=0x807009e "",
    msg=0xb7b42273 "status neuss-r1.if_load green Mon Apr  5 14:20:17 2010",
tstamp=1270488357) at rrd/do_devmon.c:83
#7  0x080598c4 in update_rrd (hostname=0xb7b42207 "neuss-r1",
testname=0xb7b42210 "if_load",
    msg=0xb7b42273 "status neuss-r1.if_load green Mon Apr  5 14:20:17 2010",
tstamp=1270488357, sender=0xb7b421f8 "199.200.11.51",
    ldef=0x8506588, classname=0xb7b42244
"imat/imat_network/Infrastructure_Devices", pagepaths=0x807009e "") at
do_rrd.c:649
#8  0x0804a400 in main (argc=3, argv=0xbf88c524) at hobbitd_rrd.c:349

Does anyone here is facing the same problem?

Thanks in advance,

Mario.


On Wed, Mar 31, 2010 at 5:28 PM, Mario Andre Panza
<user-82c7780661a4@xymon.invalid>wrote:
Buchan,


The revision 164 do_devmon.c from the devmon svn was working good with
4.2.3

After upgrade to 4.3.0 beta2 the revision 6171 and 6222 ( from xymon svn)
do not work.
Even with the last branch 4.3.0 from the svn I'm having lot of core files
and rrdctl. files?
Do you know why the rrdctl files?

But my question is what changes are really necessary at the revision 164 in
order to work with 4.3.0 ?


The diff is between 164(devmon) and 6222(svn xymon) :

diff xymon-4.2.3/hobbitd/rrd/do_devmon.c diff/6222
4c4
< /* Copyright (C) 2004-2006 Henrik Storner <user-ce4a2c883f75@xymon.invalid>
*/
---
/* Copyright (C) 2004-2009 Henrik Storner <user-ce4a2c883f75@xymon.invalid>
*/
14c14
< int do_devmon_rrd(char *hostname, char *testname, char *msg, time_t
tstamp)
---
int do_devmon_rrd(char *hostname, char *testname, char *classname, char
*pagepaths, char *msg, time_t tstamp)
18c18
<       static char *devmon_tpl      = NULL;
---
      static void *devmon_tpl      = NULL;
68,69d67
<                       devmon_params[0] = "rrdcreate";
<                       devmon_params[1] = rrdfn;
74c72
<                               devmon_params[numds+2] =
xstrdup(columns[numds]);
---
                              devmon_params[numds] =
xstrdup(columns[numds]);
78,82c76
<                       devmon_params[numds+2] = rra1;
<                       devmon_params[numds+3] = rra2;
<                       devmon_params[numds+4] = rra3;
<                       devmon_params[numds+5] = rra4;
<                       devmon_params[numds+6] = NULL;
---
                      devmon_params[numds] = NULL;
115,116c109
<               snprintf(rrdfn, sizeof(rrdfn)-1, "%s.%s.rrd", rrdbasename,
ifname);
<               rrdfn[sizeof(rrdfn)-1] = '\0';
---
              setupfn2("%s.%s.rrd", rrdbasename, ifname);
118c111
<               create_and_update_rrd(hostname, rrdfn, devmon_params,
devmon_tpl);
---
              create_and_update_rrd(hostname, testname, classname,
pagepaths, devmon_params, devmon_tpl);
127a121
Thanks in advance,

Mario.