Xymon Mailing List Archive search

rrd error messages

5 messages in this thread

list Steve Holmes · Mon, 21 May 2007 09:57:16 -0400 ·
Greetings,

I am getting a lot of errors in the rrd-status.log file. There are 2 actually that are appearing at an alarming frequency (and have been for quite a while):

2007-05-21 09:42:52 RRD error updating /var/hobbit/data/rrd/client.purdue.edu/disk,export,video.rrd from 128.210.xxx.xxx: unknown consolidation function '^?<F8>'

2007-05-21 09:42:52 RRD error updating /var/hobbit/data/rrd/client.purdue.edu/memory.real.rrd from 128.210.xxx.xxx: illegal
attempt to update using time 1179754972 when last update time is 2146959360 (minimum one second step)

Note that the system time on the hobbit server and on the clients are all synchronized with xntpd so the clocks aren't different.

I am also having some trouble with a lot of NAN values in the disk history graphs. I'm wondering a) how to fix these errors, and b) if the errors have anything to do with not getting good data in the graphs.

Thanks,
Steve Holmes
list Steve Holmes · Mon, 25 Jun 2007 12:03:49 -0400 ·
I'm still getting these errors. Does no one have an answer? I have searched
the archives and found nothing. They are occurring for multiple hosts and
multiple services.

Thanks,
Steve.
quoted from Steve Holmes


On 5/21/07, Steve Holmes <user-08c0215782b3@xymon.invalid> wrote:
Greetings,

I am getting a lot of errors in the rrd-status.log file. There are 2
actually that are appearing at an alarming frequency (and have been for
quite a while):

2007-05-21 09:42:52 RRD error updating
/var/hobbit/data/rrd/client.purdue.edu/disk,export,video.rrd from
128.210.xxx.xxx: unknown consolidation function '^?<F8>'

2007-05-21 09:42:52 RRD error updating
/var/hobbit/data/rrd/client.purdue.edu/memory.real.rrd from
128.210.xxx.xxx: illegal
attempt to update using time 1179754972 when last update time is
2146959360 (minimum one second step)

Note that the system time on the hobbit server and on the clients are all
synchronized with xntpd so the clocks aren't different.

I am also having some trouble with a lot of NAN values in the disk history
graphs. I'm wondering a) how to fix these errors, and b) if the errors
have anything to do with not getting good data in the graphs.

Thanks,
Steve
Holmes

-- 

Nonviolence means avoiding not only external physical violence but also
internal violence of spirit. You not only refuse to shoot a man, but you
refuse to hate him. -Martin Luther King, Jr., civil-rights leader
(1929-1968)
list Henrik Størner · Mon, 25 Jun 2007 22:56:34 +0200 ·
quoted from Steve Holmes
On Mon, Jun 25, 2007 at 12:03:49PM -0400, Steve Holmes wrote:
I'm still getting these errors. Does no one have an answer? I have searched
the archives and found nothing. They are occurring for multiple hosts and
multiple services.
2007-05-21 09:42:52 RRD error updating
/var/hobbit/data/rrd/client.purdue.edu/disk,export,video.rrd from
128.210.xxx.xxx: unknown consolidation function '^?<F8>'
This looks like a corrupted RRD file.
quoted from Steve Holmes
2007-05-21 09:42:52 RRD error updating
/var/hobbit/data/rrd/client.purdue.edu/memory.real.rrd from
128.210.xxx.xxx: illegal
attempt to update using time 1179754972 when last update time is
2146959360 (minimum one second step)
There are several potential reasons for this, but right now you should
just take my word that it's harmless.
quoted from Steve Holmes
I am also having some trouble with a lot of NAN values in the disk history
graphs. I'm wondering a) how to fix these errors, and b) if the errors
have anything to do with not getting good data in the graphs.
What's the load on your server ? I've seen this happen on my own system,
where it turned out that the amount of RRD file-updates was pushing the
disks beyond capacity. vmstat/iostat data usually picks this up. But
then, this would affect all types of graphs.


Regards,
Henrik
list Ralph Mitchell · Mon, 25 Jun 2007 16:22:31 -0500 ·
quoted from Henrik Størner
On 6/25/07, Henrik Stoerner <user-ce4a2c883f75@xymon.invalid> wrote:
On Mon, Jun 25, 2007 at 12:03:49PM -0400, Steve Holmes wrote:
2007-05-21 09:42:52 RRD error updating
/var/hobbit/data/rrd/client.purdue.edu/memory.real.rrd from
128.210.xxx.xxx: illegal
attempt to update using time 1179754972 when last update time is
2146959360 (minimum one second step)
There are several potential reasons for this, but right now you should
just take my word that it's harmless.
Are you sure it's harmless in this particular case??  As a time value,
that number evaluates to Tue Jan 12 19:36:00 2038.  If the rrd has an
entry with that datestamp, it won't accept new input for the next 30
years...

Ralph Mitchell
list Steve Holmes · Thu, 28 Jun 2007 09:22:39 -0400 ·
Thanks, Henrik.

There appear to be only 16 of the rrd files which might be corrupt. Is there
any way to repair them, or am I stuck with just removing them?

There are no dates on the files further into the future than August of 2007.
But that is more than one second off.

As to the server load. No the server isn't stressed at all.

Thanks,
Steve.
quoted from Henrik Størner


On 6/25/07, Henrik Stoerner <user-ce4a2c883f75@xymon.invalid> wrote:
On Mon, Jun 25, 2007 at 12:03:49PM -0400, Steve Holmes wrote:
I'm still getting these errors. Does no one have an answer? I have
searched
the archives and found nothing. They are occurring for multiple hosts
and
multiple services.
2007-05-21 09:42:52 RRD error updating
/var/hobbit/data/rrd/client.purdue.edu/disk,export,video.rrd from
128.210.xxx.xxx: unknown consolidation function '^?<F8>'
This looks like a corrupted RRD file.
2007-05-21 09:42:52 RRD error updating
/var/hobbit/data/rrd/client.purdue.edu/memory.real.rrd from
128.210.xxx.xxx: illegal
attempt to update using time 1179754972 when last update time is
2146959360 (minimum one second step)
There are several potential reasons for this, but right now you should
just take my word that it's harmless.
I am also having some trouble with a lot of NAN values in the disk
history
graphs. I'm wondering a) how to fix these errors, and b) if the errors
have anything to do with not getting good data in the graphs.
What's the load on your server ? I've seen this happen on my own system,
where it turned out that the amount of RRD file-updates was pushing the
disks beyond capacity. vmstat/iostat data usually picks this up. But
then, this would affect all types of graphs.


Regards,
Henrik

-- 
Nonviolence means avoiding not only external physical violence but also
internal violence of spirit. You not only refuse to shoot a man, but you
refuse to hate him. -Martin Luther King, Jr., civil-rights leader
(1929-1968)

The great thing about getting older is that you don't lose all the other
ages you've been. -Madeleine L'Engle, writer (1918- )