TCP/IP stats (bits/s) limited to 100M

11 messages in this thread

list Nicolas Dorfsman · Wed, 28 Jun 2006 10:20:11 +0200 ·

Hi,

	I have data suspicious with TCP/IP stats on Solaris and AIX.

	Graphs told me that my hosts don't send/receive more than ~110 Mbits/ s . This raise an alert on the backup server which is installed with  a GigEth interface and need to eat many backups flow simulteanously.

	So, I'm checking other GigEth equipped host.
	I have some solaris hosts with GigEth. Graphs are inconsistent  between interfaces details and TCP/IP general graphs. Take a look :

	http://www.unikservice.com/frp/tcpip.png
	http://www.unikservice.com/frp/i1.png
	http://www.unikservice.com/frp/i2.png

	So, I'm suspecting an issue with collect or graphs.  Could somebody  tell me where I should start to debug ?


	Nicolas

list Beau Olivier · Wed, 28 Jun 2006 11:42:14 +0200 ·

Hi,

this looks like tcp-data going arround the 32bit counter problem...
are your counters 32 bit ? could you give us a copy of them ?


olivier

▸ quoted from Nicolas Dorfsman

-----Message d'origine-----
De : Nicolas Dorfsman [mailto:user-0b8cdfcc881d@xymon.invalid]
Envoyé : mercredi 28 juin 2006 10:20
À : user-ae9b8668bcde@xymon.invalid
Objet : [hobbit] TCP/IP stats (bits/s) limited to 100M


Hi,

	I have data suspicious with TCP/IP stats on Solaris and AIX.

	Graphs told me that my hosts don't send/receive more than ~110 Mbits/ s . This raise an alert on the backup server which is installed with  a GigEth interface and need to eat many backups flow simulteanously.

	So, I'm checking other GigEth equipped host.
	I have some solaris hosts with GigEth. Graphs are inconsistent  between interfaces details and TCP/IP general graphs. Take a look :

	http://www.unikservice.com/frp/tcpip.png
	http://www.unikservice.com/frp/i1.png
	http://www.unikservice.com/frp/i2.png

	So, I'm suspecting an issue with collect or graphs.  Could somebody  tell me where I should start to debug ?


	Nicolas

list Nicolas Dorfsman · Wed, 28 Jun 2006 11:54:27 +0200 ·

▸ quoted from Beau Olivier

Le 28 juin 06 à 11:42, Beau Olivier a écrit :

Hi,

this looks like tcp-data going arround the 32bit counter problem...
are your counters 32 bit ? could you give us a copy of them ?

I'd be glad to.

Which counter is used by hobbit ?


Nicolas

list Henrik Størner · Wed, 28 Jun 2006 12:35:17 +0200 ·

▸ quoted from Nicolas Dorfsman

On Wed, Jun 28, 2006 at 10:20:11AM +0200, Nicolas Dorfsman wrote:

	Graphs told me that my hosts don't send/receive more than ~110 
	Mbits/ s . This raise an alert on the backup server which is installed with 
a GigEth interface and need to eat many backups flow simulteanously.
	So, I'm checking other GigEth equipped host.
	I have some solaris hosts with GigEth. Graphs are inconsistent  
between interfaces details and TCP/IP general graphs. Take a look :

	http://www.unikservice.com/frp/tcpip.png
	http://www.unikservice.com/frp/i1.png
	http://www.unikservice.com/frp/i2.png

	So, I'm suspecting an issue with collect or graphs.  Could somebody  
tell me where I should start to debug ?

Let me explain where these data come from.

The first graph ("TCP/IP statistics") are fed by data from the "netstat
-s" command. This is (from a Solaris host):

TCP	tcpRtoAlgorithm     =     4	tcpRtoMin           =   400
<snip>
	tcpCurrEstab        =     0	tcpOutSegs          =51380214
	tcpOutDataSegs      =17936799	tcpOutDataBytes     =4114388778
<more snip>
	tcpInSegs           =59097243
	tcpInAckSegs        =19928198	tcpInAckBytes       =4108598170
	tcpInDupAck         =9794396	tcpInAckUnsent      =     0
	tcpInInorderSegs    =34384580	tcpInInorderBytes   =1273412387
	tcpInUnorderSegs    =970394	tcpInUnorderBytes   =694993056
	tcpInDupSegs        = 70767	tcpInDupBytes       =20764736

Hobbit tracks the "tcpOutDataBytes" and "tcpInInorderBytes" for the
first graph. These are fed into an RRD file which computes the
difference between two measurements, and from that it computes an
average number of bytes sent over a 5 minute period. For the graph,
this is then multiplied by 8 to go from bytes/second to bits/second.

What this means is that Hobbit does not count UDP traffic or other
non-TCP traffic in this graph. If you have lots of streaming data
which typically uses UDP, this can be a significant amount of data.

Also, it doesn't count out-of-order packets (retransmits, duplicate 
packets - see your OS documentation to learn exactly what goes into 
the "tcpInUnorderBytes" counter).


The second graph is fed by data from the Solaris' "kstat" utility, or
AIX's "netstat -v" output. As far I understand, this counts raw Ethernet
packet bytes - i.e. all protocols. They are fed into RRD files just like
the TCP statistics.


So - most likely the difference is in what protocols are counted for
each of the graphs.


Regards,
Henrik

list Henrik Størner · Wed, 28 Jun 2006 13:09:13 +0200 ·

▸ quoted from Beau Olivier

On Wed, Jun 28, 2006 at 11:42:14AM +0200, Beau Olivier wrote:

Hi,

this looks like tcp-data going arround the 32bit counter problem...
are your counters 32 bit ? could you give us a copy of them ?

The RRD files are created as "DERIVE" datatypes with a minimum value of
0, which should handle 32/64-bit counter overflows automatically.
(See the rrdcreate man-page).

Note that Hobbit never does any calculations on these values, it passes
them directly (as strings) to the RRDtool functions. 


Regards,
Henrik

list Werner Gmail Lists · Wed, 28 Jun 2006 10:42:51 -0300 ·

Hi,

	The 110Mbits/s value you get, does really point to 32bit counter
wrap, because with 32bit BYTE counter, measured every 5 minutes,
110Mbits/s is (aprox) the maximum you can count without wrapping the
counter.

	As Henrik explained bellow, it should not be a "wrap" done by
hobbit nor RRD. I'm think you need to look at you OS counters directly
and see if they're wrapping in less than 5 minutes.

	If you on Solaris / SunOS you could use something like the
bellow, and watch if any of the counters wraps 32 bit value (4294967295
if i recall correctly)

(host)$> while [ 1 ]; do date; netstat -s | \
egrep "(tcpInInorderBytes|tcpOutDataBytes)" ; sleep 60; done

Example output:
Wed Jun 28 09:28:28 EDT 2006
        tcpOutDataSegs      =5864959    tcpOutDataBytes     =4273878800
        tcpInInorderSegs    =2670997    tcpInInorderBytes   =725993348

	(CARE) With RRD it's possible to come around of this OS
limitation by feeding the data in shorter times, lets say every 2
minutes. RRD will take care of computing (making) the values "correct"
for the steep size used to create the RRD (in hobbit's case 300 secs). 

	I'm not exactly sure how /and or if hobbit will be happy in
receiving client info quicker than every 5 minutes, but i think it
should be transparent.

	Hope this can give you some help.

	Regards
	Werner

▸ quoted from Henrik Størner


----------------------- Original Message -----------------------
From:    user-ce4a2c883f75@xymon.invalid (Henrik Stoerner)
To:      user-ae9b8668bcde@xymon.invalid
Date:    Wed, 28 Jun 2006 13:09:13 +0200
Subject: Re: [hobbit] TCP/IP stats (bits/s) limited to 100M
----

On Wed, Jun 28, 2006 at 11:42:14AM +0200, Beau Olivier wrote:

Hi,

this looks like tcp-data going arround the 32bit counter problem...
are your counters 32 bit ? could you give us a copy of them ?

The RRD files are created as "DERIVE" datatypes with a minimum value of
0, which should handle 32/64-bit counter overflows automatically.
(See the rrdcreate man-page).

Note that Hobbit never does any calculations on these values, it passes
them directly (as strings) to the RRDtool functions. 


Regards,
Henrik

list Nicolas Dorfsman · Wed, 28 Jun 2006 16:07:44 +0200 ·

▸ quoted from Werner Gmail Lists

Le 28 juin 06 à 15:42, Werner (gmail Lists) a écrit :

Hi,

	The 110Mbits/s value you get, does really point to 32bit counter
wrap, because with 32bit BYTE counter, measured every 5 minutes,
110Mbits/s is (aprox) the maximum you can count without wrapping the
counter.

	As Henrik explained bellow, it should not be a "wrap" done by
hobbit nor RRD. I'm think you need to look at you OS counters directly
and see if they're wrapping in less than 5 minutes.

	If you on Solaris / SunOS you could use something like the
bellow, and watch if any of the counters wraps 32 bit value  
(4294967295
if i recall correctly)

Correct.
Found this document which approves what you're saying :

http://sunsolve.sun.com/search/document.do?assetkey=1-25-72535-1

▸ quoted from Werner Gmail Lists



Le 28 juin 06 à 13:09, Henrik Stoerner a écrit :

On Wed, Jun 28, 2006 at 11:42:14AM +0200, Beau Olivier wrote:

Hi,

this looks like tcp-data going arround the 32bit counter problem...
are your counters 32 bit ? could you give us a copy of them ?

The RRD files are created as "DERIVE" datatypes with a minimum  
value of
0, which should handle 32/64-bit counter overflows automatically.
(See the rrdcreate man-page).

Well...the man is not so confident :

              COUNTER
                  is for continuous incrementing counters like the
                  ifInOctets counter in a router. The COUNTER data
                  source assumes that the counter never decreases,
                  except when a counter overflows.  The update
                  function takes the overflow into account.  The
                  counter is stored as a per-second rate. When the
                  counter overflows, RRDtool checks if the
                  overflow happened at the 32bit or 64bit border
                  and acts accordingly by adding an appropriate
                  value to the result.

              DERIVE
                  will store the derivative of the line going from
                  the last to the current value of the data
                  source. This can be useful for gauges, for
                  example, to measure the rate of people entering
                  or leaving a room. Internally, derive works
                  exactly like COUNTER but without overflow
                  checks. So if your counter does not reset at 32
                  or 64 bit you might want to use DERIVE and
                  combine it with a MIN value of 0.

                  NOTE on COUNTER vs DERIVE
                      by Don Baarda <user-b0c673992076@xymon.invalid>

                      If you cannot tolerate ever mistaking the
                      occasional counter reset for a legitimate
                      counter wrap, and would prefer "Unknowns"
                      for all legitimate counter wraps and resets,
                      always use DERIVE with min=0. Otherwise,
                      using COUNTER with a suitable max will
                      return correct values for all legitimate
                      counter wraps, mark some counter resets as
                      "Unknown", but can mistake some counter
                      resets for a legitimate counter wrap.

                      For a 5 minute step and 32-bit counter, the
                      probability of mistaking a counter reset for
                      a legitimate wrap is arguably about 0.8% per
                      1Mbps of maximum bandwidth. Note that this
                      equates to 80% for 100Mbps interfaces, so
                      for high bandwidth interfaces and a 32bit
                      counter, DERIVE with min=0 is probably
                      preferable. If you are using a 64bit
                      counter, just about any max setting will
                      eliminate the possibility of mistaking a
                      reset for a counter wrap.


  In my particular case (and maybe in any large GigEth flow) COUNTER  
with max set to 4294967295 should be the solution

▸ quoted from Werner Gmail Lists



Le 28 juin 06 à 15:42, Werner (gmail Lists) a écrit :

	(CARE) With RRD it's possible to come around of this OS
limitation by feeding the data in shorter times, lets say every 2
minutes. RRD will take care of computing (making) the values "correct"
for the steep size used to create the RRD (in hobbit's case 300 secs).

	I'm not exactly sure how /and or if hobbit will be happy in
receiving client info quicker than every 5 minutes, but i think it
should be transparent.


Mmmm. I'd prefer to try to fix the RRD file. May be tricky (export,  
import, etc), but more reliable.

	Hope this can give you some help.

it definitively helps, thanks !


Nicolas

list Henrik Størner · Sun, 9 Jul 2006 18:18:13 +0200 ·

▸ quoted from Nicolas Dorfsman

On Wed, Jun 28, 2006 at 04:07:44PM +0200, Nicolas Dorfsman wrote:

The RRD files are created as "DERIVE" datatypes with a minimum  
value of
0, which should handle 32/64-bit counter overflows automatically.
(See the rrdcreate man-page).

Well...the man is not so confident :

                     If you cannot tolerate ever mistaking the
                     occasional counter reset for a legitimate
                     counter wrap, and would prefer "Unknowns"
                     for all legitimate counter wraps and resets,
                     always use DERIVE with min=0. Otherwise,
                     using COUNTER with a suitable max will
                     return correct values for all legitimate
                     counter wraps, mark some counter resets as
                     "Unknown", but can mistake some counter
                     resets for a legitimate counter wrap.

OK, you got me on that one.

It seems that using COUNTER for the byte-counts in both the
netstat- and ifstat-RRD's might be a good idea. The question then
becomes "what's a suitable max" for these data ? Should I 
assume they are 32-bit counters ? I know some of them are not
(e.g. Solaris has 64-bit counters for bytes in/out per interface).

I'll change it to a counter now, with MAX set to "unknown". The overflow
handling should still work correctly, if I understand the RRD
docs right.

Note: This doesn't affect all of the existing RRD's, only new ones 
created.


Regards,
Henrik

list Scott Walters · Sun, 9 Jul 2006 16:02:32 -0400 ·

On Jul 9, 2006, at 12:18 PM, Henrik Stoerner wrote:

OK, you got me on that one.

Not really, you inherited this ;)   He is trying to get me, and his  point is valid, but the tool 'works as designed', read on . . .

▸ quoted from Henrik Størner

It seems that using COUNTER for the byte-counts in both the
netstat- and ifstat-RRD's might be a good idea.

*might* being the operative word there

▸ quoted from Henrik Størner

The question then
becomes "what's a suitable max" for these data ? Should I
assume they are 32-bit counters ? I know some of them are not
(e.g. Solaris has 64-bit counters for bytes in/out per interface).

exactly, and it is even more complicated than that . . . see below

▸ quoted from Henrik Størner

I'll change it to a counter now, with MAX set to "unknown". The  overflow
handling should still work correctly, if I understand the RRD
docs right.

I would not recommend this. Another major issue is counter resets instead of overflows (e.g reboot) get mistaken as wraps if the MAX is not correct. From what I recall, if you use counter and anything gets mistaken, you get a massive spike in the RRD making all the data relatively useless because the y axis autoscales to the spike.

With DERIVE=0 you acknowledge you won't handle counter wraps correctly (which are not that common anyway) but the result for all wraps/resets are benign with the NaN, which does *not* cause a spike. I am a firm believer in no data is better than bad data.

I am not opposing the ideal that COUNTER with correct max is the 'right way'. The problem with software that runs on so many platforms is the correct max is impossible to know for certain. Defining the MAX as just whatever 32/64 bits value is not adequate because reboots will cause spikes, you'd need to now the MAX for the particular metric and that is completely impossible to know absolutely. inbytes MAX would need to be different for 10Mb/s 100 1000, Token Ring 16Mb/s, etc, etc.

DERIVE=0 and NaN is a much better compromise than the spikes. And I would bet the farm reboots are a much more common event than counter wraps for the majority of environments.

And Henrik, the net result to you will be answering an endless stream of emails regarding why every COUNTER RRD has spikes . . . I've been there, done that ;) I am almost 100% positive there is not *one* counter RRD in the larrd stuff, all DERIVE. It's not impossible rrdtool has changed to alleviate some of this, but from what I have read of your email streams it I haven't seen anything to support that.

scott

list Henrik Størner · Sun, 9 Jul 2006 22:23:57 +0200 ·

Hi Scott,

▸ quoted from Scott Walters


On Sun, Jul 09, 2006 at 04:02:32PM -0400, Scott Walters wrote:

The question then
becomes "what's a suitable max" for these data ? Should I
assume they are 32-bit counters ? I know some of them are not
(e.g. Solaris has 64-bit counters for bytes in/out per interface).

exactly, and it is even more complicated than that . . . see below

[snip explanation]

▸ quoted from Scott Walters

And Henrik, the net result to you will be answering an endless stream  
of emails regarding why every COUNTER RRD has spikes . . . I've been  
there, done that ;)  I am almost 100% positive there is not *one*  
counter RRD in the larrd stuff, all DERIVE.  It's not impossible  
rrdtool has changed to alleviate some of this, but from what I have  
read of your email streams it I haven't seen anything to support that.

Your experience certainly carries a lot of weight. Since I've never
used any COUNTER datasets I haven't seen this problem (you're right:
all the LARRD DS definitions use DERIVE - I copied those just about
verbatim into Hobbit).

So - I've undone the change. Back to DERIVE with MIN=0, and we'll
see how much trouble that gives us. So far, only one person has
noticed bad effects from this.


Thanks,
Henrik

list Nicolas Dorfsman · Mon, 10 Jul 2006 09:36:28 +0200 ·

▸ quoted from Henrik Størner

Le 9 juil. 06 à 22:23, Henrik Stoerner a écrit :

So - I've undone the change. Back to DERIVE with MIN=0, and we'll
see how much trouble that gives us. So far, only one person has
noticed bad effects from this.

   Sorry to be this one  ;) .


   The choice is not easy. The better could be to have a specific  
manual on this, with some option to have this rrd correctly set up.


	Nicolas

TCP/IP stats (bits/s) limited to 100M 🔗 link

TCP/IP stats (bits/s) limited to 100M