Xymon Mailing List Archive search

Load Averages

9 messages in this thread

list Gavin Leonard · Wed, 8 Oct 2008 15:16:21 -0600 ·
I am getting pages from my hobbit server when some of my linux systems are hitting 15 on their load avg. does that equate to something worse than it sounds? Cause 15% load average on a linux system does not seem that bad? Or is it not a percentage?


Gavin Leonard

[cid:image001.gif at 01C92958.D1EDC010]

Director, Systems-Network Engineering

T

 XXX-XXX-XXXX

F

 XXX-XXX-XXXX

E

 user-d65663809eb4@xymon.invalid<mailto:user-d65663809eb4@xymon.invalid>


Research | Marketing | Sales Generation

www.progrexion.com<http://www.progrexion.com/>;


This email and its contents are confidential. If you are not the intended recipient, delete this email and do not use or disclose the information within this email or its attachments. Thank you.
list Michael McCullen · Wed, 8 Oct 2008 17:24:11 -0400 ·
Load is a measure of how many processes are running or runnable and is  not directly related to % of CPU used.  There are cases where Load  average is through the roof but CPU % is very low, usually when when  the server is having issues talking to NFS storage or accessing some  kind of I/O resource.

We generally monitor load average based on number of CPUs in the box.   Our standard is yellow at 1.5 * # CPUs and red at 2 * # CPUs.  YMMV.
quoted from Gavin Leonard

On Oct 8, 2008, at 5:16 PM, Gavin Leonard wrote:
I am getting pages from my hobbit server when some of my linux  systems are hitting 15 on their load avg. does that equate to  something worse than it sounds? Cause 15% load average on a linux  system does not seem that bad? Or is it not a percentage?


Gavin Leonard

<image001.gif>
Director, Systems-Network Engineering
T
 XXX-XXX-XXXX
F
 XXX-XXX-XXXX
E
 user-d65663809eb4@xymon.invalid


Research | Marketing | Sales Generation
www.progrexion.com
quoted from Gavin Leonard

This email and its contents are confidential. If you are not the  intended recipient, delete this email and do not use or disclose the  information within this email or its attachments. Thank you.

list Rob MacGregor · Wed, 8 Oct 2008 22:25:59 +0100 ·
quoted from Gavin Leonard
On Wed, Oct 8, 2008 at 22:16, Gavin Leonard <user-d65663809eb4@xymon.invalid> wrote:
I am getting pages from my hobbit server when some of my linux systems are
hitting 15 on their load avg. does that equate to something worse than it
sounds? Cause 15% load average on a linux system does not seem that bad? Or
is it not a percentage?
http://en.wikipedia.org/wiki/Load_(computing)

The Load Average is the number of processes requiring processor time.
What it means depends on the number of processors - a load average of
6 is fine for a system with 8 processors, but not good news for one
with only a single processor.  Fifteen on an 8 processor box isn't
good news, but isn't the end of the world.  On a single processor box
it's a sign that you've got problems ;)

-- 
                 Please keep list traffic on the list.

Rob MacGregor
      Whoever fights monsters should see to it that in the process he
        doesn't become a monster.                  Friedrich Nietzsche
list Josh Luthman · Wed, 8 Oct 2008 17:27:53 -0400 ·
It is not a percentage.

http://www.luv.asn.au/overheads/NJG_LUV_2002/luvSlides.html

Simply put the load is how much calculator power your box is doing.  I have
seen servers that have a value of several hundred load on a day to day
basis.

The biggest reason for it being monitored is to watch what is usual and when
you see a spike, ascend, descend, etc, you have a history to compare it to.
This value will be different for nearly every single server.

Josh Luthman
Office: XXX-XXX-XXXX
Direct: XXX-XXX-XXXX
XXXX Wayne St
Suite XXXX
Troy, OH XXXXX

Those who don't understand UNIX are condemned to reinvent it, poorly.
--- Henry Spencer
quoted from Gavin Leonard


On Wed, Oct 8, 2008 at 5:16 PM, Gavin Leonard <user-d65663809eb4@xymon.invalid>wrote:
 I am getting pages from my hobbit server when some of my linux systems
are hitting 15 on their load avg. does that equate to something worse than
it sounds? Cause 15% load average on a linux system does not seem that bad?
Or is it not a percentage?


*Gavin Leonard***

[image: cid:image001.gif at 01C856AD.922EF120]
quoted from Michael McCullen

Director, Systems-Network Engineering

*T*

 XXX-XXX-XXXX

*F*

 XXX-XXX-XXXX

*E*

 user-d65663809eb4@xymon.invalid


Research | Marketing | Sales Generation

*www.progrexion.com* <http://www.progrexion.com/>;
quoted from Michael McCullen


This email and its contents are confidential. If you are not the intended
recipient, delete this email and do not use or disclose the information
within this email or its attachments. Thank you.

list Rich Smrcina · Wed, 08 Oct 2008 16:33:28 -0500 ·
quoted from Gavin Leonard
Gavin Leonard wrote:
I am getting pages from my hobbit server when some of my linux systems are hitting 15 on their load avg. does that equate to something worse than it sounds? Cause 15% load average on a linux system does not seem that bad? Or is it not a percentage?
Your thinking CPU Utilization.  Load Average measures the number of running processes.

-- 

Rich Smrcina
VM Assist, Inc.
Phone: XXX-XXX-XXXX
Ans Service:  XXX-XXX-XXXX
user-61add9955ef9@xymon.invalid
http://www.linkedin.com/in/richsmrcina

Catch the WAVV!  http://www.wavv.org
WAVV 2009 - Orlando, FL - May 15-19, 2009
list Greg L Hubbard · Wed, 8 Oct 2008 16:33:53 -0500 ·
Gavin,
 
Load average is (more or less) the average length of the run queue for a
UNIX-style system (someone once posted a link to a more precise
definition on this forum).  For your system, a load average value of 15
means that an average of 15 processes are waiting to use the CPU -- a
problem for most systems.
 
If you look at the trends button, you will probably find a CPU
utilization graph that shows a split between user, system, and idle CPU
usage.
 
CPU utilization shows how busy the CPU(s) is/are, but load average shows
the impact of a busy CPU on the system's workload.
 
I once had a system that spiked to 200+ in the load average display.
This problem was traced to a problem in some multi-threaded code where
the threads were going into infinite loops one by one until all of them
were locked up.  Amazingly enough, the system still responded to
interactive usage, but v-e-r-y s-l-o-w-l-y.
 
GLH
quoted from Gavin Leonard


	From: Gavin Leonard [mailto:user-d65663809eb4@xymon.invalid] 
	Sent: Wednesday, October 08, 2008 4:16 PM
	To: user-ae9b8668bcde@xymon.invalid
	Subject: [hobbit] Load Averages
	
	
	I am getting pages from my hobbit server when some of my linux
systems are hitting 15 on their load avg. does that equate to something
worse than it sounds? Cause 15% load average on a linux system does not
seem that bad? Or is it not a percentage?

	 
Gavin Leonard

 
Director, Systems-Network Engineering

T

 XXX-XXX-XXXX

F

 XXX-XXX-XXXX

E

 user-d65663809eb4@xymon.invalid <mailto:user-d65663809eb4@xymon.invalid> 
quoted from Josh Luthman

 
Research | Marketing | Sales Generation

www.progrexion.com <http://www.progrexion.com/>; 

 
This email and its contents are confidential. If you are not the
intended recipient, delete this email and do not use or disclose the
information within this email or its attachments. Thank you.
list Gavin Leonard · Wed, 8 Oct 2008 15:38:37 -0600 ·
It is based on physical processors or cores?

-Gavin
quoted from Josh Luthman

From: Josh Luthman [mailto:user-4c45a83f15cb@xymon.invalid]
Sent: Wednesday, October 08, 2008 3:28 PM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] Load Averages

It is not a percentage.

http://www.luv.asn.au/overheads/NJG_LUV_2002/luvSlides.html

Simply put the load is how much calculator power your box is doing.  I have seen servers that have a value of several hundred load on a day to day basis.

The biggest reason for it being monitored is to watch what is usual and when you see a spike, ascend, descend, etc, you have a history to compare it to.  This value will be different for nearly every single server.

Josh Luthman
Office: XXX-XXX-XXXX
Direct: XXX-XXX-XXXX
XXXX Wayne St
Suite XXXX
Troy, OH XXXXX

Those who don't understand UNIX are condemned to reinvent it, poorly.
--- Henry Spencer

On Wed, Oct 8, 2008 at 5:16 PM, Gavin Leonard <user-d65663809eb4@xymon.invalid<mailto:user-d65663809eb4@xymon.invalid>> wrote:

I am getting pages from my hobbit server when some of my linux systems are hitting 15 on their load avg. does that equate to something worse than it sounds? Cause 15% load average on a linux system does not seem that bad? Or is it not a percentage?


Gavin Leonard


Error! Filename not specified.
quoted from Greg L Hubbard


Director, Systems-Network Engineering


T


 XXX-XXX-XXXX


F


 XXX-XXX-XXXX


E


 user-d65663809eb4@xymon.invalid<mailto:user-d65663809eb4@xymon.invalid>


Research | Marketing | Sales Generation


www.progrexion.com<http://www.progrexion.com/>;


This email and its contents are confidential. If you are not the intended recipient, delete this email and do not use or disclose the information within this email or its attachments. Thank you.
list Hobbit User in Richmond · Wed, 8 Oct 2008 21:09:56 -0400 (EDT) ·
quoted from Josh Luthman
On Wed, October 8, 2008 17:27, Josh Luthman wrote:
Simply put the load is how much calculator power your box is doing.  I
have seen servers that have a value of several hundred load on a day to
day basis.
Well, it depends on what sort of work the box is doing, and what your
throughput expectations are, but any server at a load average of 200+ is
going to have abysmal response times for online users.  I've seen that on
dedicated mail servers in a spam storm, but when our Oracle database
servers go above about 30, we start getting calls from application users.
The default alert levels in Hobbit work pretty well for us, letting us
check whether anything's going awry or if it's a true spike in demand,
before the phones start ringing.  I do keep the dedicated Internet-facing
sendmail servers at significantly higher thresholds, because mail doesn't
necessarily have to move in near real time, and customers are more
tolerant of slow delivery than "unable to deliver" warnings and bounces.
list Rob MacGregor · Thu, 9 Oct 2008 07:04:18 +0100 ·
quoted from Gavin Leonard
On Wed, Oct 8, 2008 at 22:38, Gavin Leonard <user-d65663809eb4@xymon.invalid> wrote:
It is based on physical processors or cores?
The number of CPU cores in total - so a pair of single core processors
and one dual core processor are effectively the same.
quoted from Rob MacGregor

-- 
                 Please keep list traffic on the list.

Rob MacGregor
      Whoever fights monsters should see to it that in the process he
        doesn't become a monster.                  Friedrich Nietzsche