Summary ithr in mpstat increasing

From: Bigadmin <bigadmin_at_unixplanet.biz>
Date: Sat Jan 17 2009 - 11:47:19 EST
Hello Sun Managers,

Thanks everyone for quick answer, really appreciate your help.
And I was lucky to talk to someone very knowledgeable at SUN Kernel support.
  
In most cases having abnormal number of interrupts or CPU usage
Indicates an offensive process, however our case was related to 
high network traffic usage. We are doing some heavy work on network and 
network interface send its interrupts to one specific CPU which causes 
the number of interrupts on one CPU goes higher than others. This wont 
cause any problem unless CPU power is less than Network speed.
In this case high number of interrupts pins out other processes 
that are running on that CPU.

I found following website very useful:
http://blogs.sun.com/sunay/entry/the_solaris_networking_the_magic

and SUN support mentioned about increasing number of FIFO on network
interface.

I am going to copy all answers the I received from SUN Managers.
Once again thanks every one.


****************************************************************************
******

You can do this in a multi step process.

1. First select the process you think is causing the high ithr - 
   it will be a process using pthreads, or java, or some lib that allows it
to thread 
   (or it might be a custom thread lib, hence the bad behavior)

2. Select a group of 2 or more processors and create a processor group using
'pset' - 
this processor group will be used to isolate execution.
 If you have several processors (T1 or T2 archs) you should create one
process group 
 for each suspect process to save time.  Having a smaller number of
processors helps 
 increase the frequency of interrupts and context switches.

3.  Also using the pset command, bind your suspect processes to a process
group.  
    Only one process group per process so you can easily see which one is
causing the problem.

4. Run mpstat using the -a option to monitor by processor set.

Hope this helps.

JayJay Florendo


****************************************************************************
******

Hello,

It looks like a thread contention on CPU 2. I think prstat has the
possibility to sort PID based on CPU. 
Just look the PID with the max number of threads that use CPU 2. It should
be your guilty PID.

Kind regards,
Eric Deblon

****************************************************************************
********


Hi,

hopefully this is a Sol10 box.. in that case dtrace might help you find the
process producing these interrupts...

http://prefetch.net/articles/dtracecookbook.html

You can get the dtrace toolkit from

http://www.brendangregg.com/dtrace.html#DTraceToolkit

Cheers

Thom





-----Original Message-----
From: Bigadmin [mailto:bigadmin@unixplanet.biz] 
Sent: Tuesday, December 23, 2008 9:15 PM
To: sunmanagers@sunmanagers.org
Cc: bigadmin@unixplanet.biz
Subject: ithr in mpstat increasing

Hi

mpstat output shows Number of intr and ithr are increasing and it 
Reaches to the point that System disable the CPU.

I don't think it is a hard ware problem and looking for a way to find the
exact
Process that cause this problem and then find the application.


CPU minf mjf xcal  intr ithr  csw icsw migr smtx  srw syscl  usr sys  wt idl
  0  101   2  255  1003  645 5575  345  520  575    0  6214   11  24   0  64
  1   61   1  149   609   73 5768  483  555  553    0  4569   10  24   0  66
  2   55   2 1259 14530 14485 1253   86  185 2141    0  3063    6  39   0
56
  3   77   1  197   833  259 5634  469  544  554    0  5050   11  25   0  65

Thanks
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers
Received on Sat Jan 17 11:47:37 2009

This archive was generated by hypermail 2.1.8 : Thu Mar 03 2016 - 06:44:13 EST