[Linux-cluster] gfs_controld, aisexec and cman_tool

Ed Sanborn Ed.Sanborn at genband.com
Fri Mar 20 18:29:26 UTC 2009


Hi folks,

 

I have an 8-node cluster running on an IBM Bladecenter HS21.  Using RHEL
5.2, GFS (no GFS2).

The nodes are exhibiting high-cpu load with the following apps:

 

aisexec and cman_tool

 

Both these apps race the cpu without any other user apps doing much at
all.

Affectively, the user experience is dog-slow.

After I reboot one of the nodes it clears up, these apps (aisexec and
cman_tool)\

seem to behave, for awhile.  Eventually they race the cpu again days to
weeks later.

Has anyone ever experienced this?  Top output is below.

 

Thanks,

 

Ed

 

 

   [root at blade1]# top

top - 13:47:51 up 40 days, 22:16, 37 users,  load average: 4.17, 3.94,
3.86

Tasks: 372 total,   2 running, 369 sleeping,   1 stopped,   0 zombie

Cpu(s):  5.9%us, 32.6%sy,  0.0%ni, 61.4%id,  0.0%wa,  0.0%hi,  0.0%si,
0.0%st

Mem:   8311372k total,  1934844k used,  6376528k free,    76332k buffers

Swap:  8388600k total,   322976k used,  8065624k free,   443172k cached

 

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND

 4352 root      RT   0 37404  35m 2020 R  100  0.4  10519:34 aisexec

20806 root      16   0  1684  560  484 S   42  0.0   8324:49 cman_tool

12501 root      15   0  1680  556  484 S   31  0.0 609:38.46 cman_tool

27245 root      16   0  1688  560  484 S   30  0.0 508:14.31 cman_tool

 4635 root      34  19     0    0    0 S    2  0.0   1271:52 kipmi0

 5047 root      18   0  405m  17m 6260 S    1  0.2  21:57.04 cimserver

28975 root      15   0  2564 1296  900 R    1  0.0   0:00.05 top

    1 root      15   0  2064  576  524 S    0  0.0   0:02.91 init

    2 root      RT  -5     0    0    0 S    0  0.0   0:02.98 migration/0

    3 root      34  19     0    0    0 S    0  0.0   0:00.11 ksoftirqd/0

    4 root      RT  -5     0    0    0 S    0  0.0   0:00.00 watchdog/0

    5 root      RT  -5     0    0    0 S    0  0.0   0:01.29 migration/1

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20090320/5d1619a8/attachment.htm>


More information about the Linux-cluster mailing list