head node has an extremely high load average.

m.roth at 5-cent.us m.roth at 5-cent.us
Thu Jun 27 14:25:11 UTC 2013


Jonathan Billings wrote:
> I saw a 'gmetad' running on your head node, which means you're running
> ganglia.  You can probably monitor your NFS traffic client-side on each
> node and have ganglia collect that information.
>
> Just keep in mind, your CPU load doesn't look that unusual for a NFS
> server with many nodes running jobs writing to it.  You said you had 10
nfsd
> threads, and your top output above said there were 3 tasks running on a
> 2-core system, with a load average between 13 and 14.  Sounds about right.
> Don't focus on the load average, you'd be better off trying to improve
> storage and network speed.
<snip>
Oh, this reminds me: one of my users is, right now, running a Gaussian (I
think) on our of our clusters - he's using a few nodes, but one node he's
running on is currently, according to top as I type this, at 64.16, though
this is *not* the head node. (And we use torque.)

         mark





More information about the redhat-list mailing list