[Linux-cluster] GFS tuning for combined batch / interactive use
Ben Turner
bturner at redhat.com
Thu Dec 16 22:25:56 UTC 2010
There is some helpful stuff here on the tuning side:
http://sources.redhat.com/cluster/wiki/FAQ/GFS#gfs_tuning
-b
----- "Bob Peterson" <rpeterso at redhat.com> wrote:
> ----- "Kevin Maguire" <kmaguire at eso.org> wrote:
> | Hi
> |
> | We are running a 20 node cluster, using Scientific Linux 5.3, with
> a
> | GFS
> | shared filesystem hosted on our SAN. Cluster nodes are dual core
> units
> |
> | with 4 GB of RAM, and a standard Qlogic FC HBA.
> |
> | Most of the 20 nodes form a batch-processing cluster, and our users
> | are
> | happy enough with the performance they get, but some nodes are used
>
> | interactively. When the filesystem is under stress due to large
> batch
> |
> | processing jobs running on other nodes, interactive use becomes
> very
> | slow
> | and painful.
> |
> | Is there any tuning I (the sysadmin) can do that might help in this
>
> | situation? Would a migration to gfs2 make a difference? Are all
> nodes
> |
> | treated identically, or can hosts mounting the filesystem have any
> | kind of
> | priority/QoS? Which tools could I use to track down any
> bottlenecks?
> |
> | In theory we could update kernel+gfs bits to a later release,
> though
> | we
> | saw the same issues when using the same cluster with a SL4.x stack,
> | but
> | for now it's
> |
> | kernel-2.6.18-128.1.1.el5.i686
> | kmod-gfs-0.1.31-3.el5.i686
> | gfs-utils-0.1.20-7.el5.i386
> | gfs2-utils-0.1.53-1.el5_3.1.i386
> |
> | Thanks for any help/suggestions,
> | Kevin
>
> Hi Kevin,
>
> We recently identified a slowdown in RHEL5.x that involves DLM
> traffic.
> There is a patch to speed dlm up, and it's being tested now. The
> patch is built into RHEL5 kernels starting with 2.6.18-232 and newer.
> That means it is currently scheduled to be released in RHEL5.6.
>
> It's also being z-streamed back to 5.5.z, but I don't know when that
> is scheduled to go out. Unfortunately, since the problem was
> opened by a customer, the bugzilla record is private to protect the
> customer's confidential information. The patch is public though.
> If you are a Red Hat customer, you can probably call Red Hat Support
> and ask to be put on the list for bugzilla bug 604139 and
> maybe find out when the fix will be available.
>
> There is no guarantee this is what your problem is, and there is
> no guarantee that the patch will speed you up. But it might be.
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
More information about the Linux-cluster
mailing list