[Linux-cluster] DLM Problem
Patrick Caulfeld
pcaulfie at redhat.com
Wed Jan 30 15:42:45 UTC 2008
isplist at logicore.net wrote:
>> That means that something else is using port 21064 - the TCP port that
>> the DLM uses. If the DLM can't bind to its port then it cannot start.
>
> What's weird is that I didn't change anything, just the storage, then needing
> to get the cluster quorum so that I could set up gfs.
>
>> Use netstat -tap or lsof to find out what is using that port. If you
>> can't stop that particular application that is using it, then you'll
>
> #lsof -i | grep 21064
> Reveals nothing.
>
> # netstat -anp | grep 21064
> Seems to show port 21064 being available.
>
> tcp 0 0 192.168.1.58:21064 0.0.0.0:* LISTEN -
> tcp 0 0 192.168.1.58:6809 192.168.1.62:21064 ESTABLISHED -
> tcp 0 0 192.168.1.58:21064 192.168.1.63:32780 ESTABLISHED -
> tcp 0 0 192.168.1.58:21064 192.168.1.92:6809 ESTABLISHED -
> tcp 0 0 192.168.1.58:21064 192.168.1.62:6809 ESTABLISHED -
> tcp 0 0 192.168.1.58:21064 192.168.1.40:6809 ESTABLISHED -
>
>
That's not showing available, that's showing it already in use. The port
6809 on the other end is very suspicious, like there is maybe some
confusion about cman & dlm ports going on. Check cluster.conf and your
startup scripts for port changing things. It's very unusual. Also check
that all the nodes are using the same configuration.
There certainly is some port conflict going on with that system. it
might be a bizarre cluster suite misconfiguration or it might be some
other application - it's hard to be sure.
Patrick
More information about the Linux-cluster
mailing list