[Linux-cluster] cluster suite crashing

Chris Harms chris at cmiware.com
Thu Aug 2 23:23:55 UTC 2007


I grabbed the RHEL5 branch out of CVS, but compilation fails with

make[2]: Entering directory `/usr/src/cluster-cvs/cluster/dlm/lib'
gcc -Wall  -g -I. -O2  -D_REENTRANT -c -o libdlm.o libdlm.c
libdlm.c: In function âset_version_v5â:
libdlm.c:324: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:325: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:326: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function âset_version_v6â:
libdlm.c:335: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:336: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:337: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function âdetect_kernel_versionâ:
libdlm.c:443: error: storage size of âvâ isnât known
libdlm.c:446: error: invalid application of âsizeofâ to incomplete type 
âstruct dlm_device_versionâ
libdlm.c:448: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:449: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:450: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:452: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:453: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:454: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:443: warning: unused variable âvâ
libdlm.c: In function âdo_dlm_dispatchâ:
libdlm.c:590: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function âls_lock_v6â:
libdlm.c:835: error: âstruct dlm_lock_paramsâ has no member named âxidâ
libdlm.c:837: error: âstruct dlm_lock_paramsâ has no member named âtimeoutâ
libdlm.c: In function âls_lockâ:
libdlm.c:892: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function âdlm_ls_lockxâ:
libdlm.c:916: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function âdlm_ls_unlockâ:
libdlm.c:1067: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function âdlm_ls_deadlock_cancelâ:
libdlm.c:1099: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:1115: error: âDLM_USER_DEADLOCKâ undeclared (first use in this 
function)
libdlm.c:1115: error: (Each undeclared identifier is reported only once
libdlm.c:1115: error: for each function it appears in.)
libdlm.c: In function âdlm_ls_purgeâ:
libdlm.c:1134: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:1145: error: âDLM_USER_PURGEâ undeclared (first use in this 
function)
libdlm.c:1146: error: âunion <anonymous>â has no member named âpurgeâ
libdlm.c:1147: error: âunion <anonymous>â has no member named âpurgeâ
libdlm.c: In function âcreate_lockspaceâ:
libdlm.c:1311: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function ârelease_lockspaceâ:
libdlm.c:1415: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c: In function âdlm_kernel_versionâ:
libdlm.c:1501: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:1502: error: invalid use of undefined type âstruct 
dlm_device_versionâ
libdlm.c:1503: error: invalid use of undefined type âstruct 
dlm_device_versionâ
make[2]: *** [libdlm.o] Error 1
make[2]: Leaving directory `/usr/src/cluster-cvs/cluster/dlm/lib'
make[1]: *** [all] Error 2
make[1]: Leaving directory `/usr/src/cluster-cvs/cluster/dlm'
make: *** [all] Error 2

I guess it doesn't like the officially supported RHEL kernel (2.6.18-8.1.8).

We also are trying to get the 5.1 Beta rpms going with no success.  So 
far a kernel panic on 5.1 kernel (2.6.18-36)



Lon Hohberger wrote:
> On Thu, Aug 02, 2007 at 11:08:51AM -0500, Chris Harms wrote:
>   
>> rgmanager-2.0.24-1.el5
>>
>> I'm not sure if this is useful or not, but I had just rebooted Node B 
>> when we pulled the cables on Node A.  It is possible not all of the 
>> services / inter-node communication had completed.
>>     
>
> Could you pull from CVS (RHEL5 or 51 branches)?  The current code has a
> couple of crash bugs fixed.
>
> Note that if you store:
>
> DAEMON_COREFILE_LIMIT="unlimited"
> RGMGR_OPTS="-w"
>
> ... in /etc/sysconfig/cluster, rgmanager will generate a core file in
> the root directory.  Attaching the core to the bug report will help
> determine whether it's something already fixed in CVS.
>
> But seriously, if you see 'daemon died, rebooting' it's either user
> error (you did a 'kill -9' of only one rgmanager pid) or a bug (crash).
>
> -- Lon
>
>   




More information about the Linux-cluster mailing list