Description of problem:
clvmd init script permanently hangs (can't be interrupted)
on a 64-bit (x86_64) node when the same command is done
simultaneously by three 32-bit nodes in a cluster.
The 32-bit nodes work fine, but the 64-bit node hangs.
Version-Release number of selected component (if applicable):
FC6test1
How reproducible:
I've gotten this hang several times, and am sure I can do it again.
Steps to Reproduce:
1. Hard power-cycle all four nodes in a cluster of three i686's
and one x86_64.
2. Using cssh, do "service cman start" simultaneously.
3. Using cssh, use group_tool -v to make sure all nodes are
talking properly in the cluster.
4. Using cssh, do "service clvmd start" simultaneously.
Actual results:
Starting clvmd: [ OK ]
(followed by a hang that's uninterruptible by <ctrl-c>)
The clvmd service hangs on the 64-bit nodes, but not the others.
Expected results:
Starting clvmd: [ OK ]
Activating VGs: 2 logical volume(s) in volume group "VolGroup00" now active
3 logical volume(s) in volume group "Smoke_Cluster" now active
[ OK ]
Additional info:
I was using the "smoke" cluster in the Minneapolis lab:
camel, merit, winston and kool. Kool is the x86_64 node that
hangs. All machines have built LVM2 with clustering, and all
share the same SAN.
I used the sysrq trigger to dump the state of all tasks on the system,
and I'm adding that as an attachment.

Created attachment 133640[details]
Sysrq output from yesterday's hang doing the same thing
This output is from the same sequence of events I did yesterday.
This time, the hang appears to be in vgchange rather than vgscan,
but the result is the same: a hung "service clvmd start" command
that can't be interrupted.

Lenny, RHEL4 has a completely different DLM so that patch is not relevant. The
cause will be something different
Could you please open up a new bug against RHEL4 and post as much evidence as
you can find: syslogs, /proc/cluster/dlm_debug & /proc/cluster/services.
Thanks.