On Wed, Oct 12, 2005 at 04:28:25PM +1000, Bojan Smojver wrote: > Quoting Bojan Smojver <bojan@xxxxxxxxxxxxx>: > > >I have a 5 node experimental cluster running RHEL4 U1 and GFS 6.1.0. I > >upgraded one box to RHEL U2 (kernel 2.6.9-22.ELsmp) and to GFS 6.1.2. > >When the box boots up with the new kernel and GFS, it joins the cluster > >OK (I can see that on other members), but clvmd and fenced won't start, > >so the system hangs. > > > >Did anyone else experience similar stuff? Or is this intentional (i.e. > >is the new version of GFS/cluster binary incompatible with U1 version)? > > BTW, this is what I get on the upgraded machine when I attempt to start > fenced: > > Oct 12 16:24:42 matrix1-5 kernel: SM: process_reply invalid id=0 > nodeid=3 > Oct 12 16:24:42 matrix1-5 kernel: SM: process_reply invalid id=0 > nodeid=2 > Oct 12 16:24:42 matrix1-5 kernel: SM: process_reply invalid id=0 > nodeid=4 > Oct 12 16:24:42 matrix1-5 kernel: SM: process_reply invalid id=0 > nodeid=1 > > Fenced never starts... A bug fix required a minor change to the cman/sm message formats between U1 and U2 that make the two versions incompatible, so all nodes need to be running the U2 version. Dave -- Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster