Re: Desperate for Help - Cluster Node randomly reboots

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 26/11/14 09:24, Tan Ban Wee wrote:
Hi,

This is a 2 nodes cluster and they are randomly rebooting itself. I hope
someone can help me to narrow down to the cause.

Nov 25 23:40:17 qdiskd Node 1 missed an update (3/4)

Nov 25 23:40:18 qdiskd Node 1 missed an update (4/4)

Nov 25 23:40:19 qdiskd Node 1 missed an update (5/4)

Nov 25 23:40:19 qdiskd Node 1 DOWN

Nov 25 23:40:19 qdiskd Writing eviction notice for node 1

Nov 25 23:40:19 qdiskd Telling CMAN to kill the node

Nov 25 23:40:20 qdiskd Node 1 evicted

Nov 25 23:44:19 qdiskd Node 1 is UP

Nov 25 23:44:20 qdiskd Node 1 shutdown

Nov 25 23:44:26 qdiskd Node 1 is UP

Nov 25 23:44:37 qdiskd Node 1 missed an update (2/4)

Nov 25 23:44:38 qdiskd Node 1 missed an update (3/4)

Nov 25 23:44:39 qdiskd Node 1 missed an update (4/4)

Nov 25 23:44:40 qdiskd Node 1 missed an update (5/4)

Nov 25 23:44:40 qdiskd Node 1 DOWN

Nov 25 23:44:40 qdiskd Writing eviction notice for node 1

Nov 25 23:44:40 qdiskd Telling CMAN to kill the node

Nov 25 23:44:41 qdiskd Node 1 evicted

Nov 25 23:50:48 qdiskd Loading dynamic configuration

Nov 25 23:50:49 qdiskd Setting autocalculated votes to 1

Nov 25 23:50:49 qdiskd Loading static configuration

Nov 25 23:50:49 qdiskd Auto-configured TKO as 4 based on token=10000
interval=1

Nov 25 23:50:49 qdiskd Timings: 4 tko, 1 interval

Nov 25 23:50:49 qdiskd Timings: 2 tko_up, 3 master_wait, 2 upgrade_wait

Nov 25 23:50:49 qdiskd Heuristic: 'ping -c3 -w5 10.101.210.250' score=1
interval=3 tko=5

Nov 25 23:50:49 qdiskd 1 heuristics loaded

Nov 25 23:50:49 qdiskd Quorum Daemon: 1 heuristics, 1 interval, 4 tko, 1
votes

Nov 25 23:50:49 qdiskd Run Flags: 00000231

Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad

Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512

Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad

Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512

Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad

Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512

Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad

Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512

Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad

Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512

Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad


From that limited information I would guess that your quorum disk partition is either offline or corrupted. First check that the drive is online and if it seems OK physically then check that it's not been formatted as a filesystem or something else by mistake and rebuild the header using mkqdisk.

Chrissie


_______________________________________________
discuss mailing list
discuss@xxxxxxxxxxxx
http://lists.corosync.org/mailman/listinfo/discuss




[Index of Archives]     [Linux Clusters]     [Corosync Project]     [Linux USB Devel]     [Linux Audio Users]     [Photo]     [Yosemite News]    [Yosemite Photos]    [Linux Kernel]     [Linux SCSI]     [X.Org]

  Powered by Linux