[linux-cluster] multipath issue... Smells of hardware issue.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Title: [linux-cluster] multipath issue... Smells of hardware issue.

Hi,

I have a setup with two identical RX200s3 FuSi servers talking to a SAN (SX60 + extra controller), and that works fine with gfs1.

I do however see some errors on one of the servers. It's in my message log and only now and then now and then (though always under load, but i cant load it and thereby force it to give the error).

The error says:
Jun 28 15:44:17 app02 multipathd: 8:16: mark as failed
Jun 28 15:44:17 app02 multipathd: main_disk_volume1: remaining active paths: 1
Jun 28 15:44:17 app02 kernel: sd 2:0:0:0: SCSI error: return code = 0x00070000
Jun 28 15:44:17 app02 kernel: end_request: I/O error, dev sdb, sector 705160231
Jun 28 15:44:17 app02 kernel: device-mapper: multipath: Failing path 8:16.
Jun 28 15:44:22 app02 multipathd: sdb: readsector0 checker reports path is up
Jun 28 15:44:22 app02 multipathd: 8:16: reinstated
Jun 28 15:44:22 app02 multipathd: main_disk_volume1: remaining active paths: 2
Jun 28 15:46:02 app02 multipathd: 8:32: mark as failed
Jun 28 15:46:02 app02 multipathd: main_disk_volume1: remaining active paths: 1
Jun 28 15:46:02 app02 kernel: sd 3:0:0:0: SCSI error: return code = 0x00070000
Jun 28 15:46:02 app02 kernel: end_request: I/O error, dev sdc, sector 739870727
Jun 28 15:46:02 app02 kernel: device-mapper: multipath: Failing path 8:32.
Jun 28 15:46:06 app02 multipathd: sdc: readsector0 checker reports path is up
Jun 28 15:46:06 app02 multipathd: 8:32: reinstated
Jun 28 15:46:06 app02 multipathd: main_disk_volume1: remaining active paths: 2

To me i looks like a fiber that bounces up and down. (There is no switch involved).

Sometimes i only get a slightly shorter version:
Jun 29 09:04:32 app02 kernel: sd 2:0:0:0: SCSI error: return code = 0x00070000
Jun 29 09:04:32 app02 kernel: end_request: I/O error, dev sdb, sector 2782490295
Jun 29 09:04:32 app02 kernel: device-mapper: multipath: Failing path 8:16.
Jun 29 09:04:32 app02 multipathd: 8:16: mark as failed
Jun 29 09:04:32 app02 multipathd: main_disk_volume1: remaining active paths: 1
Jun 29 09:04:37 app02 multipathd: sdb: readsector0 checker reports path is up
Jun 29 09:04:37 app02 multipathd: 8:16: reinstated
Jun 29 09:04:37 app02 multipathd: main_disk_volume1: remaining active paths: 2


Any sugestions, but start swapping hardware?

Mvh / Kind regards

Kristoffer Lippert
Systemansvarlig
JP/Politiken A/S
Online Magasiner

Tlf. +45 8738 3032
Cell. +45 6062 8703

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux