Re: RHEL6.2: path failures during good path I/O

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



So, I've started on my RHEL6.1 system filesystem I/O against 5 multipath devices. Each multipath device contains 4 partition - two of them were mounted and used for fs-I/O. The exerciser was running for appr. 2 hours without a single path failure message. Then I'v started block I/O against the other 5 multipath devices.
Pretty soon the first path failure was reported:

Jun 18 12:51:18 jabulan-lp5 root: block IO starts here
Jun 18 12:51:34 jabulan-lp5 root: ./blocktest.pl *************** kaupie:./blocktest.pl test starting *************** Jun 18 12:51:34 jabulan-lp5 root: ./blocktest.pl *************** Starting pass 1 *************** Jun 18 12:51:34 jabulan-lp5 root: ./blocktest.pl *************** function function_512k_RI starting *************** Jun 18 12:52:12 jabulan-lp5 multipathd: mpathk: sde - directio checker reports path is down Jun 18 12:52:12 jabulan-lp5 multipathd: checker failed path 8:64 in map mpathk
Jun 18 12:52:12 jabulan-lp5 multipathd: mpathk: remaining active paths: 1
Jun 18 12:52:12 jabulan-lp5 kernel: device-mapper: multipath: Failing path 8:64. Jun 18 12:52:18 jabulan-lp5 multipathd: mpathk: sde - directio checker reports path is down Jun 18 12:52:23 jabulan-lp5 multipathd: mpathk: sde - directio checker reports path is up
Jun 18 12:52:23 jabulan-lp5 multipathd: 8:64: reinstated
Jun 18 12:52:23 jabulan-lp5 multipathd: mpathk: remaining active paths: 2
Jun 18 12:53:53 jabulan-lp5 multipathd: mpathk: sde - directio checker reports path is down Jun 18 12:53:53 jabulan-lp5 multipathd: checker failed path 8:64 in map mpathk
Jun 18 12:53:53 jabulan-lp5 multipathd: mpathk: remaining active paths: 1
Jun 18 12:53:53 jabulan-lp5 kernel: device-mapper: multipath: Failing path 8:64. Jun 18 12:53:58 jabulan-lp5 multipathd: mpathk: sde - directio checker reports path is down Jun 18 12:54:03 jabulan-lp5 multipathd: mpathk: sde - directio checker reports path is up
Jun 18 12:54:03 jabulan-lp5 multipathd: 8:64: reinstated
Jun 18 12:54:03 jabulan-lp5 multipathd: mpathk: remaining active paths: 2
Jun 18 12:55:04 jabulan-lp5 multipathd: mpathi: sdj - directio checker reports path is down Jun 18 12:55:04 jabulan-lp5 multipathd: checker failed path 8:144 in map mpathi
Jun 18 12:55:04 jabulan-lp5 multipathd: mpathi: remaining active paths: 1
Jun 18 12:55:04 jabulan-lp5 kernel: device-mapper: multipath: Failing path 8:144.


No I/O error messages were logged...following the dmesg output after filesystem creation (preparing the partitions for fs-I/O) until block-I/O got started:

EXT3-fs (dm-39): using internal journal
EXT3-fs (dm-39): mounted filesystem with ordered data mode
kjournald starting.  Commit interval 5 seconds
EXT3-fs (dm-25): using internal journal
EXT3-fs (dm-25): mounted filesystem with ordered data mode
kjournald starting.  Commit interval 5 seconds
EXT3-fs (dm-22): using internal journal
EXT3-fs (dm-22): mounted filesystem with ordered data mode
kjournald starting.  Commit interval 5 seconds
EXT3-fs (dm-46): using internal journal
EXT3-fs (dm-46): mounted filesystem with ordered data mode
kjournald starting.  Commit interval 5 seconds
EXT3-fs (dm-38): using internal journal
EXT3-fs (dm-38): mounted filesystem with ordered data mode
device-mapper: multipath: Failing path 8:64.
device-mapper: multipath: Failing path 8:64.
device-mapper: multipath: Failing path 8:144.
device-mapper: multipath: Failing path 8:32.
device-mapper: multipath: Failing path 8:32.
device-mapper: multipath: Failing path 8:144.
[root@jabulan-lp5 ~]#


Really strange...



Am 18.06.2012 12:03, schrieb Christian May:
Sorry, I did a mistake when testing with RHEL6.1....the multipath daemon was stopped. So, also with RHEL6.1. I've noticed path failures. I'm trying to do some more testing - maybe it's related to I/O workload. For my regular tests I'm attaching 10 SCSI LUns to the host system via two pathes. On each multipath device 4 partitions will be created. Five multipath devices are going to be used for filesystem I/O, the remaining five for block I/O. Currently I have just fs-I/O started and so far - 1h - no path failures occured. I will start block I/O later....

My guess why it could be device-mapper related is based on SLES11SP1 bugzilla 62249 multipath-tools: multipath device path failure without error injection.



Am 14.06.2012 23:19, schrieb Mike Snitzer:
On Thu, Jun 14 2012 at  3:15pm -0400,
Christian May<cmay@xxxxxxxxxxxxxxxxxx>  wrote:

I couldn't recreate the path failures on the exact same test setup
with RHEL6.1.

Something must have been changed in the device-mapper package!?
It is good to know 6.1 works and 6.2 doesn't for you.  But I'm not
understanding why you're thinking it is a device-mapper-multipath
issue (not kernel issue).

You could do a couple things:
1) run the RHEL6.2 kernel on a RHEL6.1 install
2) install RHEL6.1 device-mapper-multipath package on RHEL6.2 install

If both work then it implicates the RHEL6.2 device-mapper-multipath
package.

But all being said, please just file a BZ at bugzilla.redhat.com



--
dm-devel mailing list
dm-devel@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/dm-devel



--
dm-devel mailing list
dm-devel@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/dm-devel


[Index of Archives]     [DM Crypt]     [Fedora Desktop]     [ATA RAID]     [Fedora Marketing]     [Fedora Packaging]     [Fedora SELinux]     [Yosemite Discussion]     [KDE Users]     [Fedora Docs]

  Powered by Linux