Re: ESXi 6.0u2 loosing connection to targets

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi William,

On Tue, 2017-03-07 at 10:48 -0500, William Thompson wrote:
> Please keep me in CC, I am not subscribed.
> 
> Sorry for the delay in replying.
> 
> On Sun, Feb 19, 2017 at 10:23:53PM -0800, Nicholas A. Bellinger wrote:
> > On Thu, 2017-02-16 at 07:30 -0500, William Thompson wrote:
> > > Kernel is:
> > > Linux it-iscsi 4.6.0-1-amd64 #1 SMP Debian 4.6.2-2 (2016-06-25) x86_64 GNU/Linux
> > > 
> > 
> > Thanks for your bug-report.
> > 
> > Using a v4.6.x kernel, I think there are two issues to consider.
> > First, note v4.6.x is not a long term stable kernel, so it's missing a
> > number of regression bug-fixes from the past year that you'll likely
> > encounter against ESX hosts with VAAI enabled.
> > 
> > https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/commit/drivers/target?h=linux-4.4.y&id=60ba156dda2c11ff7a44d78ec64abd21b9813115
> > https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/commit/drivers/target?h=linux-4.4.y&id=f318588b758514c35f0a9227195178a3b2b4b733
> > https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/commit/drivers/target?h=linux-4.4.y&id=56661d2b89b2a549be04f37dcf824c39d7aca9c6
> 
> Debian kernel, I just upgraded to 4.9.x

Great.  To confirm, which v4.9.x kernel are you using..?

> 
> > These three are the minimum patches to v4.6.y that you'll need.
> > However, considering v4.6.y is not a long-term stable kernel, I'd very
> > much recommend instead you use the latest v4.1.y, v4.4.y or anything >=
> > v4.8.y in order to get the full set of bug-fixes from upstream.
> > 
> > Second, there is a well known ESX 5.5u2+ bug with VMFS5 that is
> > triggered by targets that support VAAI AtomicTestandSet (emulate_caw=1
> > in LIO backend attribute speak).
> > 
> > https://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2113956
> > http://cormachogan.com/2015/04/17/heads-up-ats-miscompare-detected-between-test-and-set-hb-images/
> > http://www.thevirtualist.org/alert-application-outages-using-vaai-ats-on-vsphere-5-5-update2-vsphere-6-0/
> > https://www-304.ibm.com/support/docview.wss?uid=ssg1S1005201
> > http://h20565.www2.hpe.com/hpsc/doc/public/display?sp4ts.oid=75953&docId=mmr_sf-EN_US000005979&lang=en-us&cc=us&docLocale=en_US
> > 
> > The result is that data stores will eventually go offline, if the ATS
> > heartbeat logic for VMFS5 is not explicitly disabled on all ESX hosts.
> > There have many, many users who have hit this, and based on the
> > recommendation of all the other vendors above, it's a must disable in
> > order to get a stable working ESX 5.5u2+ + VAAI setup.
> 
> If it keeps up with 4.9.x, I'll take a look at this.
> 

Just a heads up.  The second issue mentioned here is *not* a LIO target
issue, but a well-known ESX host side issue that requires users to
manually disable ATS heartbeat on all your ESX hosts using VAAI.

Note this is required in order to get a stable setup.

The ESX side instructions for doing this are:

# esxcli system settings advanced set -i 0 -o /VMFS3/UseATSForHBOnVMFS5
# esxcli system settings advanced list -o /VMFS3/UseATSForHBonVMFS5
   Path: /VMFS3/UseATSForHBOnVMFS5
   Type: integer
   Int Value: 0
   Default Int Value: 1
   Min Value: 0
   Max Value: 1
   String Value:
   Default String Value:
   Valid Characters:
   Description: Use ATS for HB on ATS supported VMFS5 volumes

--
To unsubscribe from this list: send the line "unsubscribe target-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux SCSI]     [Kernel Newbies]     [Linux SCSI Target Infrastructure]     [Share Photos]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Device Mapper]

  Powered by Linux