Re: [PATCH 0/5] xfs: quota deadlock fixes

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Feb 17, 2017 at 12:54:54PM -0500, Brian Foster wrote:
> On Fri, Feb 17, 2017 at 02:53:15PM +0800, Eryu Guan wrote:
> > On Wed, Feb 15, 2017 at 10:40:42AM -0500, Brian Foster wrote:
> > > Hi all,
> > > 
> > > This is a collection of several quota related deadlock fixes for
> > > problems that have been reported to the list recently.
> > > 
> > > Patch 1 fixes the low memory quotacheck problem reported by Martin[1].
> > > Dave is CC'd as he had comments on this particular thread that started a
> > > discussion, but I hadn't heard anything back since my last response.
> > > 
> > > Patch 2 fixes a separate problem I ran into while attempting to
> > > reproduce Eryu's xfs/305 hang report[2]. 
> > > 
> > > Patches 3-5 fix the actual problem reported by Eryu, which is a quotaoff
> > > deadlock reproduced by xfs/305.
> > > 
> > > Further details are included in the individual commit log descriptions.
> > > Thoughts, reviews, flames appreciated.
> > > 
> > > Eryu,
> > > 
> > > I've run several hundred iterations of this on your reproducer system
> > > without reproducing the hang. I have reproduced a reset overnight but
> > > still haven't been able to grab a stack trace from that occurrence (I'll
> > > try again today/tonight with better console logging). I suspect this is
> > 
> > I hit a NULL pointer dereference while testing your fix, I was running
> > xfs/305 for 1000 iterations and host crashed at the 639th run. Not sure
> > if it's the same issue you've met here. I posted dmesg log at the end of
> > mail. I haven't tried to see if I can reproduce it on stock linus tree
> > yet.
> > 
> 
> Interesting, thanks. I don't know for sure because I didn't hit anything
> on my second overnight run, but I wouldn't be surprised if it's the
> same, particularly if you hit this again. This does look like an
> independent problem to me, though. A kdump might be nice, if possible,
> given the difficulty to reproduce...

Unfortunately, my second round of 1000 iteration run hit hang too, at
the 824th loop.  Test configuration is all default, crc enabled XFS with
4k block size, no rmapbt no reflink no finobt no sparse inode.

I attached the dmesg log and sysrq-w output. I also left the host in the
hang state, you can login and take a look if you have interest.

Thanks,
Eryu

Attachment: upstream-4.10-rc4-xfs-305-testpatch-sysrq-w.log.bz2
Description: BZip2 compressed data


[Index of Archives]     [XFS Filesystem Development (older mail)]     [Linux Filesystem Development]     [Linux Audio Users]     [Yosemite Trails]     [Linux Kernel]     [Linux RAID]     [Linux SCSI]


  Powered by Linux