The reason 490 seems to be failing is that it is using the blocksize as allocation size and then doing truncate/seek. Could be buggy test case but easier to just set it to 2MB which solves both problems. On Wed, Feb 20, 2019 at 8:31 PM Steve French <smfrench@xxxxxxxxx> wrote: > > The problem with dropping the patch is that fixes another critical > problem where cp and scp can go VERY slowly (since they use blocksize > to determine the I/O size to use). 1MB is probably the minimum we > want to set (which is what NFS uses) > > On Wed, Feb 20, 2019 at 7:48 PM ronnie sahlberg > <ronniesahlberg@xxxxxxxxx> wrote: > > > > This patch is also the reason why generic/490 fails, so lets drop it for now. > > > > On Thu, Feb 21, 2019 at 10:22 AM Steve French <smfrench@xxxxxxxxx> wrote: > > > > > > We could also try with 1mb instead of 4mb for uncached I/o > > > > > > On Wed, Feb 20, 2019, 18:12 ronnie sahlberg <ronniesahlberg@xxxxxxxxx> wrote: > > >> > > >> What I mean is that generic/013 still works, it just takes ~3 times > > >> as long to run so the buildbot is timing it out. > > >> > > >> Steve, > > >> Can we revert d3de48233978524514d3b605ad55bb21d1ecd706 from for-next > > >> and re-run the buildbot and see it it now passes? > > >> > > >> > > >> On Thu, Feb 21, 2019 at 10:09 AM Ronnie Sahlberg <lsahlber@xxxxxxxxxx> wrote: > > >> > > > >> > I ran a git bisect and the performance drop came with commit d3de48233978524514d3b605ad55bb21d1ecd706 > > >> > > > >> > > > >> > ----- Original Message ----- > > >> > From: "ronnie sahlberg" <ronniesahlberg@xxxxxxxxx> > > >> > To: "Steve French" <smfrench@xxxxxxxxx> > > >> > Cc: "Pavel Shilovsky" <piastryyy@xxxxxxxxx>, "Aurélien Aptel" <aaptel@xxxxxxxx>, "CIFS" <linux-cifs@xxxxxxxxxxxxxxx> > > >> > Sent: Thursday, 21 February, 2019 8:47:44 AM > > >> > Subject: Re: xfstests and current cifs for-next patch set > > >> > > > >> > generic/013 and generic/014 failed with timeout. These tests used to > > >> > take several minutes. > > >> > > > >> > Testing locally, generic/013 is successful but it takes almost > > >> > three times longer than it used to just a week ago. > > >> > > > >> > On Thu, Feb 21, 2019 at 7:18 AM Steve French <smfrench@xxxxxxxxx> wrote: > > >> > > > > >> > > Looks like Pavel's latest fix (unrelated to credits it turns out, the > > >> > > problem in this case was skipping a mid) does fix xfstest 310. Azure > > >> > > test bucket passes, no reconnects that I spotted: > > >> > > > > >> > > http://smb3-test-rhel-75.southcentralus.cloudapp.azure.com/#/builders/4/builds/94 > > >> > > > > >> > > Running cifs-testing buildbot bucket now. > > >> > > > > >> > > http://smb3-test-rhel-75.southcentralus.cloudapp.azure.com/#/builders/2/builds/134 > > >> > > > > >> > > On Sun, Feb 17, 2019 at 1:30 PM Steve French <smfrench@xxxxxxxxx> wrote: > > >> > > > > > >> > > > Retrying the same test run it worked. Rerunning the same set of > > >> > > > patches but this time with larger (cifs-testing) collection of tests > > >> > > > on the buildbot > > >> > > > > > >> > > > On Sat, Feb 16, 2019 at 10:38 PM Steve French <smfrench@xxxxxxxxx> wrote: > > >> > > > > > > >> > > > > The test (310 and subsequent) seemed to start failing with this in dmesg: > > >> > > > > > > >> > > > > [root@fedora29 ~]# dmesg > > >> > > > > [ 2969.016552] CIFS VFS: Cancelling wait for mid 29640 cmd: 14 > > >> > > > > [ 2979.449426] CIFS VFS: disabling echoes and oplocks > > >> > > > > [ 2999.109655] CIFS VFS: Cancelling wait for mid 1494 cmd: 6 > > >> > > > > [ 3225.207488] CIFS VFS: Server > > >> > > > > linuxsmb3testshares.file.core.windows.net has not responded in 120 > > >> > > > > seconds. Reconnecting... > > >> > > > > > > >> > > > > On Sat, Feb 16, 2019 at 8:30 PM Steve French <smfrench@xxxxxxxxx> wrote: > > >> > > > > > > > >> > > > > > So (unless there is a random factor involved) - I narrowed it down to this patch > > >> > > > > > > > >> > > > > > Author: Pavel Shilovsky <pshilov@xxxxxxxxxxxxx> > > >> > > > > > Date: Wed Jan 16 11:12:41 2019 -0800 > > >> > > > > > > > >> > > > > > CIFS: Respect reconnect in MTU credits calculations > > >> > > > > > > > >> > > > > > Every time after a session reconnect we don't need to account for > > >> > > > > > credits obtained in previous sessions. Introduce new struct cifs_credits > > >> > > > > > which contains both credits value and reconnect instance of the > > >> > > > > > time those credits were taken. Modify a routine that add credits > > >> > > > > > back to handle the reconnect instance by assuming zero credits > > >> > > > > > if the reconnect happened after the credits were obtained and > > >> > > > > > before we decided to add them back due to some errors during sending. > > >> > > > > > > > >> > > > > > This patch fixes the MTU credits cases. The subsequent patch > > >> > > > > > will handle non-MTU ones. > > >> > > > > > > > >> > > > > > Signed-off-by: Pavel Shilovsky <pshilov@xxxxxxxxxxxxx> > > >> > > > > > Signed-off-by: Steve French <stfrench@xxxxxxxxxxxxx> > > >> > > > > > > > >> > > > > > > > >> > > > > > ---------- Forwarded message --------- > > >> > > > > > From: Steve French <smfrench@xxxxxxxxx> > > >> > > > > > Date: Sat, Feb 16, 2019 at 6:46 PM > > >> > > > > > Subject: Re: xfstests and current cifs for-next patch set > > >> > > > > > To: CIFS <linux-cifs@xxxxxxxxxxxxxxx> > > >> > > > > > > > >> > > > > > > > >> > > > > > Narrowed the xfstest 310 possible regression in current for-next down > > >> > > > > > to three patches, rerunning with this one of the three added (see > > >> > > > > > http://smb3-test-rhel-75.southcentralus.cloudapp.azure.com/#/builders/4/builds/85) > > >> > > > > > > > >> > > > > > Author: Pavel Shilovsky <pshilov@xxxxxxxxxxxxx> > > >> > > > > > Date: Wed Jan 16 11:12:41 2019 -0800 > > >> > > > > > > > >> > > > > > CIFS: Respect reconnect in MTU credits calculations > > >> > > > > > > > >> > > > > > On Sat, Feb 16, 2019 at 1:40 PM Steve French <smfrench@xxxxxxxxx> wrote: > > >> > > > > > > > > >> > > > > > > With 5.0-rc5 and current for-next (29 paches) two tests 310 (read and > > >> > > > > > > readdir simultaneously) and 422 (delayed allocation stat, number of > > >> > > > > > > blocks) fail I see this in the azure test bucket in the buildbot). > > >> > > > > > > see this run: http://smb3-test-rhel-75.southcentralus.cloudapp.azure.com/#/builders/4/builds/80 > > >> > > > > > > > > >> > > > > > > These don't fail when I select only the first 8 cifs fixes in for-next > > >> > > > > > > ontop of 5.0-rc5. See > > >> > > > > > > http://smb3-test-rhel-75.southcentralus.cloudapp.azure.com/#/builders/4/builds/82 > > >> > > > > > > so am trying to narrow it down. This run (in progress) > > >> > > > > > > http://smb3-test-rhel-75.southcentralus.cloudapp.azure.com/#/builders/4/builds/83 > > >> > > > > > > has the first 19 (of the 29) cifs patches (ontop of 5.0-rc5 mainline > > >> > > > > > > as with the runs above) so we can bisect which commit causes the > > >> > > > > > > problem with tests 310 and 422. > > >> > > > > > > > > >> > > > > > > This seems unrelated to the problem I see in slightly more current > > >> > > > > > > mainline (that we can see with no cifs changes) in xfstest 422 that > > >> > > > > > > was introduced with 5.0-rc6. > > >> > > > > > > > > >> > > > > > > Let me know if others (or other scenario problems) see the tests > > >> > > > > > > 310/422 failure. > > >> > > > > > > > > >> > > > > > > -- > > >> > > > > > > Thanks, > > >> > > > > > > > > >> > > > > > > Steve > > >> > > > > > > > >> > > > > > > > >> > > > > > > > >> > > > > > -- > > >> > > > > > Thanks, > > >> > > > > > > > >> > > > > > Steve > > >> > > > > > > > >> > > > > > > > >> > > > > > -- > > >> > > > > > Thanks, > > >> > > > > > > > >> > > > > > Steve > > >> > > > > > > >> > > > > > > >> > > > > > > >> > > > > -- > > >> > > > > Thanks, > > >> > > > > > > >> > > > > Steve > > >> > > > > > >> > > > > > >> > > > > > >> > > > -- > > >> > > > Thanks, > > >> > > > > > >> > > > Steve > > >> > > > > >> > > > > >> > > > > >> > > -- > > >> > > Thanks, > > >> > > > > >> > > Steve > > > > -- > Thanks, > > Steve -- Thanks, Steve