Re: Glusto failures with dispersed volumes + Samba

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I've retried it more than a couple of times and the failures are consistent.

On Wed, Jul 5, 2017 at 6:26 PM, Amar Tumballi <atumball@xxxxxxxxxx> wrote:


On Wed, Jul 5, 2017 at 6:16 PM, Ashish Pandey <aspandey@xxxxxxxxxx> wrote:
Hi Nigel,

As Pranith has already mentioned, we are getting different gfid's in loc and loc->inode.
It looks like issue with DHT. If a re validate fails for gfid, a fresh look up should be done.

I don't know if it is related or not but a similar bug was fixed by Pranith
https://review.gluster.org/#/c/16986/

Ashish


Thanks for this info Ashish & Pranith. Also thanks for looking into this Anoop.

Nigel, lets retry these things, and see if its still the case! if not great, but if it is, then I will help you sort this out!

Regards,
Amar
 



From: "Pranith Kumar Karampuri" <pkarampu@xxxxxxxxxx>
To: "Anoop C S" <anoopcs@xxxxxxxxxxxxx>
Cc: "gluster-devel" <gluster-devel@xxxxxxxxxxx>
Sent: Thursday, June 29, 2017 7:36:45 PM
Subject: Re: [Gluster-devel] Glusto failures with dispersed volumes + Samba




On Thu, Jun 29, 2017 at 6:49 PM, Anoop C S <anoopcs@xxxxxxxxxxxxx> wrote:
On Thu, 2017-06-29 at 16:35 +0530, Nigel Babu wrote:
> Hi Pranith and Xavi,
>
> We seem to be running into a problem with glusto tests when we try to run them against dispersed
> volumes over a CIFS mount[1].

Is this a new test case? If not was it running successfully before?

> You can find the logs attached to the job [2].

VFS stat call failures are seen in Samba logs:

[2017/06/29 11:01:55.959374,  0] ../source3/modules/vfs_glusterfs.c:870(vfs_gluster_stat)
  glfs_stat(.) failed: Invalid argument

I could also see the following errors(repeatedly..) in glusterfs client logs:

[2017-06-29 10:33:43.031198] W [MSGID: 122019] [ec-helpers.c:412:ec_loc_gfid_check] 0-
testvol_distributed-dispersed-disperse-0: Mismatching GFID's in loc
[2017-06-29 10:33:43.031303] I [MSGID: 109094] [dht-common.c:1016:dht_revalidate_cbk] 0-
testvol_distributed-dispersed-dht: Revalidate: subvolume testvol_distributed-dispersed-disperse-0
for /user11 (gfid = 665c515b-3940-480f-af7c-6aaf37731eaa) returned -1 [Invalid argument]

This log basically says that EC received loc which has different gfids in loc->inode->gfid and loc->gfid.
 

> I've triggered a fresh job[3] to confirm that it only fails in these particular conditions and
> certainly seems to be the case. The job is currently ongoing, so you may want to take a look when
> you get some time how this job went.
>
> Let me know if you have any questions or need more debugging information. 
>
> [1]: https://ci.centos.org/job/gluster_glusto/325/testReport/
> [2]: https://ci.centos.org/job/gluster_glusto/325/artifact/
> [3]: https://ci.centos.org/job/gluster_glusto/326/console
>
>
> _______________________________________________
> Gluster-devel mailing list
> Gluster-devel@xxxxxxxxxxx
> http://lists.gluster.org/mailman/listinfo/gluster-devel



--
Pranith

_______________________________________________
Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-devel


_______________________________________________
Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-devel



--
Amar Tumballi (amarts)



--
nigelb
_______________________________________________
Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-devel

[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux