Re: Input/output error - would not heal

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



----- Original Message -----
From: "lejeczek" <peljasz@xxxxxxxxxxx>
To: "Nag Pavan Chilakam" <nchilaka@xxxxxxxxxx>
Cc: gluster-users@xxxxxxxxxxx
Sent: Wednesday, 8 February, 2017 7:15:29 PM
Subject: Re:  Input/output error - would not heal



On 08/02/17 06:11, Nag Pavan Chilakam wrote:
> "gluster volume info" and "gluster vol status" would help in us debug faster.
>
> However, coming to gfid mismatch, yes the file "abbreviations.log" (I assume the other brick copy also to be " abbreviations.log" and not "breviations.log" ....typo mistake?) is in gfid mismatch leading to IO error(gfid splitbrain)
> Resolving data and metadata splitbrains are not recommended to be done from backend brick.
> But in case of a GFID splitbrain(like in file abbreviations.log), the only method available is resolving from backend brick
> You can read more about this in http://gluster.readthedocs.io/en/latest/Troubleshooting/split-brain/?highlight=gfid   (Fixing Directory entry split-brain   section)
> (There is a bug already existing to resolve gfid splitbrain using CLI )
>
>   
I've read that doc, however I'm not sure what to do with 
bits that are not mentioned in that doc. Which is:
when some xattr does not exist on one copy but does on the 
other, like:

3]$ getfattr -d -m . -e hex .vim.backup/.bash_profile.swp
# file: .vim.backup/.bash_profile.swp
security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000
trusted.afr.USER-HOME-client-0=0x000000010000000100000000
trusted.afr.USER-HOME-client-5=0x000000010000000100000000

2]$ getfattr -d -m . -e hex .vim.backup/.bash_profile.swp
# file: .vim.backup/.bash_profile.swp
security.selinux=0x73797374656d5f753a6f626a6563745f723a64656661756c745f743a733000
trusted.afr.USER-HOME-client-5=0x000000010000000100000000
trusted.afr.USER-HOME-client-6=0x000000010000000100000000

that means the file .bash_profile.swp is possibly in a data and metadata splitbrain
I need to understand the volume configuration, that is the reason I am asking for volume info
By seeing the above, I am guessing that it is a x3 volume(3 replica copies)


unless the doc talks about it and I've gone (temporarily) 
blind, but if it's does not it would be great to include 
more scenarios/cases there.
many thx.
L.

>
>
> thanks,
> nagpavan
>
>
> ----- Original Message -----
> From: "lejeczek" <peljasz@xxxxxxxxxxx>
> To: "Nag Pavan Chilakam" <nchilaka@xxxxxxxxxx>
> Cc: gluster-users@xxxxxxxxxxx
> Sent: Tuesday, 7 February, 2017 10:53:07 PM
> Subject: Re:  Input/output error - would not heal
>
>
>
> On 07/02/17 12:50, Nag Pavan Chilakam wrote:
>> Hi,
>> Can you help us with more information on the volume, like volume status and volume info
>> One reason of "transport endpoint error" is the brick could be down
>>
>> Also, i see that the syntax used for healing is wrong.
>> You need to use as below:
>> gluster v heal <vname> split-brain source-brick <brick path> <filename considering brick path as />
>>
>> In yourcase if brick path is "/G-store/1" and the file to be healed is "that_file" , then use below syntax (in this case i am considering "that_file" lying under the brick path directly"
>>
>> gluster volume heal USER-HOME split-brain source-brick 10.5.6.100:/G-store/1 /that_file
> that was that, my copy-paste typo, it does not heal.
> Interestingly, that file is not reported by heal.
>
> I've replied to -  GFID Mismatch - Automatic Correction ? -
> I think my problem is similar, here is a file the heal
> actually sees:
>
>
> $ gluster vol heal USER-HOME info
> Brick
> 10.5.6.100:/__.aLocalStorages/3/0-GLUSTERs/0-USER.HOME/aUser/.vim.backup/.bash_profile.swp
>
> Status: Connected
> Number of entries: 1
>
> Brick
> 10.5.6.49:/__.aLocalStorages/3/0-GLUSTERs/0-USER.HOME/aUser/.vim.backup/.bash_profile.swp
>
> Status: Connected
> Number of entries: 1
>
> I'm copying+pasting what I said in that reply to that thread:
> ...
>
> yep, I'm seeing the same:
> as follows:
> 3]$ getfattr -d -m . -e hex .
> # file: .
> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000
> trusted.afr.USER-HOME-client-2=0x000000000000000000000000
> trusted.afr.USER-HOME-client-3=0x000000000000000000000000
> trusted.afr.USER-HOME-client-5=0x000000000000000000000000
> trusted.afr.dirty=0x000000000000000000000000
> trusted.gfid=0x06341b521ba94ab7938eca57f7a1824f
> trusted.glusterfs.9e4ed9b7-373a-413b-bc82-b6f978e82ec4.xtime=0x5898e0cf000dd2fe
> trusted.glusterfs.dht=0x000000010000000000000000ffffffff
> trusted.glusterfs.quota.00000000-0000-0000-0000-000000000001.contri.1=0x00701c90fcb11200fffffef6f08c798e0000006a99819205
> trusted.glusterfs.quota.dirty=0x3000
> trusted.glusterfs.quota.size.1=0x00701c90fcb11200fffffef6f08c798e0000006a99819205
> 3]$ getfattr -d -m . -e hex .vim.backup
> # file: .vim.backup
> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000
> trusted.afr.USER-HOME-client-3=0x000000000000000000000000
> trusted.gfid=0x0b3a223955534de89086679a4dce8156
> trusted.glusterfs.9e4ed9b7-373a-413b-bc82-b6f978e82ec4.xtime=0x5898621c0005d720
> trusted.glusterfs.dht=0x000000010000000000000000ffffffff
> trusted.glusterfs.quota.06341b52-1ba9-4ab7-938e-ca57f7a1824f.contri.1=0x000000000000040000000000000000020000000000000001
> trusted.glusterfs.quota.dirty=0x3000
> trusted.glusterfs.quota.size.1=0x000000000000040000000000000000020000000000000001
> 3]$ getfattr -d -m . -e hex .vim.backup/.bash_profile.swp
> # file: .vim.backup/.bash_profile.swp
> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000
> trusted.afr.USER-HOME-client-0=0x000000010000000100000000
> trusted.afr.USER-HOME-client-5=0x000000010000000100000000
> trusted.gfid=0xc2693670fc6d4fed953f21dcb77a02cf
> trusted.glusterfs.9e4ed9b7-373a-413b-bc82-b6f978e82ec4.xtime=0x5896043c000baa55
> trusted.glusterfs.quota.0b3a2239-5553-4de8-9086-679a4dce8156.contri.1=0x00000000000000000000000000000001
> trusted.pgfid.0b3a2239-5553-4de8-9086-679a4dce8156=0x00000001
>
> 2]$ getfattr -d -m . -e hex .
> # file: .
> security.selinux=0x73797374656d5f753a6f626a6563745f723a64656661756c745f743a733000
> trusted.afr.USER-HOME-client-1=0x000000000000000000000000
> trusted.afr.USER-HOME-client-2=0x000000000000000000000000
> trusted.afr.USER-HOME-client-3=0x000000000000000000000000
> trusted.afr.USER-HOME-client-5=0x000000000000000000000000
> trusted.afr.dirty=0x000000000000000000000000
> trusted.gfid=0x06341b521ba94ab7938eca57f7a1824f
> trusted.glusterfs.9e4ed9b7-373a-413b-bc82-b6f978e82ec4.xtime=0x5898e0d000016f82
> trusted.glusterfs.dht=0x000000010000000000000000ffffffff
> trusted.glusterfs.quota.00000000-0000-0000-0000-000000000001.contri.1=0xa5e66200a7a45000cb96fbf7d6336229fae7152d8851097b
> trusted.glusterfs.quota.dirty=0x3000
> trusted.glusterfs.quota.size.1=0xa5e66200a7a45000cb96fbf7d6336229fae7152d8851097b
> 2]$ getfattr -d -m . -e hex .vim.backup
> # file: .vim.backup
> security.selinux=0x73797374656d5f753a6f626a6563745f723a64656661756c745f743a733000
> trusted.afr.USER-HOME-client-3=0x000000000000000000000000
> trusted.gfid=0x0b3a223955534de89086679a4dce8156
> trusted.glusterfs.9e4ed9b7-373a-413b-bc82-b6f978e82ec4.xtime=0x5898621b000855fe
> trusted.glusterfs.dht=0x000000010000000000000000ffffffff
> trusted.glusterfs.quota.06341b52-1ba9-4ab7-938e-ca57f7a1824f.contri.1=0x000000000000040000000000000000020000000000000001
> trusted.glusterfs.quota.dirty=0x3000
> trusted.glusterfs.quota.size.1=0x000000000000040000000000000000020000000000000001
> 2]$ getfattr -d -m . -e hex .vim.backup/.bash_profile.swp
> # file: .vim.backup/.bash_profile.swp
> security.selinux=0x73797374656d5f753a6f626a6563745f723a64656661756c745f743a733000
> trusted.afr.USER-HOME-client-5=0x000000010000000100000000
> trusted.afr.USER-HOME-client-6=0x000000010000000100000000
> trusted.gfid=0x8a5b6e4ad18a49d0bae920c9cf8673a5
> trusted.glusterfs.9e4ed9b7-373a-413b-bc82-b6f978e82ec4.xtime=0x5896041400058191
> trusted.glusterfs.quota.0b3a2239-5553-4de8-9086-679a4dce8156.contri.1=0x00000000000000000000000000000001
> trusted.pgfid.0b3a2239-5553-4de8-9086-679a4dce8156=0x00000001
>
>
> and the log bit:
>
> GFID mismatch for
> <gfid:335bf026-68bd-4bf4-9cba-63b65b12c0b1>/abbreviations.xlsx
> 6e9a7fa1-bfbe-4a59-ad06-a78ee1625649 on USER-HOME-client-6
> and 773b7ea3-31cf-4b24-94f0-0b61b573b082 on USER-HOME-client-0
>
> most importantly, is there a workaround for the problem, as
> of now? Before the bug, it it's such, gets fixed.
> b.w.
> L.
>
> -- end of paste
>
> but I have a few more files which also report I/O errors and
> heal does NOT even mention them:
> on the brick that is a "master"(samba was sharing to the users)
>
> # file: abbreviations.log
> security.selinux=0x73797374656d5f753a6f626a6563745f723a64656661756c745f743a733000
> trusted.afr.dirty=0x000000000000000000000000
> trusted.bit-rot.version=0x0200000000000000589081fd00060376
> trusted.gfid=0x773b7ea331cf4b2494f00b61b573b082
> trusted.glusterfs.quota.335bf026-68bd-4bf4-9cba-63b65b12c0b1.contri.1=0x0000000000002a000000000000000001
> trusted.pgfid.335bf026-68bd-4bf4-9cba-63b65b12c0b1=0x00000001
>
> on the "slave" brick, was not serving files (certainly not
> that file) to any users:
>
> # file: bbreviations.log
> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000
> trusted.afr.dirty=0x000000000000000000000000
> trusted.bit-rot.version=0x0200000000000000588c958a000b67ea
> trusted.gfid=0x6e9a7fa1bfbe4a59ad06a78ee1625649
> trusted.glusterfs.quota.335bf026-68bd-4bf4-9cba-63b65b12c0b1.contri.1=0x0000000000002a000000000000000001
> trusted.pgfid.335bf026-68bd-4bf4-9cba-63b65b12c0b1=0x00000001
>
> Question that probably was answered many times: is it OK to
> tamper with(remove in my case) files directly from bricks?
> many thanks,
> L.
>
>
>> regards,
>> nag pavan
>>
>> ----- Original Message -----
>> From: "lejeczek"<peljasz@xxxxxxxxxxx>
>> To:gluster-users@xxxxxxxxxxx
>> Sent: Tuesday, 7 February, 2017 2:00:51 AM
>> Subject:  Input/output error - would not heal
>>
>> hi all
>>
>> I'm hitting such problem:
>>
>> $ gluster vol heal USER-HOME split-brain source-brick
>> 10.5.6.100:/G-store/1
>> Healing gfid:8a5b6e4a-d18a-49d0-bae9-20c9cf8673a5
>> failed:Transport endpoint is not connected.
>> Status: Connected
>> Number of healed entries: 0
>>
>>
>>
>>
>> $ gluster vol heal USER-HOME split-brain source-brick
>> 10.5.6.100:/G-store/1/that_file
>> Lookup failed on /that_file:Input/output  error
>> Volume heal failed.
>>
>> v3.9. it's a two-brick volume, was three but removed one I
>> think a few hours before the problem was first noticed.
>> what to do now?
>> many thanks,
>> L
>>
>> _______________________________________________
>> Gluster-users mailing list
>> Gluster-users@xxxxxxxxxxx
>> http://lists.gluster.org/mailman/listinfo/gluster-users

_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-users



[Index of Archives]     [Gluster Development]     [Linux Filesytems Development]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux