Re: Rebalance task fails

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



SOLVED:
Problem was caused by bug affecting servers with more than 40 CPUs
visible - https://review.gluster.org/#/c/15062/
Rebalance works after upgrading to 3.10.4.

Many thanks to Nithya for help!

Best regards,
Szymon Miotk

On Thu, Jul 13, 2017 at 10:32 AM, Szymon Miotk <szymon.miotk@xxxxxxxxx> wrote:
> Hi Nithya,
> I see index in context:
> [2017-07-07 10:07:18.230202] E [MSGID: 106062]
> [glusterd-utils.c:7997:glusterd_volume_rebalance_use_rsp_dict]
> 0-glusterd: failed to get index
>
> I wonder if there is anything I can do to fix it.
> I was trying to strace gluster process but still have no clue what
> exactly is gluster index.
>
> Best regards,
> Szymon Miotk
>
> On Thu, Jul 13, 2017 at 10:12 AM, Nithya Balachandran
> <nbalacha@xxxxxxxxxx> wrote:
>> Hi Szymon,
>>
>> I have received the files and will take a look and get back to you.
>>
>> In what context are you seeing index?
>>
>> Thanks,
>> Nithya
>>
>> On 11 July 2017 at 01:15, Szymon Miotk <szymon.miotk@xxxxxxxxx> wrote:
>>>
>>> Hi Nithya,
>>>
>>> the files were sent to priv to avoid spamming the list with large
>>> attachments.
>>> Could someone explain what is index in Gluster?
>>> Unfortunately index is popular word, so googling is not very helpful.
>>>
>>> Best regards,
>>> Szymon Miotk
>>>
>>> On Sun, Jul 9, 2017 at 6:37 PM, Nithya Balachandran <nbalacha@xxxxxxxxxx>
>>> wrote:
>>> >
>>> > On 7 July 2017 at 15:42, Szymon Miotk <szymon.miotk@xxxxxxxxx> wrote:
>>> >>
>>> >> Hello everyone,
>>> >>
>>> >>
>>> >> I have problem rebalancing Gluster volume.
>>> >> Gluster version is 3.7.3.
>>> >> My 1x3 replicated volume become full, so I've added three more bricks
>>> >> to make it 2x3 and wanted to rebalance.
>>> >> But every time I start rebalancing, it fails immediately.
>>> >> Rebooting Gluster nodes doesn't help.
>>> >>
>>> >> # gluster volume rebalance  gsae_artifactory_cluster_storage start
>>> >> volume rebalance: gsae_artifactory_cluster_storage: success: Rebalance
>>> >> on gsae_artifactory_cluster_storage has been started successfully. Use
>>> >> rebalance status command to check status of the rebalance process.
>>> >> ID: b22572ff-7575-4557-8317-765f7e52d445
>>> >>
>>> >> # gluster volume rebalance  gsae_artifactory_cluster_storage status
>>> >>                                     Node Rebalanced-files
>>> >> size       scanned      failures       skipped               status
>>> >> run time in secs
>>> >>                                ---------      -----------
>>> >> -----------   -----------   -----------   -----------
>>> >> ------------     --------------
>>> >>                                localhost                0
>>> >> 0Bytes             0             0             0               failed
>>> >>              0.00
>>> >>                              10.239.40.9                0
>>> >> 0Bytes             0             0             0               failed
>>> >>              0.00
>>> >>                              10.239.40.8                0
>>> >> 0Bytes             0             0             0               failed
>>> >>              0.00
>>> >> volume rebalance: gsae_artifactory_cluster_storage: success:
>>> >>
>>> >> The messages in logfiles mention 'failed to get index':
>>> >> [2017-07-07 10:07:18.230202] E [MSGID: 106062]
>>> >> [glusterd-utils.c:7997:glusterd_volume_rebalance_use_rsp_dict]
>>> >> 0-glusterd: failed to get index
>>> >>
>>> >> and then the rebalance process crashes:
>>> >> pending frames:
>>> >> frame : type(0) op(0)
>>> >> frame : type(0) op(0)
>>> >> frame : type(0) op(0)
>>> >> patchset: git://git.gluster.com/glusterfs.git
>>> >> signal received: 4
>>> >> time of crash:
>>> >> 2017-07-07 10:07:23
>>> >> configuration details:
>>> >> argp 1
>>> >> backtrace 1
>>> >> dlfcn 1
>>> >> libpthread 1
>>> >> llistxattr 1
>>> >> setfsid 1
>>> >> spinlock 1
>>> >> epoll.h 1
>>> >> xattr.h 1
>>> >> st_atim.tv_nsec 1
>>> >> package-string: glusterfs 3.7.3
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/libglusterfs.so.0(_gf_msg_backtrace_nomem+0x92)[0x7f24de214502]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/libglusterfs.so.0(gf_print_trace+0x31d)[0x7f24de23059d]
>>> >> /lib/x86_64-linux-gnu/libc.so.6(+0x36d40)[0x7f24dd612d40]
>>> >> /lib/x86_64-linux-gnu/libpthread.so.0(+0x11f6b)[0x7f24dd9b2f6b]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/libglusterfs.so.0(inode_ref+0x19)[0x7f24de234e69]
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/libglusterfs.so.0(loc_copy+0x4a)[0x7f24de21291a]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/glusterfs/3.7.3/xlator/cluster/distribute.so(dht_local_init+0x4b)[0x7f24d851f51b]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/glusterfs/3.7.3/xlator/cluster/distribute.so(dht_lookup+0x91)[0x7f24d8550521]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/libglusterfs.so.0(syncop_lookup+0x1a2)[0x7f24de258fc2]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/glusterfs/3.7.3/xlator/cluster/distribute.so(gf_defrag_fix_layout+0x87)[0x7f24d85289e7]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/glusterfs/3.7.3/xlator/cluster/distribute.so(gf_defrag_start_crawl+0x6d3)[0x7f24d8529ce3]
>>> >>
>>> >>
>>> >> /usr/lib/x86_64-linux-gnu/libglusterfs.so.0(synctask_wrap+0x12)[0x7f24de255822]
>>> >> /lib/x86_64-linux-gnu/libc.so.6(+0x498b0)[0x7f24dd6258b0]
>>> >>
>>> >>
>>> >> Anybody has a clue how to fix 'failed to get index' error?
>>> >
>>> >
>>> > Do you have the coredump? If yes, please send that and the rebalance
>>> > logs
>>> > across.
>>> >
>>> > Regards,
>>> > Nithya
>>> >>
>>> >>
>>> >> Thank you in advance!
>>> >> Szymon Miotk
>>> >> _______________________________________________
>>> >> Gluster-users mailing list
>>> >> Gluster-users@xxxxxxxxxxx
>>> >> http://lists.gluster.org/mailman/listinfo/gluster-users
>>> >
>>> >
>>
>>
_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-users



[Index of Archives]     [Gluster Development]     [Linux Filesytems Development]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux