Re: Gluster 3.12.11 geo-replication connection to peer is broken

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Dear Kotresh,

On Jul 24, 2018, at 12:44 AM, Kotresh Hiremath Ravishankar <khiremat@xxxxxxxxxx> wrote:

Hi Pablo,

The geo-rep status should go to Faulty if he connection to peer is broken.

The geo-rep status don’t go to “faulty” after the “connection to peer is broken” on the event log.

Does node log files failing with same error? Are these logs repeating?

The “connection to peer is broken” error is on the following log file.  No new events are added after “connection to peer is broken” on the master.

/var/log/glusterfs/geo-replication/vol_replicated/ssh%3A%2F%2Fgeoaccount1%4010.20.220.12%3Agluster%3A%2F%2F127.0.0.1%3Ageorep_1.log

Does stop and start geo-rep giving the same error?

I restarted the geo-rep process and keeps giving the same error.

Another user reported the same problem last month.



Thanks,
Kotresh HR

On Tue, Jul 24, 2018 at 1:47 AM, Pablo J Rebollo Sosa <pablo.rebollo@xxxxxxx> wrote:
Hi,

I’m having problem with Gluster 3.12.11 geo-replication in CentOS 7.5.  The process starts the geo-replication but after few minutes the log shows “connection to peer is broken”.

The “status detail” looks ok but no files are replicated.

[root@gluster1 vol_replicated]#  gluster volume geo-replication vol_replicated geoaccount1@10.20.220.12::georep_1 status detail | sort

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
MASTER NODE    MASTER VOL        MASTER BRICK                     SLAVE USER     SLAVE                                   SLAVE NODE      STATUS     CRAWL STATUS    LAST_SYNCED    ENTRY    DATA    META    FAILURES    CHECKPOINT TIME    CHECKPOINT COMPLETED    CHECKPOINT COMPLETION TIME
gluster1     vol_replicated    /export/brick1/vol_replicated    geoaccount1    geoaccount1@10.20.220.12::georep_1    10.20.220.12    Active     Hybrid Crawl    N/A            8191     6550    0       0           N/A                N/A                     N/A
gluster2     vol_replicated    /export/brick1/vol_replicated    geoaccount1    geoaccount1@10.20.220.12::georep_1    10.20.220.13    Passive    N/A             N/A            N/A      N/A     N/A     N/A         N/A                N/A                     N/A
gluster3     vol_replicated    /export/brick1/vol_replicated    geoaccount1    geoaccount1@10.20.220.12::georep_1    10.20.220.12    Passive    N/A             N/A            N/A      N/A     N/A     N/A         N/A                N/A                     N/A
gluster4     vol_replicated    /export/brick1/vol_replicated    geoaccount1    geoaccount1@10.20.220.12::georep_1    10.20.220.13    Active     Hybrid Crawl    N/A            8191     6532    0       0           N/A                N/A                     N/A

These are the messages on the log file.

[2018-07-23 19:35:50.18026] I [gsyncdstatus(/export/brick1/vol_replicated):276:set_active] GeorepStatus: Worker Status Change   status=Active
[2018-07-23 19:35:50.19126] I [gsyncdstatus(/export/brick1/vol_replicated):248:set_worker_crawl_status] GeorepStatus: Crawl Status Change       status=History Crawl
[2018-07-23 19:35:50.19480] I [master(/export/brick1/vol_replicated):1432:crawl] _GMaster: starting history crawl       turns=1 stime=(0, 0)    entry_stime=None        etime=1532374550
[2018-07-23 19:35:50.20056] E [repce(/export/brick1/vol_replicated):117:worker] <top>: call failed:
Traceback (most recent call last):
  File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 113, in worker
    res = getattr(self.obj, rmeth)(*in_data[2:])
  File "/usr/libexec/glusterfs/python/syncdaemon/changelogagent.py", line 54, in history
    num_parallel)
  File "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", line 103, in cl_history_changelog
    raise ChangelogHistoryNotAvailable()
ChangelogHistoryNotAvailable
[2018-07-23 19:35:50.20999] E [repce(/export/brick1/vol_replicated):209:__call__] RepceClient: call failed on peer      call=39755:140602890745664:1532374550.02        method=history  error=ChangelogHistoryNotAvailable
[2018-07-23 19:35:50.21156] I [resource(/export/brick1/vol_replicated):1675:service_loop] GLUSTER: Changelog history not available, using xsync
[2018-07-23 19:35:50.28688] I [master(/export/brick1/vol_replicated):1543:crawl] _GMaster: starting hybrid crawl        stime=(0, 0)
[2018-07-23 19:35:50.30505] I [gsyncdstatus(/export/brick1/vol_replicated):248:set_worker_crawl_status] GeorepStatus: Crawl Status Change       status=Hybrid Crawl
[2018-07-23 19:35:54.35396] I [master(/export/brick1/vol_replicated):1554:crawl] _GMaster: processing xsync changelog   path=/var/lib/misc/glusterfsd/vol_replicated/ssh%3A%2F%2Fgeoaccount1%4010.20.220.12%3Agluster%3A%2F%2F127.0.0.1%3Ageorep_1/a68ebfef8cdf86c3c6e9a0d85969cd3f/xsync/XSYNC-CHANGELOG.1532374550
[2018-07-23 19:36:11.590595] E [syncdutils(/export/brick1/vol_replicated):304:log_raise_exception] <top>: connection to peer is broken

Anyone have some clues to what might be wrong?

Best regards,

Pablo J. Rebollo-Sosa

_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-users



--
Thanks and Regards,
Kotresh H R

Attachment: signature.asc
Description: Message signed with OpenPGP

_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-users

[Index of Archives]     [Gluster Development]     [Linux Filesytems Development]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux