Sounds related to this:
Reminder: 3.5.2 centos7
[2015-06-02 15:26:18.373590] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 20 crawls, 0 turns
[2015-06-02 15:27:20.3036] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 20 crawls, 0 turns
[2015-06-02 15:27:51.6190] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433283827, 0)
[2015-06-02 15:28:22.90625] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 20 crawls, 1 turns
[2015-06-02 15:28:49.910826] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433284067, 0)
[2015-06-02 15:29:24.195203] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 20 crawls, 1 turns
[2015-06-02 15:29:49.40649] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433284127, 0)
[2015-06-02 15:30:26.267199] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 20 crawls, 1 turns
[2015-06-02 15:30:51.149673] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433284187, 0)
[2015-06-02 15:31:28.343223] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 20 crawls, 1 turns
[2015-06-02 15:31:50.39115] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433284247, 0)
[2015-06-02 15:31:59.995592] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:08.631409] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:14.611870] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:19.32982] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:23.172465] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:27.220045] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:31.250354] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:35.346721] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:41.444981] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284308
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:32:46.512193] W [master(/export/raid/usr_global):860:process] _GMaster: changelogs CHANGELOG.1433284308 could not be processed - moving on...
[2015-06-02 15:32:46.556715] W [master(/export/raid/usr_global):862:process] _GMaster: SKIPPED GFID =
[2015-06-02 15:32:49.584790] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 8 crawls, 1 turns
[2015-06-02 15:32:49.673245] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433284307, 0)
[2015-06-02 15:33:19.665309] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:33:28.593414] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:33:37.65300] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:33:44.746570] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:33:52.161550] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:34:05.730026] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:34:17.92046] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:34:26.22125] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:34:33.609838] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: CHANGELOG.1433284368
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:34:42.241246] W [master(/export/raid/usr_global):860:process] _GMaster: changelogs CHANGELOG.1433284368 could not be processed - moving on...
[2015-06-02 15:34:42.267297] W [master(/export/raid/usr_global):862:process] _GMaster: SKIPPED GFID =
[2015-06-02 15:34:45.295624] I [master(/export/raid/usr_global):445:crawlwrap] _GMaster: 1 crawls, 1 turns
[2015-06-02 15:34:45.384219] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433284367, 0)
[2015-06-02 15:34:47.878042] E [repce(/export/raid/usr_global):188:__call__] RepceClient: call 26948:139904768341824:1433284485.44 (entry_ops) failed on peer with OSError
[2015-06-02 15:34:47.878254] E [syncdutils(/export/raid/usr_global):240:log_raise_exception] <top>: FAIL:
Traceback (most recent call last):
File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 150, in main
main_i()
File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 542, in main_i
local.service_loop(*[r for r in [remote] if r])
File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 1178, in service_loop
g2.crawlwrap()
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 467, in crawlwrap
self.crawl()
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 1068, in crawl
self.process(changes)
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 825, in process
self.process_change(change, done, retry)
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 793, in process_change
self.slave.server.entry_ops(entries)
File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 204, in __call__
return self.ins(self.meth, *a)
File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 189, in __call__
raise res
OSError: [Errno 39] Directory not empty
[2015-06-02 15:34:48.34485] I [syncdutils(/export/raid/usr_global):192:finalize] <top>: exiting.
[2015-06-02 15:34:48.750009] I [monitor(monitor):81:set_state] Monitor: new state: faulty
[2015-06-02 15:34:58.877173] I [monitor(monitor):129:monitor] Monitor: ------------------------------------------------------------
[2015-06-02 15:34:58.877352] I [monitor(monitor):130:monitor] Monitor: starting gsyncd worker
[2015-06-02 15:34:59.75263] I [gsyncd(/export/raid/usr_global):532:main_i] <top>: syncing:
gluster://localhost:usr_global ->
ssh://root@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx:gluster://localhost:usr_global
[2015-06-02 15:35:04.116229] I [master(/export/raid/usr_global):58:gmaster_builder] <top>: setting up xsync change detection mode
[2015-06-02 15:35:04.116432] I [master(/export/raid/usr_global):357:__init__] _GMaster: using 'rsync' as the sync engine
[2015-06-02 15:35:04.126663] I [master(/export/raid/usr_global):58:gmaster_builder] <top>: setting up changelog change detection mode
[2015-06-02 15:35:04.126804] I [master(/export/raid/usr_global):357:__init__] _GMaster: using 'rsync' as the sync engine
[2015-06-02 15:35:04.127224] I [master(/export/raid/usr_global):1105:register] _GMaster: xsync temp directory: /var/run/gluster/usr_global/ssh%3A%2F%2Froot%40172.31.222.136%3Agluster%3A%2F%2F127.0.0.1%3Ausr_global/ce749a38ba30d4171cd674ec00ab24f9/xsync
[2015-06-02 15:35:04.127363] I [resource(/export/raid/usr_global):614:changelog_register] <top>: ##### <class 'resource.brickserver'> - /export/raid/usr_global - /var/run/gluster/usr_global/ssh%3A%2F%2Froot%40172.31.222.136%3Agluster%3A%2F%2F127.0.0.1%3Ausr_global/ce749a38ba30d4171cd674ec00ab24f9
- /var/run/gluster/usr_global/ssh%3A%2F%2Froot%40172.31.222.136%3Agluster%3A%2F%2F127.0.0.1%3Ausr_global/ce749a38ba30d4171cd674ec00ab24f9/changes.log - 9 - 5
[2015-06-02 15:35:04.155560] I [master(/export/raid/usr_global):421:crawlwrap] _GMaster: primary master with volume id f8f5ef65-4678-47be-bcb1-6c3cdaef545e ...
[2015-06-02 15:35:04.209250] I [master(/export/raid/usr_global):432:crawlwrap] _GMaster: crawl interval: 60 seconds
[2015-06-02 15:35:04.257262] I [master(/export/raid/usr_global):1126:crawl] _GMaster: starting hybrid crawl...
[2015-06-02 15:35:05.291480] I [master(/export/raid/usr_global):1135:crawl] _GMaster: processing xsync changelog /var/run/gluster/usr_global/ssh%3A%2F%2Froot%40172.31.222.136%3Agluster%3A%2F%2F127.0.0.1%3Ausr_global/ce749a38ba30d4171cd674ec00ab24f9/xsync/XSYNC-CHANGELOG.1433284504
[2015-06-02 15:35:33.109668] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:35:43.180270] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:35:52.679383] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:35:59.172976] I [monitor(monitor):81:set_state] Monitor: new state: Stable
[2015-06-02 15:36:02.804461] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:36:13.447704] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:36:23.184911] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:36:32.980029] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:36:44.702897] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:37:00.601946] W [master(/export/raid/usr_global):877:process] _GMaster: incomplete sync, retrying changelogs: XSYNC-CHANGELOG.1433284504
— reduced these lines (several thousand) >[2015-06-02 15:37:15.897319] W [master(/export/raid/usr_global):250:regjob] <top>: Rsync: .gfid/xxxxxxx-xxxxx-xxxxx-xxxx-xxxxxxxxx [errcode: 23]
[2015-06-02 15:37:15.897825] W [master(/export/raid/usr_global):860:process] _GMaster: changelogs XSYNC-CHANGELOG.1433284504 could not be processed - moving on...
[2015-06-02 15:37:15.931947] W [master(/export/raid/usr_global):862:process] _GMaster: SKIPPED GFID =
[2015-06-02 15:37:15.966494] I [master(/export/raid/usr_global):1132:crawl] _GMaster: finished hybrid crawl syncing
[2015-06-02 15:37:15.967732] I [master(/export/raid/usr_global):421:crawlwrap] _GMaster: primary master with volume id f8f5ef65-4678-47be-bcb1-6c3cdaef545e ...
[2015-06-02 15:37:16.471] I [master(/export/raid/usr_global):432:crawlwrap] _GMaster: crawl interval: 3 seconds
[2015-06-02 15:37:16.99163] I [master(/export/raid/usr_global):1060:crawl] _GMaster: slave's time: (1433284497, 270246)
[2015-06-02 15:37:16.99271] I [master(/export/raid/usr_global):1063:crawl] _GMaster: skipping already processed change: CHANGELOG.1433284428...
Changelog stoped yesterday around 3:30pm and was no longer process changes (changelog.XXXXXX files were stacking in /var/run/gluster/……/.processing/).
Sounds like it crashed, fallback to xsync, but keep failing and then does nothing more (after xsync is done, goes back to changelog but not processing ./processing files anymore).
Sounds like we will have to do our own geo-replication scripts based on rsync as it’s not reliable at all :(
--
Cyril Peponnet
|
_______________________________________________ Gluster-users mailing list Gluster-users@xxxxxxxxxxx http://www.gluster.org/mailman/listinfo/gluster-users