I now have a situation similar to
https://bugzilla.redhat.com/show_bug.cgi?id=1202649 but trying to
register to report the bug, I don't receive the confirmation email
to my account so I can't register. Stopping and starting geo-replication has no effect and in fact now shows no status at all. root@james:~# gluster volume geo-replication static ssh://gluster-b1::static status No active geo-replication sessions between static and ssh://gluster-b1::static root@james:~# gluster volume geo-replication static ssh://gluster-b1::static stop Stopping geo-replication session between static & ssh://gluster-b1::static has been successful root@james:~# gluster volume geo-replication static ssh://gluster-b1::static status No active geo-replication sessions between static and ssh://gluster-b1::static root@james:~# gluster volume geo-replication static ssh://gluster-b1::static start Starting geo-replication session between static & ssh://gluster-b1::static has been successful root@james:~# gluster volume geo-replication static ssh://gluster-b1::static status No active geo-replication sessions between static and ssh://gluster-b1::static root@james:~# gluster volume geo-replication static ssh://gluster-b1::static status No active geo-replication sessions between static and ssh://gluster-b1::static This is what is reported in /var/log/glusterfs/geo-replication/static/ssh%3A%2F%2Froot%40palace%3Agluster%3A%2F%2F127.0.0.1%3Astatic.log [2015-10-16 12:31:35.679045] I [monitor(monitor):222:monitor] Monitor: starting gsyncd worker [2015-10-16 12:31:41.453392] I [monitor(monitor):282:monitor] Monitor: worker(/data/gluster1/static/brick1) died in startup phase [2015-10-16 12:31:51.595781] I [monitor(monitor):221:monitor] Monitor: ------------------------------------------------------------ [2015-10-16 12:31:51.596124] I [monitor(monitor):222:monitor] Monitor: starting gsyncd worker [2015-10-16 12:31:51.680993] I [changelogagent(agent):75:__init__] ChangelogAgent: Agent listining... [2015-10-16 12:31:51.684289] I [gsyncd(/data/gluster1/static/brick1):649:main_i] <top>: syncing: gluster://localhost:static -> ssh://root@palace:gluster://localhost:static [2015-10-16 12:31:54.378592] I [master(/data/gluster1/static/brick1):83:gmaster_builder] <top>: setting up xsync change detection mode [2015-10-16 12:31:54.379020] I [master(/data/gluster1/static/brick1):401:__init__] _GMaster: using 'tar over ssh' as the sync engine [2015-10-16 12:31:54.379853] I [master(/data/gluster1/static/brick1):83:gmaster_builder] <top>: setting up changelog change detection mode [2015-10-16 12:31:54.380121] I [master(/data/gluster1/static/brick1):401:__init__] _GMaster: using 'tar over ssh' as the sync engine [2015-10-16 12:31:54.381195] I [master(/data/gluster1/static/brick1):83:gmaster_builder] <top>: setting up changeloghistory change detection mode [2015-10-16 12:31:54.381473] I [master(/data/gluster1/static/brick1):401:__init__] _GMaster: using 'tar over ssh' as the sync engine [2015-10-16 12:31:56.395081] E [repce(agent):117:worker] <top>: call failed: Traceback (most recent call last): File "/usr/lib/x86_64-linux-gnu/glusterfs/python/syncdaemon/repce.py", line 113, in worker res = getattr(self.obj, rmeth)(*in_data[2:]) File "/usr/lib/x86_64-linux-gnu/glusterfs/python/syncdaemon/changelogagent.py", line 41, in register return Changes.cl_register(cl_brick, cl_dir, cl_log, cl_level, retries) File "/usr/lib/x86_64-linux-gnu/glusterfs/python/syncdaemon/libgfchangelog.py", line 45, in cl_register cls.raise_changelog_err() File "/usr/lib/x86_64-linux-gnu/glusterfs/python/syncdaemon/libgfchangelog.py", line 27, in raise_changelog_err raise ChangelogException(errn, os.strerror(errn)) ChangelogException: [Errno 111] Connection refused [2015-10-16 12:31:56.396080] E [repce(/data/gluster1/static/brick1):207:__call__] RepceClient: call 4297:140312069371648:1444959114.39 (register) failed on peer with ChangelogException [2015-10-16 12:31:56.396344] E [resource(/data/gluster1/static/brick1):1428:service_loop] GLUSTER: Changelog register failed, [Errno 111] Connection refused [2015-10-16 12:31:56.396723] I [syncdutils(/data/gluster1/static/brick1):220:finalize] <top>: exiting. [2015-10-16 12:31:56.398370] I [repce(agent):92:service_loop] RepceServer: terminating on reaching EOF. [2015-10-16 12:31:56.398675] I [syncdutils(agent):220:finalize] <top>: exiting. [2015-10-16 12:31:57.381922] I [monitor(monitor):282:monitor] Monitor: worker(/data/gluster1/static/brick1) died in startup phase [2015-10-16 12:32:01.250627] I [gsyncd(/data/gluster1/static/brick1):649:main_i] <top>: syncing: gluster://localhost:static -> ssh://root@palace:gluster://localhost:static [2015-10-16 12:32:01.252258] I [changelogagent(agent):75:__init__] ChangelogAgent: Agent listining... [2015-10-16 12:32:03.950707] I [master(/data/gluster1/static/brick1):83:gmaster_builder] <top>: setting up xsync change detection mode [2015-10-16 12:32:03.951102] I [master(/data/gluster1/static/brick1):401:__init__] _GMaster: using 'tar over ssh' as the sync engine [2015-10-16 12:32:03.952385] I [master(/data/gluster1/static/brick1):83:gmaster_builder] <top>: setting up changelog change detection mode [2015-10-16 12:32:03.952636] I [master(/data/gluster1/static/brick1):401:__init__] _GMaster: using 'tar over ssh' as the sync engine [2015-10-16 12:32:03.953428] I [master(/data/gluster1/static/brick1):83:gmaster_builder] <top>: setting up changeloghistory change detection mode [2015-10-16 12:32:03.953665] I [master(/data/gluster1/static/brick1):401:__init__] _GMaster: using 'tar over ssh' as the sync engine Also /var/lib/glusterd/geo-replication/static_gluster-b1_static/brick_%2Fdata%2Fgluster1%2Fstatic%2Fbrick1.status: {"checkpoint_time": 0, "last_synced": 1444950684, "checkpoint_completed": "No", "meta": 0, "failures": 1952064, "entry": 0, "slave_node": "N/A", "data": 0, "worker_status": "Faulty", "crawl_status": "N/A", "checkpoint_completion_time": 0} /var/lib/glusterd/geo-replication/static_gluster-b1_static/ssh%3A%2F%2Froot%40palace%3Agluster%3A%2F%2F127.0.0.1%3Astatic.status: Started /var/lib/glusterd/geo-replication/static_gluster-b1_static/ssh%3A%2F%2Froot%40madonna%3Agluster%3A%2F%2F127.0.0.1%3Astatic.status: Started On 15/10/2015 10:25 pm, Wade
Fitzpatrick wrote:
Well I'm kind of worried about the 3 million failures listed in the FAILURES column, the timestamp showing that syncing "stalled" 2 days ago and the fact that only half of the files have been transferred to the remote volume. |
_______________________________________________ Gluster-users mailing list Gluster-users@xxxxxxxxxxx http://www.gluster.org/mailman/listinfo/gluster-users