Re: Adding a monitor to

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2014-10-30 08:23, Joao Eduardo Luis wrote:
On 10/27/2014 06:37 PM, Patrick Darley wrote:
Hi there

Over the last week or so, I've been trying to connect a ceph monitor
node running on a baserock system
to connect to a simple 3-node ubuntu ceph cluster.

The 3 node ubunutu cluster was created by following the documented Quick
installation guide using 3 VMs running ubuntu Trusty.

After the ubuntu cluster has been deployed I would then follow the
directions below, which I derived from comparing the ceph-deploy debug
information, the ceph documentation on adding monitor nodes to an
existing system and the ceph documentation on bootstrapping monitor nodes.

  1. scp the /etc/ceph/* from admin node
  2. create the dir: mkdir /var/lib/ceph/mon/ceph-bcc08
  3. generate mon keyring: sudo ceph auth get mon. -o
/var/lib/ceph/tmp/ceph-bcc08.mon.keyring
4. generate monmap: sudo ceph mon getmap -o /var/lib/ceph/tmp/monmap
  5. That filesystem thingy: sudo ceph-mon --cluster ceph --mkfs -i
bcc08 --keyring /var/lib/ceph/tmp/ceph-bcc08.mon.keyring --monmap
/var/lib/ceph/tmp/monmap
  6. Unlink keys and old monmap: rm /var/lib/ceph/tmp/*
  7. touch things: touch /var/lib/ceph/mon/ceph-bcc08/done and touch
/var/lib/ceph/mon/ceph-bcc08/sysvinit
  8. Then start the mon: sudo /etc/init.d/ceph start mon.bcc08

I have a feeling that the issue is your ceph.conf, which you copy on
the first step.

Have you added the monitor you're adding to it?

I hadn't, I had thought that the monitors communicate via the monmap and did not look at the ceph.conf so much, so I hadn't thought to change it.


Most commonly the initial configuration will be based on either
'mon_initial_members' and 'mon_hosts' config options, or
monitor-specific sections.  Say you initially have something like
this:

mon_initial_members = ucc01
mon_hosts = 192.168.122.95

[mon.ucc01]
host = ucc01

then you'll want to add 'bcc07' to the list before you fire up the
new monitor (possibly you'll even want to do it prior to mkfs):

mon_initial_members = ucc01,bcc07
moh_hosts = 192.168.122.95,whatever-ip-for-bcc07:port-if-not-default

[mon.ucc01]
host = ucc01

[mon.bcc07]
host = bcc07

Let me know if you still have problems after doing this.
  -Joao


I adjusted the cpeh.conf thus and pushed to all the nodes in the cluster at step 1 I then continued with the procedure above and when I started bcc07 it entered the probing state.

Checking the log of ucc01 I get the following line repeated:

2014-10-30 18:02:42.617404 7ff6d4b2a700 1 mon.ucc01@0(leader) e1 peer 192.168.122.42:6789/0 missing features 824633720832

And the log of bcc07 has this line repeated:

2014-10-30 18:07:44.782907 7fb0fe22a700 1 mon.bcc07@0(probing).paxos(paxos recovering c 0..0) is_readable now=2014-10-30 18:07:44.782912 lease_expire=0.000000 has v0 lc 0

This is the most communication I've seen between them so this is on the right track I guess!

Do you know what the problem might be here?

Thanks very much for your time!


Patrick



When I carry out these steps in the attempt to add a baserock system to the ubuntu cluster, the monitor node has not been added to the cluster
and the admin socket mon_status gives the following output.

   ~ # ceph --cluster=ceph --admin-daemon
/var/run/ceph/ceph-mon.bcc07.asok mon_status
   { "name": "bcc07",
     "rank": -1,
     "state": "probing",
     "election_epoch": 0,
     "quorum": [],
     "outside_quorum": [],
     "extra_probe_peers": [],
     "sync_provider": [],
     "monmap": { "epoch": 0,
         "fsid": "4460079d-42f4-4e3a-8ce3-e2a7fa2685e6",
         "modified": "2014-10-27 12:37:25.531542",
         "created": "2014-10-27 12:37:25.531542",
         "mons": [
               { "rank": 0,
                 "name": "ucc01",
                 "addr": "192.168.122.95:6789\/0"}]}}


And the newly added monitor remains stuck in the probing state
indefinitely. To try and resolve
this issue I have looked at the problems monitor troubleshooting page of
the ceph documentation, eg. ntp sychronisation and checking network
connectivity (to the best of my ability :-s ).

It is also worth mentioning that I have created a 3 node ceph cluster on baserock machines (1 mon, 2 osds) then successfully added monitor nodes running baserock and ubuntu systems using the same 8 step process given
above.

This leaves me confused as to why adding the monitor run on baserock to
the all ubuntu cluster specifically is causing problems.

Are there any reasons why this 'probing' problem could be occuring? Im feeling a little stuck of how to proceed and would welcome any suggestions.

Thanks for your help,

Patrick
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux