Re: monitor connection error

"Tuffli, Chuck" <chuck.tuffli@xxxxxxx> · Thu, 13 May 2021 16:50:04 +0000

OK, that change makes the error message go away, but the ceph command then seemingly hangs:

[centos@cnode-01 ~]$ time sudo ceph --cluster es-c1 --status
^CCluster connection aborted

real    6m33.555s
user    0m0.140s
sys     0m0.041s
________________________________
From: Eugen Block <eblock@xxxxxx>
Sent: Thursday, May 13, 2021 9:15 AM
To: Tuffli, Chuck <chuck.tuffli@xxxxxxx>
Cc: ceph-users@xxxxxxx <ceph-users@xxxxxxx>
Subject: Re:  Re: monitor connection error

Sorry, hit the send the send button too early. Could you try and
rename the keyring file like this?

es-c1.client.admin.keyring:

Zitat von Eugen Block <eblock@xxxxxx>:

> I think you need to rename the admin keyring file accordingly to the
> cluster name, too.
>
> Zitat von "Tuffli, Chuck" <chuck.tuffli@xxxxxxx>:
>
>> Thank you for trying to reproduce my issue. I think I did that step:
>>
>> [centos@cnode-01 ~]$ ls -l /etc/ceph/ceph.client.admin.keyring
>> -rw-------. 1 root root 151 May 10 23:20 /etc/ceph/ceph.client.admin.keyring
>> [centos@cnode-01 ~]$ sudo cat /etc/ceph/ceph.client.admin.keyring
>> [client.admin]
>>        key = AQC6v5lgi0JBAhAAJ9Duj11SufIKidydIgO82Q==
>>        caps mds = "allow *"
>>        caps mgr = "allow *"
>>        caps mon = "allow *"
>>        caps osd = "allow *"
>> [centos@cnode-01 ~]$ sudo grep -r AQC6v5 /etc/ceph /tmp
>> /etc/ceph/ceph.client.admin.keyring:    key =
>> AQC6v5lgi0JBAhAAJ9Duj11SufIKidydIgO82Q==
>> /tmp/ceph.mon.keyring:  key = AQC6v5lgi0JBAhAAJ9Duj11SufIKidydIgO82Q==
>> [centos@cnode-01 ~]$
>>
>> ________________________________
>> From: Eugen Block <eblock@xxxxxx>
>> Sent: Thursday, May 13, 2021 12:37 AM
>> To: Tuffli, Chuck <chuck.tuffli@xxxxxxx>
>> Cc: ceph-users@xxxxxxx <ceph-users@xxxxxxx>
>> Subject: Re:  Re: monitor connection error
>>
>> I could reproduce your issue, it seems you're missing a keyring file
>> in /etc/ceph. Did you miss step 9 from the manual deployment guide?
>>
>> sudo ceph-authtool --create-keyring
>> /etc/ceph/ceph.client.admin.keyring --gen-key -n client.admin --cap
>> mon 'allow *' --cap osd 'allow *' --cap mds 'allow *' --cap mgr 'allow
>> *'
>>
>>
>>
>>
>> Zitat von "Tuffli, Chuck" <chuck.tuffli@xxxxxxx>:
>>
>>>> -----Original Message-----
>>>> From: Eugen Block [mailto:eblock@xxxxxx]
>>>> Sent: Tuesday, May 11, 2021 11:39 PM
>>>> To: ceph-users@xxxxxxx
>>>> Subject:  Re: monitor connection error
>>>>
>>>> Hi,
>>>>
>>>>> What is this error trying to tell me? TIA
>>>>
>>>> it tells you that the cluster is not reachable to the client, this
>>>> can have various
>>>> reasons.
>>>>
>>>> Can you show the output of your conf file?
>>>>
>>>> cat /etc/ceph/es-c1.conf
>>>
>>> [centos@cnode-01 ~]$ cat /etc/ceph/es-c1.conf
>>> [global]
>>> fsid = 3c5da069-2a03-4a5a-8396-53776286c858
>>> mon_initial_members = cnode-01,cnode-02,cnode-03
>>> mon_host = 192.168.122.39
>>> public_network = 192.168.122.0/24
>>> auth_cluster_required = cephx
>>> auth_service_required = cephx
>>> auth_client_required = cephx
>>> osd_journal_size = 1024
>>> osd_pool_default_size = 3
>>> osd_pool_default_min_size = 2
>>> osd_pool_default_pg_num = 333
>>> osd_pool_default_pgp_num = 333
>>> osd_crush_chooseleaf_type = 1
>>> [centos@cnode-01 ~]$
>>>
>>>> Is the monitor service up running? I take it you don't use cephadm
>>>> yet so it's not
>>>> a containerized environment?
>>>
>>> Correct, this is bare metal and not a containerized environment. And
>>> I believe it is running:
>>> [centos@cnode-01 ~]$ sudo systemctl --all | grep ceph
>>>  ceph-crash.service
>>>                      loaded    active   running   Ceph crash dump
>>> collector
>>>  ceph-mon@cnode-01.service
>>>                      loaded    active   running   Ceph cluster
>>> monitor daemon
>>>  system-ceph\x2dmon.slice
>>>                      loaded    active   active
>>> system-ceph\x2dmon.slice
>>>  ceph-mon.target
>>>                      loaded    active   active    ceph target
>>> allowing to start/stop all ceph-mon@.service instances at once
>>>  ceph.target
>>>                      loaded    active   active    ceph target
>>> allowing to start/stop all ceph*@.service instances at once
>>> [centos@cnode-01 ~]$
>>>
>>>> Regards,
>>>> Eugen
>>>>
>>>>
>>>> Zitat von "Tuffli, Chuck" <chuck.tuffli@xxxxxxx>:
>>>>
>>>>> Hi
>>>>>
>>>>> I'm new to ceph and have been following the Manual Deployment document
>>>>> [1]. The process seems to work correctly until step 18 ("Verify that
>>>>> the monitor is running"):
>>>>>
>>>>> [centos@cnode-01 ~]$ uname -a
>>>>> Linux cnode-01 3.10.0-693.5.2.el7.x86_64 #1 SMP Fri Oct 20 20:32:50
>>>>> UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
>>>>> [centos@cnode-01 ~]$ ceph -v
>>>>> ceph version 15.2.11 (e3523634d9c2227df9af89a4eac33d16738c49cb)
>>>>> octopus (stable)
>>>>> [centos@cnode-01 ~]$ sudo ceph --cluster es-c1 -s [errno 2] RADOS
>>>>> object not found (error connecting to the cluster)
>>>>> [centos@cnode-01 ~]$
>>>>>
>>>>> What is this error trying to tell me? TIA
>>>>>
>>>>> [1]
>>>>> INVALID URI REMOVED
>>>>> nual-deployment/__;!!NpxR!1-v_Ql6E-l3P_E8DvIfk_YtknPrVFeZ5sFaPHLlsJVY8
>>>>> PmzP7kySRbr1rYqbFiZ1$
>>>> _______________________________________________
>>>>> ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an
>>>>> email to ceph-users-leave@xxxxxxx
>>>>
>>>>
>>>> _______________________________________________
>>>> ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send
>>>> an email to
>>>> ceph-users-leave@xxxxxxx

_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx