Hi! Am 25.03.2019 um 15:07 schrieb Brian Topping:
Did you check port access from other nodes? My guess is a forgotten firewall re-emerged on that node after reboot.
I am pretty sure it's not the firewall. To be extra sure, I switched it off for testing.
I found this in the mon-logs: On the working MONs2019-03-25 14:10:41.386 7fb0322c4ec0 0 starting mon.cephtmon01 rank 0 at public addrs [v2:172.17.0.35:3300/0,v1:172.17.0.35:6789/0] at bind addrs [v2:172.17.0.35:3300/0,v1:172.17.0.35:6789/0] mon_data /var/lib/ceph/mon/ceph-cephtmon01 fsid f8d766ec-6306-4442-bc08-97facc64e1d8
and2019-03-25 14:10:36.651 7f65b9344ec0 0 starting mon.cephtmon02 rank 1 at public addrs [v2:172.17.0.36:3300/0,v1:172.17.0.36:6789/0] at bind addrs [v2:172.17.0.36:3300/0,v1:172.17.0.36:6789/0] mon_data /var/lib/ceph/mon/ceph-cephtmon02 fsid f8d766ec-6306-4442-bc08-97facc64e1d8
but on the defunct MON this line is repeated several times each second:2019-03-25 15:44:09.747 7f5daabf1ec0 0 starting mon.cephtmon03 rank 2 at public addrs v1:172.17.0.37:6789/0 at bind addrs v1:172.17.0.37:6789/0 mon_data /var/lib/ceph/mon/ceph-cephtmon03 fsid f8d766ec-6306-4442-bc08-97facc64e1d8
All hosts have identical ceph.conf. Did this host miss the switch to msgr2? Can I enforce it somehow?
-- Jörn Clausen Daten- und Rechenzentrum GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel Düsternbrookerweg 20 24105 Kiel
Attachment:
smime.p7s
Description: S/MIME Cryptographic Signature
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com