On Fri, 25 Mar 2022 14:22:07 +0100 grin <cephlist@xxxxxxxxxxxx> wrote: > Adding back the hosts (`ceph orch add host <name> <ip> _admin`) gave > the result of all the daemons in "starting" on master view, while > visibly running on the host, and not joining the cluster. It stays > "starting" forever. I believe I have found the cause of that but I cannot really see the reason: cephadm is choked on an ssh-started command of cephadm ['--env', 'CEPH_VOLUME_OSDSPEC_AFFINITY=all-available-devices', '--image', 'quay.io/ceph/ceph@sha256:0cae510dabfda2c03a3cedcf0e8af1c66cddad9e18b1634725efb3c228897f84', 'ceph-volume', '--fsid', '61a42e5e-abb5-11ec-ae5a-ca5232992f14', '--config-json', '-', '--', 'lvm', 'batch', '--no-auto', '/dev/sda', '/dev/sdb', '/dev/sdc', '/dev/sdd', '/dev/sde', '/dev/sdf', '/dev/sdg', '/dev/sdh', '/dev/sdi', '/dev/sdj', '--yes', '--no-systemd'] which get into an endless syscall and stuck the ssh channel. Interestingly the other "stuck" host chokes on a related but slightly different part of the osd creation (result/part of the same command I believe): \_ /usr/libexec/platform-python -s /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring -i - osd new f8d92ec7-ee18-4427-be5b-820407aa88de but all the 4 machines are identical configurations. None of these two had any lvm stuff created by ceph yet. No visible problems is in syslog. g _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx