Re: Getting started with cephadm

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Podman is fine (preferably 3.0+). What were those variables set to
before? With most recent distributions and kernels we've not noticed a
problem with the defaults. Did you notice errors that lead to you
changing them? We have many clusters of 21 nodes, 24 HDDs each,
multiple NVMEs serving as WAL/DB which were on 15.2.7 and prior, but
now all are 15.2.9, running in podman 3.0.1 (fixes issues with the 2.2
series on upgrade). We have less RAM (128G) per node without issues.

On the OSDs that will not start - what error(s) do you see? You can
inspect the OSDs with "podman logs <id>" if they've started inside of
podman but just aren't joining the cluster; if they haven't, then
looking at the systemctl status for the service or journalctl will
normally give more insight. Hopefully the root cause of your problems
can be identified so it can be addressed directly.

On Sat, Feb 27, 2021 at 11:34 AM Peter Childs <pchilds@xxxxxxx> wrote:
>
> I'm new to ceph, and I've been trying to set up a new cluster with 16
> computers with 30 disks each and 6 SSD (plus boot disks), 256G of memory,
> IB Networking. (ok its currently 15 but never mind)
>
> When I take them over about 10 OSD's each they start having problems
> starting the OSD up and I can normally fix this by rebooting them and it
> will continue again for a while, and it is possible to get them up to the
> full complement with a bit of poking around. (Once its working it fne
> unless you start adding services or moving the OSD's around
>
> Is there anything I can change to make it a bit more stable.
>
> I've already set
>
> fs.aio-max-nr = 1048576
> kernel.pid_max = 4194303
> fs.file-max = 500000
>
> which made it a bit better, but I feel it could be even better.
>
> I'm currently trying to upgrade to 15.2.9 from the default cephadm version
> of octopus.  The upgrade is going very very slowly. I'm currently using
> podman if that helps, I'm not sure if docker would be better? (I've mainly
> used singularity when I've handled containers before)
>
> Thanks in advance
>
> Peter Childs
> _______________________________________________
> ceph-users mailing list -- ceph-users@xxxxxxx
> To unsubscribe send an email to ceph-users-leave@xxxxxxx
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux