Hi, I have a problem I hope is possible to solve… I upgraded to 9.2.0 a couple of days back and I missed this part: “If your systems already have a ceph user, upgrading the package will cause problems. We suggest you
first remove or rename the existing ‘ceph’ user and ‘ceph’ group before upgrading.” I guess that might be the reason why my OSD:s has started to die on me. I can get the osd-services when having the file permissions as root:root and using: setuser match path = /var/lib/ceph/$type/$cluster-$i I am really not sure where to look to find out what is wrong. First when I had upgraded and the OSD:s were restarted then I got a permission denied on the ods-directories and that was solve then adding the “setuser match” in ceph.conf. With 5 of 12 OSD:s down I am starting to worry and since I only have one replica I might lose som data. As I mentioned the OSD-services start and “ceph osd in” does not give me any error but the OSD never comes up. Any suggestions or helpful tips are most welcome, /Claes ID WEIGHT TYPE NAME UP/DOWN REWEIGHT PRIMARY-AFFINITY -1 24.00000 root default -2 8.00000 host black 3 2.00000 osd.3 up 1.00000 1.00000 2 2.00000 osd.2 up 1.00000 1.00000 0 2.00000 osd.0 up 1.00000 1.00000 1 2.00000 osd.1 up 1.00000 1.00000 -3 8.00000 host purple 7 2.00000 osd.7 down 0 1.00000 6 2.00000 osd.6 up 1.00000 1.00000 4 2.00000 osd.4 up 1.00000 1.00000 5 2.00000 osd.5 up 1.00000 1.00000 -4 8.00000 host orange 11 2.00000 osd.11 down 0 1.00000 10 2.00000 osd.10 down 0 1.00000 8 2.00000 osd.8 down 0 1.00000 9 2.00000 osd.9 down 0 1.00000 root@black:/var/log/ceph# ceph -s 2015-11-15 21:55:27.919339 7ffb38446700 0 -- :/1336310814 >> 172.16.0.203:6789/0 pipe(0x7ffb34064550 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7ffb3405e000).fault cluster ee8eae7a-5994-48bc-bd43-aa07639a543b health HEALTH_WARN 1591 pgs backfill 38 pgs backfilling 2439 pgs degraded 105 pgs down 106 pgs peering 138 pgs stale 2439 pgs stuck degraded 106 pgs stuck inactive 138 pgs stuck stale 2873 pgs stuck unclean 2439 pgs stuck undersized 2439 pgs undersized recovery 1694156/6668499 objects degraded (25.405%) recovery 2315800/6668499 objects misplaced (34.727%) too many PGs per OSD (1197 > max 350) 1 mons down, quorum 0,1 black,purple monmap e3: 3 mons at {black=172.16.0.201:6789/0,orange=172.16.0.203:6789/0,purple=172.16.0.202:6789/0} election epoch 448, quorum 0,1 black,purple mdsmap e5: 0/0/1 up osdmap e34098: 12 osds: 7 up, 7 in; 2024 remapped pgs pgmap v8211622: 4608 pgs, 3 pools, 12027 GB data, 3029 kobjects 17141 GB used, 8927 GB / 26069 GB avail 1694156/6668499 objects degraded (25.405%) 2315800/6668499 objects misplaced (34.727%) 1735 active+clean 1590 active+undersized+degraded+remapped+wait_backfill 637 active+undersized+degraded 326 active+remapped 137 stale+active+undersized+degraded 101 down+peering 38 active+undersized+degraded+remapped+backfilling 37 active+undersized+degraded+remapped 4 down+remapped+peering 1 stale+remapped+peering 1 active 1 active+remapped+wait_backfill recovery io 66787 kB/s, 16 objects/s |
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com