Re: PG is in 'stuck unclean' state, but all acting OSD are up

Goncalo Borges <goncalo.borges@xxxxxxxxxxxxx> · Tue, 16 Aug 2016 13:36:54 +1000



    Hi Chris...
    The precise osd set you see now [79,8,74] was obtained on epoch
      104536 but this was after a lot of tries as showed by the recovery
      section. 

    
    Actually, in the first try (on epoch 100767) osd 116 was selected
      somehow (maybe it was up at the time?) and probably the pg got
      stuck because it went down during the recover process?
    
      recovery_state": [

                {

                    "name": "Started\/Primary\/Peering\/GetInfo",

                    "enter_time": "2016-08-11 11:45:06.052568",

                    "requested_info_from": []

                },

                {

                    "name": "Started\/Primary\/Peering",

                    "enter_time": "2016-08-11 11:45:06.052558",

                    "past_intervals": [

                        {

                            "first": 100767,

                            "last": 100777,

                            "maybe_went_rw": 1,

                            "up": [

                                79,

                                116,

                                74

                            ],

                            "acting": [

                                79,

                                116,

                                74

                            ],

                            "primary": 79,

                            "up_primary": 79

                        },

      
    The pg query also shows

    
      peering_blocked_by": [

                        {

                            "osd": 116,

                            "current_lost_at": 0,

                            "comment": "starting or marking this osd
        lost may let us proceed"

                        }

      
    Maybe, you can check the documentation in [1] and see if you
      think you could follow the suggestion inside the pg and mark osd
      116 as lost. This should be done after proper evaluation from you.
    Another thing I found strange is that in the recovery section,
      there are a lot of tries where you do not get a proper osd set.
      The very last recover try was on epoch 104540. 

      
                    {

                          "first": 104536,

                          "last": 104540,

                          "maybe_went_rw": 1,

                          "up": [

                              2147483647,

                              8,

                              74

                          ],

                          "acting": [

                              2147483647,

                              8,

                              74

                          ],

                          "primary": 8,

                          "up_primary": 8

                      }

    
    From [2], "When CRUSH fails to find enough OSDs to map to a PG,
      it will show as a 2147483647 which is ITEM_NONE or no OSD found.".
      

    This could be an artifact of the peering being blocked by
      osd.116, or a genuine problem where you are not being able to get
      a proper osd set. That could be for a variety of reasons: from
      network issues, to osds being almost full or simply because the
      system can't get 3 osds in 3 different hosts.
    Cheers
    Goncalo

    
    [1]
http://docs.ceph.com/docs/master/rados/troubleshooting/troubleshooting-pg/#placement-group-down-peering-failure
    [2]
http://docs.ceph.com/docs/master/rados/troubleshooting/troubleshooting-pg/
    

    On 08/16/2016 11:42 AM, Heller, Chris
      wrote:

    
        Output of
            `ceph pg dump_stuck`
         
        # ceph pg
            dump_stuck
        ok
        pg_stat
            state   up      up_primary      acting  acting_primary
        4.2a8  
            down+peering    [79,8,74]       79      [79,8,74]       79
        4.c3   
            down+peering    [56,79,67]      56      [56,79,67]      56
         
        -Chris
         
        
          From: Goncalo Borges
              <goncalo.borges@xxxxxxxxxxxxx>

              Date: Monday, August 15, 2016 at 9:03 PM

              To: "ceph-users@xxxxxxxxxxxxxx"
              <ceph-users@xxxxxxxxxxxxxx>, "Heller, Chris"
              <cheller@xxxxxxxxxx>

              Subject: Re:  PG is in 'stuck unclean'
              state, but all acting OSD are up
        
        
            Hi Heller...
            Can you actually post the result of
               ceph pg dump_stuck ?
            Cheers
            G.
             
             
              On 08/15/2016 10:19 PM, Heller, Chris
                wrote:
            
            
              I’d like to better understand
                  the current state of my CEPH cluster.
               
              I currently have 2 PG that
                  are in the ‘stuck unclean’ state:
               
              # ceph health detail
              HEALTH_WARN 2 pgs down; 2 pgs
                  peering; 2 pgs stuck inactive; 2 pgs stuck unclean
              pg 4.2a8 is stuck inactive
                  for 124516.777791, current state down+peering, last
                  acting [79,8,74]
              pg 4.c3 is stuck inactive
                  since forever, current state down+peering, last acting
                  [56,79,67]
              pg 4.2a8 is stuck unclean for
                  124536.223284, current state down+peering, last acting
                  [79,8,74]
              pg 4.c3 is stuck unclean
                  since forever, current state down+peering, last acting
                  [56,79,67]
              pg 4.2a8 is down+peering,
                  acting [79,8,74]
              pg 4.c3 is down+peering,
                  acting [56,79,67]
               
              While my cluster does
                  currently have some down OSD, none are in the acting
                  set for either PG:
               
              ceph osd tree | grep down
              73   1.00000        
                  osd.73              down        0          1.00000
              96   1.00000        
                  osd.96              down        0          1.00000
              110   1.00000        
                  osd.110             down        0          1.00000
              116   1.00000        
                  osd.116             down        0          1.00000
              120   1.00000        
                  osd.120             down        0          1.00000
              126   1.00000        
                  osd.126             down        0          1.00000
              124   1.00000        
                  osd.124             down        0          1.00000
              119   1.00000        
                  osd.119             down        0          1.00000
               
              I’ve queried one of the two
                  PG, and see that recovery is currently blocked on
                  OSD.116, which is indeed down, but is not part of the
                  acting set of OSD for that PG:
               
              http://pastebin.com/Rg2hK9GE
               
              This is all with CEPH version
                  0.94.3:
               
              # ceph version
              ceph version 0.94.3
                  (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
               
              Why does this PG remain
                  ‘stuck unclean’?
              Is there some steps I can
                  take to unstick it, given that all the acting OSD are
                  up and in?
               
              (* Re-sent, now that I’m
                  subscribed to list *)
              -Chris
              

              _______________________________________________
              ceph-users mailing list
              ceph-users@xxxxxxxxxxxxxx
              http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
            
            
            -- 
            Goncalo Borges
            Research Computing
            ARC Centre of Excellence for Particle Physics at the Terascale
            School of Physics A28 | University of Sydney, NSW  2006
            T: +61 2 93511937
          
        
    -- 
Goncalo Borges
Research Computing
ARC Centre of Excellence for Particle Physics at the Terascale
School of Physics A28 | University of Sydney, NSW  2006
T: +61 2 93511937
  

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com