Re: Help: pool not responding

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello Mario,

This kind of problem usually happens for following reasons:

1-) One of the OSD nodes has network problem.
2-) Disk failure
3-) Not enough resource for OSD nodes
4-) Slow OSD Disks

This happened before me. The problem was network cable problem. As soon as I replaced the cable, everything was fine and dandy.

On Sun, Feb 14, 2016 at 9:56 PM, Mario Giammarco <mgiammarco@xxxxxxxxx> wrote:
Hello,
I am using ceph hammer under proxmox.
I have working cluster it is several month I am using it.
For reasons yet to discover I am now in this situation:

HEALTH_WARN 4 pgs incomplete; 4 pgs stuck inactive; 4 pgs stuck unclean; 7
requests are blocked > 32 sec; 1 osds have slow requests
pg 0.0 is stuck inactive for 3541712.444492, current state incomplete, last
acting [0,1,3]
pg 0.40 is stuck inactive for 1478467.695684, current state incomplete,
last acting [1,0,3]
pg 0.3f is stuck inactive for 3541852.000546, current state incomplete,
last acting [0,3,1]
pg 0.3b is stuck inactive for 3541865.897979, current state incomplete,
last acting [0,3,1]
pg 0.0 is stuck unclean for 3555526.301120, current state incomplete, last
acting [0,1,3]
pg 0.40 is stuck unclean for 3555526.301128, current state incomplete, last
acting [1,0,3]
pg 0.3f is stuck unclean for 3555545.066879, current state incomplete, last
acting [0,3,1]
pg 0.3b is stuck unclean for 3555579.201819, current state incomplete, last
acting [0,3,1]
pg 0.40 is incomplete, acting [1,0,3]
pg 0.3f is incomplete, acting [0,3,1]
pg 0.3b is incomplete, acting [0,3,1]
pg 0.0 is incomplete, acting [0,1,3]
7 ops are blocked > 2097.15 sec
7 ops are blocked > 2097.15 sec on osd.0
1 osds have slow requests


Problem is that when I try to read or write to pool "rbd" (where I have all
my virtual machines) ceph starts to log "slow reads" and system hungs.
If in the same cluster I create another pool and inside it I create an
image I can read and write correctly (and fast) so it seems the cluster is
working and only the pool is not working.

Can you help me?
Thanks,
Mario



_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux