Re: rbd map hangs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, Jun 07, 2018 at 09:30:23AM PDT, Jason Dillaman spake thusly:
> I think what Ilya is saying is that it's a very old RHEL 7-based
> kernel (RHEL 7.1?). For example, the current RHEL 7.5 kernel includes
> numerous improvements that have been backported from the current
> upstream kernel.

Ah, I understand now. My VM servers tend not to get upgraded often as
restarting all of the VMs is a hassle. I'll fix that. Do we think that
is related to my issues? It has worked reliably for ages as far as
mapping rbd goes.

I still have the following in flight requests. I set osd.73 out as
suggested and even went and restarted the osd process on the node. It
doesn't seem to have had any effect. And I still have unkillable
processes blocking on mapped rbd devices. I guess I can patch/reboot
this box which would likely clear this up but that's going to have to
wait a week or so and involves downtime for 21 VMs which is less than
ideal. I would love to get this fixed, finish transferring images from
iscsi storage to ceph rbd, then I can retire the iscsi storage and have
some surplus amps so I can bring some more VM servers online so I can
live migrate these VMs in the future allowing easier reboots/upgrades as
that's the real limiting factor here.

# find /sys/kernel/debug/ceph -type f -print -exec cat {} \;
# [70/1950]
/sys/kernel/debug/ceph/b2b00aae-f00d-41b4-a29b-58859aa41375.client31276017/osdmap
epoch 232501
flags
pool 0 pg_num 2500 (4095) read_tier -1 write_tier -1
pool 2 pg_num 512 (511) read_tier -1 write_tier -1
pool 3 pg_num 128 (127) read_tier -1 write_tier -1
pool 4 pg_num 100 (127) read_tier -1 write_tier -1
osd0    10.0.5.3:6801    54%    (exists, up)    100%
osd1    10.0.5.3:6812    57%    (exists, up)    100%
osd2    (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd3    10.0.5.4:6812    50%    (exists, up)    100%
osd4    (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd5    (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd6    10.0.5.9:6861    37%    (exists, up)    100%
osd7    10.0.5.9:6876    28%    (exists, up)    100%
osd8    10.0.5.9:6864    43%    (exists, up)    100%
osd9    10.0.5.9:6836    30%    (exists, up)    100%
osd10   10.0.5.9:6820    22%    (exists, up)    100%
osd11   10.0.5.9:6844    54%    (exists, up)    100%
osd12   10.0.5.9:6803    43%    (exists, up)    100%
osd13   10.0.5.9:6826    41%    (exists, up)    100%
osd14   10.0.5.9:6853    37%    (exists, up)    100%
osd15   10.0.5.9:6872    36%    (exists, up)    100%
osd16   (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd17   10.0.5.9:6812    44%    (exists, up)    100%
osd18   10.0.5.9:6817    48%    (exists, up)    100%
osd19   10.0.5.9:6856    33%    (exists, up)    100%
osd20   10.0.5.9:6808    46%    (exists, up)    100%
osd21   10.0.5.9:6871    41%    (exists, up)    100%
osd22   10.0.5.9:6816    49%    (exists, up)    100%
osd23   10.0.5.9:6823    56%    (exists, up)    100%
osd24   10.0.5.9:6800    54%    (exists, up)    100%
osd25   10.0.5.9:6848    54%    (exists, up)    100%
osd26   10.0.5.9:6840    37%    (exists, up)    100%
osd27   10.0.5.9:6883    69%    (exists, up)    100%
osd28   10.0.5.9:6833    39%    (exists, up)    100%
osd29   10.0.5.9:6809    38%    (exists, up)    100%
osd30   10.0.5.9:6829    51%    (exists, up)    100%
osd31   10.0.5.11:6828   47%    (exists, up)    100%
osd32   10.0.5.11:6848   25%    (exists, up)    100%
osd33   10.0.5.11:6802   56%    (exists, up)    100%
osd34   10.0.5.11:6840   35%    (exists, up)    100%
osd35   10.0.5.11:6856   32%    (exists, up)    100%
osd36   10.0.5.11:6832   26%    (exists, up)    100%
osd37   10.0.5.11:6868   42%    (exists, up)    100%
osd38   (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd39   10.0.5.11:6812   52%    (exists, up)    100%
[23/1950]
osd40   10.0.5.11:6864   44%    (exists, up)    100%
osd41   10.0.5.11:6801   25%    (exists, up)    100%
osd42   10.0.5.11:6872   39%    (exists, up)    100%
osd43   10.0.5.13:6809   38%    (exists, up)    100%
osd44   10.0.5.11:6844   47%    (exists, up)    100%
osd45   10.0.5.11:6816   20%    (exists, up)    100%
osd46   10.0.5.3:6800    58%    (exists, up)    100%
osd47   10.0.5.2:6808    43%    (exists, up)    100%
osd48   10.0.5.2:6804    44%    (exists, up)    100%
osd49   10.0.5.2:6812    44%    (exists, up)    100%
osd50   10.0.5.2:6800    47%    (exists, up)    100%
osd51   10.0.5.4:6808    43%    (exists, up)    100%
osd52   10.0.5.12:6815   41%    (exists, up)    100%
osd53   10.0.5.11:6820   24%    (up)    100%
osd54   10.0.5.11:6876   34%    (exists, up)    100%
osd55   10.0.5.11:6836   48%    (exists, up)    100%
osd56   10.0.5.11:6824   31%    (exists, up)    100%
osd57   10.0.5.11:6860   48%    (exists, up)    100%
osd58   10.0.5.11:6852   35%    (exists, up)    100%
osd59   10.0.5.11:6800   42%    (exists, up)    100%
osd60   10.0.5.11:6880   58%    (exists, up)    100%
osd61   10.0.5.3:6803    52%    (exists, up)    100%
osd62   10.0.5.12:6800   42%    (exists, up)    100%
osd63   10.0.5.12:6819   46%    (exists, up)    100%
osd64   10.0.5.12:6809   44%    (exists, up)    100%
osd65   10.0.5.13:6800   44%    (exists, up)    100%
osd66   (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd67   10.0.5.13:6808   50%    (exists, up)    100%
osd68   10.0.5.4:6804    41%    (exists, up)    100%
osd69   10.0.5.4:6800    39%    (exists, up)    100%
osd70   10.0.5.13:6804   42%    (exists, up)    100%
osd71   (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd72   (unknown sockaddr family 0)       0%    (doesn't exist) 100%
osd73   10.0.5.16:6826   92%    (exists, up)    100%
osd74   10.0.5.16:6846  100%    (exists, up)    100%
osd75   10.0.5.16:6811   98%    (exists, up)    100%
osd76   10.0.5.16:6815  100%    (exists, up)    100%
osd77   10.0.5.16:6835   93%    (exists, up)    100%
osd78   10.0.5.16:6802   97%    (exists, up)    100%
osd79   10.0.5.16:6858  100%    (exists, up)    100%
osd80   10.0.5.16:6839   91%    (exists, up)    100%
osd81   10.0.5.16:6801  100%    (exists, up)    100%
osd82   10.0.5.16:6820   99%    (exists, up)    100%
osd83   10.0.5.16:6852   98%    (exists, up)    100%
osd84   10.0.5.16:6862   93%    (exists, up)    100%
osd85   10.0.5.16:6800   96%    (exists, up)    100%
/sys/kernel/debug/ceph/b2b00aae-f00d-41b4-a29b-58859aa41375.client31276017/monmap
epoch 12
    mon0    10.0.5.2:6789
    mon1    10.0.5.4:6789
    mon2    10.0.5.13:6789
/sys/kernel/debug/ceph/b2b00aae-f00d-41b4-a29b-58859aa41375.client31276017/osdc
34533231        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533233        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533234        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533235        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533236        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533237        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533238        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533239        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34533241        osd73   0.f0ae1f02 rbd_data.51f32238e1f29.00000000000013de set-alloc-hint,write
34919983        osd67   0.f4cdfa38 rbd_header.51f32238e1f29 5613'998386622791680    watch
34919984        osd6    2.5aca5ef2 rbd_header.93285b6b8b4567 4422885'943544185389056 watch
34919985        osd67   2.4dbc6037 rbd_header.5f75476b8b4567 28922'998386622791680   watch
34919986        osd1    2.ba8d973e rbd_header.dd3b556b8b4567 5305738'894263730634752 watch
/sys/kernel/debug/ceph/b2b00aae-f00d-41b4-a29b-58859aa41375.client31276017/monc
have osdmap 232501
want next osdmap


-- 
Tracy Reed
http://tracyreed.org
Digital signature attached for your safety.

Attachment: signature.asc
Description: PGP signature

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux