Re: Hanging VMs with Qemu + RBD

Robert LeBlanc <robert@xxxxxxxxxxxxx> · Fri, 19 Dec 2014 11:46:55 -0700

I think smaller clusters get chocked up with the default backfill. I've seen latency on a four node cluster with 10 OSD each improve by setting osd_max_backfills to 2. I would try lowering it and see if it helps.
Also, if you are running both cluster and VM traffic on the same network, you could get congestion especially on a 1 Gb network.

Robert LeBlanc

On Fri, Dec 19, 2014 at 9:33 AM, Nico Schottelius <nico-ceph-users@xxxxxxxxxxxxxxx> wrote:Hello,

another issue we have experienced with qemu VMs

(qemu 2.0.0) with ceph-0.80 on Ubuntu 14.04

managed by opennebula 4.10.1:

The VMs are completly frozen when rebalancing takes place,

they do not even respond to ping anymore.

Looking at the qemu processes they are in state "Sl".

Is this a known problem / have others seen this behaviour?

I have not yet tuned any backfilling parameters and it is a

cluster of 3 hosts with one host having 6 osds and two 1 one (so 8 osds

in total).

Our qemu runs with these rbd related options:

qemu-system-x86_64 ... -drive

    file=rbd:one/one-38:id=libvirt:key=...:auth_supported=cephx\;none:mon_host=kaffee.private.ungleich.ch\;wein.private.ungleich.ch\;tee.private.ungleich.ch,if=none,id=drive-ide0-0-0,format=raw,cache=none

Cheers,

Nico

--

New PGP key: 659B 0D91 E86E 7E24 FD15  69D0 C729 21A1 293F 2D24

_______________________________________________

ceph-users mailing list

ceph-users@xxxxxxxxxxxxxx

http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com