AW: ./osd/OSDMap.h:460: FAILED assert(exists(osd) && is_up(osd))

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Yes, one OSD crashed approximately an hour before this was happening.

Christian

-----Ursprüngliche Nachricht-----
Von: gregf@xxxxxxxxxxxxxxx [mailto:ceph-devel-owner@xxxxxxxxxxxxxxx] Im Auftrag von Gregory Farnum
Gesendet: Dienstag, 26. Oktober 2010 20:00
An: Christian Brunner
Cc: ceph-devel@xxxxxxxxxxxxxxx
Betreff: Re: ./osd/OSDMap.h:460: FAILED assert(exists(osd) && is_up(osd))

Do you have any down OSDs? Looking through the code I am noticing an issue with down OSDs where op_submit can get an error code that it isn't watching for, but there may be checks in other places to prevent that actually happening.
-Greg

On Tue, Oct 26, 2010 at 5:13 AM, Christian Brunner <chb@xxxxxx> wrote:
> When accessing multiple RBD-Volumes from one VM in parallel, we are 
> receiving an assertion:
>
> ./osd/OSDMap.h: In function 'entity_inst_t OSDMap::get_inst(int)':
> ./osd/OSDMap.h:460: FAILED assert(exists(osd) && is_up(osd)) ceph 
> version 0.22.1 (commit:c6f403a6f441184956e00659ce713eaee7014279)
> 1: (Objecter::op_submit(Objecter::Op*)+0x6c2) [0x38658854c2]
> 2: /usr/lib64/librados.so.1() [0x3865855dc9]
> 3: (RadosClient::aio_write(RadosClient::PoolCtx&, object_t, long, 
> ceph::buffer::list const&, unsigned long,
> RadosClient::AioCompletion*)+0x24b) [0x386585724b]
> 4: (rados_aio_write()+0x9a) [0x386585741a]
> 5: /usr/bin/qemu-kvm() [0x45a305]
> 6: /usr/bin/qemu-kvm() [0x45a430]
> 7: /usr/bin/qemu-kvm() [0x43bb73]
> NOTE: a copy of the executable, or `objdump -rdS <executable>` is 
> needed to interpret this.
> ./osd/OSDMap.h: In function 'entity_inst_t OSDMap::get_inst(int)':
> ./osd/OSDMap.h:460: FAILED assert(exists(osd) && is_up(osd)) ceph 
> version 0.22.1 (commit:c6f403a6f441184956e00659ce713eaee7014279)
> 1: (Objecter::op_submit(Objecter::Op*)+0x6c2) [0x38658854c2]
> 2: /usr/lib64/librados.so.1() [0x3865855dc9]
> 3: (RadosClient::aio_write(RadosClient::PoolCtx&, object_t, long, 
> ceph::buffer::list const&, unsigned long,
> RadosClient::AioCompletion*)+0x24b) [0x386585724b]
> 4: (rados_aio_write()+0x9a) [0x386585741a]
> 5: /usr/bin/qemu-kvm() [0x45a305]
> 6: /usr/bin/qemu-kvm() [0x45a430]
> 7: /usr/bin/qemu-kvm() [0x43bb73]
> NOTE: a copy of the executable, or `objdump -rdS <executable>` is 
> needed to interpret this.
> terminate called after throwing an instance of 'ceph::FailedAssertion'
> *** Caught signal (ABRT) ***
> ceph version 0.22.1 (commit:c6f403a6f441184956e00659ce713eaee7014279)
> 1: (sigabrt_handler(int)+0x91) [0x3865922b91]
> 2: /lib64/libc.so.6() [0x3c0c032a30]
> 3: (gsignal()+0x35) [0x3c0c0329b5]
> 4: (abort()+0x175) [0x3c0c034195]
> 5: (__gnu_cxx::__verbose_terminate_handler()+0x12d) [0x3c110beaad]
>
> This is reproducible by doing the following inside a VM:
>
> # mkfs.btrfs /dev/vdb /dev/vdc /dev/vdd /dev/vde # mount /dev/vdb /mnt 
> # cd /mnt # bonnie++ -u root -d /mnt -f
>
> Any hints are welcome...
>
> Thanks,
>
> Christian
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" 
> in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo 
> info at  http://vger.kernel.org/majordomo-info.html
>
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at  http://vger.kernel.org/majordomo-info.html


--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux