Hello, we operate 4 equal servers (HP DL360 G5), each with 6 hard disks. These disks are used for ceph OSD's and they are configured to use bcache. Once per week one of the servers stops with an Oops. System is Ubuntu 12.04 LTS 64bit with kernel package linux-image-3.11.0-15-generic. Here are the kernel messages of 2 incidents on different servers: (done with OCR, so there may be wrong characters) [1335759.070167] BUG: unable to handle kernel NULL pointer dereference at 0000000000000008 [1335759.070340] IP: [<ffffffffa021b59f>] bch_btree_node_read_done+0x4f/0x430 [bcache] [1335759.070494] PGD 802573067 PUD 800a1c067 PMD 0 [1335759.070612] Oops: 0002 [#1] SMP [1335759.070685] Modules linked in: ufs qnx4 hfsplus hfs minix ntfs msdos jfs reiserfs ext2 mptctl mptbase ipmi_de_intf bonding radeon xfs coretemp ttm i5000_edac edac_core drm_kms_helper bridge ipmi_si drm gpio_ich psmouse ipmi_msghandler i5k_amb hpwdt sh pchp lpc_ich i2c_algo_bit hpilo serio_raw stp mac_hid llc lp parport bcache ses enclosure btrfs hid_eneric usbhid hid hpsa raid 6_pq mpt2sas bnx2 scsi_transport_sas raid_class cciss xor zlib_deflate [1335759.072181] CPU: 1 PID: 18619 Comm: kworker/1:2 Not tainted 3.11.0-15-generic #25~precise1-Ubuntu [1335759.072323] Hardware name: HP ProLiant DL360 G5, BIOS P58 05/02/2011 [1335759.072427] Workqueue: bcache bch_journal [bcache] [1335759.072518] task: ffff880782425dc0 ti: ffff880776cfa000 task.ti: ffff880776cfa000 [1335759.072640] RIP: 0010:[<ffffffffa021b59f>] [<ffffffffa021b59f>] bch_btree_node_read_done+0x4f/0x430 [bcache] [1335759.072798] RSP: 0018:ffff880776cfbb38 EFLAGS: 00010246 [1335759.072896] RAX: 0000000000000400 RBX: ffff8807f97bc000 RCX: ffff8807f5480000 [1335759.073007] RDX: 0000000000000000 RSI: 0000000000000282 RDI: 0000000000000282 [1335759.073117] RBP: ffff880776cfbb68 R08: 0000000000011200 R09: 0000000000000f7d [1335759.073226] R10: 0000000000007d24 R11: 0000000000000000 R12: 0000000000000000 [1335759.073336] R13: ffff880004980000 R14: ffff88030f4e2020 R15: ffff880782425dc0 [1335759.073446] FS: 0000000000000000(0000) GS:ffff88082fa40000(0000) knlGS:0000000000000000 [1335759.073575] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [1335759.073665] CR2: 0000000000000008 CR3: 0000000802224000 CR4: 00000000000006e0 [1335759.073774] Stack: [1335759.073820] ffff880776cfbb68 ffff8807f97bc000 ffff880776cfbb78 0004bedda4b293fc [1335759.074011] ffff88030f4e2020 ffff880782425dc0 ffff880776cfbbd8 ffffffffa021d420 [1335759.074140] 0000000000000000 ffff880782425dc0 0000000000000000 0000000000000000 [1335759.074140] Call Trace: [1335759.074140] [<ffffffffa021d420>] bch_btree_node_read+0x150/0x1a0 [bcache] [1335759.074140] [<ffffffffa021e737>] bch_btree_node_et+0x147/0x290 [bcache] [1335759.074140] [<ffffffffa0220f5b>] bch_btree_insert_recurse+0x36b/0x510 [bcache] [1335759.074140] [<ffffffffa0221256>] bch_btree_insert+0x156/0x330 [bcache] [1335759.074140] [<ffffffffa022d61a>] bch_btree_insert_async+0x1a/0xe0 [bcache] [1335759.074140] [<ffffffffa0228139>] bch_journal+0x49/0x300 [bcache] [1335759.074140] [<ffffffff81081060>] process_one_work+0x170/0x4a0 [1335759.074140] [<ffffffff81082121>] worker_thread+0x121/0x390 [1335759.074140] [<ffffffff81082000>] ? manage_workers.isra.21+0x170/0x170 [1335759.074140] [<ffffffff81088fe0>] kthread+0xc0/0xd0 [1335759.074140] [<ffffffff81088f20>] ? flush_kthread_worker+0xb0/0xb0 [1335759.074140] [<ffffffff817508ec>] ret_from_fork+0x7c/0xb0 [1335759.074140] [<ffffffff81088f20>] ? flush_kthread_worker+0xb0/0xb0 [1335759.074140] Code: af c0 00 00 00 48 8b b8 50 35 00 00 e8 8b 04 f3 e0 48 8b 8b 80 00 00 00 49 89 c4 31 d2 0f b7 81 32 04 00 00 66 f7 b1 30 04 00 00 <49> c7 44 24 08 00 00 00 00 0f b7 c0 49 89 04 24 49 8b 45 10 48 [1335759.074140] RIP [<ffffffffa021b59f>] bch_btree_node_read_done+0x4f/0x430 [bcache] [1335759.074140] RSP <ffff880776cfbb38> [1335759.074140] CR2: 0000000000000008 [922864.546138] BUG: unable to handle kernel NULL pointer dereference at 0000000000000008 [922864.546303] IP: [<ffffffffa01dd59f>] bch_btree_node_read_done+0x4f/0x430 [bcache] [922864.546460] PGD 0 [922864.546509] Oops: 0002 [#1] SMP [922864.546578] Modules linked in: mptctl mptbase ipmi_de_intf xfs radeon ttm gpio_ich i5000_edac drm_kms_helper drm edac_core l pc_ich i2c_algo_bit coretemp i5k_amb psmouse ipmi_si shpchp serio_raw hpilo bonding ipmi_msghandler hpwdt mac_hid lp parport bca che ses enclosure btrfs hid_eneric usbhid hid hpsa raid6_q mpt2sas bnx2 scsi_transport_sas raid_class cciss xor zlib_deflate [922864.547799] CPU: 7 PID: 7846 Comm: kworker/7:0 Not tainted 3.11.0-15-generic #25~precise1-Ubuntu [922864.547935] Hardware name: HP ProLiant DL360 G5, BIOS P58 05/02/2011 [922864.548036] Workqueue: bcache bch_journal [bcache] [922864.548124] task: ffff88010d1f0000 ti: ffff8806058aa000 task.ti: ffff8806058aa000 [922864.548243] RIP: 0010:[<ffffffffa01dd59f>] [<ffffffffa01dd59f>] bch_btree_node_read_done+0x4f/0x430 [bcache] [922864.548412] RSP: 0018:ffff8806058abb38 EFLAGS: 00010246 [922864.548499] RAX: 0000000000000400 RBX: ffff8807f52cac00 RCX: ffff8807f7360000 [922864.548613] RDX: 0000000000000000 RSI: 0000000000000282 RDI: 0000000000000282 [922864.548720] RBP: ffff8806058abb68 R08: 0000000000011200 R09: 0000000000000f7d [922864.548827] R10: 0000000000007d24 R11: 0000000000000001 R12: 0000000000000000 [922864.548933] R13: ffff8800b40c0000 R14: ffff88009f735020 R15: ffff88010d1f0000 [922864.549041] FS: 0000000000000000(0000) GS:ffff88082fbc0000(0000) knlGS:0000000000000000 [922864.549166] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [922864.549253] CR2: 0000000000000008 CR3: 0000000001c0d000 CR4: 00000000000006e0 [922864.549360] Stack: [922864.549403] ffff8806058abb68 ffff8807f52cac00 ffff8806058abb78 00034757253e6300 [922864.549583] ffff88009f735020 ffff88010d1f0000 ffff8806058abbd8 ffffffffa01df420 [922864.549767] 0000000000000000 ffff88010d1f0000 0000000000000000 0000000000000000 [922864.549944] Call Trace: [922864.549996] [<ffffffffa01df420>] bch_btree_node_read+0x150/0x1a0 [bcache] [922864.550110] [<ffffffffa01e0737>] bch_btree_node_et+0x147/0x290 [bcache] [922864.550110] [<ffffffffa01e2f5b>] bch_btree_insert_recurse+0x36b/0x510 [bcache] [922864.550110] [<ffffffffa01e3256>] bch_btree_insert+0x156/0x330 [bcache] [922864.550110] [<ffffffffa01ef61a>] bch_btree_insert_async+0x1a/0xe0 [bcache] [922864.550110] [<ffffffffa01ea139>] bch_journal+0x49/0x300 [bcache] [922864.550110] [<ffffffff81081060>] process_one_work+0x170/0x4a0 [922864.550110] [<ffffffff81082121>] worker_thread+0x121/0x390 [922864.550110] [<ffffffff81082000>] ? manage_workers.isra.21+0x170/0x170 [922864.550110] [<ffffffff81088fe0>] kthread+0xc0/0xd0 [922864.550110] [<ffffffff81088f20>] ? flush_kthread_worker+0xb0/0xb0 [922864.550110] [<ffffffff817508ec>] ret_from_fork+0x7c/0xb0 [922864.550110] [<ffffffff81088f20>] ? flush_kthread_worker+0xb0/0xb0 [922864.550110] Code: af c0 00 00 00 48 8b b8 50 35 00 00 e8 8b e4 f6 e0 48 8b 8b 80 00 00 00 49 89 c4 31 d2 0f b7 81 32 04 00 0 0 66 f7 b1 30 04 00 00 <49> c7 44 24 08 00 00 00 00 0f b7 c0 49 89 04 24 49 8b 45 10 48 [922864.550110] RIP [<ffffffffa01dd59f>] bch_btree_node_read_done+0x4f/0x430 [bcache] [922864.550110] RSP <ffff8806058abb38> [922864.550110] CR2: 0000000000000008 Any help appreciated. Thanks. Tino -- Internal Services phone: +49 (0)341 594 00 5531 sprd.net AG Gießerstraße 27 04229 Leipzig Germany http://www.spreadshirt.net/ Vorstand/Executive Board: Matthias Spieß, Tobias Schaugg, Philip Rooke Aufsichtsratsvorsitzender/Chairman of the Supervisory Board: Lukasz Gadowski Handelsregister/Trade Register: Amtsgericht Leipzig, HRB 22478 Umsatzsteuer-IdentNummer/VAT-ID: DE 8138 7149 4 -- To unsubscribe from this list: send the line "unsubscribe linux-bcache" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html