On 03/25/2013 06:03 AM, Dan van der Ster wrote: > Hi, > Apologies if this is already a known bug (though I didn't find it). > > If we try to map a device that doesn't exist, we get an immediate and > reproduceable kernel BUG (see the P.S.). We hit this by accident > because we forgot to add the --pool <ourpool>. I have begun looking at this. I'd like to reproduce it myself so I can more easily troubleshoot it. > This works: > > [root@afs245 /]# rbd map afs254-vicepa --pool afs --id afs --keyring > /etc/ceph/ceph.client.afs.keyring > [root@afs245 /]# rbd showmapped > id pool image snap device > 1 afs afs254-vicepa - /dev/rbd1 > > But this BUGS: > > [root@afs245 /]# rbd map afs254-vicepa You are doing this independent of the above command, right? I.e., are you running the command after mapping it as shown above, or are you doing it on a fairly pristine system? Do you know if there is a problem using the default pool (rbd)? I'll let you know if I am able to reproduce it before I hear back from you. Thanks a lot for reporting this. I've created an issue to track it. http://tracker.ceph.com/issues/4559 -Alex > BUG... > > Any clue? > > Cheers, > Dan, CERN IT > > > Mar 25 11:48:25 afs245 kernel: kernel BUG at mm/slab.c:3130! > Mar 25 11:48:25 afs245 kernel: invalid opcode: 0000 [#1] SMP > Mar 25 11:48:25 afs245 kernel: Modules linked in: rbd libceph > libcrc32c cpufreq_ondemand ipv6 ext2 iTCO_wdt iTCO_vendor_support > coretemp acpi_cpufreq freq_tabl > e mperf kvm_intel kvm crc32c_intel ghash_clmulni_intel microcode > pcspkr serio_raw i2c_i801 lpc_ich joydev e1000e ses enclosure sg ixgbe > hwmon dca ptp pps_core > mdio ext3 jbd mbcache sd_mod crc_t10dif aesni_intel ablk_helper cryptd > lrw aes_x86_64 xts gf128mul ahci libahci 3w_9xxx mpt2sas > scsi_transport_sas raid_class v > ideo mgag200 ttm drm_kms_helper dm_mirror dm_region_hash dm_log dm_mod > Mar 25 11:48:25 afs245 kernel: CPU 3 > Mar 25 11:48:25 afs245 kernel: Pid: 7444, comm: rbd Not tainted > 3.8.4-1.el6.elrepo.x86_64 #1 Supermicro X9SCL/X9SCM/X9SCL/X9SCM > Mar 25 11:48:25 afs245 kernel: RIP: 0010:[<ffffffff81179790>] > [<ffffffff81179790>] cache_alloc_refill+0x270/0x3c0 > Mar 25 11:48:25 afs245 kernel: RSP: 0018:ffff8808028e5c48 EFLAGS: 00010082 > Mar 25 11:48:25 afs245 kernel: RAX: 00000000ffffffff RBX: > ffff88082f000e00 RCX: ffff88082f000e00 > Mar 25 11:48:25 afs245 kernel: RDX: ffff8808055fba80 RSI: > ffff88082f0028d0 RDI: ffff88082f002900 > Mar 25 11:48:25 afs245 kernel: RBP: ffff8808028e5ca8 R08: > ffff88082f0028e0 R09: ffff8808010068c0 > Mar 25 11:48:25 afs245 kernel: R10: dead000000200200 R11: > 0000000000000003 R12: 0000000000000000 > Mar 25 11:48:25 afs245 kernel: R13: ffff880807a71ec0 R14: > ffff88082f0028c0 R15: 0000000000000004 > Mar 25 11:48:25 afs245 kernel: FS: 00007ff85056e760(0000) > GS:ffff88082fd80000(0000) knlGS:0000000000000000 > Mar 25 11:48:25 afs245 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > Mar 25 11:48:25 afs245 kernel: CR2: 0000000000428220 CR3: > 00000007eee7e000 CR4: 00000000001407e0 > Mar 25 11:48:25 afs245 kernel: DR0: 0000000000000000 DR1: > 0000000000000000 DR2: 0000000000000000 > Mar 25 11:48:25 afs245 kernel: DR3: 0000000000000000 DR6: > 00000000ffff0ff0 DR7: 0000000000000400 > Mar 25 11:48:25 afs245 kernel: Process rbd (pid: 7444, threadinfo > ffff8808028e4000, task ffff8807ef6fb520) > Mar 25 11:48:25 afs245 kernel: Stack: > Mar 25 11:48:25 afs245 kernel: ffff8808028e5d68 ffffffff8112fd5d > ffff8808028e5de8 ffff880800ac7000 > Mar 25 11:48:25 afs245 kernel: ffffffff028e5c78 00000000000080d0 > ffff8808028e5fd8 ffff88082f000e00 > Mar 25 11:48:25 afs245 kernel: 0000000000001078 0000000000000010 > 00000000000080d0 00000000000080d0 > Mar 25 11:48:25 afs245 kernel: Call Trace: > Mar 25 11:48:25 afs245 kernel: [<ffffffff8112fd5d>] ? > get_page_from_freelist+0x22d/0x710 > Mar 25 11:48:25 afs245 kernel: [<ffffffff8117ab08>] __kmalloc+0x168/0x340 > Mar 25 11:48:25 afs245 kernel: [<ffffffffa0380a35>] ? > ceph_parse_options+0x65/0x410 [libceph] > Mar 25 11:48:25 afs245 kernel: [<ffffffffa03ad2a0>] ? kzalloc+0x20/0x20 [rbd] > Mar 25 11:48:25 afs245 kernel: [<ffffffffa0380a35>] > ceph_parse_options+0x65/0x410 [libceph] > Mar 25 11:48:25 afs245 kernel: [<ffffffff8117a2a4>] ? > kmem_cache_alloc_trace+0x214/0x2e0 > Mar 25 11:48:25 afs245 kernel: [<ffffffff8117ac17>] ? __kmalloc+0x277/0x340 > Mar 25 11:48:25 afs245 kernel: [<ffffffffa03ad28f>] ? kzalloc+0xf/0x20 [rbd] > Mar 25 11:48:25 afs245 kernel: [<ffffffffa03ae96a>] > rbd_add_parse_args+0x1fa/0x250 [rbd] > Mar 25 11:48:25 afs245 kernel: [<ffffffffa03b18d4>] rbd_add+0x84/0x2b4 [rbd] > Mar 25 11:48:25 afs245 kernel: [<ffffffff813bc9a7>] bus_attr_store+0x27/0x30 > Mar 25 11:48:25 afs245 kernel: [<ffffffff81205d6f>] sysfs_write_file+0xef/0x170 > Mar 25 11:48:25 afs245 kernel: [<ffffffff81191904>] vfs_write+0xb4/0x130 > Mar 25 11:48:25 afs245 kernel: [<ffffffff811920af>] sys_write+0x5f/0xa0 > Mar 25 11:48:25 afs245 kernel: [<ffffffff810df656>] ? > __audit_syscall_exit+0x246/0x2f0 > Mar 25 11:48:25 afs245 kernel: [<ffffffff815e1499>] > system_call_fastpath+0x16/0x1b > Mar 25 11:48:25 afs245 kernel: Code: 48 8b 00 48 8b 55 b0 8b 4d b8 48 > 8b 75 a8 4c 8b 45 a0 4c 8b 4d c0 a8 40 0f 84 b8 fe ff ff 49 83 cf 01 > e9 af fe ff ff 0f 0b eb fe <0f> 0b eb fe 8b 75 c8 8b 55 cc 31 c9 48 89 > df 81 ce 00 12 04 00 > Mar 25 11:48:25 afs245 kernel: RIP [<ffffffff81179790>] > cache_alloc_refill+0x270/0x3c0 > Mar 25 11:48:25 afs245 kernel: RSP <ffff8808028e5c48> > Mar 25 11:48:25 afs245 kernel: ---[ end trace 46b67e5b8b69abcb ]--- > _______________________________________________ > ceph-users mailing list > ceph-users@xxxxxxxxxxxxxx > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com