I am seeking to relieve load from a central NFS fileserver and instead to push some disk IO to client machines that are clustered in a CDN. I am using: nfs-utils-1.2.0 and cachefilesd-0.9 These are on AMD64 machines running Ubuntu hardy. I have applied the patch from: * http://bugzilla.kernel.org/show_bug.cgi?id=13469* against both Linus’ most recent tree retrieved with git, and also against a debian backport 2.30 kernel as proposed elsewhere. I have applied the attached patches. The issue I am seeing is: Aug 17 20:07:00 dload04 kernel: [ 4238.561499] BUG: unable to handle kernel NULL pointer dereference at 0000000000000040 Aug 17 20:07:00 dload04 kernel: [ 4238.561612] IP: [<ffffffffa0308da5>] fscache_object_slow_work_execute+0x755/0xb70 [fscache] Aug 17 20:07:00 dload04 kernel: [ 4238.561726] PGD 375a4067 PUD 375a2067 PMD 0 Aug 17 20:07:00 dload04 kernel: [ 4238.561785] Oops: 0002 [#1] SMP Aug 17 20:07:00 dload04 kernel: [ 4238.561839] last sysfs file: /sys/kernel/uevent_seqnum Aug 17 20:07:00 dload04 kernel: [ 4238.561896] CPU 2 Aug 17 20:07:00 dload04 kernel: [ 4238.561941] Modules linked in: cachefiles af_packet autofs4 ipv6 nfs lockd fscache nfs_acl auth_rpcgss sunrpc iptable_filter ip_tables x_tables usbhid hid uhci_hcd ehci_hcd usbkbd psmouse serio_raw parport_pc parport i2c_amd756 i2c_amd8111 amd_rng i2c_core k8temp button processor pcspkr shpchp pci_hotplug evdev ext3 jbd mbcache sd_mod sg sr_mod cdrom ata_generic pata_amd ohci_hcd pata_acpi sata_mv usbcore floppy tg3 libphy 3w_xxxx libata scsi_mod thermal fan thermal_sys fuse Aug 17 20:07:00 dload04 kernel: [ 4238.562462] Pid: 4360, comm: kslowd Not tainted 2.6.30-iwin-lennybpo-cachefs #1 H8DAR-T Aug 17 20:07:00 dload04 kernel: [ 4238.562560] RIP: 0010:[<ffffffffa0308da5>] [<ffffffffa0308da5>] fscache_object_slow_work_execute+0x755/0xb70 [fscache] Aug 17 20:07:00 dload04 kernel: [ 4238.562678] RSP: 0018:ffff880037685ea0 EFLAGS: 00010246 Aug 17 20:07:00 dload04 kernel: [ 4238.562735] RAX: 0000000000000000 RBX: ffff8800bf590578 RCX: 06000000000000c1 Aug 17 20:07:00 dload04 kernel: [ 4238.562799] RDX: ffff88013a3c5390 RSI: ffffe200044bd318 RDI: ffff8800bf59051c Aug 17 20:07:00 dload04 kernel: [ 4238.562863] RBP: ffff8800bf590500 R08: 0000000000000000 R09: ffffffff80683f00 Aug 17 20:07:00 dload04 kernel: [ 4238.562927] R10: 00000000ffffffff R11: 0000000000000000 R12: ffff8800bf59051c Aug 17 20:07:00 dload04 kernel: [ 4238.562991] R13: 0000000000000000 R14: 00000001000f0640 R15: 0000000000000000 Aug 17 20:07:00 dload04 kernel: [ 4238.563055] FS: 0000000042331950(0000) GS:ffffc20000028000(0000) knlGS:0000000000000000 Aug 17 20:07:00 dload04 kernel: [ 4238.563157] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b Aug 17 20:07:00 dload04 kernel: [ 4238.563215] CR2: 0000000000000040 CR3: 00000000375a5000 CR4: 00000000000006e0 Aug 17 20:07:00 dload04 kernel: [ 4238.563278] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Aug 17 20:07:00 dload04 kernel: [ 4238.563342] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Aug 17 20:07:00 dload04 kernel: [ 4238.563407] Process kslowd (pid: 4360, threadinfo ffff880037684000, task ffff88007a338c80) Aug 17 20:07:00 dload04 kernel: [ 4238.563506] Stack: Aug 17 20:07:00 dload04 kernel: [ 4238.563548] ffff8800bf590578 0000000000000001 0000000000000004 0000000000000000 Aug 17 20:07:00 dload04 kernel: [ 4238.563617] 0000000000000000 ffffffff80297be3 0000000000000000 ffff88007a338c80 Aug 17 20:07:00 dload04 kernel: [ 4238.563721] ffffffff802524a0 ffff880037685ee8 ffff880037685ee8 ffffffff80297920 Aug 17 20:07:00 dload04 kernel: [ 4238.563859] Call Trace: Aug 17 20:07:00 dload04 kernel: [ 4238.563905] [<ffffffff80297be3>] ? slow_work_thread+0x2c3/0x480 Aug 17 20:07:00 dload04 kernel: [ 4238.563968] [<ffffffff802524a0>] ? autoremove_wake_function+0x0/0x30 Aug 17 20:07:00 dload04 kernel: [ 4238.564035] [<ffffffff80297920>] ? slow_work_thread+0x0/0x480 Aug 17 20:07:00 dload04 kernel: [ 4238.564094] [<ffffffff80297920>] ? slow_work_thread+0x0/0x480 Aug 17 20:07:00 dload04 kernel: [ 4238.564153] [<ffffffff80252098>] ? kthread+0x58/0x90 Aug 17 20:07:00 dload04 kernel: [ 4238.564209] [<ffffffff8020cdba>] ? child_rip+0xa/0x20 Aug 17 20:07:00 dload04 kernel: [ 4238.564267] [<ffffffff80252040>] ? kthread+0x0/0x90 Aug 17 20:07:00 dload04 kernel: [ 4238.564323] [<ffffffff8020cdb0>] ? child_rip+0x0/0x20 Aug 17 20:07:00 dload04 kernel: [ 4238.564380] Code: ff ff 48 8b 45 60 48 83 65 28 fd 4c 8d 65 1c 48 89 ef 48 8b 00 ff 50 18 4c 89 e7 e8 e6 1e 1c e0 c7 45 00 06 00 00 00 48 8b 45 68 <f0> 0f ba 70 40 01 19 d2 85 d2 74 12 48 8b 7d 68 be 01 00 00 00 Aug 17 20:07:00 dload04 kernel: [ 4238.564689] RIP [<ffffffffa0308da5>] fscache_object_slow_work_execute+0x755/0xb70 [fscache] Aug 17 20:07:00 dload04 kernel: [ 4238.564793] RSP <ffff880037685ea0> Aug 17 20:07:00 dload04 kernel: [ 4238.564841] CR2: 0000000000000040 Aug 17 20:07:00 dload04 kernel: [ 4238.565287] ---[ end trace 4e7508a1fec25125 ]--- These, in turn, lead to frozen machines. I have opened a bug: *http://bugzilla.kernel.org/show_bug.cgi?id=13998 * Any input would be appreciated. Thank you for your time, Joshua
-- Linux-cachefs mailing list Linux-cachefs@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cachefs