[1.] One line summary of the problem: Networking Oops in 2.4.16 (tcp_recvmsg -> ... -> skb_release_data -> __free_pages) [2.] Full description of the problem/report: 2.4.16 oopsed and killed the process each time. I haven't seen this mentioned on either the linux-kernel or linux-net lists, or in any of the 2.4.17-pre changelogs (or in Dave's detailed "Networking updates" changelog for 2.4.17-pre5.) The oops has "Tainted: P", but I've loaded only modules from the kernel distribution. It looks like the nls modules weren't fixed until sometime in 2.4.17-pre. It's happened four times, as noted below, but I don't have a definate way to trigger it. It does seem to be related to heavy network traffic. I'm not doing any firewalling or masquerading or anything else other than vanilla TCP/IP. Please let me know if any information I haven't included would be useful, or if there's a patch I should try. Please cc me for a faster response. Linux version 2.4.16 (fortytwo@manetheren) (gcc version 2.95.3 20010315 (Debian release)) #3 Tue Nov 27 00:34:35 PST 2001 ksymoops 2.4.3 on i686 2.4.16. Options used -v /home/fortytwo/linux-2.4.16/vmlinux (specified) -k /proc/ksyms (default) -l /proc/modules (default) -o /lib/modules/2.4.16/ (default) -m /boot/System.map-2.4.16 (default) Dec 7 23:54:41 manetheren kernel: Unable to handle kernel paging request at virtual address 4c3d79fb Dec 7 23:54:41 manetheren kernel: c01290ce Dec 7 23:54:41 manetheren kernel: *pde = 00000000 Dec 7 23:54:41 manetheren kernel: Oops: 0000 Dec 7 23:54:41 manetheren kernel: CPU: 0 Dec 7 23:54:41 manetheren kernel: EIP: 0010:[<c01290ce>] Tainted: P Using defaults from ksymoops -t elf32-i386 -a i386 Dec 7 23:54:41 manetheren kernel: EFLAGS: 00010246 Dec 7 23:54:41 manetheren kernel: eax: 4c3d79e3 ebx: 00000000 ecx: 4c3d79e3 edx: 00000000 Dec 7 23:54:41 manetheren kernel: esi: cbb6ebe0 edi: cbb6ec3c ebp: 00000000 esp: c4b4be84 Dec 7 23:54:41 manetheren kernel: ds: 0018 es: 0018 ss: 0018 Dec 7 23:54:41 manetheren kernel: Process gtk-gnutella (pid: 24848, stackpage=c4b4b000) Dec 7 23:54:41 manetheren kernel: Stack: c019622d cbb6ebe0 000005b4 c019626b cbb6ebe0 cbb6ebe0 c0196385 cbb6ebe0 Dec 7 23:54:41 manetheren kernel: cbb6ebe0 00000000 c01a8afd cbb6ebe0 c02069c0 00001000 c4b4bf80 c4b4bf80 Dec 7 23:54:41 manetheren kernel: c2217b14 c2217af8 c2217af4 00000000 00000000 00000000 00000001 c2217c9c Dec 7 23:54:41 manetheren kernel: Call Trace: [<c019622d>] [<c019626b>] [<c0196385>] [<c01a8afd>] [<c01be9f5>] Dec 7 23:54:41 manetheren kernel: [<c0193731>] [<c019383e>] [<c012df36>] [<c0106b0b>] Dec 7 23:54:41 manetheren kernel: Code: 8b 41 18 f6 c4 40 75 11 ff 49 14 0f 94 c0 84 c0 74 07 89 c8 >>EIP; c01290ce <__free_pages+2/1c> <===== Trace; c019622c <skb_release_data+40/74> Trace; c019626a <kfree_skbmem+a/54> Trace; c0196384 <__kfree_skb+d0/d8> Trace; c01a8afc <tcp_recvmsg+60c/7e8> Trace; c01be9f4 <inet_recvmsg+3c/54> Trace; c0193730 <sock_recvmsg+3c/ac> Trace; c019383e <sock_read+92/a0> Trace; c012df36 <sys_read+96/cc> Trace; c0106b0a <system_call+32/38> Code; c01290ce <__free_pages+2/1c> 00000000 <_EIP>: Code; c01290ce <__free_pages+2/1c> <===== 0: 8b 41 18 mov 0x18(%ecx),%eax <===== Code; c01290d0 <__free_pages+4/1c> 3: f6 c4 40 test $0x40,%ah Code; c01290d4 <__free_pages+8/1c> 6: 75 11 jne 19 <_EIP+0x19> c01290e6 <__free_pages+1a/1c> Code; c01290d6 <__free_pages+a/1c> 8: ff 49 14 decl 0x14(%ecx) Code; c01290d8 <__free_pages+c/1c> b: 0f 94 c0 sete %al Code; c01290dc <__free_pages+10/1c> e: 84 c0 test %al,%al Code; c01290de <__free_pages+12/1c> 10: 74 07 je 19 <_EIP+0x19> c01290e6 <__free_pages+1a/1c> Code; c01290e0 <__free_pages+14/1c> 12: 89 c8 mov %ecx,%eax The second one was identical, except for these lines: Dec 8 00:54:11 manetheren kernel: <1>Unable to handle kernel paging request at virtual address b8b500a3 Dec 8 00:54:11 manetheren kernel: eax: b8b5008b ebx: 00000000 ecx: b8b5008b edx: 00000000 Dec 8 00:54:11 manetheren kernel: esi: c6b022c0 edi: c6b0231c ebp: 00000000 esp: c33dbe84 Dec 8 00:54:11 manetheren kernel: ds: 0018 es: 0018 ss: 0018 Dec 8 00:54:11 manetheren kernel: Process gtk-gnutella (pid: 26801, stackpage=c33db000) Dec 8 00:54:11 manetheren kernel: Stack: c019622d c6b022c0 00000009 c019626b c6b022c0 c6b022c0 c0196385 c6b022c0 Dec 8 00:54:11 manetheren kernel: c6b022c0 0000003c c01a8afd c6b022c0 c02069c0 00000009 c33dbf80 c33dbf80 Dec 8 00:54:11 manetheren kernel: d6262dd4 d6262db8 d6262db4 00000000 00000000 00000000 00000001 d6262f5c The third: Dec 8 09:44:52 manetheren kernel: <1>Unable to handle kernel paging request at virtual address 018000ae Dec 8 09:44:52 manetheren kernel: eax: 01800096 ebx: 00000000 ecx: 01800096 edx: 00000000 Dec 8 09:44:52 manetheren kernel: esi: d5cbe9a0 edi: d5cbe9fc ebp: 00000000 esp: c55dde84 Dec 8 09:44:52 manetheren kernel: ds: 0018 es: 0018 ss: 0018 Dec 8 09:44:52 manetheren kernel: Process squid (pid: 26659, stackpage=c55dd000) Dec 8 09:44:52 manetheren kernel: Stack: c019622d d5cbe9a0 000005a8 c019626b d5cbe9a0 d5cbe9a0 c0196385 d5cbe9a0 Dec 8 09:44:52 manetheren kernel: d5cbe9a0 00000000 c01a8afd d5cbe9a0 c02069c0 00015554 c55ddf80 c55ddf80 Dec 8 09:44:52 manetheren kernel: d6e66e54 d6e66e38 d6e66e34 00000000 00000000 00000000 00000001 d6e66fdc And the fourth: Dec 8 10:18:33 manetheren kernel: <1>Unable to handle kernel paging request at virtual address 494d5dd4 Dec 8 10:18:33 manetheren kernel: eax: 494d5dbc ebx: 00000000 ecx: 494d5dbc edx: 00000000 Dec 8 10:18:33 manetheren kernel: esi: c6b02a40 edi: c6b02a9c ebp: 00000000 esp: c4af7e84 Dec 8 10:18:33 manetheren kernel: ds: 0018 es: 0018 ss: 0018 Dec 8 10:18:33 manetheren kernel: Process gtk-gnutella (pid: 4346, stackpage=c4af7000) Dec 8 10:18:33 manetheren kernel: Stack: c019622d c6b02a40 0000024c c019626b c6b02a40 c6b02a40 c0196385 c6b02a40 Dec 8 10:18:33 manetheren kernel: c6b02a40 00000000 c01a8afd c6b02a40 c02069c0 00001000 c4af7f80 c4af7f80 Dec 8 10:18:33 manetheren kernel: c4d600d4 c4d600b8 c4d600b4 00000000 00000000 00000000 00000001 c4d6025c Linux manetheren 2.4.16 #3 Tue Nov 27 00:34:35 PST 2001 i686 unknown Gnu C 2.95.3 Gnu make 3.79.1 util-linux 2.11m mount 2.11m modutils 2.4.11 e2fsprogs 1.25 Linux C Library 2.2.4 Dynamic linker (ldd) 2.2.4 Procps 2.0.7 Net-tools 1.60 Console-tools 0.2.3 Sh-utils 2.0.11 Modules Loaded nls_iso8859-1 isofs inflate_fs ide-cd cdrom floppy i810_rng parport_pc lp parport rtc af_packet uhci usbcore processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 6 model name : Celeron (Mendocino) stepping : 5 cpu MHz : 465.267 cache size : 128 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr bogomips : 927.33 nls_iso8859-1 2848 1 (autoclean) isofs 24160 1 (autoclean) inflate_fs 17920 0 (autoclean) [isofs] ide-cd 26080 1 (autoclean) cdrom 26880 0 (autoclean) [ide-cd] floppy 44832 0 (autoclean) i810_rng 2520 1 (autoclean) parport_pc 21288 1 (autoclean) lp 6144 0 (autoclean) parport 24992 1 (autoclean) [parport_pc lp] rtc 5368 0 (autoclean) af_packet 11144 0 (autoclean) uhci 23496 0 (unused) usbcore 47904 1 [uhci] 0000-001f : dma1 0020-003f : pic1 0040-005f : timer 0060-006f : keyboard 0070-007f : rtc 0080-008f : dma page reg 00a0-00bf : pic2 00c0-00df : dma2 00f0-00ff : fpu 0170-0177 : ide1 01f0-01f7 : ide0 0376-0376 : ide1 0378-037a : parport0 037b-037f : parport0 03c0-03df : vga+ 03f6-03f6 : ide0 03f8-03ff : serial(set) 0cf8-0cff : PCI conf1 d000-dfff : PCI Bus #01 d800-d8ff : Lite-On Communications Inc LNE100TX d800-d8ff : tulip df00-df3f : Ensoniq ES1371 [AudioPCI-97] df00-df3f : es1371 ef80-ef9f : Intel Corp. 82801AA USB ef80-ef9f : usb-uhci efa0-efaf : Intel Corp. 82801AA SMBus ffa0-ffaf : Intel Corp. 82801AA IDE ffa0-ffa7 : ide0 ffa8-ffaf : ide1 00000000-0009efff : System RAM 000a0000-000bffff : Video RAM area 000c0000-000c7fff : Video ROM 000f0000-000fffff : System ROM 00100000-17ebffff : System RAM 00100000-001cb14d : Kernel code 001cb14e-0020849f : Kernel data f6a00000-f6afffff : PCI Bus #01 f8000000-fbffffff : Intel Corp. 82810-DC100 CGC [Chipset Graphics Controller] ff800000-ff8fffff : PCI Bus #01 ff8ffc00-ff8ffcff : Lite-On Communications Inc LNE100TX ff8ffc00-ff8ffcff : tulip ffa80000-ffafffff : Intel Corp. 82810-DC100 CGC [Chipset Graphics Controller] 01:08.0 Ethernet controller: Lite-On Communications Inc LNE100TX (rev 21) Subsystem: Netgear FA310TX Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 64 Interrupt: pin A routed to IRQ 11 Region 0: I/O ports at d800 [size=256] Region 1: Memory at ff8ffc00 (32-bit, non-prefetchable) [size=256] Expansion ROM at ff880000 [disabled] [size=256K] (other lspci entries skipped for brevity) -- Paul Cassella - : send the line "unsubscribe linux-net" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html