Re: Strange CPU occupation... and system hangs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



BERTRAND Joël wrote:

<snip>

and some process are in D state :
Root gershwin:[/etc] > ps auwx | grep D
USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
root       270  0.0  0.0      0     0 ?        D    Oct27   1:17 [pdflush]
root      3676  0.9  0.0      0     0 ?        D    Oct27  56:03 [nfsd]
root      5435  0.0  0.0      0     0 ?        D<   Oct27   3:16 [md7_raid1]
root      5438  0.0  0.0      0     0 ?        D<   Oct27   1:01 [kjournald]
root      5440  0.0  0.0      0     0 ?        D<   Oct27   0:33 [loop0]
root      5441  0.0  0.0      0     0 ?        D<   Oct27   0:05 [kjournald]
root 16442 0.0 0.0 20032 1208 pts/2 D+ 13:23 0:00 iftop -i eth2

	Why md7_raid is in D state ? Same question about iftop ?

Some bad news... After ten or eleven hours, kernel crashes on this server. The last top screen is :

top - 04:59:46 up 4 days, 16:24, 3 users, load average: 19.72, 19.22, 19.05
Tasks: 285 total,   5 running, 279 sleeping,   0 stopped,   1 zombie
Cpu(s): 0.0%us, 4.2%sy, 0.0%ni, 68.5%id, 27.3%wa, 0.0%hi, 0.0%si, 0.0%st
Mem:   4139024k total,  4130800k used,     8224k free,    38984k buffers
Swap:  7815536k total,      304k used,  7815232k free,    79056k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
5426 root      15  -5     0    0    0 R  100  0.0 970:17.21 md_d0_raid5
26923 root 20 0 3120 1568 1112 R 2 0.0 13:32.24 top
...

I have rebooted. I don't have any message in log files. I don't have any screen but I haven't seen anything on serial console. In ker.log, I have : Oct 31 15:36:15 gershwin kernel: swapper: page allocation failure. order:2, mode:0x4020
Oct 31 15:36:15 gershwin kernel: Call Trace:
Oct 31 15:36:15 gershwin kernel: [00000000004b6568] __slab_alloc+0x1b0/0x720 Oct 31 15:36:15 gershwin kernel: [00000000004b87a8] __kmalloc_track_caller+0xb0/0xe0
Oct 31 15:36:15 gershwin kernel:  [0000000000601d68] __alloc_skb+0x50/0x120
Oct 31 15:36:15 gershwin kernel: [0000000000642ee0] tcp_collapse+0x1e8/0x440 Oct 31 15:36:15 gershwin kernel: [0000000000643298] tcp_prune_queue+0x160/0x3a0 Oct 31 15:36:15 gershwin kernel: [0000000000643d08] tcp_data_queue+0x830/0xde0 Oct 31 15:36:15 gershwin kernel: [0000000000645d74] tcp_rcv_established+0x35c/0x840 Oct 31 15:36:15 gershwin kernel: [000000000064cf7c] tcp_v4_do_rcv+0xe4/0x4a0
Oct 31 15:36:15 gershwin kernel:  [000000000064fdd8] tcp_v4_rcv+0xb00/0xb20
Oct 31 15:36:15 gershwin kernel: [000000000062e2ac] ip_local_deliver+0x194/0x3a0
Oct 31 15:36:15 gershwin kernel:  [000000000062dd98] ip_rcv+0x360/0x6e0
Oct 31 15:36:15 gershwin kernel: [0000000000607f64] netif_receive_skb+0x1ec/0x480
Oct 31 15:36:15 gershwin kernel:  [00000000005a5fe0] tg3_poll+0x6c8/0xc40
Oct 31 15:36:15 gershwin kernel: [000000000060a940] net_rx_action+0x88/0x160
Oct 31 15:36:15 gershwin kernel:  [0000000000468078] __do_softirq+0x80/0x100
Oct 31 15:36:15 gershwin kernel:  [000000000046815c] do_softirq+0x64/0x80
Oct 31 15:36:15 gershwin kernel: Mem-info:
Oct 31 15:36:15 gershwin kernel: Normal per-cpu:
Oct 31 15:36:15 gershwin kernel: CPU 0: Hot: hi: 90, btch: 15 usd: 15 Cold: hi: 30, btch: 7 usd: 5 Oct 31 15:36:15 gershwin kernel: CPU 1: Hot: hi: 90, btch: 15 usd: 31 Cold: hi: 30, btch: 7 usd: 4 Oct 31 15:36:15 gershwin kernel: CPU 2: Hot: hi: 90, btch: 15 usd: 4 Cold: hi: 30, btch: 7 usd: 3 Oct 31 15:36:15 gershwin kernel: CPU 3: Hot: hi: 90, btch: 15 usd: 82 Cold: hi: 30, btch: 7 usd: 2 Oct 31 15:36:15 gershwin kernel: CPU 4: Hot: hi: 90, btch: 15 usd: 84 Cold: hi: 30, btch: 7 usd: 0 Oct 31 15:36:15 gershwin kernel: CPU 5: Hot: hi: 90, btch: 15 usd: 65 Cold: hi: 30, btch: 7 usd: 4 Oct 31 15:36:15 gershwin kernel: CPU 6: Hot: hi: 90, btch: 15 usd: 85 Cold: hi: 30, btch: 7 usd: 6 Oct 31 15:36:15 gershwin kernel: CPU 7: Hot: hi: 90, btch: 15 usd: 69 Cold: hi: 30, btch: 7 usd: 4 Oct 31 15:36:15 gershwin kernel: CPU 8: Hot: hi: 90, btch: 15 usd: 11 Cold: hi: 30, btch: 7 usd: 5 Oct 31 15:36:15 gershwin kernel: CPU 9: Hot: hi: 90, btch: 15 usd: 75 Cold: hi: 30, btch: 7 usd: 1 Oct 31 15:36:15 gershwin kernel: CPU 10: Hot: hi: 90, btch: 15 usd: 84 Cold: hi: 30, btch: 7 usd: 2 Oct 31 15:36:15 gershwin kernel: CPU 11: Hot: hi: 90, btch: 15 usd: 13 Cold: hi: 30, btch: 7 usd: 1 Oct 31 15:36:15 gershwin kernel: CPU 12: Hot: hi: 90, btch: 15 usd: 17 Cold: hi: 30, btch: 7 usd: 23 Oct 31 15:36:15 gershwin kernel: CPU 13: Hot: hi: 90, btch: 15 usd: 7 Cold: hi: 30, btch: 7 usd: 25 Oct 31 15:36:15 gershwin kernel: CPU 14: Hot: hi: 90, btch: 15 usd: 64 Cold: hi: 30, btch: 7 usd: 27 Oct 31 15:36:15 gershwin kernel: CPU 15: Hot: hi: 90, btch: 15 usd: 12 Cold: hi: 30, btch: 7 usd: 6 Oct 31 15:36:15 gershwin kernel: CPU 16: Hot: hi: 90, btch: 15 usd: 2 Cold: hi: 30, btch: 7 usd: 1 Oct 31 15:36:15 gershwin kernel: CPU 17: Hot: hi: 90, btch: 15 usd: 80 Cold: hi: 30, btch: 7 usd: 1 Oct 31 15:36:15 gershwin kernel: CPU 18: Hot: hi: 90, btch: 15 usd: 4 Cold: hi: 30, btch: 7 usd: 17 Oct 31 15:36:15 gershwin kernel: CPU 19: Hot: hi: 90, btch: 15 usd: 58 Cold: hi: 30, btch: 7 usd: 1 Oct 31 15:36:16 gershwin kernel: CPU 20: Hot: hi: 90, btch: 15 usd: 13 Cold: hi: 30, btch: 7 usd: 4 Oct 31 15:36:16 gershwin kernel: CPU 21: Hot: hi: 90, btch: 15 usd: 87 Cold: hi: 30, btch: 7 usd: 2 Oct 31 15:36:16 gershwin kernel: CPU 22: Hot: hi: 90, btch: 15 usd: 77 Cold: hi: 30, btch: 7 usd: 6 Oct 31 15:36:16 gershwin kernel: CPU 23: Hot: hi: 90, btch: 15 usd: 10 Cold: hi: 30, btch: 7 usd: 2 Oct 31 15:36:16 gershwin kernel: Active:72131 inactive:275943 dirty:6433 writeback:1041 unstable:0 Oct 31 15:36:16 gershwin kernel: free:2187 slab:131941 mapped:1732 pagetables:258 bounce:0 Oct 31 15:36:16 gershwin kernel: Normal free:17496kB min:8144kB low:10176kB high:12216kB active:577048kB inactive:2207544kB present:4149440kB pages_scanned:0 all_unreclaimable? no
Oct 31 15:36:16 gershwin kernel: lowmem_reserve[]: 0 0
Oct 31 15:36:16 gershwin kernel: Normal: 1786*8kB 173*16kB 0*32kB 1*64kB 0*128kB 0*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB 0*8192kB = 17632kB Oct 31 15:36:16 gershwin kernel: Swap cache: add 39, delete 39, find 0/0, race 0+0
Oct 31 15:36:16 gershwin kernel: Free swap  = 7815232kB
Oct 31 15:36:16 gershwin kernel: Free swap  = 7815232kB
Oct 31 15:36:16 gershwin kernel: Total swap = 7815536kB
Oct 31 15:36:16 gershwin kernel: Free swap:       7815232kB
Oct 31 15:36:17 gershwin kernel: 524260 pages of RAM
Oct 31 15:36:17 gershwin kernel: 6882 reserved pages
Oct 31 15:36:17 gershwin kernel: 278965 pages shared
Oct 31 15:36:17 gershwin kernel: 0 pages swap cached
Oct 31 15:36:17 gershwin kernel: 6435 pages dirty
Oct 31 15:36:17 gershwin kernel: 1041 pages writeback
Oct 31 15:36:17 gershwin kernel: 1732 pages mapped
Oct 31 15:36:17 gershwin kernel: 131941 pages slab
Oct 31 15:36:17 gershwin kernel: 258 pages pagetables

	Any idea ?

	Regards,

	JKB
-
To unsubscribe from this list: send the line "unsubscribe sparclinux" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Kernel Development]     [DCCP]     [Linux ARM Development]     [Linux]     [Photo]     [Yosemite Help]     [Linux ARM Kernel]     [Linux SCSI]     [Linux x86_64]     [Linux Hams]

  Powered by Linux