Re: random + 8250-omap + edma: dma_ccerr_handler "did not care"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Matthijs,

On Thu, May 19, 2016 at 4:42 AM, Matthijs van Duin
<matthijsvanduin@xxxxxxxxx> wrote:
> I already had occasional random failures rebooting my bbb, but it
> happened rarely and I hadn't investigated yet.
>
> While debugging another issue I turned off the "quiet" option and as a
> side-effect I discovered the cause of the failures:
>
> random: nonblocking pool is initialized
> irq 187: nobody cared (try booting with the "irqpoll" option)
> CPU: 0 PID: $varies Comm: $varies Not tainted 4.6.0-bone3-dd1 #2
> Hardware name: Generic AM33XX (Flattened Device Tree)
> [<c010b059>] (unwind_backtrace) from [<c0109945>] (show_stack+0x11/0x14)
> [<c0109945>] (show_stack) from [<c014802f>] (__report_bad_irq+0x23/0x84)
> [<c014802f>] (__report_bad_irq) from [<c01482a1>] (note_interrupt+0x1c5/0x200)
> [<c01482a1>] (note_interrupt) from [<c0146a13>] (handle_irq_event_percpu+0xfb/0x154)
> [<c0146a13>] (handle_irq_event_percpu) from [<c0146a8d>] (handle_irq_event+0x21/0x2c)
> [<c0146a8d>] (handle_irq_event) from [<c01486d1>] (handle_level_irq+0x61/0xac)
> [<c01486d1>] (handle_level_irq) from [<c014633d>] (generic_handle_irq+0x1d/0x28)
> [<c014633d>] (generic_handle_irq) from [<c01464ff>] (__handle_domain_irq+0x3b/0x80)
> [<c01464ff>] (__handle_domain_irq) from [<c044284d>] (__irq_svc+0x4d/0x74)
> [<c044284d>] (__irq_svc) from [<c0121d52>] (__do_softirq+0x66/0x1c8)
> [<c0121d52>] (__do_softirq) from [<c0122229>] (irq_exit+0x95/0xbc)
> [<c0122229>] (irq_exit) from [<c0146503>] (__handle_domain_irq+0x3f/0x80)
> [<c0146503>] (__handle_domain_irq) from [<c044284d>] (__irq_svc+0x4d/0x74)
> [<c044284d>] (__irq_svc) from [<c0145656>] (console_unlock+0x26e/0x410)
> [<c0145656>] (console_unlock) from [<c01459b5>] (vprintk_emit+0x1bd/0x310)
>   (rest of traceback varies)
> handlers:
> [<c02c5e6d>] dma_ccerr_handler
> Disabling IRQ #187
>
> To be honest, I can't even begin to speculate what's going on here.  I
> checked dma_ccerr_handler but I don't see how it could fail to clear the
> error irq.  And I didn't include the "random: nonblocking pool is
> initialized" message right before the traceback by accident.  So far
> it's been there every single time.
>
> The exact moment this happens varies, I just made it easier to trigger
> by increasing the volume of console output.  Probably.  Repeatedly
> dumping a large pile of output to /dev/console failed to trigger it
> though.  It does however on rare occasion also happen on shutdown.
>
> Whenever it occurs during boot, often things eventually get stuck
> resulting in hung task tracebacks in out_of_line_wait_on_bit() in ext4
> code.  But not always.  I haven't seen it block shutdown.
>
> I've confirmed I can also reproduce it using mainline v4.6.  My config
> file can be found here:
> https://github.com/dutchanddutch/bb-kernel/blob/am33x-v4.6/patches/defconfig
> The only change needed to build with mainline is clearing EXTRA_FIRMWARE

I've also encountered such an error. Though I've seen it only twice
and couldn't reproduce it.

[  OK  ] Started Load Kernel Modules.[    9.283491] systemd[1]:
Started Load Kernel Modules.

[  OK  ] Started Create list of required static device nodes
...current kernel.[    9.324927] systemd[1]: Started Create list of
required static device nodes for the current kernel.

[    9.359464] random: nonblocking pool is initialized
[    9.704147] irq 22: nobody cared (try booting with the "irqpoll" option)
[    9.704160] CPU: 0 PID: 1 Comm: systemd Not tainted
4.6.0-rc2-00099-g3871144-dirty #4
[    9.704164] Hardware name: Generic AM33XX (Flattened Device Tree)
[    9.704204] [<c010feec>] (unwind_backtrace) from [<c010c10c>]
(show_stack+0x10/0x14)
[    9.704223] [<c010c10c>] (show_stack) from [<c046b194>]
(dump_stack+0xb0/0xe4)
[    9.704244] [<c046b194>] (dump_stack) from [<c019e484>]
(__report_bad_irq+0x24/0xc0)
[    9.704255] [<c019e484>] (__report_bad_irq) from [<c019e850>]
(note_interrupt+0x24c/0x2ac)
[    9.704265] [<c019e850>] (note_interrupt) from [<c019be10>]
(handle_irq_event_percpu+0x260/0x2e0)
[    9.704275] [<c019be10>] (handle_irq_event_percpu) from
[<c019bec8>] (handle_irq_event+0x38/0x5c)
[    9.704284] [<c019bec8>] (handle_irq_event) from [<c019efa8>]
(handle_level_irq+0xb8/0x14c)
[    9.704294] [<c019efa8>] (handle_level_irq) from [<c019b284>]
(generic_handle_irq+0x20/0x34)
[    9.704303] [<c019b284>] (generic_handle_irq) from [<c019b578>]
(__handle_domain_irq+0x64/0xe0)
[    9.704315] [<c019b578>] (__handle_domain_irq) from [<c078dab8>]
(__irq_svc+0x58/0x78)
[    9.704329] [<c078dab8>] (__irq_svc) from [<c013d100>]
(__do_softirq+0xb4/0x440)
[    9.704339] [<c013d100>] (__do_softirq) from [<c013d7ac>]
(irq_exit+0xbc/0x130)
[    9.704348] [<c013d7ac>] (irq_exit) from [<c019b580>]
(__handle_domain_irq+0x6c/0xe0)
[    9.704356] [<c019b580>] (__handle_domain_irq) from [<c078dab8>]
(__irq_svc+0x58/0x78)
[    9.704364] [<c078dab8>] (__irq_svc) from [<c0199e14>]
(console_unlock+0x464/0x62c)
[    9.704372] [<c0199e14>] (console_unlock) from [<c019a254>]
(vprintk_emit+0x278/0x568)
[    9.704386] [<c019a254>] (vprintk_emit) from [<c0224e70>]
(printk_emit+0x20/0x28)
[    9.704396] [<c0224e70>] (printk_emit) from [<c019a61c>]
(devkmsg_write+0xac/0x10c)
[    9.704407] [<c019a61c>] (devkmsg_write) from [<c0287804>]
(do_readv_writev+0x138/0x2f8)
[    9.704417] [<c0287804>] (do_readv_writev) from [<c0287ba4>]
(vfs_writev+0x40/0x58)
[    9.704424] [<c0287ba4>] (vfs_writev) from [<c0287c08>] (do_writev+0x4c/0xa8)
[    9.704438] [<c0287c08>] (do_writev) from [<c0107820>]
(ret_fast_syscall+0x0/0x1c)
[    9.704443] handlers:
[    9.704454] [<c04c77c8>] dma_ccerr_handler
[    9.704457] Disabling IRQ #22
[  OK  ] Started File System Check on Root Device.
[   10.050054] systemd[1]: Started File System Check on Root Device.
[   10.243231] systemd[1]: Starting Remount Root and Kernel File Systems...
         Starting Remount Root and Kernel File Systems...
[   10.286925] systemd[1]: Starting Create Static Device Nodes in /dev...
         Starting Create Static Device Nodes in /dev...
[   10.361410] systemd[1]: Mounted FUSE Control File System.
[   10.367168] systemd[1]: Starting Apply Kernel Variables...
         Starting Apply Kernel Variables...
[   10.435927] systemd[1]: Mounting Configuration File System...
         Mounting Configuration File System...[   10.487309] EXT4-fs
(mmcblk0p2): re-mounted. Opts: errors=remount-ro

Yegor
--
To unsubscribe from this list: send the line "unsubscribe linux-omap" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Arm (vger)]     [ARM Kernel]     [ARM MSM]     [Linux Tegra]     [Linux WPAN Networking]     [Linux Wireless Networking]     [Maemo Users]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite Trails]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux