Hi Matthijs, On Thu, May 19, 2016 at 4:42 AM, Matthijs van Duin <matthijsvanduin@xxxxxxxxx> wrote: > I already had occasional random failures rebooting my bbb, but it > happened rarely and I hadn't investigated yet. > > While debugging another issue I turned off the "quiet" option and as a > side-effect I discovered the cause of the failures: > > random: nonblocking pool is initialized > irq 187: nobody cared (try booting with the "irqpoll" option) > CPU: 0 PID: $varies Comm: $varies Not tainted 4.6.0-bone3-dd1 #2 > Hardware name: Generic AM33XX (Flattened Device Tree) > [<c010b059>] (unwind_backtrace) from [<c0109945>] (show_stack+0x11/0x14) > [<c0109945>] (show_stack) from [<c014802f>] (__report_bad_irq+0x23/0x84) > [<c014802f>] (__report_bad_irq) from [<c01482a1>] (note_interrupt+0x1c5/0x200) > [<c01482a1>] (note_interrupt) from [<c0146a13>] (handle_irq_event_percpu+0xfb/0x154) > [<c0146a13>] (handle_irq_event_percpu) from [<c0146a8d>] (handle_irq_event+0x21/0x2c) > [<c0146a8d>] (handle_irq_event) from [<c01486d1>] (handle_level_irq+0x61/0xac) > [<c01486d1>] (handle_level_irq) from [<c014633d>] (generic_handle_irq+0x1d/0x28) > [<c014633d>] (generic_handle_irq) from [<c01464ff>] (__handle_domain_irq+0x3b/0x80) > [<c01464ff>] (__handle_domain_irq) from [<c044284d>] (__irq_svc+0x4d/0x74) > [<c044284d>] (__irq_svc) from [<c0121d52>] (__do_softirq+0x66/0x1c8) > [<c0121d52>] (__do_softirq) from [<c0122229>] (irq_exit+0x95/0xbc) > [<c0122229>] (irq_exit) from [<c0146503>] (__handle_domain_irq+0x3f/0x80) > [<c0146503>] (__handle_domain_irq) from [<c044284d>] (__irq_svc+0x4d/0x74) > [<c044284d>] (__irq_svc) from [<c0145656>] (console_unlock+0x26e/0x410) > [<c0145656>] (console_unlock) from [<c01459b5>] (vprintk_emit+0x1bd/0x310) > (rest of traceback varies) > handlers: > [<c02c5e6d>] dma_ccerr_handler > Disabling IRQ #187 > > To be honest, I can't even begin to speculate what's going on here. I > checked dma_ccerr_handler but I don't see how it could fail to clear the > error irq. And I didn't include the "random: nonblocking pool is > initialized" message right before the traceback by accident. So far > it's been there every single time. > > The exact moment this happens varies, I just made it easier to trigger > by increasing the volume of console output. Probably. Repeatedly > dumping a large pile of output to /dev/console failed to trigger it > though. It does however on rare occasion also happen on shutdown. > > Whenever it occurs during boot, often things eventually get stuck > resulting in hung task tracebacks in out_of_line_wait_on_bit() in ext4 > code. But not always. I haven't seen it block shutdown. > > I've confirmed I can also reproduce it using mainline v4.6. My config > file can be found here: > https://github.com/dutchanddutch/bb-kernel/blob/am33x-v4.6/patches/defconfig > The only change needed to build with mainline is clearing EXTRA_FIRMWARE I've also encountered such an error. Though I've seen it only twice and couldn't reproduce it. [ OK ] Started Load Kernel Modules.[ 9.283491] systemd[1]: Started Load Kernel Modules. [ OK ] Started Create list of required static device nodes ...current kernel.[ 9.324927] systemd[1]: Started Create list of required static device nodes for the current kernel. [ 9.359464] random: nonblocking pool is initialized [ 9.704147] irq 22: nobody cared (try booting with the "irqpoll" option) [ 9.704160] CPU: 0 PID: 1 Comm: systemd Not tainted 4.6.0-rc2-00099-g3871144-dirty #4 [ 9.704164] Hardware name: Generic AM33XX (Flattened Device Tree) [ 9.704204] [<c010feec>] (unwind_backtrace) from [<c010c10c>] (show_stack+0x10/0x14) [ 9.704223] [<c010c10c>] (show_stack) from [<c046b194>] (dump_stack+0xb0/0xe4) [ 9.704244] [<c046b194>] (dump_stack) from [<c019e484>] (__report_bad_irq+0x24/0xc0) [ 9.704255] [<c019e484>] (__report_bad_irq) from [<c019e850>] (note_interrupt+0x24c/0x2ac) [ 9.704265] [<c019e850>] (note_interrupt) from [<c019be10>] (handle_irq_event_percpu+0x260/0x2e0) [ 9.704275] [<c019be10>] (handle_irq_event_percpu) from [<c019bec8>] (handle_irq_event+0x38/0x5c) [ 9.704284] [<c019bec8>] (handle_irq_event) from [<c019efa8>] (handle_level_irq+0xb8/0x14c) [ 9.704294] [<c019efa8>] (handle_level_irq) from [<c019b284>] (generic_handle_irq+0x20/0x34) [ 9.704303] [<c019b284>] (generic_handle_irq) from [<c019b578>] (__handle_domain_irq+0x64/0xe0) [ 9.704315] [<c019b578>] (__handle_domain_irq) from [<c078dab8>] (__irq_svc+0x58/0x78) [ 9.704329] [<c078dab8>] (__irq_svc) from [<c013d100>] (__do_softirq+0xb4/0x440) [ 9.704339] [<c013d100>] (__do_softirq) from [<c013d7ac>] (irq_exit+0xbc/0x130) [ 9.704348] [<c013d7ac>] (irq_exit) from [<c019b580>] (__handle_domain_irq+0x6c/0xe0) [ 9.704356] [<c019b580>] (__handle_domain_irq) from [<c078dab8>] (__irq_svc+0x58/0x78) [ 9.704364] [<c078dab8>] (__irq_svc) from [<c0199e14>] (console_unlock+0x464/0x62c) [ 9.704372] [<c0199e14>] (console_unlock) from [<c019a254>] (vprintk_emit+0x278/0x568) [ 9.704386] [<c019a254>] (vprintk_emit) from [<c0224e70>] (printk_emit+0x20/0x28) [ 9.704396] [<c0224e70>] (printk_emit) from [<c019a61c>] (devkmsg_write+0xac/0x10c) [ 9.704407] [<c019a61c>] (devkmsg_write) from [<c0287804>] (do_readv_writev+0x138/0x2f8) [ 9.704417] [<c0287804>] (do_readv_writev) from [<c0287ba4>] (vfs_writev+0x40/0x58) [ 9.704424] [<c0287ba4>] (vfs_writev) from [<c0287c08>] (do_writev+0x4c/0xa8) [ 9.704438] [<c0287c08>] (do_writev) from [<c0107820>] (ret_fast_syscall+0x0/0x1c) [ 9.704443] handlers: [ 9.704454] [<c04c77c8>] dma_ccerr_handler [ 9.704457] Disabling IRQ #22 [ OK ] Started File System Check on Root Device. [ 10.050054] systemd[1]: Started File System Check on Root Device. [ 10.243231] systemd[1]: Starting Remount Root and Kernel File Systems... Starting Remount Root and Kernel File Systems... [ 10.286925] systemd[1]: Starting Create Static Device Nodes in /dev... Starting Create Static Device Nodes in /dev... [ 10.361410] systemd[1]: Mounted FUSE Control File System. [ 10.367168] systemd[1]: Starting Apply Kernel Variables... Starting Apply Kernel Variables... [ 10.435927] systemd[1]: Mounting Configuration File System... Mounting Configuration File System...[ 10.487309] EXT4-fs (mmcblk0p2): re-mounted. Opts: errors=remount-ro Yegor -- To unsubscribe from this list: send the line "unsubscribe linux-omap" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html