Hi Hans, On Mon, 2015-01-12 at 16:29 +0100, Hans Verkuil wrote: > On 12/29/2014 03:38 PM, Jurgen Kramer wrote: > > > > On Sat, 2014-12-27 at 10:35 +0100, Jurgen Kramer wrote: > >> I am seeing kernel oopses using recent media_builds on kernel 3.17: > >> > >> [ 506.969697] BUG: unable to handle kernel NULL pointer dereference at > >> 0000000000000058 > >> [ 506.969720] IP: [<ffffffffa03a233a>] vb2_thread+0x17a/0x480 > >> [videobuf2_core] > >> [ 506.969739] PGD 0 > >> [ 506.969746] Oops: 0002 [#1] SMP > >> [ 506.969754] Modules linked in: nf_conntrack_netbios_ns > >> nf_conntrack_broadcast cfg80211 rfkill ip6t_rpfilter ip6t_REJECT > >> xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter > >> ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 > >> ip6table_mangle ip6table_security ip6table_raw ip6table_filter > >> ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 > >> nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw sp2(OE) > >> si2157(OE) si2168(OE) i2c_mux nouveau cx25840(OE) cx23885(OE) > >> altera_ci(OE) tda18271(OE) altera_stapl(OE) videobuf2_dvb(OE) > >> videobuf2_core(OE) videobuf2_dma_sg(OE) videobuf2_memops(OE) snd_seq > >> snd_seq_device snd_pcm snd_timer snd video i2c_algo_bit ttm > >> drm_kms_helper soundcore iTCO_wdt ppdev gpio_ich iTCO_vendor_support > >> tveeprom(OE) cx2341x(OE) > >> [ 506.969871] coretemp dvb_core(OE) v4l2_common(OE) videodev(OE) > >> media(OE) kvm crc32c_intel raid456 async_raid6_recov async_memcpy > >> async_pq async_xor drm xor async_tx raid6_pq microcode serio_raw shpchp > >> i7core_edac edac_core i2c_i801 lpc_ich mfd_core parport_pc parport > >> ite_cir(OE) rc_core(OE) tpm_infineon tpm_tis tpm acpi_cpufreq nfsd > >> auth_rpcgss nfs_acl lockd sunrpc mxm_wmi asix usbnet r8169 mii wmi > >> [ 506.969970] CPU: 0 PID: 3160 Comm: vb2-cx23885[0] Tainted: G > >> OE 3.17.4-200.fc20.x86_64 #1 > >> [ 506.969982] Hardware name: To Be Filled By O.E.M. To Be Filled By > >> O.E.M./P55 Extreme, BIOS P2.70 08/20/2010 > >> [ 506.969993] task: ffff8800bc18e220 ti: ffff88020d36c000 task.ti: > >> ffff88020d36c000 > >> [ 506.970002] RIP: 0010:[<ffffffffa03a233a>] [<ffffffffa03a233a>] > >> vb2_thread+0x17a/0x480 [videobuf2_core] > >> [ 506.970021] RSP: 0018:ffff88020d36fe68 EFLAGS: 00010246 > >> [ 506.970663] RAX: 0000000000000000 RBX: 0000000000000000 RCX: > >> 000000000000000b > >> [ 506.971305] RDX: 0000000000000058 RSI: ffff8800bc18e220 RDI: > >> 0000000000000058 > >> [ 506.971952] RBP: ffff88020d36fec0 R08: ffff88020d36c000 R09: > >> 000000000000158f > >> [ 506.972611] R10: 00000000000030de R11: 0000000000000010 R12: > >> 0000000000000058 > >> [ 506.973275] R13: ffff8800b81814a0 R14: 0000000000000000 R15: > >> ffff880225c61028 > >> [ 506.973947] FS: 0000000000000000(0000) GS:ffff880233c00000(0000) > >> knlGS:0000000000000000 > >> [ 506.974634] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b > >> [ 506.975321] CR2: 0000000000000058 CR3: 0000000001c14000 CR4: > >> 00000000000007f0 > >> [ 506.976021] Stack: > >> [ 506.976723] ffff8800bc18e220 0000000000000070 00ffffff81c1b460 > >> 0000000000000000 > >> [ 506.977442] ffff880200000000 ffff880225c61028 ffff88020d1d8480 > >> ffff880225c61028 > >> [ 506.978165] ffffffffa03a21c0 0000000000000000 0000000000000000 > >> ffff88020d36ff48 > >> [ 506.979055] Call Trace: > >> [ 506.979795] [<ffffffffa03a21c0>] ? vb2_internal_qbuf+0x210/0x210 > >> [videobuf2_core] > >> [ 506.980545] [<ffffffff810b0498>] kthread+0xd8/0xf0 > >> [ 506.981293] [<ffffffff810b03c0>] ? kthread_create_on_node > >> +0x190/0x190 > >> [ 506.982045] [<ffffffff8172e33c>] ret_from_fork+0x7c/0xb0 > >> [ 506.982806] [<ffffffff810b03c0>] ? kthread_create_on_node > >> +0x190/0x190 > >> [ 506.983568] Code: 89 e7 ba 58 00 00 00 0f 85 94 01 00 00 40 f6 c7 02 > >> 0f 85 72 01 00 00 40 f6 c7 04 0f 85 50 01 00 00 89 d1 31 c0 c1 e9 03 f6 > >> c2 04 <f3> 48 ab 74 0a c7 07 00 00 00 00 48 83 c7 04 f6 c2 02 74 0a 31 > >> [ 506.984464] RIP [<ffffffffa03a233a>] vb2_thread+0x17a/0x480 > >> [videobuf2_core] > >> [ 506.985306] RSP <ffff88020d36fe68> > >> [ 506.986147] CR2: 0000000000000058 > >> [ 506.990986] ---[ end trace 1973fbcab83c3353 ]--- > >> > >> First I thought is was related to CAM initialization but after removing > >> the CAMS and doing a fresh cold start I am still seeing the oopses. > >> After the oops everything is still functioning. I am using 3x DVBSKY > >> T980C. How can I debug this further? > >> > > The problem persist while my system went through a motherboard/mem/cpu > > upgrade. The oops occurs when one of the DVB-C cards get its first use > > (in my case mythtv): > > > > [ 102.050294] si2157 18-0060: downloading firmware from file > > 'dvb-tuner-si2158-a20-01.fw' > > [ 181.460968] BUG: unable to handle kernel NULL pointer dereference at > > 0000000000000058 > > [ 181.460991] IP: [<ffffffffa04d833a>] vb2_thread+0x17a/0x480 > > [videobuf2_core] > > [ 181.461019] PGD 0 > > [ 181.461024] Oops: 0002 [#1] SMP > > [ 181.461032] Modules linked in: nf_conntrack_netbios_ns > > nf_conntrack_broadcast ip6t_rpfilter cfg80211 rfkill ip6t_REJECT > > xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter > > ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 > > ip6table_mangle ip6table_security ip6table_raw ip6table_filter > > ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 > > nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw sp2(OE) > > si2157(OE) si2168(OE) i2c_mux cx25840(OE) cx23885(OE) altera_ci(OE) > > tda18271(OE) altera_stapl(OE) videobuf2_dvb(OE) videobuf2_core(OE) > > videobuf2_dma_sg(OE) videobuf2_memops(OE) snd_seq snd_seq_device > > x86_pkg_temp_thermal snd_pcm snd_timer coretemp snd soundcore > > tveeprom(OE) kvm_intel kvm cx2341x(OE) dvb_core(OE) rc_core(OE) > > v4l2_common(OE) videodev(OE) > > [ 181.461264] crct10dif_pclmul raid456 crc32_pclmul async_raid6_recov > > async_memcpy crc32c_intel media(OE) async_pq async_xor > > ghash_clmulni_intel xor async_tx microcode i915 i2c_algo_bit > > drm_kms_helper drm shpchp e1000e raid6_pq i2c_i801 ptp pps_core mei_me > > serio_raw mei i2c_hid sdhci_acpi sdhci tpm_tis mmc_core dw_dmac > > i2c_designware_platform dw_dmac_core i2c_designware_core tpm acpi_pad > > nfsd auth_rpcgss nfs_acl lockd sunrpc mxm_wmi wmi video > > [ 181.461374] CPU: 7 PID: 2279 Comm: vb2-cx23885[0] Tainted: G > > OE 3.17.7-200.fc20.x86_64 #1 > > [ 181.461393] Hardware name: To Be Filled By O.E.M. To Be Filled By > > O.E.M./Z97 Extreme4, BIOS P1.50 12/17/2014 > > [ 181.461413] task: ffff8803f5f3f5c0 ti: ffff8800367a4000 task.ti: > > ffff8800367a4000 > > [ 181.461429] RIP: 0010:[<ffffffffa04d833a>] [<ffffffffa04d833a>] > > vb2_thread+0x17a/0x480 [videobuf2_core] > > [ 181.461460] RSP: 0018:ffff8800367a7e68 EFLAGS: 00010246 > > [ 181.461944] RAX: 0000000000000000 RBX: 0000000000000000 RCX: > > 000000000000000b > > [ 181.462481] RDX: 0000000000000058 RSI: ffff8803f5f3f5c0 RDI: > > 0000000000000058 > > [ 181.462995] RBP: ffff8800367a7ec0 R08: ffff8800367a4000 R09: > > 0000000000000000 > > [ 181.463621] R10: 0000000000000004 R11: 0000000000000005 R12: > > 0000000000000058 > > [ 181.464151] R13: ffff880036875e80 R14: 0000000000000000 R15: > > ffff880400999028 > > [ 181.464660] FS: 0000000000000000(0000) GS:ffff88041fbc0000(0000) > > knlGS:0000000000000000 > > [ 181.465272] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > [ 181.465804] CR2: 0000000000000058 CR3: 0000000001c14000 CR4: > > 00000000001407e0 > > [ 181.466441] DR0: 0000000000000000 DR1: 0000000000000000 DR2: > > 0000000000000000 > > [ 181.466990] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: > > 0000000000000400 > > [ 181.467541] Stack: > > [ 181.468094] ffff8803f5f3f5c0 0000000000000070 00ff880409993ae0 > > 0000000000000000 > > [ 181.468672] ffff880300000000 ffff880400999028 ffff8803e81c83c0 > > ffff880400999028 > > [ 181.469334] ffffffffa04d81c0 0000000000000000 0000000000000000 > > ffff8800367a7f48 > > [ 181.469981] Call Trace: > > [ 181.470668] [<ffffffffa04d81c0>] ? vb2_internal_qbuf+0x210/0x210 > > [videobuf2_core] > > [ 181.471292] [<ffffffff810b04a8>] kthread+0xd8/0xf0 > > [ 181.471987] [<ffffffff810b03d0>] ? kthread_create_on_node > > +0x190/0x190 > > [ 181.472690] [<ffffffff8172ebbc>] ret_from_fork+0x7c/0xb0 > > [ 181.473333] [<ffffffff810b03d0>] ? kthread_create_on_node > > +0x190/0x190 > > [ 181.474081] Code: 89 e7 ba 58 00 00 00 0f 85 94 01 00 00 40 f6 c7 02 > > 0f 85 72 01 00 00 40 f6 c7 04 0f 85 50 01 00 00 89 d1 31 c0 c1 e9 03 f6 > > c2 04 <f3> 48 ab 74 0a c7 07 00 00 00 00 48 83 c7 04 f6 c2 02 74 0a 31 > > [ 181.475582] RIP [<ffffffffa04d833a>] vb2_thread+0x17a/0x480 > > [videobuf2_core] > > [ 181.476271] RSP <ffff8800367a7e68> > > [ 181.476939] CR2: 0000000000000058 > > [ 181.479707] ---[ end trace fc6ff9e31c18c55f ]--- > > [ 183.034654] dvb_ca adapter 0: DVB CAM detected and initialised > > successfully > > [ 369.140017] dvb_ca adapter 1: DVB CAM detected and initialised > > successfully > > [ 449.206849] dvb_ca adapter 2: DVB CAM detected and initialised > > successfully > > > > The system works for a while after this oops but eventually gives out. > > Any pointers how to debug this would be appreciated. > > Hmm, at first use. That's strange. > > Add some printk lines to vb2_thread (drivers/media/v4l2-core/videobuf2-core.c) > in the for loop in order to narrow down which pointer is NULL. > > Also add a printk in vb2_thread_stop() to see if that is called before the > oops occurs. OK, I'll put in some printks to see if it reveals the problem. It seems my issues have shifted somewhat I now get these: [570186.658591] vb2: counters for queue ffff880036876828: UNBALANCED! [570186.658593] vb2: setup: 1 start_streaming: 1 stop_streaming: 1 [570186.658594] vb2: wait_prepare: 258956 wait_finish: 258957 [602624.204658] vb2: counters for queue ffff88040873b828: UNBALANCED! [602624.204661] vb2: setup: 1 start_streaming: 1 stop_streaming: 1 [602624.204661] vb2: wait_prepare: 253535 wait_finish: 253536 And: [639763.343177] page:ffffea000022bf80 count:1 mapcount:0 mapping: (null) index:0x0 [639763.343180] page flags: 0x3ffff800000000() [639763.343181] page dumped because: VM_BUG_ON_PAGE(atomic_read(&page->_count) == 0) [639763.343208] ------------[ cut here ]------------ [639763.344120] kernel BUG at include/linux/mm.h:321! [639763.345072] invalid opcode: 0000 [#2] SMP [639763.345999] Modules linked in: fuse cfg80211 rfkill nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_rpfilter ip6t_REJECT xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw sp2(OE) si2157(OE) si2168(OE) i2c_mux cx25840(OE) x86_pkg_temp_thermal coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel cx23885(OE) altera_ci(OE) tda18271(OE) altera_stapl(OE) videobuf2_dvb(OE) videobuf2_core(OE) videobuf2_dma_sg(OE) ghash_clmulni_intel videobuf2_memops(OE) snd_seq snd_seq_device snd_pcm snd_timer snd soundcore tveeprom(OE) [639763.349166] cx2341x(OE) dvb_core(OE) rc_core(OE) v4l2_common(OE) videodev(OE) media(OE) raid456 async_raid6_recov async_memcpy async_pq async_xor xor async_tx raid6_pq microcode serio_raw i2c_i801 e1000e mei_me ptp mei shpchp pps_core tpm_tis tpm sdhci_acpi sdhci i2c_hid dw_dmac dw_dmac_core mmc_core i2c_designware_platform i2c_designware_core i915 i2c_algo_bit drm_kms_helper drm acpi_pad nfsd auth_rpcgss nfs_acl lockd sunrpc mxm_wmi wmi video [639763.352438] CPU: 3 PID: 29599 Comm: DVBRead Tainted: G D OE 3.17.7-200.fc20.x86_64 #1 [639763.353571] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z97 Extreme4, BIOS P1.50 12/17/2014 [639763.354687] task: ffff880006266220 ti: ffff880009010000 task.ti: ffff880009010000 [639763.355813] RIP: 0010:[<ffffffff81193927>] [<ffffffff81193927>] __free_pages+0x47/0x50 [639763.356921] RSP: 0018:ffff880009013c40 EFLAGS: 00010246 [639763.358041] RAX: 0000000000000000 RBX: 000000000022bf80 RCX: 0000000000000006 [639763.359104] RDX: 0000000000000000 RSI: 0000000000000246 RDI: 000000000008afe0 [639763.360224] RBP: ffff880009013c40 R08: 000000000000000a R09: 0000000000000000 [639763.361247] R10: 000000000000065b R11: ffff880009013916 R12: ffff8804091f1098 [639763.362239] R13: 00000000000001f8 R14: ffff880008afe000 R15: 0000000000000000 [639763.363201] FS: 00007fd3d97fa700(0000) GS:ffff88041fac0000(0000) knlGS:0000000000000000 [639763.364141] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [639763.365055] CR2: 0000000002295028 CR3: 00000004079ca000 CR4: 00000000001407e0 [639763.365984] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [639763.366883] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [639763.367750] Stack: [639763.368641] ffff880009013c60 ffffffff81193996 0000000009013c88 0000000008afe000 [639763.369510] ffff880009013c70 ffffffff8101aca5 ffff880009013ca8 ffffffff810598c1 [639763.370420] ffffffff81c2de00 ffff8804091f1098 00000000000001f8 ffff8803fdf77c00 [639763.371299] Call Trace: [639763.372198] [<ffffffff81193996>] free_pages+0x66/0x70 [639763.373053] [<ffffffff8101aca5>] dma_generic_free_coherent +0x25/0x30 [639763.373907] [<ffffffff810598c1>] x86_swiotlb_free_coherent +0x41/0x70 [639763.374785] [<ffffffffa04f6ae9>] cx23885_free_buffer+0x79/0xc0 [cx23885] [639763.375653] [<ffffffff810d1374>] ? __wake_up+0x44/0x50 [639763.376595] [<ffffffffa04f8d3f>] buffer_finish+0x1f/0x30 [cx23885] [639763.377450] [<ffffffffa037d294>] __vb2_queue_cancel+0x114/0x230 [videobuf2_core] [639763.378345] [<ffffffffa037f79e>] __reqbufs.isra.13+0xae/0x4b0 [videobuf2_core] [639763.379196] [<ffffffffa037fbd0>] vb2_reqbufs+0x30/0x40 [videobuf2_core] [639763.380073] [<ffffffffa037fe12>] vb2_thread_stop+0xa2/0x180 [videobuf2_core] [639763.381014] [<ffffffffa03a612a>] vb2_dvb_stop_feed+0x4a/0x60 [videobuf2_dvb] [639763.381898] [<ffffffffa0479315>] dmx_ts_feed_stop_filtering +0x45/0xb0 [dvb_core] [639763.382778] [<ffffffffa0476a8a>] dvb_dmxdev_feed_stop+0x6a/0xc0 [dvb_core] [639763.383659] [<ffffffffa0476c21>] dvb_dmxdev_filter_stop+0x31/0xe0 [dvb_core] [639763.384552] [<ffffffffa0476f84>] dvb_demux_release+0x44/0x190 [dvb_core] [639763.385411] [<ffffffff81201f9c>] __fput+0xdc/0x1e0 [639763.386342] [<ffffffff812020ee>] ____fput+0xe/0x10 [639763.387256] [<ffffffff810aeb47>] task_work_run+0xa7/0xe0 [639763.388104] [<ffffffff81013c97>] do_notify_resume+0x97/0xb0 [639763.388970] [<ffffffff8172ef22>] int_signal+0x12/0x17 [639763.389809] Code: 00 00 00 00 85 f6 74 0c e8 f7 fc ff ff 5d c3 0f 1f 44 00 00 31 f6 e8 c9 fd ff ff 5d c3 31 d2 48 c7 c6 18 ad a4 81 e8 69 ca ff ff <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 48 89 [639763.391703] RIP [<ffffffff81193927>] __free_pages+0x47/0x50 [639763.392595] RSP <ffff880009013c40> [639763.398360] ---[ end trace ecac1729e2883c95 ]--- [680738.726616] traps: polkitd[32250] general protection ip:7fe9a222c022 sp:7fff38b02150 error:0 in libmozjs-17.0.so[7fe9a20ed000+3b3000] [684294.949321] vb2: counters for queue ffff88040873b828: UNBALANCED! [684294.949323] vb2: setup: 1 start_streaming: 1 stop_streaming: 1 [684294.949324] vb2: wait_prepare: 240061 wait_finish: 240062 And also some of these: 195123.414590] cx23885[0]: mpeg risc op code error [195123.414594] cx23885[0]: TS1 B - dma channel status dump [195123.414598] cx23885[0]: cmds: init risc lo : 0x0261f000 [195123.414600] cx23885[0]: cmds: init risc hi : 0x00000000 [195123.414602] cx23885[0]: cmds: cdt base : 0x00010580 [195123.414605] cx23885[0]: cmds: cdt size : 0x0000000a [195123.414607] cx23885[0]: cmds: iq base : 0x00010400 [195123.414609] cx23885[0]: cmds: iq size : 0x00000010 [195123.414612] cx23885[0]: cmds: risc pc lo : 0xb7262048 [195123.414615] cx23885[0]: cmds: risc pc hi : 0x00000000 [195123.414617] cx23885[0]: cmds: iq wr ptr : 0x00004105 [195123.414620] cx23885[0]: cmds: iq rd ptr : 0x00004109 [195123.414622] cx23885[0]: cmds: cdt current : 0x000105a8 Are those related issues? Regards, Jurgen -- To unsubscribe from this list: send the line "unsubscribe linux-media" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html