On 10/12/07, Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> wrote: > On Fri, 12 Oct 2007 10:31:42 +0200 "Torsten Kaiser" <just.for.lkml@xxxxxxxxxxxxxx> wrote: > > Oct 12 10:23:03 treogen smartd[6091]: Device: /dev/sdc, not found in > > smartd database. > > hm. > > > Oct 12 10:23:03 treogen [ 105.990000] WARNING: at > > drivers/ata/libata-core.c:5752 ata_qc_issue() > > Let's cc linux-ide. > > > Oct 12 10:23:03 treogen [ 105.990000] > > Oct 12 10:23:03 treogen [ 105.990000] Call Trace: > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804442ef>] > > ata_qc_issue+0x47f/0x540 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff80432e60>] scsi_done+0x0/0x20 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff80449c80>] > > ata_scsi_flush_xlat+0x0/0x30 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8044a6ea>] > > ata_scsi_translate+0xfa/0x180 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff80432e60>] scsi_done+0x0/0x20 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8044d84d>] > > ata_scsi_queuecmd+0x12d/0x210 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804333d0>] > > scsi_dispatch_cmd+0x150/0x250 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804391f1>] > > scsi_request_fn+0x1f1/0x360 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8039b827>] > > elv_insert+0x167/0x250 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff803a0ac2>] > > __make_request+0xe2/0x670 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8039d560>] > > generic_make_request+0x1d0/0x3c0 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff802bc1b9>] > > bio_alloc_bioset+0xb9/0x140 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff802bc061>] > > __bio_clone+0x91/0xc0 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8039d7b6>] > > submit_bio+0x66/0xf0 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804cc06e>] > > write_page+0x16e/0x2c0 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff80231b01>] > > dequeue_task_fair+0x51/0xb0 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804c482d>] > > md_update_sb+0x18d/0x320 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804caa10>] md_thread+0x0/0x100 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804c9065>] > > md_check_recovery+0x1f5/0x550 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804caa10>] md_thread+0x0/0x100 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804bf1d3>] raid5d+0x23/0x490 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8023eb12>] > > try_to_del_timer_sync+0x52/0x60 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff805b0057>] > > schedule_timeout+0x67/0xd0 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8023e740>] > > process_timeout+0x0/0x10 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff805b004a>] > > schedule_timeout+0x5a/0xd0 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804caa10>] md_thread+0x0/0x100 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804caa40>] > > md_thread+0x30/0x100 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8024a710>] > > autoremove_wake_function+0x0/0x30 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff804caa10>] md_thread+0x0/0x100 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8024a32b>] kthread+0x4b/0x80 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8020c9d8>] child_rip+0xa/0x12 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8024a2e0>] kthread+0x0/0x80 > > Oct 12 10:23:03 treogen [ 105.990000] [<ffffffff8020c9ce>] child_rip+0x0/0x12 > > Oct 12 10:23:03 treogen [ 105.990000] > > Oct 12 10:23:13 treogen [ 115.940000] ata3.00: exception Emask 0x0 > > SAct 0x0 SErr 0x0 action 0x2 frozen > > Oct 12 10:23:13 treogen [ 115.940000] ata3.00: cmd > > b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0 cdb 0x0 data 0 > > Oct 12 10:23:13 treogen [ 115.940000] res > > 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) > > Oct 12 10:23:13 treogen [ 115.940000] ata3.00: status: { DRDY } > > Oct 12 10:23:14 treogen [ 116.270000] ata3: soft resetting link > > Oct 12 10:23:14 treogen [ 116.430000] ata3: SATA link up 3.0 Gbps > > (SStatus 123 SControl 300) > > Oct 12 10:23:14 treogen [ 116.740000] ata3.00: configured for UDMA/133 > > Oct 12 10:23:14 treogen [ 116.740000] ata3: EH complete > > Oct 12 10:23:14 treogen [ 116.740000] WARNING: at > > drivers/ata/libata-core.c:5752 ata_qc_issue() > > Oct 12 10:23:14 treogen [ 116.740000] > > Oct 12 10:23:14 treogen [ 116.740000] Call Trace: > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff804442ef>] > > ata_qc_issue+0x47f/0x540 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff80432e60>] scsi_done+0x0/0x20 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff80449c80>] > > ata_scsi_flush_xlat+0x0/0x30 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff8044a6ea>] > > ata_scsi_translate+0xfa/0x180 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff80432e60>] scsi_done+0x0/0x20 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff8044d84d>] > > ata_scsi_queuecmd+0x12d/0x210 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff804333d0>] > > scsi_dispatch_cmd+0x150/0x250 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff804391f1>] > > scsi_request_fn+0x1f1/0x360 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff80436b80>] > > scsi_error_handler+0x0/0x310 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff8039fe73>] > > blk_run_queue+0x43/0x80 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff80438659>] > > scsi_run_host_queues+0x19/0x40 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff80436d54>] > > scsi_error_handler+0x1d4/0x310 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff80436b80>] > > scsi_error_handler+0x0/0x310 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff8024a32b>] kthread+0x4b/0x80 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff8020c9d8>] child_rip+0xa/0x12 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff8024a2e0>] kthread+0x0/0x80 > > Oct 12 10:23:14 treogen [ 116.740000] [<ffffffff8020c9ce>] child_rip+0x0/0x12 > > Oct 12 10:23:14 treogen [ 116.740000] > > Oct 12 10:23:14 treogen [ 116.770000] sd 2:0:0:0: [sdc] 625142448 > > 512-byte hardware sectors (320073 MB) > > Oct 12 10:23:14 treogen [ 116.770000] sd 2:0:0:0: [sdc] Write Protect is off > > Oct 12 10:23:14 treogen [ 116.770000] sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 > > Oct 12 10:23:14 treogen [ 116.770000] sd 2:0:0:0: [sdc] Write cache: > > enabled, read cache: enabled, doesn't support DPO or FUA > > Oct 12 10:23:24 treogen [ 126.740000] ata3.00: exception Emask 0x0 > > SAct 0x0 SErr 0x0 action 0x2 frozen > > Oct 12 10:23:24 treogen [ 126.740000] ata3.00: cmd > > b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0 cdb 0x0 data 0 > > Oct 12 10:23:24 treogen [ 126.740000] res > > 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) > > Oct 12 10:23:24 treogen [ 126.740000] ata3.00: status: { DRDY } > > Oct 12 10:23:24 treogen [ 127.070000] ata3: soft resetting link > > Oct 12 10:23:25 treogen [ 127.230000] ata3: SATA link up 3.0 Gbps > > (SStatus 123 SControl 300) > > Oct 12 10:23:25 treogen [ 127.370000] ata3.00: configured for UDMA/133 > > Oct 12 10:23:25 treogen [ 127.370000] ata3: EH complete > > Oct 12 10:23:25 treogen [ 127.370000] sd 2:0:0:0: [sdc] 625142448 > > 512-byte hardware sectors (320073 MB) > > Oct 12 10:23:25 treogen [ 127.370000] sd 2:0:0:0: [sdc] Write Protect is off > > Oct 12 10:23:25 treogen [ 127.370000] sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 > > Oct 12 10:23:25 treogen [ 127.370000] sd 2:0:0:0: [sdc] Write cache: > > enabled, read cache: enabled, doesn't support DPO or FUA > > Oct 12 10:23:25 treogen smartd[6091]: Device: /dev/sdc, is SMART > > capable. Adding to "monitor" list. > > ... but I can still access the filesystem and the RAID device on that drive. > > (sdc is MAXTOR STM332082 3.AA sata-drive on a MCP55 using sata_nv with > > swncq activated) > > > > Torsten > On the next boot no WARNING show up. On the third boot with 2.6.23-mm1 the drive failed completely: First I got this WARNING: Oct 13 07:46:48 treogen smartd[6081]: Device: /dev/sdc, opened Oct 13 07:46:48 treogen [ 99.850000] WARNING: at drivers/ata/libata-core.c:5761 ata_qc_issue() Oct 13 07:46:48 treogen [ 99.850000] Oct 13 07:46:48 treogen [ 99.850000] Call Trace: Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8044431a>] ata_qc_issue+0x4aa/0x540 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff80432e60>] scsi_done+0x0/0x20 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8044ce30>] ata_scsi_pass_thru+0x0/0x2c0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8044a6ea>] ata_scsi_translate+0xfa/0x180 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff80432e60>] scsi_done+0x0/0x20 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8044d84d>] ata_scsi_queuecmd+0x12d/0x210 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff804333d0>] scsi_dispatch_cmd+0x150/0x250 Oct 13 07:46:48 treogen smartd[6081]: Device: /dev/sdc, not found in smartd database. Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff804391f1>] scsi_request_fn+0x1f1/0x360 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8039f362>] blk_execute_rq_nowait+0x62/0xb0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8039f446>] blk_execute_rq+0x96/0x110 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8039f5b1>] get_request_wait+0x21/0x1a0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8022c8ea>] __wake_up_common+0x5a/0x90 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff80438e14>] scsi_execute+0xe4/0x120 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8044cb14>] ata_cmd_ioctl+0x124/0x270 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8044cd67>] ata_scsi_ioctl+0x107/0x1d0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8043424c>] scsi_ioctl+0xbc/0x330 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff803a14f3>] blkdev_driver_ioctl+0x93/0xa0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff803a1766>] blkdev_ioctl+0x266/0x7c0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8022c8ea>] __wake_up_common+0x5a/0x90 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8022c8ea>] __wake_up_common+0x5a/0x90 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8022d543>] __wake_up+0x43/0x70 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff802babba>] invalidate_inode_buffers+0x2a/0x100 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8024a5e0>] bit_waitqueue+0x10/0xd0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff802bd08b>] block_ioctl+0x1b/0x30 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff802a08bf>] do_ioctl+0x2f/0xa0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff802a0b50>] vfs_ioctl+0x220/0x2d0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff802a0c91>] sys_ioctl+0x91/0xb0 Oct 13 07:46:48 treogen [ 99.850000] [<ffffffff8020bbbe>] system_call+0x7e/0x83 Oct 13 07:46:48 treogen [ 99.850000] Oct 13 07:46:48 treogen [ 99.850000] ata3: EH in SWNCQ mode,QC:qc_active 0x3 sactive 0x1 Oct 13 07:46:48 treogen [ 99.850000] ata3: SWNCQ:qc_active 0x1 defer_bits 0x0 last_issue_tag 0x0 Oct 13 07:46:48 treogen [ 99.850000] dhfis 0x1 dmafis 0x0 sdbfis 0x0 Oct 13 07:46:48 treogen [ 99.850000] ata3: ATA_REG 0x51 ERR_REG 0x4 Oct 13 07:46:48 treogen [ 99.850000] ata3: tag : dhfis dmafis sdbfis sacitve Oct 13 07:46:48 treogen [ 99.850000] ata3: tag 0x0: 1 0 0 1 Oct 13 07:46:48 treogen [ 99.850000] ata3.00: exception Emask 0x1 SAct 0x1 SErr 0x0 action 0x6 frozen Oct 13 07:46:48 treogen [ 99.850000] ata3.00: Ata error. fis:0x41 Oct 13 07:46:48 treogen [ 99.850000] ata3.00: cmd 60/30:00:d1:6b:db/00:00:18:00:00/40 tag 0 cdb 0x0 data 24576 in Oct 13 07:46:48 treogen [ 99.850000] res 51/04:00:01:4f:c2/04:00:d1:6b:db/00 Emask 0x1 (device error) Oct 13 07:46:48 treogen [ 99.850000] ata3.00: status: { DRDY ERR } Oct 13 07:46:48 treogen [ 99.850000] ata3.00: error: { ABRT } Oct 13 07:46:48 treogen [ 99.850000] ata3.00: cmd b0/d8:00:01:4f:c2/00:00:00:00:00/00 tag 1 cdb 0x0 data 0 Oct 13 07:46:48 treogen [ 99.850000] res 51/04:00:01:4f:c2/00:00:00:00:00/00 Emask 0x1 (device error) Oct 13 07:46:48 treogen [ 99.850000] ata3.00: status: { DRDY ERR } Oct 13 07:46:48 treogen [ 99.850000] ata3.00: error: { ABRT } Oct 13 07:46:48 treogen [ 99.850000] ata3: hard resetting link Oct 13 07:46:49 treogen [ 100.360000] ata3: SATA link up 3.0 Gbps (SStatus 123 SControl 300) Oct 13 07:46:49 treogen [ 100.510000] ata3.00: configured for UDMA/133 Oct 13 07:46:49 treogen [ 100.510000] ata3: EH complete then the other two WARNINGs again. (drivers/ata/libata-core.c:5752) After that the drive is inaccessible. The last now "good" kernel for this problem is probable 2.6.23-rc8-mm1. That version only had the sata_sil24-bug (ata_sg_is_last). I only booted 2.6.23-rc8-mm2 one time and that one try did not complete bootup. But was neither able to see the complete OOPS or save it. And as I was still trying to find the other bug, I did not investigate more. Torsten - To unsubscribe from this list: send the line "unsubscribe linux-ide" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html