Hi list,
I seem to have a bug with JMicron controller in a Gigabyte GA-N680SLI-DQ6
motherboard.
http://www.gigabyte.com.tw/Support/Motherboard/BIOS_Model.aspx?ProductID=2460
Kernel is 2.6.24.
10 on-board SATA connectors, 2+4*JMicron 20360/20363 + 4*nVidia MCP55
2*200GB disks (System - SW RAID1) on the JMicron controller and
8*500 (Data - SW RAID6) - 4 on the JMicron, 4 on the nVidia controller.
Under heavy load the JMicron controller gets exceptions, then eventually
"hard resetting link".
All 4 disks/connector, one after another. This of course "kills" the RAID
Excerpt from syslog
Feb 9 16:16:32 storage1 kernel: ata2.00: exception Emask 0x0 SAct 0x3ffff
SErr 0x0 action 0x2 frozen
Feb 9 16:16:32 storage1 kernel: ata1.00: exception Emask 0x0 SAct 0x1fffff
SErr 0x0 action 0x2 frozen
Feb 9 16:16:32 storage1 kernel: ata1.00: cmd
61/08:00:73:12:d9/00:00:23:00:00/40 tag 0 ncq 4096 out
Feb 9 16:16:32 storage1 kernel: res
40/00:80:c3:7c:d3/00:01:23:00:00/40 Emask 0x4 (timeout)
Feb 9 16:16:32 storage1 kernel: ata1.00: status: { DRDY }
...
Feb 9 16:16:32 storage1 kernel: ata1.00: cmd
61/80:a0:c3:1f:d9/00:00:23:00:00/40 tag 20 ncq 65536 out
Feb 9 16:16:32 storage1 kernel: res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb 9 16:16:32 storage1 kernel: ata1.00: status: { DRDY }
Feb 9 16:16:32 storage1 kernel: ata1: hard resetting link
Didn't dare to post all attachments, so
full dmesg
lspci -nn
syslog - from the error start
can be downloaded from:
http://www.huweb.hu/maques/tmp/jmicron
I'm lost.
Anyone seen such thing? What could it be? Hardware (MB, chipset, BIOS),
kernel (driver) or what?
Any suggestion? Kernel version to try, dispose hardware or shoot myself in
the head?
Thanks,
Gabor
-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html