On 5/3/2012 3:41 PM, Brian Candler wrote: > On Thu, May 03, 2012 at 07:50:00AM -0500, Stan Hoeppner wrote: >>> Any other suggestions (and of course interpretation of the kernel call >>> tracebacks) would be much appreciated. >> >> Which mainboards are these Brian? Make/model? > > Tyan S5510, with Intel Xeon CPU E31225 @ 3.10GHz > > Upgraded to BIOS 1.05a and iKVM 3.00 > >> Make/model/count of all add in cards? > > 1 x LSI SAS9201–16i > 1 x LSI SAS92118–8i > 1 x Intel X520-DA2 dual 10G NIC > > although the 10G link wasn't being used for the most recent tests. I'm not finding anything down this avenue so far. >> Make/model of PSU? > > Will have to check, I think it may be this one: > http://www.xcase.co.uk/XCASE-Power-Supply-p/psu-dolphin-900..htm > >> Make model of chassis? > > http://www.xcase.co.uk/24-bay-Hotswap-rackmount-chassis-norco-RPC-4224-p/case-xcase-rm424.htm > > The drives are 24 x ST3000DM001 (I was hoping to get low-power Hitachi > drives but they weren't available at the time) Yuk. Zero technical info available on this "house brand" PSU or chassis. That PSU is decidedly consumer oriented. It may be fine, but there seems to be no documentation available to verify its output current or rail config. No pic of the PSU sticker, no manual to be found online. Many of these "house brand" and cheap no-name Chinese PSUs don't put out anywhere near the advertised power, nor necessarily cleanly, which is why the UL sticker is so handy--it doesn't lie as the printing on the box might. No pic of the inside of that chassis, and no manual here either, so I have no way of knowing the backplane architecture. I'd have to guess it's a 6x 4 drive passive backplane setup similar to the Norco chassis. FWIW, I'd never buy a PSU with a "technical description" of: 140mm Red Fan Strong Long Cables Strong outer Casing Quality Internal Components Low Noise - Big Heat Sinks One can spend a few extra bucks (pounds) and get a SuperMicro or Sparkle unit with all the tech specs and QC test data available online, such as: http://www.supermicro.com/products/powersupply/80plus/80plus_pws-865-pq.pdf > However, last night I rebooted one box (the one which wouldn't let me ssh > in) then upgraded it to ubuntu 12.04. It has been running a couple of > concurrent bonnie++ instances for over 24 hours without a hitch. > > So maybe it's the mpt2sas driver which is the difference: > > [dmesg from Ubuntu 11.10] > mpt2sas version 08.100.00.02 > > [dmesg from Ubuntu 12.04] > mpt2sas version 10.100.00.00 Glad to hear you've got one running somewhat stable. Could be a driver problem, but it's pretty rare for a SCSI driver to hard lock a box isn't it? Keep us posted. -- Stan _______________________________________________ xfs mailing list xfs@xxxxxxxxxxx http://oss.sgi.com/mailman/listinfo/xfs