Re: Tyan, RAID-6, and other recent hassles... (long, a bit OT)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

> Mobo was a Tyan Thunder K8W. (S2885)

personally I'm very disappointed about Tyan, well I only know their
opteron mainboards, but thats already sufficient to never buy their
mainboards again.

Server B (failover): S2880, as soon as a cable is plugged in into one of
the onboard broadcom NICs, the system crashes. Broadcom driver doesn't
need to be loaded, system may already crash during the pre-boot
initialisation or later on during the kernel boot-procedure.

Server A (main): First S2882 board crashed rather soon during pre-boot
initialisation, in the bios setup, during the boot-procedure and
sometimes it was even running stable - for a few hours. After returning
the server, we got back the system with a new mainboard (if it only
hadn't taken them two month to do this :-( ). 
With the new mainboard it was running stable for about 7 month and, we only 
experienced some rather strange ext2 errors. Interesting also to note
that those ext2 partitions were mirrored by drbd and those errors could
not be observed on the corresponding partions on Server B.
After exchanging one of the software components that required ext2, we 
could get completely rid of those ext2 partitions and everything was
fine until 1.5 weeks ago. The server suddenly crashed and crashed again
shortly after rebooting. Reiserfsck found pretty much problems on the
root partions and we decided to use our backup. For some reasons we are
pretty convinced that those ext2 and reiserfs errors are due to the
onbaord SIL3114 controller (its probably not RAM, since we monitor the
ECC memory using bluesmoke, futhermore, the much bigger data partitions
connected to the onboard SCSI controller are (so far) not affected by any
of those problems) . We now added an additional promise controller. 
Unfortunately this didn't solve our crash problem. After updating the
bios to the recent version, the promise bios complained that it can't
find any harddisk, downgrading to the version from last July solved this
problem and also almost our crash problem (bevor updating the bios was
from last March or April). It now only crashes sometimes during the bios
initialisation...

Cluster, 16 nodes: S2881 boards, master server crashed rather often
until we updated to linux-2.6.9 (old kernel was 2.4.27).


Cheers,
	Bernd


PS: I'm also mailling this to Tyan. I'm curious what they will say about
this report.
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux