On 09/23/2017 05:29 AM, Eyal Lebedinsky wrote: > On 23/09/17 08:52, Eyal Lebedinsky wrote: >> Thanks Phil, >> >> Interesting. I checked the card and it still looks "as new", no dust. >> It does not have a fan and the heatsink is glued firmly to the processor. >> Does not look like there is anything I can do here. >> >> I added a fan firing external air directly at the card and will test >> again >> later. I really need to understand the source of the problem. If this >> fixes >> it then I will get a replacement controller (the fan is just a hack). >> >>> Phil > > The 'check' completed, something it was not able to do before, so it > seems that > it was an overheating problem. Now that I know, I searched for the > relevant terms > and did find some notes suggesting this card is known to have such an > issue. > > But, why now? I could only guess that the heatsink tape (the only thing > connecting the processor to the heatsink) has aged enough to a low level of > performance. I am not sure that I can fix it, or even remove the > heatsink safely. Chips get old. Primarily due to thermal cycling producing expansion cracking. Silicon and the various deposited metals expand/contract with different coefficients, so temperature changes stress the mating points between semiconductors and interconnect. (Sounds like your problem, as extra cooling helps.) If thermal stress doesn't kill them, then dopant diffusion within the semiconductors eventually will. This occurs even when turned off, but proceeds much faster at elevated temperatures. Dopant concentrations are engineered to not suffer from diffusion effects well beyond the expected life of the components (at rated operating temps), but that doesn't mean there aren't marginal production runs. Which can be hard to discover before parts start failing years later. If that happens after the warranty period, the manufacturer has dodged a big bullet. With a bit of tarnish on their name, of course. > [OT: the following documents how I handled the 'check' results] > Thanks, I feel much better now. I like happy endings. (-: But it's too bad you have to replace it. Phil -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html