On 04/14/2011 10:04 PM, Darren Hart wrote: > On 04/08/2011 05:46 AM, Jean Delvare wrote: >> The bottom line is that using the W83795 driver in a multi-master I2C >> setup (and I strongly suspect this is what Supermicro did) is a bad >> hardware design mistake. This hardware monitoring device wasn't >> designed with this use case in mind. > > Super Micro responded: > " > Do you have an extra fan blowing air toward the northbridge > heatsink. The temperature on northbridage heatsink must less than 75 > degree. Adding an extra fan which will help solve your issue. > We are not recommend user using lmsensor on X8DTL-IF. It will > cause system crash due to lmsensor and our IPMI program getting > information from BIOS at the same time and collide on each other. It > won't happen immediately but definitely will happen in random time. > " > > It's a bit broken, but it sounds like they are confirming you theory. > > As an experiment I removed the CPU2 fan and pointed it directly at the > Intel 5520 chipset (technically not a Northbridge as it turns out... > just ignore that intel.com in my email address, it means nothing ;-) and > while I haven't been able to measure the temp1 reading from the w83795 > driver since my return, the fans no longer ramp up to 4k rpm and the > chip is cool to the touch. > > I'm seeking the recommended solution from Super Micro, failing that, > I'll have to resort to chassis modding.... I thought that was for the > overclocking-acrylic-window-neon-lights crowd.... sigh. This is turning into a support issue for Super Micro, but I thought I'd post the following for completeness. After trying a different kernel, I was able to get reading from the w83795 again. I applied the fan to the chipset until it reached it's lowest point (52.5C while idle). I then positioned the fan away from the chipset and watched the temperature rise until it reached 84.5C and the fans sped up to > 4000RPM. FAN 1 | 2401.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 2 | 0.000 | RPM | nr | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 3 | 2401.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 4 | 4356.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 5 | 3969.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 Given that the system is idle, and Super Mictro stated the chipset should not exceed 75C, and I have no obstructions in the case and no expansion boards to add heat, something appears to be wrong. Here is an annotated log of the experiment, one reading every 10 seconds: dvhart@rage:~$ while true; do sensors w83795g-i2c-0-2f | grep temp1; sleep 10; done temp1: +61.2ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +59.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +57.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +56.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +55.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +54.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +53.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +52.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +52.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +53.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +55.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +56.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +58.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +60.2ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +61.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +63.2ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +64.2ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +65.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +66.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +67.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +68.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +69.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +70.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +71.2ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +72.2ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +72.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +73.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +74.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +75.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +75.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +76.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +77.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +77.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +77.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +78.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +78.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +79.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +79.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +79.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +80.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +80.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +80.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +81.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +81.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +81.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +81.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +82.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +82.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +82.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +82.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +82.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +83.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +84.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) Fan speed jumped up at this point: FAN 1 | 2401.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 2 | 0.000 | RPM | nr | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 3 | 2401.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 4 | 4356.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 FAN 5 | 3969.000 | RPM | ok | 400.000 | 576.000 | 784.000 | 33856.000 | 34225.000 | 34596.000 And stayed at high speed until: temp1: +79.2ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +79.8ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +79.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +79.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +80.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +80.5ÂC (high = +127.0ÂC, hyst = +127.0ÂC) temp1: +81.0ÂC (high = +127.0ÂC, hyst = +127.0ÂC) And sped up again here. And so on. -- Darren Hart Intel Open Source Technology Center Yocto Project - Linux Kernel _______________________________________________ lm-sensors mailing list lm-sensors@xxxxxxxxxxxxxx http://lists.lm-sensors.org/mailman/listinfo/lm-sensors