Re: w83795 fan control not working

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




On 04/14/2011 10:04 PM, Darren Hart wrote:
> On 04/08/2011 05:46 AM, Jean Delvare wrote:
>> The bottom line is that using the W83795 driver in a multi-master I2C
>> setup (and I strongly suspect this is what Supermicro did) is a bad
>> hardware design mistake. This hardware monitoring device wasn't
>> designed with this use case in mind.
> 
> Super Micro responded:
> "
>         Do you have an extra fan blowing air toward the northbridge
> heatsink. The temperature on northbridage heatsink must less than 75
> degree. Adding an extra fan which will help solve your issue.
>         We are not recommend user using lmsensor on X8DTL-IF. It will
> cause system crash due to lmsensor and our IPMI program getting
> information from BIOS at the same time and collide on each other. It
> won't happen immediately but definitely will happen in random time.
> "
> 
> It's a bit broken, but it sounds like they are confirming you theory.
> 
> As an experiment I removed the CPU2 fan and pointed it directly at the
> Intel 5520 chipset (technically not a Northbridge as it turns out...
> just ignore that intel.com in my email address, it means nothing ;-) and
> while I haven't been able to measure the temp1 reading from the w83795
> driver since my return, the fans no longer ramp up to 4k rpm and the
> chip is cool to the touch.
> 
> I'm seeking the recommended solution from Super Micro, failing that,
> I'll have to resort to chassis modding.... I thought that was for the
> overclocking-acrylic-window-neon-lights crowd.... sigh.

This is turning into a support issue for Super Micro, but I thought I'd
post the following for completeness.

After trying a different kernel, I was able to get reading from the
w83795 again. I applied the fan to the chipset until it reached it's
lowest point (52.5C while idle). I then positioned the fan away from the
chipset and watched the temperature rise until it reached 84.5C and the
fans sped up to > 4000RPM.

FAN 1            | 2401.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 2            | 0.000      | RPM        | nr    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 3            | 2401.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 4            | 4356.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 5            | 3969.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000


Given that the system is idle, and Super Mictro stated the chipset
should not exceed 75C, and I have no obstructions in the case and no
expansion boards to add heat, something appears to be wrong.

Here is an annotated log of the experiment, one reading every 10 seconds:

dvhart@rage:~$ while true; do sensors w83795g-i2c-0-2f | grep temp1;
sleep 10; done
temp1:       +61.2ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +59.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +57.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +56.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +55.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +54.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +53.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +52.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +52.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +53.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +55.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +56.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +58.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +60.2ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +61.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +63.2ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +64.2ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +65.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +66.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +67.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +68.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +69.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +70.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +71.2ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +72.2ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +72.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +73.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +74.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +75.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +75.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +76.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +77.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +77.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +77.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +78.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +78.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +79.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +79.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +79.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +80.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +80.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +80.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +81.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +81.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +81.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +81.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +82.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +82.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +82.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +82.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +82.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +83.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +84.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)

Fan speed jumped up at this point:

FAN 1            | 2401.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 2            | 0.000      | RPM        | nr    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 3            | 2401.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 4            | 4356.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000
FAN 5            | 3969.000   | RPM        | ok    | 400.000   | 576.000
  | 784.000   | 33856.000 | 34225.000 | 34596.000

And stayed at high speed until:

temp1:       +79.2ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +79.8ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +79.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +79.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +80.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +80.5ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)
temp1:       +81.0ÂC  (high = +127.0ÂC, hyst = +127.0ÂC)

And sped up again here.
And so on.

-- 
Darren Hart
Intel Open Source Technology Center
Yocto Project - Linux Kernel

_______________________________________________
lm-sensors mailing list
lm-sensors@xxxxxxxxxxxxxx
http://lists.lm-sensors.org/mailman/listinfo/lm-sensors



[Index of Archives]     [Linux Kernel]     [Linux Hardware Monitoring]     [Linux USB Devel]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [Yosemite Backpacking]

  Powered by Linux