dme1737 error messages

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Jean,

On 11/6/07, Jean Delvare <khali at linux-fr.org> wrote:
> Hi Juerg,
>
> On Tue, 6 Nov 2007 10:31:22 -0800, Juerg Haefliger wrote:
> > Hi Bobby,
> >
> > On 11/4/07, Borislav Davitkov <davitkov at gmail.com> wrote:
> > > Hi Juerg,
> > >
> > > With acpi diabled, both kind of errors are gone (acpi errors, which is to be
> > > expected, and dme errors). I have another screenshot for you. However this
> > > also disables hyperthreading, and now I have only cpu0, whereas with acpi on
> > > I had cpu0 and cpu1.
> >
> > You can specify acpi=ht to just enable enough ACPI for hyperthreading.
> >
> >
> > > With no hwmonitoring - only acpi errors.
> > > With dme1737 module loaded - acpi errors plus dme errors.
> > > With acpi off - no errors at all.
> >
> > OK, so it's definitely ACPI that interferes with the dme1737 driver.
> > Not good. I'm surprised that removing the fan and thermal modules
> > didn't get rid of the ACPI errors. But then again I'm not an ACPI
> > expert so don't really know how it works and what to expect...
> >
> > I recompiled the DSDT that you sent me and it's definitely broken. You
> > can try to fix it but again I don't know what you can expect from that
> > exercise. I also took a closer look at the DSDT and there's code that
> > accesses the dme1737 all over the place. It mocks around with temp
> > limits and PWM duty cycles and more. It's definitely *not* safe to use
> > the dme1737 driver, it's interfering with ACPI regulating the fans
> > based on the measured temps. Now why ACPI is controlling the fans
> > instead of letting the dme1737 chip take care of it is beyond me...
> > You'd have to ask the smart designers at ASUS :-)
>
> There's nothing fundamentally wrong with this, if the ACPI
> implementation is correct. Why require an OS driver if they can do the
> same in ACPI and it works on all OSes without user interaction?

What I meant by my comment is that I don't understand why they don't
make use of the automatic fan control feature of the dme1737 chip.
That would be much simpler (setting up a couple of registers) and
wouldn't require implementing a closed-loop control algorithm SW. It's
an ASUS board and they do it for other motherboards, so why not on
this one? Maybe it's an HP requirement, I don't know...


> Of course this also means that the user has almost no control over
> what's going on, which is alright if things work well, but becomes a
> problem if they don't. The worst thing is that almost all the time, the
> ACPI stuff only exposes a small subset of the hardware monitoring
> chip's features, which is very frustrating for computer enthusiasts.
>
> > Bottom line, I think you have 3 options:
> > 1) Disable ACPI and use the dme1737 driver. In that case you loose the
> > automatic fan control unless you set it up yourself. Dangerous, you're
> > interfering with the way the machine is designed to work and might fry
> > it.
> > 2) Take it to the ACPI mailing list and try to figure out what exactly
> > ACPI is doing and what needs to be fixed to get rid of the ACPI errors
> > (that still doesn't fix the dme1737/ACPI conflict issue).
> > 3) If you're brave, you can keep using the dme1737 driver, ignore all
> > the erros and hope and pray it doesn't interfere with ACPI in a way
> > that puts the HW at risk.
> >
> > Jean: Do you know where the whole ACPI/hwmon conflict issue is at? Do
> > we know what the problem is, when it occurs, and how to get around it?
>
> The patch I posted (which will be in the next -mm kernel also) attempts
> to detect the conflicts, and the hwmon and i2c (bus) drivers will
> cooperate and refuse to load when a conflict is detected. This approach
> relies on the fact that ACPI is supposed to declare which ports it may
> access at run time. This may yield false negatives (if ACPI doesn't
> declare port usage properly) and false positives (if ACPI declares
> ports that it doesn't actually use.) So we're waiting to see the effect
> of the patch on a large number of systems, if it is acceptable it will
> go upstream.
>
> It would be great if Borislav could try either my patch or the next -mm
> kernel, and report whether the conflict is detected or not.

He claims he tested it and it didn't do anything. Bobby: Can you elaborate?


> What we will do next, I'm not sure. In some cases it might be possible
> to synchronize the hwmon driver accesses with the ACPI accesses, then
> we can get both to cooperate.

How can we achieve cooperation between the two?


> In other cases it probably won't be
> possible and we'll have blacklist the hwmon driver, I fear.

Uh-huh... Based on the motherboard/system type? How does that work?


> There is
> also at least one known case where an "ACPI hwmon driver" can be
> written on top of the ACPI code to expose all the features as regular
> hwmon drivers do. All in all, the outcome will probably be highly
> dependent upon the machine, as every ACPI implementation is different.
>
> I'll keep the list updated when things progress, of course.

Thanks, very much appreciated.

...juerg


> --
> Jean Delvare
>




[Index of Archives]     [Linux Kernel]     [Linux Hardware Monitoring]     [Linux USB Devel]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [Yosemite Backpacking]

  Powered by Linux