Re: General Protection Fault with bcmsensors

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Martin,

On 8/10/05, Yani Ioannou <yani.ioannou at gmail.com> wrote:
> Hi,
> 
> I tested it out with 2.6.13-rc1, but I'll try it out with 2.6.13-rc6
> and get back to you.
> 
> Yani
> 
> On 8/10/05, Martin Drab <drab at kepler.fjfi.cvut.cz> wrote:
> > On Mon, 8 Aug 2005, Yani Ioannou wrote:
> > > On 8/8/05, Martin Drab <drab at kepler.fjfi.cvut.cz> wrote:
> > > > None of the kernel patches work for recent kernels. I'm not entirely sure
> > > > since when exactly it began, but from a certain point those patches on
> > > > the sf.net cased to work properly and began to crash like that.
> > > >
> > > > Recently there were some fixes that needed to be done in order to make
> > > > those patches on sf.net compile (changes in i2c structures), but though it
> > > > compiled, it crashed.
> > > >
> > > > So I thought I'll try the CVS bmcsensors-26. Those compiled without a
> > > > problem (that's probably the patch you are referring to and perhaps
> > > > simillar that I was using), but it crashes the same way as well. So I
> > > > think there may be some other problem introduced somewhere else in recent
> > > > kernels. Tomorrow when I get a chance to safely reboot the server I may
> > > > try to find when it was last working. From what I can briefly see from the
> > > > past kernel logs, it seems to have been working with the kernel 2.6.12.2,
> > > > then I tried the 2.6.13-rc4-git4 and it crashed allready. (Both had to be
> > > > patched to compile and for both I used the same patch.)
> > >
> > > lol..indeed the problem is something that was introduced into the
> > > kernel at 2.6.13-rc1 - a change by myself :-). The sysfs callbacks
> > > need to be updated (you should have got warnings complaining of an
> > > invalid function pointer when you compiled the module) for newer
> > > kernels, I'll fix that now, and commit it to CVS so you can try it out
> > > tomorrow, and I will try to test this out on my own machines tonight
> > > too.
> >
> > I've tested the current CVS again. It seems to be updated, and it no
> > longer gives the compile warnings. But still it produces a crash. although
> > the system isn't frozen, unless the IPMI watchdog is enabled, which runs
> > out and reboots the comp. When I disable it, the system runs on. So,
> > perhaps just the IPMI part crashes and the watchdog is no longer
> > refreshed.
> >
> > Anyway, the crash again occurs when you try to read the sensors
> > values (either directly through sysfs or using sensors) now produces this:
> >
> > ---------------
> > Aug 10 16:58:39 neutron kernel: [  672.694754] Unable to handle kernel paging request at ffffffff89feb91c RIP:
> > Aug 10 16:58:39 neutron kernel: [  672.694942] <ffffffff881e77cc>{:bmcsensors:show_sensor+44}
> > Aug 10 16:58:39 neutron kernel: [  672.719842] PGD 103027 PUD 105027 PMD 0
> > Aug 10 16:58:39 neutron kernel: [  672.719847] Oops: 0000 [1] SMP
> > Aug 10 16:58:39 neutron kernel: [  672.719850] CPU 1
> > Aug 10 16:58:39 neutron kernel: [  672.719852] Modules linked in: ipmi_si ipmi_devintf i2c_ipmi bmcsensors i2c_isa i2c_amd756 nfsd exportfs lockd nfs_acl parport_pc lp parport autofs4 sunrpc powernow_k8 freq_table binfmt_misc dm_mod video thermal processor hotkey fan container button battery ac ipv6 usbkbd usbhid ohci_hcd i2c_amd8111 i2c_core hw_random shpchp tg3 ide_cd cdrom sg usbcore ext3 jbd sd_mod
> > Aug 10 16:58:39 neutron kernel: [  672.719868] Pid: 3364, comm: sensors Not tainted 2.6.13-rc6
> > Aug 10 16:58:39 neutron kernel: [  672.719870] RIP: 0010:[<ffffffff881e77cc>] <ffffffff881e77cc>{:bmcsensors:show_sensor+44}
> > Aug 10 16:58:39 neutron kernel: [  672.719878] RSP: 0018:ffff810076ff9ea8  EFLAGS: 00010216
> > Aug 10 16:58:39 neutron kernel: [  672.719881] RAX: 0000000001dfff88 RBX: ffff810037c65240 RCX: fffffffffffffff4
> > Aug 10 16:58:39 neutron kernel: [  672.719885] RDX: 00000000001ffff8 RSI: ffff810037c65240 RDI: ffffffff881eac00
> > Aug 10 16:58:39 neutron kernel: [  672.719888] RBP: ffff810075f954a0 R08: ffff810040000000 R09: 0000000000000282
> > Aug 10 16:58:39 neutron kernel: [  672.719891] R10: 0000000000000000 R11: ffffffff881e77a0 R12: ffff81007f3f0000
> > Aug 10 16:58:39 neutron kernel: [  672.719895] R13: ffff810076ff9f50 R14: ffffffff80438440 R15: ffffffff881ead00
> > Aug 10 16:58:39 neutron kernel: [  672.719899] FS:  00002aaaaaad9e40(0000) GS:ffffffff80541880(0000) knlGS:0000000061b80bb0
> > Aug 10 16:58:39 neutron kernel: [  672.719902] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> > Aug 10 16:58:39 neutron kernel: [  672.719905] CR2: ffffffff89feb91c CR3: 000000003b70d000 CR4: 00000000000006e0
> > Aug 10 16:58:39 neutron kernel: [  672.719908] Process sensors (pid: 3364, threadinfo ffff810076ff8000, task ffff81003c54c070)
> > Aug 10 16:58:39 neutron kernel: [  672.719910] Stack: ffffffff881ead00 ffff810075f95480 0000000000001000 ffffffff801c48af
> > Aug 10 16:58:39 neutron kernel: [  672.719916]        0000000000000000 00002aaaadb48000 ffff810037c65240 ffff81007df09180
> > Aug 10 16:58:39 neutron kernel: [  672.719921]        0000000000000000 0000000000001000
> > Aug 10 16:58:39 neutron kernel: [  672.719923] Call Trace:<ffffffff801c48af>{sysfs_read_file+175} <ffffffff8018342b>{vfs_read+219}
> > Aug 10 16:58:39 neutron kernel: [  672.719942]        <ffffffff80183bf3>{sys_read+83} <ffffffff8010dcf2>{system_call+126}
> > Aug 10 16:58:40 neutron kernel: [  672.719952]
> > Aug 10 16:58:40 neutron kernel: [  672.719957]
> > Aug 10 16:58:40 neutron kernel: [  672.719958] Code: 0f b6 b8 94 b9 1e 88 48 8d b0 60 b9 1e 88 e8 11 fe ff ff 4c
> > Aug 10 16:58:40 neutron kernel: [  672.719965] RIP <ffffffff881e77cc>{:bmcsensors:show_sensor+44} RSP <ffff810076ff9ea8>
> > Aug 10 16:58:40 neutron kernel: [  672.719972] CR2: ffffffff89feb91c
> > ---------------
> >
> > Is it possible you forgot to update something? Or is it a different bug?
> >
> > Martin
> >
> >
> 

Indeed with 2.6.13-rc6 I have the same problem, but I highly suspect
it has to do with the new hwmon patch. To move bmcsensors to hwmon
though I need a device struct in the IPMI subsystem to hang
device_attributes off and that brings us back to the same place the
re-write (ipmi-sensors) is stuck at right now (see my updated web page
at http://bmcsensors-26.sourceforge.net/).

I have quite a bit of work done on adding the driver model to the IPMI
subsystem, but it is not in any state to be submitted yet, and I won't
have the time to work on it for a week :-(.

Yani




[Index of Archives]     [Linux Kernel]     [Linux Hardware Monitoring]     [Linux USB Devel]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [Yosemite Backpacking]

  Powered by Linux