Re: PCIe error reporting

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, Oct 16, 2017 at 02:23:22PM +0000, David Laight wrote:
> From: Bjorn Helgaas
> > On Wed, Oct 11, 2017 at 04:00:04PM +0000, David Laight wrote:
> > > From: Bjorn Helgaas

> > It would be interesting to know what Windows does about AER on that
> > platform.  I would expect Windows to respect the platform's wishes as
> > expressed by _OSC, so if Windows does AER recovery, there might be a
> > problem in the way Linux uses _OSC.
> 
> I've booted server 2012, difficult to say whether AER gets logged.
> There are some recent WHEA logs - but not from when I was generating
> errors.  The event viewer doesn't decode the data that might say
> what is being reported.

I don't know how to tell what Windows is doing with respect to AER.

> I've 'bodged' the Linux kernel to think that the BIOS gave it control
> of AER (set OSC_PCI_EXPRESS_AER_CONTROL into *mask and
> root->osc_control_set in acpi_pci_osc_control_set()).
> I think this is the earliest place the info is saved.
> This is enough to the 'pcieport ... AER enabled with IRQ nn' messages
> (It is sharing the interrupt with PME).
> 
> I've made sure my card is beneath one of the cpu bridges (the companion
> chip host bridges don't support AER).
> I've also bodged the driver to ioremap() an area larger than one of the
> BARs so I can generated PCIe read and write TLP that are outside the
> BAR ranges.
> Reads set CESta: NonFatalError.
> Writes set UESTA: UnsepReq and save the TLP header.
> No interrupts to aerdrv are generated.
> I can clear the status bits using setpci.
> Should I expect these errors to raise interrupts?

I think that depends on the Root Error Command register.



[Index of Archives]     [DMA Engine]     [Linux Coverity]     [Linux USB]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Greybus]

  Powered by Linux