Re: SGI Octane && Bridge DMA bug

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 08/28/2016 23:33, Joshua Kinard wrote:
> On 08/28/2016 14:06, Florian Fainelli wrote:
>> 2016-08-28 9:58 GMT-07:00 Joshua Kinard <kumba@xxxxxxxxxx>:
>>> On 08/28/2016 08:01, Joshua Kinard wrote:

[snip]

>>
>> Regarding your first question, for all plat_dma_* operations you
>> should be able to inspect the struct device properties and provide the
>> correct implementation based on whether this device is a child of the
>> Bridge IOMMU or not (e.g: looking at dev->parent.name for instance?)
> 
> Stan's original code used to check the struct device *dev arg for !NULL to
> determine if it needed to be cast to struct pci_device for Bridge ops, else,
> regard it as the Impact board.  But when Impact was converted to a platform
> device, that check would no longer work (dev was always set then), so I
> switched it to checking that dev->bus->name was "pci".  I thought that code got
> executed a lot, though, and strcmp() is expensive.  Turns out, the plat_dma_*
> functions are not called very often, so a strcmp() shouldn't be too much of an
> issue.
> 
> 
>> You are right that this only works for addresses that have already
>> been allocated, if you need to make sure that the allocation falls
>> under a particular range as well, which is not taken care of by
>> dma-default.c, either setting an appropriate dma_mask, or providing a
>> custom implementation for dma_ma_ops may be required here.
> 
> OpenBSD's using some "uvm" subsystem that appears to be quite adaptable once
> you set a few parameters, which is what their xbridge driver is doing, but it's
> completely unlike what Linux has.  I can't tell yet if I have to guarantee that
> Bridge DMA allocations have to stay within 0x20000000 and 0x9fffffff (possibly
> subtracting/adding 0x20000000 as needed to deal w/ the physical address
> offset), or if I have to just translate already-allocated addresses to/from
> that range.  If the latter, I should be able to do that w/ dma-coherence.h.
> Else, it'll probably be a custom dma_ops setup.  At least I have Loongson and
> Octeon to look at for examples.
> 
> Luckily, SGI appears to have imported large chunks of the original IRIX PCI
> code into Linux when they were bringing up the Altix platform.  So I've been
> referring back to 2.4.18 and 2.5.70 to see how the "pcibr.c" and "xtalk.c"
> drivers implemented a lot of stuff in IA64.
> 
> Can't find anything specific to the Octane in the Linux code, though.  So I
> can't tell if they had any workarounds in place or not for the Bridge ASIC on
> this platform.  If they did, they probably removed them.
> 

Hrm, so it looks like qla1280 sets a 64-bit DMA mask if BITS_PER_LONG == 64.
I've tried using ZONE_DMA or ZONE_DMA32 (but not both together), with no real
luck so far.  During boot, qla1280 seems to have no issues doing DMA for the
disk probing and other actions.  MD and XFS are the drivers that are triggering
the random Oopses when they try to assemble an array or mount the root partition.

Since ZONE_DMA is for the old 24-bit DMA space (16MB), I think for Octane, I
want ZONE_DMA32, but override MAX_DMA32_PFN to be (1UL << (31 - PAGE_SHIFT)).
Still need to figure out how to handle translating between phys and DMA
addresses to handle the 512MB physical address offset imposed by the system's
design.

I think some of the confusion also arises in that Octane provides three
separate groups of "windows" into Crosstalk space via HEART, its system controller:

  - sixteen 16MB "small" windows, 0x000010000000 - 0x00001f000000
  - sixteen 2GB "medium" windows, 0x000800000000 - 0x000f80000000
  - fifteen 64GB "large" windows, 0x001000000000 - 0x00f000000000

The existing Octane code appears to be picking a "default" window setup by the
firmware, which seems to be the large 64GB windows.  I think some kind of
translation layer would be needed to talk to the HEART to dynamically shift
between the three windows.  Although, not sure why you'd need the smaller
windows at all (64GB is big enough for everyone, right?).

Then you've got the Crossbow (XBOW) that the HEART connects to as widget #8,
and that's how it accesses the other widgets, such as Bridge (widget #f) or
Impact video (widget #c).

Bridge grants you three methods of accessing PCI devices:

  - 64-bit direct-mapped DMA addressing (not affected by 31-bit window bug)
  - 32-bit direct-mapped DMA addressing (affected by 31-bit window bug)
  - 32-bit translated addresses, via a type of built-in IOMMU ("ATE")

The ATE is reportedly rather buggy and OpenBSD seems to avoid using it (only
has 128 "internal" translation entries and cannot be updated while DMA is in
progress).  They go for the 32-bit direct-mapped DMA instead.

On Linux, I've got the Bridge driver using direct-mapped 64-bit DMA for Octane
and Onyx2, and that seems to work OK for Onyx2, regardless of installed memory
(8GB).  Octane is where the problems begin if installed memory is >2GB.  So I
suspect this 31-bit bug is Octane-exclusive.

It would probably help if I understood PCI addressing better.  Still confused
over what a BAR is for, and why qla1280 needs three of them (#0, #1, and #6).
Additionally, if qla1280 can do 64-bit DMA using Bridge's 64-bit direct-mapped
mode, and thus dodge the 31-bit bug, I'm puzzled why it's always MD or XFS that
trigger the Oops.

Do software drivers like MD/XFS do their own DMA, or do they use the DMA
provided by the disk driver?

Goal is to at least get the base I/O devices to work right w/ >2GB RAM,
preferably as 64-bit PCI devices.  I can then go back and look at handling
additional Bridge widgets (such as the PCI shoebox or XIO shoehorn adapters).
PCI devices plugged into the shoebox/horn will probably be pure 32-bit devices,
so I'll have to defeat this 31-bit bug somehow.

-- 
Joshua Kinard
Gentoo/MIPS
kumba@xxxxxxxxxx
6144R/F5C6C943 2015-04-27
177C 1972 1FB8 F254 BAD0 3E72 5C63 F4E3 F5C6 C943

"The past tempts us, the present confuses us, the future frightens us.  And our
lives slip away, moment by moment, lost in that vast, terrible in-between."

--Emperor Turhan, Centauri Republic




[Index of Archives]     [Linux MIPS Home]     [LKML Archive]     [Linux ARM Kernel]     [Linux ARM]     [Linux]     [Git]     [Yosemite News]     [Linux SCSI]     [Linux Hams]

  Powered by Linux