Hello,
I have been working to diagnose some issues with the IBM x346 servers
when using the on board broadcom network cards. The issues occur when I
use both the bcm5700 driver provided by IBM and the tg3 driver provided
by Redhat, Broadcom or the vanilla kernel.
The issue is that when we enable our ICMP Polling service that sends out
several thousand packets over a period of 5minutes to several thousand
different hosts we see that the packets leave the system. When we go and
have a look at the Cisco Catalyst 6509 and do an rspan of the port we
see the ICMP packets returning to the switch port and on the server we
never see those packets with tcpdump on the system.
After some research I found that there are many other people
experiencing similar problems and after my previous post I have been
sent many people asking if I had a resolution. In one way I have a
resolution but it has been compounded further with this issue.
The resolution for the problem was to add an additional Intel
EtherExpress 100 network card into the system and make that the primary
interface. Since doing that and having BOTH broadcom network interfaces
disabled the problem goes away. No more lag, no issues at all.
One thing that I did on the side was update all of the firmware on the
server to the latest BIOS, network firmware and others. Since doing that
the problem was slightly reduced but it was by no means a complete
resolution to the problem.
Now that I am using an intel eepro 100 card in the server it is working
great. However we also have a Veritas netbackup client running on that
server which backs up the data across the second network interface. That
second network interface is again the broadcom on board. This time it is
the second on board interface rather than the first. When the interfaces
are both disabled the problem is non existant. However when we enable
the eth2 interface on board for the netbackup the system starts to lag
even though the routes are all via the eth0 (eepro100) interface.
From seeing those symptoms I am wondering if there could be some sort
of memory interrupt/dma issue on the system that Linux is not reporting
correctly. I am interested to know if anyone else has had similar issues
in the past. Could it also be that the card is running 64bit rather than
in 32bit like the eepro100 card?
Here is a copy of my lspci -v output for any further hardware
investigation. Any ideas or concepts would be great to hear about so
that I can hopefully try and resolve these outstanding issues once and
for all.
===SNIP====
lspci -v
00:00.0 Host bridge: Intel Corp. Server Memory Controller Hub (rev 0a)
Subsystem: IBM: Unknown device 02dd
Flags: bus master, fast devsel, latency 0
Memory at <ignored> (32-bit, non-prefetchable) [size=4K]
Capabilities: [40] #09 [4105]
00:00.1 Class ff00: Intel Corp. Memory Controller Hub Error Reporting
Register (rev 0a)
Subsystem: IBM: Unknown device 02dd
Flags: fast devsel
00:02.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port
A0 (rev 0a) (prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=00, secondary=02, subordinate=04, sec-latency=0
I/O behind bridge: 00004000-00004fff
Memory behind bridge: ce000000-cfffffff
Capabilities: [50] Power Management version 2
Capabilities: [58] Message Signalled Interrupts: 64bit-
Queue=0/1 Enable-
Capabilities: [64] #10 [0041]
00:04.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port
B0 (rev 0a) (prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=00, secondary=05, subordinate=05, sec-latency=0
Memory behind bridge: cc000000-cdffffff
Capabilities: [50] Power Management version 2
Capabilities: [58] Message Signalled Interrupts: 64bit-
Queue=0/1 Enable-
Capabilities: [64] #10 [0041]
00:05.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port
B1 (rev 0a) (prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=00, secondary=06, subordinate=06, sec-latency=0
Memory behind bridge: ca000000-cbffffff
Capabilities: [50] Power Management version 2
Capabilities: [58] Message Signalled Interrupts: 64bit-
Queue=0/1 Enable-
Capabilities: [64] #10 [0041]
00:06.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port
C0 (rev 0a) (prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=00, secondary=07, subordinate=09, sec-latency=0
I/O behind bridge: 00005000-0000ffff
Memory behind bridge: c8000000-c9ffffff
Capabilities: [50] Power Management version 2
Capabilities: [58] Message Signalled Interrupts: 64bit-
Queue=0/1 Enable-
Capabilities: [64] #10 [0041]
00:08.0 System peripheral: Intel Corp. Memory Controller Hub Extended
Configuration Registers (rev 0a)
Subsystem: IBM: Unknown device 02dd
Flags: fast devsel
00:1d.0 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #1
(rev 02) (prog-if 00 [UHCI])
Subsystem: IBM: Unknown device 02dd
Flags: bus master, medium devsel, latency 0, IRQ 16
I/O ports at 2200 [size=32]
00:1d.1 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #2
(rev 02) (prog-if 00 [UHCI])
Subsystem: IBM: Unknown device 02dd
Flags: bus master, medium devsel, latency 0, IRQ 19
I/O ports at 2600 [size=32]
00:1d.7 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB2 EHCI
Controller (rev 02) (prog-if 20 [EHCI])
Subsystem: IBM: Unknown device 02dd
Flags: bus master, medium devsel, latency 0, IRQ 23
Memory at f0000000 (32-bit, non-prefetchable) [size=1K]
Capabilities: [50] Power Management version 2
Capabilities: [58] #0a [20a0]
00:1e.0 PCI bridge: Intel Corp. 82801BA/CA/DB/EB/ER Hub interface to PCI
Bridge (rev c2) (prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=00, secondary=01, subordinate=01, sec-latency=32
I/O behind bridge: 00003000-00003fff
Memory behind bridge: de000000-dfffffff
Prefetchable memory behind bridge: d0000000-ddffffff
00:1f.0 ISA bridge: Intel Corp. 82801EB/ER (ICH5/ICH5R) LPC Bridge (rev 02)
Flags: bus master, medium devsel, latency 0
00:1f.1 IDE interface: Intel Corp. 82801EB/ER (ICH5/ICH5R) Ultra ATA 100
Storage Controller (rev 02) (prog-if 8a [Master SecP PriP])
Subsystem: IBM: Unknown device 02dd
Flags: bus master, medium devsel, latency 0, IRQ 17
I/O ports at <ignored>
I/O ports at <ignored>
I/O ports at <ignored>
I/O ports at <ignored>
I/O ports at 0480 [size=16]
Memory at d8000000 (32-bit, non-prefetchable) [size=1K]
00:1f.3 SMBus: Intel Corp. 82801EB/ER (ICH5/ICH5R) SMBus Controller (rev
02)
Subsystem: IBM: Unknown device 02dd
Flags: medium devsel, IRQ 17
I/O ports at 0440 [size=32]
01:06.0 VGA compatible controller: ATI Technologies Inc Radeon RV100 QY
[Radeon 7000/VE] (prog-if 00 [VGA])
Subsystem: IBM: Unknown device 02c8
Flags: bus master, stepping, medium devsel, latency 64, IRQ 20
Memory at d0000000 (32-bit, prefetchable) [size=128M]
I/O ports at 3000 [size=256]
Memory at dfff0000 (32-bit, non-prefetchable) [size=64K]
Expansion ROM at <unassigned> [disabled] [size=128K]
Capabilities: [50] Power Management version 2
02:00.0 PCI bridge: Intel Corp. PCI Bridge Hub A (rev 09) (prog-if 00
[Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=02, secondary=03, subordinate=03, sec-latency=48
I/O behind bridge: 00004000-00004fff
Memory behind bridge: ce000000-cfffffff
Capabilities: [44] #10 [0071]
Capabilities: [5c] Message Signalled Interrupts: 64bit+
Queue=0/0 Enable-
Capabilities: [6c] Power Management version 2
Capabilities: [d8] PCI-X non-bridge device.
02:00.2 PCI bridge: Intel Corp. PCI Bridge Hub B (rev 09) (prog-if 00
[Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=02, secondary=04, subordinate=04, sec-latency=64
Capabilities: [44] #10 [0071]
Capabilities: [5c] Message Signalled Interrupts: 64bit+
Queue=0/0 Enable-
Capabilities: [6c] Power Management version 2
Capabilities: [d8] PCI-X non-bridge device.
03:03.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100]
(rev 0c)
Subsystem: IBM 10/100 Ethernet Server Adapter
Flags: bus master, medium devsel, latency 64, IRQ 72
Memory at cffff000 (32-bit, non-prefetchable) [size=4K]
I/O ports at 4000 [size=64]
Memory at cffc0000 (32-bit, non-prefetchable) [size=128K]
Expansion ROM at <unassigned> [disabled] [size=64K]
Capabilities: [dc] Power Management version 2
05:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5721
Gigabit Ethernet PCI Express (rev 01)
Subsystem: IBM: Unknown device 02c6
Flags: bus master, fast devsel, latency 0, IRQ 16
Memory at cdff0000 (64-bit, non-prefetchable) [size=64K]
Capabilities: [48] Power Management version 2
Capabilities: [50] Vital Product Data
Capabilities: [58] Message Signalled Interrupts: 64bit+
Queue=0/3 Enable-
Capabilities: [d0] #10 [0001]
06:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5721
Gigabit Ethernet PCI Express (rev 01)
Subsystem: IBM: Unknown device 02c6
Flags: bus master, fast devsel, latency 0, IRQ 16
Memory at cbff0000 (64-bit, non-prefetchable) [size=64K]
Capabilities: [48] Power Management version 2
Capabilities: [50] Vital Product Data
Capabilities: [58] Message Signalled Interrupts: 64bit+
Queue=0/3 Enable-
Capabilities: [d0] #10 [0001]
07:00.0 PCI bridge: Intel Corp. 80332 [Dobson] I/O processor (rev 07)
(prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=07, secondary=08, subordinate=08, sec-latency=64
Memory behind bridge: c8000000-c9ffffff
Capabilities: [44] #10 [0071]
Capabilities: [5c] Message Signalled Interrupts: 64bit+
Queue=0/0 Enable-
Capabilities: [6c] Power Management version 2
Capabilities: [d8] PCI-X non-bridge device.
07:00.2 PCI bridge: Intel Corp. 80332 [Dobson] I/O processor (rev 07)
(prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 0
Bus: primary=07, secondary=09, subordinate=09, sec-latency=64
I/O behind bridge: 00005000-0000ffff
Capabilities: [44] #10 [0071]
Capabilities: [5c] Message Signalled Interrupts: 64bit+
Queue=0/0 Enable-
Capabilities: [6c] Power Management version 2
Capabilities: [d8] PCI-X non-bridge device.
08:0e.0 RAID bus controller: Adaptec ServeRAID Controller (rev 07)
Subsystem: IBM: Unknown device 028e
Flags: bus master, stepping, 66Mhz, medium devsel, latency 64,
IRQ 38
Memory at c9fff000 (32-bit, non-prefetchable) [size=4K]
Expansion ROM at <unassigned> [disabled] [size=512K]
Capabilities: [c0] Power Management version 2
Capabilities: [d0] Message Signalled Interrupts: 64bit+
Queue=0/1 Enable-
Capabilities: [e0] PCI-X non-bridge device.
===SNIP===
Regards,
Jimmy Stewpot.