Re: SGI Atix 4700 Help

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I have this issue with all the Kernels that I have built that are
>=3.2 within Wheezy. I have one working 3.4.49 Kernel that works but
it only has support for 64 CPUs (First Kernel I built when upgrading
Squeeze to Wheezy).  You do raise a good point thought.  I suppose I
could distrust the SGI memory test when the machine powers up and test
all 64 sticks of memory individually.

On Tue, Aug 20, 2013 at 5:30 PM, Émeric MASCHINO
<emeric.maschino@xxxxxxxxx> wrote:
> Hi,
>
> Do you get this kind of error with all Linux kernels, or only >= 3.2?
> It seems to me that you're experiencing memory problem, so I'm
> wondering whether this could be a hardware issue (bad memory DIMM)
> rather than a problem with Linux.
> Maybe Linux kernels >= 3.2 test or stress memory in a way that trigger
> this hardware fault?
>
>      Emeric
>
> 2013/8/21 Barclay Jameson <almightybeeij@xxxxxxxxx>:
>> I had another go at compiling the ia64 Kernel again on the SGI Altix
>> 4700 this time with passing the  O1 flag.
>> It at least gave me some output that might be helpful. Anyone have any ideas?
>>
>> ELILO v3.14 for EFI/IA-64
>> ..
>> Loading \EFI\debian\vmlinuz-3.4.49...Loading Linux... Attempting to
>> relocate kernel...done
>> Loading file \EFI\debian\initrd.img-3.4.49...done
>> [    0.000000] Initializing cgroup subsys cpuset
>> [    0.000000] Initializing cgroup subsys cpu
>> [    0.000000] Linux version 3.4.49 (beeij@debian) (gcc version 4.6.3
>> (Debian 4.6.3-14) ) #10 SMP Tue Aug 20 16:15:16 CDT 2013
>> [    0.000000] EFI v1.10 by INTEL: SALsystab=0x1802c26190 ACPI 2.0=0x1802c26280
>> [    0.000000] booting generic kernel on platform sn2
>> [    0.000000] console [sn_sal0] enabled
>> [    0.000000] ACPI: RSDP 0000001802c26280 00024 (v02    SGI)
>> [    0.000000] ACPI: XSDT 0000001802c2a740 00044 (v01    SGI  XSDTSN2
>> 00010001    ? 0000007C)
>> [    0.000000] ACPI: APIC 0000001802c26af0 0032C (v01    SGI  APICSN2
>> 00010001    ? 00000001)
>> [    0.000000] ACPI: SRAT 0000001802c26e30 006B0 (v01    SGI  SRATSN2
>> 00010001    ? 00000001)
>> [    0.000000] ACPI: SLIT 0000001802c274f0 0012C (v01    SGI  SLITSN2
>> 00010001    ? 00000001)
>> [    0.000000] ACPI: FACP 0000001802c27680 000F4 (v03    SGI  FACPSN2
>> 00030001    ? 00000001)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in Pm1aEventBlock:
>> 32/0 (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in
>> Pm1aControlBlock: 16/0 (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in PmTimerBlock:
>> 32/0 (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in Gpe0Block: 64/0
>> (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: Invalid length for Pm1aEventBlock: 0,
>> using default 32 (20120320/tbfadt-629)
>> [    0.000000] ACPI Warning: Invalid length for Pm1aControlBlock: 0,
>> using default 16 (20120320/tbfadt-629)
>> [    0.000000] ACPI Warning: Invalid length for PmTimerBlock: 0, using
>> default 32 (20120320/tbfadt-629)
>> [    0.000000] ACPI: DSDT 0000001802c29750 00024 (v02    SGI  DSDTSN2
>> 00020001    ? 0000088B)
>> [    0.000000] ACPI: FACS 0000001802c27630 00040
>> [    0.000000] ACPI: Local APIC address c0000000fee00000
>> [    0.000000] 64 CPUs available, 64 CPUs total
>> [    0.000000] Number of logical nodes in system = 16
>> [    0.000000] Number of memory chunks in system = 16
>> [    0.000000] SMP: Allowing 64 CPUs, 0 hotplug CPUs
>> [    0.000000] Initial ramdisk at: 0xe00003daf517e000 (19257232 bytes)
>> [    0.000000] SAL 3.2: SGI SN2 version 1.54
>> [    0.000000] SAL Platform features: ITC_Drift
>> [    0.000000] SAL: AP wakeup using external interrupt vector 0x12
>> [    0.000000] MCA related initialization done
>> [    0.000000] ACPI: RSDP 0000001802c26280 00024 (v02    SGI)
>> [    0.000000] ACPI: XSDT 0000001802c2a740 0007C (v01    SGI  XSDTSN2
>> 00010001    ? 0000007C)
>> [    0.000000] ACPI: APIC 0000001802c26af0 0032C (v01    SGI  APICSN2
>> 00010001    ? 00000001)
>> [    0.000000] ACPI: SRAT 0000001802c26e30 006B0 (v01    SGI  SRATSN2
>> 00010001    ? 00000001)
>> [    0.000000] ACPI: SLIT 0000001802c274f0 0012C (v01    SGI  SLITSN2
>> 00010001    ? 00000001)
>> [    0.000000] ACPI: FACP 0000001802c27680 000F4 (v03    SGI  FACPSN2
>> 00030001    ? 00000001)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in Pm1aEventBlock:
>> 32/0 (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in
>> Pm1aControlBlock: 16/0 (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in PmTimerBlock:
>> 32/0 (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: 32/64X length mismatch in Gpe0Block: 64/0
>> (20120320/tbfadt-548)
>> [    0.000000] ACPI Warning: Invalid length for Pm1aEventBlock: 0,
>> using default 32 (20120320/tbfadt-629)
>> [    0.000000] ACPI Warning: Invalid length for Pm1aControlBlock: 0,
>> using default 16 (20120320/tbfadt-629)
>> [    0.000000] ACPI Warning: Invalid length for PmTimerBlock: 0, using
>> default 32 (20120320/tbfadt-629)
>> [    0.000000] ACPI: DSDT 0000001802c29750 0088B (v02    SGI  DSDTSN2
>> 00020101    ? 0000088B)
>> [    0.000000] ACPI: FACS 0000001802c27630 00040
>> [    0.000000] ACPI: SSDT 0000001802c2a1e0 00095 (v02    SGI  SSDTSN2
>> 00020101    ? 00000095)
>> [    0.000000] ACPI: SSDT 0000001802c2a2f0 000F5 (v02    SGI  SSDTSN2
>> 00020101    ? 000000F5)
>> [    0.000000] ACPI: SSDT 0000001802c2a400 001F2 (v02    SGI  SSDTSN2
>> 00020101    ? 000001F2)
>> [    0.000000] ACPI: SSDT 0000001802c29ff0 00095 (v02    SGI  SSDTSN2
>> 00020101    ? 00000095)
>> [    0.000000] ACPI: SSDT 0000001802c2a610 0007E (v02    SGI  SSDTSN2
>> 00020101    ? 0000007E)
>> [    0.000000] ACPI: SSDT 0000001802c2a7d0 00139 (v02    SGI  SSDTSN2
>> 00020101    ? 00000139)
>> [    0.000000] ACPI: SSDT 0000001802c2a6a0 00090 (v02    SGI  SSDTSN2
>> 00020101    ? 00000090)
>> [    0.000000] SGI SAL version 1.54
>> [    0.000000] Virtual mem_map starts at 0xa0007ffca0600000
>> [    0.000000] Zone PFN ranges:
>> [    0.000000]   DMA      0x00600c00 -> 0x1000000000
>> [    0.000000]   Normal   empty
>> [    0.000000] Movable zone start PFN for each node
>> [    0.000000] Early memory PFN ranges
>> [    0.000000]     0: 0x00600c00 -> 0x0063e000
>> [    0.000000]     0: 0x00680000 -> 0x006bdfff
>> [    0.000000]     1: 0x01600c00 -> 0x0163e000
>> [    0.000000]     1: 0x01680000 -> 0x016be000
>> [    0.000000]     2: 0x02600c00 -> 0x0263e000
>> [    0.000000]     2: 0x02680000 -> 0x026be000
>> [    0.000000]     3: 0x03600c00 -> 0x0363e000
>> [    0.000000]     3: 0x03680000 -> 0x036be000
>> [    0.000000]     4: 0x04600c00 -> 0x0463e000
>> [    0.000000]     4: 0x04680000 -> 0x046be000
>> [    0.000000]     5: 0x05600c00 -> 0x0563e000
>> [    0.000000]     5: 0x05680000 -> 0x056be000
>> [    0.000000]     6: 0x06600c00 -> 0x0663e000
>> [    0.000000]     6: 0x06680000 -> 0x066bdfff
>> [    0.000000]     7: 0x07600c00 -> 0x0763e000
>> [    0.000000]     7: 0x07680000 -> 0x076be000
>> [    0.000000]     8: 0x08600c00 -> 0x0863e000
>> [    0.000000]     8: 0x08680000 -> 0x086be000
>> [    0.000000]     9: 0x09600c00 -> 0x0963e000
>> [    0.000000]     9: 0x09680000 -> 0x096be000
>> [    0.000000]    10: 0x0a600c00 -> 0x0a63e000
>> [    0.000000]    10: 0x0a680000 -> 0x0a6be000
>> [    0.000000]    11: 0x0b600c00 -> 0x0b63e000
>> [    0.000000]    11: 0x0b680000 -> 0x0b6be000
>> [    0.000000]    12: 0x0c600c00 -> 0x0c63e000
>> [    0.000000]    12: 0x0c680000 -> 0x0c6be000
>> [    0.000000]    13: 0x0d600c00 -> 0x0d63e000
>> [    0.000000]    13: 0x0d680000 -> 0x0d6be000
>> [    0.000000]    14: 0x0e600c00 -> 0x0e63e000
>> [    0.000000]    14: 0x0e680000 -> 0x0e6bdfff
>> [    0.000000]    15: 0x0f600c00 -> 0x0f63e000
>> [    0.000000]    15: 0x0f680000 -> 0x0f6bd9ff
>> [    0.000000]    15: 0x0f6bde00 -> 0x0f6bdf56
>> [    0.000000]    15: 0x0f6bdf65 -> 0x0f6bdf84
>> [    0.000000]    15: 0x0f6bdfa0 -> 0x0f6bdfb9
>> [    0.000000] Built 16 zonelists in Node order, mobility grouping on.
>>  Total pages: 8033770
>> [    0.000000] Policy zone: DMA
>> [    0.000000] Kernel command line:
>> BOOT_IMAGE=scsi1:/EFI/debian/vmlinuz-3.4.49 root=/dev/md0  ro
>> [    0.000000] PID hash table entries: 4096 (order: 1, 32768 bytes)
>> [    0.000000] Memory: 128722256k/129185072k available (7956k code,
>> 496464k reserved, 4790k data, 816k init)
>> [    0.000000] SLUB: Genslabs=17, HWalign=128, Order=0-3,
>> MinObjects=0, CPUs=64, Nodes=256
>> [    0.000000] Hierarchical RCU implementation.
>> [    0.000000]     CONFIG_RCU_FANOUT set to non-default value of 32
>> [    0.000000] NR_IRQS:1024
>> [    0.000000] ACPI: Local APIC address c0000000fee00000
>> [    0.000000] register_intr: No IOSAPIC for GSI 52
>> [    0.000000] WARNING: Persistent clock returned invalid value!
>> [    0.000000]          Check your CMOS/BIOS settings.
>> [    0.000000] Console: colour dummy device 80x25
>> [    0.000000] console [ttySG0] enabled
>> [    0.000000] console [ttySG0] enabled
>> [    0.044000] Calibrating delay loop... 3182.59 BogoMIPS (lpj=6365184)
>> [    0.065688] pid_max: default: 65536 minimum: 512
>> [    0.073194] Security Framework initialized
>> [    0.084011] SELinux:  Disabled at boot.
>> [    0.112172] Dentry cache hash table entries: 16777216 (order: 13,
>> 134217728 bytes)
>> [    0.387579] Inode-cache hash table entries: 8388608 (order: 12,
>> 67108864 bytes)
>> [    0.523179] Mount-cache hash table entries: 1024
>> [    0.528217] Initializing cgroup subsys cpuacct
>> [    0.532005] Initializing cgroup subsys devices
>> [    0.544004] Initializing cgroup subsys freezer
>> [    0.560004] Initializing cgroup subsys net_cls
>> [    0.568224] ACPI: Core revision 20120320
>> [    0.581597] Boot processor id 0x0/0x0
>> [    0.040000] Fixed BSP b0 value from CPU 1
>> [    0.659422] Brought up 64 CPUs
>> [    0.660031] Total of 64 processors activated (203685.88 BogoMIPS).
>> [    0.706196] devtmpfs: initialized
>> [    0.724306] DMI not present or invalid.
>> [    0.726680] dummy:
>> [    0.732470] NET: Registered protocol family 16
>> [    0.740147] ACPI: bus type pci registered
>> [    0.752148] ACPI  DSDT OEM Rev 0x20101
>> [    0.782373] bio: create slab <bio-0> at 0
>> [    0.785459] ACPI: Added _OSI(Module Device)
>> [    0.792003] ACPI: Added _OSI(Processor Device)
>> [    0.804004] ACPI: Added _OSI(3.0 _SCP Extensions)
>> [    0.820003] ACPI: Added _OSI(Processor Aggregator Device)
>> [    0.828294] ACPI: SCI (ACPI GSI 52) not registered
>> [    0.845126] ACPI: Interpreter enabled
>> [    0.860003] ACPI: (supports S0)
>> [    0.872000] ACPI: Using platform specific model for interrupt routing
>> [    0.881647] ACPI: No dock devices found.
>> [    0.892058] [Firmware Bug]: ACPI: no secondary bus range in _CRS
>> [    0.904010] ACPI: PCI Root Bridge [P000] (domain 0002 [bus 00-ff])
>> [    0.916043] pci_root PNP0A03:00: host bridge window [mem
>> 0x2010200000-0x20103fffff] (PCI address [0x200000-0x3fffff])
>> [    0.928008] pci_root PNP0A03:00: host bridge window [mem
>> 0x2010400000-0x20105fffff] (PCI address [0x400000-0x5fffff])
>> [    0.944005] pci_root PNP0A03:00: host bridge window [mem
>> 0x2010600000-0x20106fffff] (PCI address [0x600000-0x6fffff])
>> [    0.956005] pci_root PNP0A03:00: host bridge window [mem
>> 0x2180700000-0x21bffeffff] (PCI address [0x700000-0x3ffeffff])
>> [    0.980007] pci_root PNP0A03:00: host bridge window [mem
>> 0x2180000000-0x21800fffff] (PCI address [0x0-0xfffff])
>> [    0.992041] PCI host bridge to bus 0002:00
>> [    1.000008] pci_bus 0002:00: root bus resource [mem
>> 0x2010200000-0x20103fffff] (bus address [0x00200000-0x003fffff])
>> [    1.016005] pci_bus 0002:00: root bus resource [mem
>> 0x2010400000-0x20105fffff] (bus address [0x00400000-0x005fffff])
>> [    1.036006] pci_bus 0002:00: root bus resource [mem
>> 0x2010600000-0x20106fffff] (bus address [0x00600000-0x006fffff])
>> [    1.048006] pci_bus 0002:00: root bus resource [mem
>> 0x2180700000-0x21bffeffff] (bus address [0x00700000-0x3ffeffff])
>> [    1.064005] pci_bus 0002:00: root bus resource [mem
>> 0x2180000000-0x21800fffff] (bus address [0x00000000-0x000fffff])
>> [    1.081959]  pci0002:00: Requesting ACPI _OSC control (0x1d)
>> [    1.092007]  pci0002:00: ACPI _OSC request failed (AE_NOT_FOUND),
>> returned control mask: 0x1d
>> [    1.108002] ACPI _OSC control for PCIe not granted, disabling ASPM
>> [    1.120074] [Firmware Bug]: ACPI: no secondary bus range in _CRS
>> [    1.128007] ACPI: PCI Root Bridge [P001] (domain 0001 [bus 00-ff])
>> [    1.140036] pci_root PNP0A03:01: host bridge window [mem
>> 0x2000200000-0x20003fffff] (PCI address [0x200000-0x3fffff])
>> [    1.156005] pci_root PNP0A03:01: host bridge window [mem
>> 0x2000400000-0x20005fffff] (PCI address [0x400000-0x5fffff])
>> [    1.172005] pci_root PNP0A03:01: host bridge window [mem
>> 0x2000600000-0x20006fffff] (PCI address [0x600000-0x6fffff])
>> [    1.184007] pci_root PNP0A03:01: host bridge window [io
>> 0x1000000-0x10fffff] (PCI address [0x0-0xfffff])
>> [    1.196008] pci_root PNP0A03:01: host bridge window [mem
>> 0x21c0700000-0x21fffeffff] (PCI address [0x700000-0x3ffeffff])
>> [    1.208010] pci_root PNP0A03:01: host bridge window [mem
>> 0x21c0000000-0x21c00fffff] (PCI address [0x0-0xfffff])
>> [    1.220046] PCI host bridge to bus 0001:00
>> [    1.236005] pci_bus 0001:00: root bus resource [mem
>> 0x2000200000-0x20003fffff] (bus address [0x00200000-0x003fffff])
>> [    1.248005] pci_bus 0001:00: root bus resource [mem
>> 0x2000400000-0x20005fffff] (bus address [0x00400000-0x005fffff])
>> [    1.260006] pci_bus 0001:00: root bus resource [mem
>> 0x2000600000-0x20006fffff] (bus address [0x00600000-0x006fffff])
>> [    1.272005] pci_bus 0001:00: root bus resource [io
>> 0x1000000-0x10fffff] (bus address [0x0000-0xfffff])
>> [    1.288005] pci_bus 0001:00: root bus resource [mem
>> 0x21c0700000-0x21fffeffff] (bus address [0x00700000-0x3ffeffff])
>> [    1.304005] pci_bus 0001:00: root bus resource [mem
>> 0x21c0000000-0x21c00fffff] (bus address [0x00000000-0x000fffff])
>> [    1.334427] pci 0001:00:03.0: PCI bridge to [bus 01-01]
>> [    1.336601]  pci0001:00: Requesting ACPI _OSC control (0x1d)
>> [    1.348005]  pci0001:00: ACPI _OSC request failed (AE_NOT_FOUND),
>> returned control mask: 0x1d
>> [    1.364002] ACPI _OSC control for PCIe not granted, disabling ASPM
>> [    1.376640] [Firmware Bug]: ACPI: no secondary bus range in _CRS
>> [    1.388008] ACPI: PCI Root Bridge [P000] (domain 0011 [bus 00-ff])
>> [    1.400031] pci_root PNP0A03:02: host bridge window [mem
>> 0x6200000000-0x67ffffffff] (PCI address [0x0-0x5ffffffff])
>> [    1.412009] pci_root PNP0A03:02: host bridge window [io
>> 0x2000000-0x2ffffff] (PCI address [0x0-0xffffff])
>> [    1.428045] PCI host bridge to bus 0011:00
>> [    1.444005] pci_bus 0011:00: root bus resource [mem
>> 0x6200000000-0x67ffffffff] (bus address [0x00000000-0x5ffffffff])
>> [    1.456007] pci_bus 0011:00: root bus resource [io
>> 0x2000000-0x2ffffff] (bus address [0x0000-0xffffff])
>> [    1.473755] pci 0011:00:01.0: PCI bridge to [bus 01-01]
>> [    1.480452] pci 0011:00:02.0: PCI bridge to [bus 02-02]
>> [    1.492243]  pci0011:00: Requesting ACPI _OSC control (0x1d)
>> [    1.500004]  pci0011:00: ACPI _OSC request failed (AE_NOT_FOUND),
>> returned control mask: 0x1d
>> [    1.516002] ACPI _OSC control for PCIe not granted, disabling ASPM
>> [    1.536571] vgaarb: loaded
>> [    1.544413] Switching to clocksource sn2_rtc
>> [    1.560324] pnp: PnP ACPI init
>> [    1.569151] ACPI: bus type pnp registered
>> [    1.578709] pnp: PnP ACPI: found 3 devices
>> [    1.594314] ACPI: ACPI bus type pnp unregistered
>> [    1.610913] NET: Registered protocol family 2
>> [    1.616848] IP route cache hash table entries: 524288 (order: 8,
>> 4194304 bytes)
>> [    1.631962] TCP established hash table entries: 524288 (order: 9,
>> 8388608 bytes)
>> [    1.656604] TCP bind hash table entries: 65536 (order: 6, 1048576 bytes)
>> [    1.662448] TCP: Hash tables configured (established 524288 bind 65536)
>> [    1.669308] TCP: reno registered
>> [    1.685228] UDP hash table entries: 65536 (order: 7, 2097152 bytes)
>> [    1.704860] UDP-Lite hash table entries: 65536 (order: 7, 2097152 bytes)
>> [    1.718780] NET: Registered protocol family 1
>> [    1.831029] Unpacking initramfs...
>> [    2.559641] Freeing initrd memory: 18784kB freed
>> [    2.562393] perfmon: version 2.0 IRQ 238
>> [    2.569242] perfmon: Montecito PMU detected, 27 PMCs, 35 PMDs, 12
>> counters (47 bits)
>> [    2.609465] perfmon: added sampling format default_format
>> [    2.612017] perfmon_default_smpl: default_format v2.0 registered
>> [    2.941690] audit: initializing netlink socket (disabled)
>> [    2.944540] type=2000 audit(2.940:1): initialized
>> [    3.053549] HugeTLB registered 256 MB page size, pre-allocated 0 pages
>> [    3.060106] VFS: Disk quotas dquot_6.5.2
>> [    3.063376] Dquot-cache hash table entries: 2048 (order 0, 16384 bytes)
>> [    3.077548] msgmni has been set to 32768
>> [    3.085721] Block layer SCSI generic (bsg) driver version 0.4
>> loaded (major 253)
>> [    3.101259] io scheduler noop registered
>> [    3.114227] io scheduler deadline registered
>> [    3.124480] io scheduler cfq registered (default)
>> [    3.138410] input: Power Button as
>> /devices/LNXSYSTM:00/LNXPWRBN:00/input/input0
>> [    3.151399] ACPI: Power Button [PWRF]
>> [    3.164463] input: Sleep Button as
>> /devices/LNXSYSTM:00/LNXSLPBN:00/input/input1
>> [    3.175828] ACPI: Sleep Button [SLPF]
>> [    3.190913] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled
>> [    3.206082] sn_console: Console driver init
>> [    3.219494] ttySG0 at I/O 0x0 (irq = 0) is a SGI SN L1
>> [    3.300195] Linux agpgart interface v0.103
>> [    3.303450] mousedev: PS/2 mouse device common for all mice
>> [    3.311532] rtc-efi rtc-efi: rtc core: registered rtc-efi as rtc0
>> [    3.324627] TCP: cubic registered
>> [    3.334577] NET: Registered protocol family 17
>> [    3.344580] Registering the dns_resolver key type
>> [    3.358790] registered taskstats version 1
>> [    3.371272] rtc-efi rtc-efi: setting system clock to 2013-08-20
>> 21:27:57 UTC (1377034077)
>> [    3.381022] Freeing unused kernel memory: 816kB freedLoading, please wait...
>> [    3.420446] kernel unaligned access to 0xe000005a80008014,
>> ip=0xa00000010020df90
>> [    3.423708] Unable to handle kernel paging request at virtual
>> address 8000800000000018
>> [    3.427693] udevd[247]: Oops 8813272891392 [1]
>> [    3.427693] Modules linked in:
>> [    3.427693]
>> [    3.427693] Pid: 247, CPU 17, comm:                udevd
>> [    3.427693] psr : 0000101008522030 ifs : 8000000000000001 ip  :
>> [<a000000100236f40>]    Not tainted (3.4.49)
>> [    3.427693] ip is at mntget+0x20/0xa0
>> [    3.427693] unat: 0000000000000000 pfs : 0000000000000286 rsc :
>> 0000000000000003
>> [    3.427693] rnat: 000000000000003c bsps: 0000000000000038 pr  :
>> 000000000001c299
>> [    3.427693] ldrs: 0000000000000000 ccv : 0000000000000000 fpsr:
>> 0009804c0270033f
>> [    3.427693] csd : 0000000000000000 ssd : 0000000000000000
>> [    3.427693] b0  : a00000010020dfb0 b6  : a000000100309c40 b7  :
>> a0000001000102d0
>> [    3.427693] f6  : 000000000000000000000 f7  : 000000000000000000000
>> [    3.427693] f8  : 000000000000000000000 f9  : 000000000000000000000
>> [    3.427693] f10 : 000000000000000000000 f11 : 000000000000000000000
>> [    3.427693] r1  : a000000100e58960 r2  : 0000000000c80000 r3  :
>> 0000000000000064
>> [    3.427693] r8  : 8000800000000000 r9  : a000000100c05370 r10 :
>> e000009805e4fd98
>> [    3.427693] r11 : e000005a80008000 r12 : e000009805e4fd40 r13 :
>> e000009805e48000
>> [    3.427693] r14 : 0000000000c80064 r15 : 0000001008526030 r16 :
>> 0000000000c80064
>> [    3.427693] r17 : 0000000000000000 r18 : 8000800000000018 r19 :
>> e000009805e4fd98
>> [    3.427693] r20 : 0000000000000000 r21 : e000009805e4fd50 r22 :
>> e000009805e4fdcc
>> [    3.427693] r23 : 0000000000000001 r24 : 0000000000000044 r25 :
>> e0000118031462f8
>> [    3.427693] r26 : fffffffffffc62f8 r27 : fffffffffffc62f8 r28 :
>> e000011803180000
>> [    3.427693] r29 : fffffffffffc62f0 r30 : 0000000000000063 r31 :
>> 0000000000000063
>> [    3.427693]
>> [    3.427693] Call Trace:
>> [    3.427693]  [<a000000100014b00>] show_stack+0x40/0x90
>> [    3.427693]                                 sp=e000009805e4f910
>> bsp=e000009805e49108
>> [    3.427693]  [<a000000100015370>] show_regs+0x7d0/0x900
>> [    3.427693]                                 sp=e000009805e4fae0
>> bsp=e000009805e49098
>> [    3.427693]  [<a00000010003b8c0>] die+0x1c0/0x320
>> [    3.427693]                                 sp=e000009805e4fae0
>> bsp=e000009805e49058
>> [    3.427693]  [<a0000001007c0060>] ia64_do_page_fault+0xa00/0xae0
>> [    3.427693]                                 sp=e000009805e4fae0
>> bsp=e000009805e49008
>> [    3.427693]  [<a00000010000bdc0>] ia64_native_leave_kernel+0x0/0x270
>> [    3.427693]                                 sp=e000009805e4fb70
>> bsp=e000009805e49008
>> [    3.427693]  [<a000000100236f40>] mntget+0x20/0xa0
>> [    3.427693]                                 sp=e000009805e4fd40
>> bsp=e000009805e49000
>> [    3.427693]  [<a00000010020dfb0>] path_get+0x30/0xe0
>> [    3.427693]                                 sp=e000009805e4fd40
>> bsp=e000009805e48fd0
>> [    3.427693]  [<a000000100214860>] path_init+0x7c0/0x880
>> [    3.427693]                                 sp=e000009805e4fd40
>> bsp=e000009805e48f88
>> [    3.427693]  [<a000000100214970>] path_lookupat+0x50/0x1180
>> [    3.427693]                                 sp=e000009805e4fd50
>> bsp=e000009805e48e88
>> [    3.427693]  [<a000000100215ad0>] do_path_lookup+0x30/0x180
>> [    3.427693]                                 sp=e000009805e4fd80
>> bsp=e000009805e48e48
>> [    3.427693]  [<a000000100215c50>] kern_path_create+0x30/0x280
>> [    3.427693]                                 sp=e000009805e4fd80
>> bsp=e000009805e48e08
>> [    3.427693]  [<a000000100219200>] user_path_create+0x60/0xc0
>> [    3.427693]                                 sp=e000009805e4fe20
>> bsp=e000009805e48dc0
>> [    3.427693]  [<a00000010021a9d0>] sys_symlinkat+0x70/0x1e0
>> [    3.427693]                                 sp=e000009805e4fe20
>> bsp=e000009805e48d50
>> [    3.427693]  [<a00000010000bc40>] ia64_ret_from_syscall+0x0/0x20
>> [    3.427693]                                 sp=e000009805e4fe30
>> bsp=e000009805e48d50
>> [    3.427693]  [<a000000000040720>] ia64_ivt+0xffffffff00040720/0x400
>> [    3.427693]                                 sp=e000009805e50000
>> bsp=e000009805e48d50
>> [    3.427693] Disabling lock debugging due to kernel taint
>> [   63.422259] INFO: rcu_sched self-detected stall on CPU { 17}
>> [   63.427883] INFO: rcu_sched detected stalls on CPUs/tasks: { 17}
>> (detected by 51, t=15002 jiffies)
>> [   63.427883] INFO: Stall ended before state dump start
>> [   63.422259]  (t=15010 jiffies)
>> [   63.422259]
>> [   63.422259] Call Trace:
>> [   63.422259]  [<a000000100014b00>] show_stack+0x40/0x90
>> [   63.422259]                                 sp=e000009805e4f690
>> bsp=e000009805e49560
>> [   63.422259]  [<a000000100014b80>] dump_stack+0x30/0x50
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e49548
>> [   63.422259]  [<a000000100146e90>] __rcu_pending+0x1b0/0x9c0
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e494d8
>> [   63.422259]  [<a000000100148520>] rcu_check_callbacks+0x100/0x1a0
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e494b0
>> [   63.422259]  [<a000000100086a40>] update_process_times+0x60/0xc0
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e49480
>> [   63.422259]  [<a00000010003a940>] timer_interrupt+0x1c0/0x300
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e49420
>> [   63.422259]  [<a000000100138460>] handle_irq_event_percpu+0xc0/0x3c0
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e49390
>> [   63.422259]  [<a00000010013fcb0>] handle_percpu_irq+0x110/0x1a0
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e49360
>> [   63.422259]  [<a000000100137590>] generic_handle_irq+0x90/0xc0
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e49340
>> [   63.422259]  [<a000000100013200>] ia64_handle_irq+0x2a0/0x340
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e492b0
>> [   63.422259]  [<a00000010000bdc0>] ia64_native_leave_kernel+0x0/0x270
>> [   63.422259]                                 sp=e000009805e4f860
>> bsp=e000009805e492b0
>> [   63.422259]  [<a0000001002366f0>] vfsmount_lock_local_lock+0xb0/0xe0
>> [   63.422259]                                 sp=e000009805e4fa30
>> bsp=e000009805e492a0
>> [   63.422259]  [<a000000100239720>] mntput_no_expire+0x40/0x380
>> [   63.422259]                                 sp=e000009805e4fa30
>> bsp=e000009805e49250
>> [   63.422259]  [<a000000100239ac0>] mntput+0x60/0x80
>> [   63.422259]                                 sp=e000009805e4fa30
>> bsp=e000009805e49230
>> [   63.422259]  [<a0000001001fb910>] fput+0x510/0x540
>> [   63.422259]                                 sp=e000009805e4fa30
>> bsp=e000009805e491d8
>> [   63.422259]  [<a0000001001a6710>] remove_vma+0xd0/0x160
>> [   63.422259]                                 sp=e000009805e4fa30
>> bsp=e000009805e491b0
>> [   63.422259]  [<a0000001001a9a90>] exit_mmap+0x470/0x500
>> [   63.422259]                                 sp=e000009805e4fa30
>> bsp=e000009805e49170
>> [   63.422259]  [<a000000100065230>] mmput+0x90/0x240
>> [   63.422259]                                 sp=e000009805e4fab0
>> bsp=e000009805e49150
>> [   63.422259]  [<a000000100070490>] exit_mm+0x270/0x2a0
>> [   63.422259]                                 sp=e000009805e4fab0
>> bsp=e000009805e49118
>> [   63.422259]  [<a000000100073f10>] do_exit+0x510/0x1400
>> [   63.422259]                                 sp=e000009805e4fac0
>> bsp=e000009805e49098
>> [   63.422259]  [<a00000010003ba00>] die+0x300/0x320
>> [   63.422259]                                 sp=e000009805e4fae0
>> bsp=e000009805e49058
>> [   63.422259]  [<a0000001007c0060>] ia64_do_page_fault+0xa00/0xae0
>> [   63.422259]                                 sp=e000009805e4fae0
>> bsp=e000009805e49008
>> [   63.422259]  [<a00000010000bdc0>] ia64_native_leave_kernel+0x0/0x270
>> [   63.422259]                                 sp=e000009805e4fb70
>> bsp=e000009805e49008
>> [   63.422259]  [<a000000100236f40>] mntget+0x20/0xa0
>> [   63.422259]                                 sp=e000009805e4fd40
>> bsp=e000009805e49000
>> [   63.422259]  [<a00000010020dfb0>] path_get+0x30/0xe0
>> [   63.422259]                                 sp=e000009805e4fd40
>> bsp=e000009805e48fd0
>> [   63.422259]  [<a000000100214860>] path_init+0x7c0/0x880
>> [   63.422259]                                 sp=e000009805e4fd40
>> bsp=e000009805e48f88
>> [   63.422259]  [<a000000100214970>] path_lookupat+0x50/0x1180
>> [   63.422259]                                 sp=e000009805e4fd50
>> bsp=e000009805e48e88
>> [   63.422259]  [<a000000100215ad0>] do_path_lookup+0x30/0x180
>> [   63.422259]                                 sp=e000009805e4fd80
>> bsp=e000009805e48e48
>> [   63.422259]  [<a000000100215c50>] kern_path_create+0x30/0x280051c21
>> [   63.422259]                                 sp=e000009805e4fd80
>> bsp=e000009805e48e08
>> [   63.422259]  [<a000000100219200>] user_path_create+0x60/0xc0
>> [   63.422259]                                 sp=e000009805e4fe20
>> bsp=e000009805e48dc0
>> [   63.422259]  [<a00000010021a9d0>] sys_symlinkat+0x70/0x1e0
>> [   63.422259]                                 sp=e000009805e4fe20
>> bsp=e000009805e48d50
>> [   63.422259]  [<a00000010000bc40>] ia64_ret_from_syscall+0x0/0x20
>> [   63.422259]                                 sp=e000009805e4fe30
>> bsp=e000009805e48d50
>> [   63.422259]  [<a000000000040720>] ia64_ivt+0xffffffff00040720/0x400
>> [   63.422259]                                 sp=e000009805e50000
>> bsp=e000009805e48d50
>>
>> On Mon, Aug 19, 2013 at 7:05 PM, Barclay Jameson
>> <almightybeeij@xxxxxxxxx> wrote:
>>> I have posted on Nekochan
>>> (http://forums.nekochan.net/viewtopic.php?f=3&t=16727918) asking help
>>> for an error when trying to boot a Kernel compiled >= 3.2 (using
>>> Debian Wheezy).
>>> Here is the error:
>>>
>>> 000 051.21^1#0a: index time stamp         type      component    subcomponent
>>> 000 051.21^1#0a: ----- ------------------ --------- ------------ ------------
>>> 000 051.21^1#0a:     0 0x000000c92deef702 MD_HW     051.21^1#0
>>> Non-existent Memory Address Error
>>> 000 051.21^1#0a:     1 0x000000ce43920f08 PI_HW     051.21^1#0   RRB
>>> Time-out Error
>>> 000 051.21^1#0a:     2 0x000000ce43b16400 PROC_MCA  051.21^1#0a  Bus Check
>>>
>>> A more detailed error is listed below:
>>>
>>> 000 051.21^1#0a:   SH2_EVENT_OCCURRED                      : 0x0000008180000003
>>> 000 051.21^1#0a:    MD Hardware Interrupt Pending
>>> 000 051.21^1#0a:   SH2_FIRST_ERROR                         : 0x0000000000000002
>>> 000 051.21^1#0a:    MD Hardware Interrupt Pending
>>> 000 051.21^1#0a:   SH2_MEM_ERROR_SUMMARY                   : 0x0000007800000002
>>> 000 051.21^1#0a:    Non-existent Memory Address Error
>>> 000 051.21^1#0a:   SH2_MEM_FIRST_ERROR                     : 0x0000000000000002
>>> 000 051.21^1#0a:    MD_HW_INT: Non-existent Memory Address Error
>>> 000 051.21^1#0a:   SH2_MISC_ERR_HDR_UPPER                  : 0x0000000001f00004
>>> 000 051.21^1#0a:     Non-Existant Memory Address Error Header Captured
>>> 000 051.21^1#0a:     Echo: 0x1f
>>> 000 051.21^1#0a:   SH2_MISC_ERR_HDR_LOWER                  : 0x8800010000000000
>>> 000 051.21^1#0a:     Source  : pi chiplet, nasid 0x0
>>> 000 051.21^1#0a:     Command : NCRD, Non-coherent read
>>> 000 051.21^1#0a:     Read Operation
>>> 000 051.21^1#0a:   SH2_MISC_ADRS_ERR_HDR_LOWER_A           : 0x80000001014cf070
>>> 000 051.21^1#0a:     Address <37:0>: 0x1014cf070
>>> 000 051.21^1#0a:     Read Operation
>>> 000 051.21^1#0a:   SH2_MD_HW_TIME_STAMP                    : 0x800000fa22fade06
>>> 000 051.21^1#0a:
>>> 000 051.21^1#0a: PI_HW :051.21^1#0 :RRB Time-out Error
>>> 000 051.21^1#0a:
>>> 000 051.21^1#0a:   SH2_EVENT_OCCURRED                      : 0x0000008180000003
>>> 000 051.21^1#0a:    PI Hardware Interrupt Pending
>>> 000 051.21^1#0a:   SH2_FIRST_ERROR                         : 0x0000000000000002
>>> 000 051.21^1#0a:   SH2_PI_ERROR_SUMMARY                    : 0x0000000000000010
>>> 000 051.21^1#0a:    RRB Time-out Error
>>> 000 051.21^1#0a:   SH2_PI_FIRST_ERROR                     : 0x0000000000000010
>>> 000 051.21^1#0a:    RRB Time-out Error
>>> 000 051.21^1#0a:   SH2_PI_ERROR_DETAIL_1                      :
>>> 0xfe200001014cf071
>>> 000 051.21^1#0a:   SH2_PI_ERROR_DETAIL_2                      :
>>> 0x000000001f0801f1
>>> 000 051.21^1#0a:     Address      : 0x1014cf070
>>> 000 051.21^1#0a:     Table Select : 0x4
>>> 000 051.21^1#0a:     Command      : RESERVED_FE
>>> 000 051.21^1#0a:     IsReal       : 0x1
>>> 000 051.21^1#0a:     RRB Idx      : 0x1f
>>> 000 051.21^1#0a:     WRB Idx      : 0x0
>>> 000 051.21^1#0a:     IRB Idx      : 0x0
>>> 000 051.21^1#0a:     Error Code   : 0x4
>>> 000 051.21^1#0a:     Echo         : 0x1f
>>> 000 051.21^1#0a:     Source       : not available
>>> 000 051.21^1#0a:     Supplemental : 0x0
>>> 000 051.21^1#0a:     AXB Queue    : 0x0
>>> 000 051.21^1#0a:   SH2_PI_HW_TIME_STAMP                    : 0x800000feba7afc05
>>> 000 051.21^1#0a:
>>> 000 051.21^1#0a: PROC_MCA :051.21^1#0a :Bus Check
>>> 000 051.21^1#0a:
>>> 000 051.21^1#0a:   processor lid                     : 0x0000000000000000
>>> 000 051.21^1#0a:     cpu: A nasid: 0x0
>>> 000 051.21^1#0a:   processor state parameter         : 0x20010000fff21120
>>> 000 051.21^1#0a:     rendevous was not attempted
>>> 000 051.21^1#0a:     min state is valid
>>> 000 051.21^1#0a:     not continuable
>>> 000 051.21^1#0a:     machine check is isolated
>>> 000 051.21^1#0a:     more info available
>>> 000 051.21^1#0a:     ip logged is not precise
>>> 000 051.21^1#0a:     min state is not precise
>>> 000 051.21^1#0a:     shared MCA
>>> 000 051.21^1#0a:     bus check
>>> 000 051.21^1#0a:     PAL recovery status:
>>> 000 051.21^1#0a:       error was isolated and contained, continuable
>>> if sw can recover
>>> 000 051.21^1#0a:   processor error map               : 0x0000000001000000
>>> 000 051.21^1#0a:     processor code id: 0
>>> 000 051.21^1#0a:     logical thread id: 0
>>> 000 051.21^1#0a:     processor bus level 1 error
>>> 000 051.21^1#0a:   processor structure: bus
>>> 000 051.21^1#0a:     bus check                         : 0x1880000000800141
>>> 000 051.21^1#0a:       bus transaction size: 1
>>> 000 051.21^1#0a:       external bus error
>>> 000 051.21^1#0a:       transaction type: partial read
>>> 000 051.21^1#0a:       bus error severity: 0
>>> 000 051.21^1#0a:       bus hierarchy: 0
>>> 000 051.21^1#0a:       UCE detected on incoming
>>> 000 051.21^1#0a:       ia64 instruction set
>>> 000 051.21^1#0a:       machine check corrected
>>> 000 051.21^1#0a:       target address valid
>>> 000 051.21^1#0a:     target identifier                 : 0x00000001014cf071
>>>
>>> Anybody have any any pointers to help me solve this problem?
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-ia64" in
>> the body of a message to majordomo@xxxxxxxxxxxxxxx
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-ia64" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Kernel]     [Sparc Linux]     [DCCP]     [Linux ARM]     [Yosemite News]     [Linux SCSI]     [Linux x86_64]     [Linux for Ham Radio]

  Powered by Linux