I have this issue with all the Kernels that I have built that are >=3.2 within Wheezy. I have one working 3.4.49 Kernel that works but it only has support for 64 CPUs (First Kernel I built when upgrading Squeeze to Wheezy). You do raise a good point thought. I suppose I could distrust the SGI memory test when the machine powers up and test all 64 sticks of memory individually. On Tue, Aug 20, 2013 at 5:30 PM, Émeric MASCHINO <emeric.maschino@xxxxxxxxx> wrote: > Hi, > > Do you get this kind of error with all Linux kernels, or only >= 3.2? > It seems to me that you're experiencing memory problem, so I'm > wondering whether this could be a hardware issue (bad memory DIMM) > rather than a problem with Linux. > Maybe Linux kernels >= 3.2 test or stress memory in a way that trigger > this hardware fault? > > Emeric > > 2013/8/21 Barclay Jameson <almightybeeij@xxxxxxxxx>: >> I had another go at compiling the ia64 Kernel again on the SGI Altix >> 4700 this time with passing the O1 flag. >> It at least gave me some output that might be helpful. Anyone have any ideas? >> >> ELILO v3.14 for EFI/IA-64 >> .. >> Loading \EFI\debian\vmlinuz-3.4.49...Loading Linux... Attempting to >> relocate kernel...done >> Loading file \EFI\debian\initrd.img-3.4.49...done >> [ 0.000000] Initializing cgroup subsys cpuset >> [ 0.000000] Initializing cgroup subsys cpu >> [ 0.000000] Linux version 3.4.49 (beeij@debian) (gcc version 4.6.3 >> (Debian 4.6.3-14) ) #10 SMP Tue Aug 20 16:15:16 CDT 2013 >> [ 0.000000] EFI v1.10 by INTEL: SALsystab=0x1802c26190 ACPI 2.0=0x1802c26280 >> [ 0.000000] booting generic kernel on platform sn2 >> [ 0.000000] console [sn_sal0] enabled >> [ 0.000000] ACPI: RSDP 0000001802c26280 00024 (v02 SGI) >> [ 0.000000] ACPI: XSDT 0000001802c2a740 00044 (v01 SGI XSDTSN2 >> 00010001 ? 0000007C) >> [ 0.000000] ACPI: APIC 0000001802c26af0 0032C (v01 SGI APICSN2 >> 00010001 ? 00000001) >> [ 0.000000] ACPI: SRAT 0000001802c26e30 006B0 (v01 SGI SRATSN2 >> 00010001 ? 00000001) >> [ 0.000000] ACPI: SLIT 0000001802c274f0 0012C (v01 SGI SLITSN2 >> 00010001 ? 00000001) >> [ 0.000000] ACPI: FACP 0000001802c27680 000F4 (v03 SGI FACPSN2 >> 00030001 ? 00000001) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in Pm1aEventBlock: >> 32/0 (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in >> Pm1aControlBlock: 16/0 (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in PmTimerBlock: >> 32/0 (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in Gpe0Block: 64/0 >> (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: Invalid length for Pm1aEventBlock: 0, >> using default 32 (20120320/tbfadt-629) >> [ 0.000000] ACPI Warning: Invalid length for Pm1aControlBlock: 0, >> using default 16 (20120320/tbfadt-629) >> [ 0.000000] ACPI Warning: Invalid length for PmTimerBlock: 0, using >> default 32 (20120320/tbfadt-629) >> [ 0.000000] ACPI: DSDT 0000001802c29750 00024 (v02 SGI DSDTSN2 >> 00020001 ? 0000088B) >> [ 0.000000] ACPI: FACS 0000001802c27630 00040 >> [ 0.000000] ACPI: Local APIC address c0000000fee00000 >> [ 0.000000] 64 CPUs available, 64 CPUs total >> [ 0.000000] Number of logical nodes in system = 16 >> [ 0.000000] Number of memory chunks in system = 16 >> [ 0.000000] SMP: Allowing 64 CPUs, 0 hotplug CPUs >> [ 0.000000] Initial ramdisk at: 0xe00003daf517e000 (19257232 bytes) >> [ 0.000000] SAL 3.2: SGI SN2 version 1.54 >> [ 0.000000] SAL Platform features: ITC_Drift >> [ 0.000000] SAL: AP wakeup using external interrupt vector 0x12 >> [ 0.000000] MCA related initialization done >> [ 0.000000] ACPI: RSDP 0000001802c26280 00024 (v02 SGI) >> [ 0.000000] ACPI: XSDT 0000001802c2a740 0007C (v01 SGI XSDTSN2 >> 00010001 ? 0000007C) >> [ 0.000000] ACPI: APIC 0000001802c26af0 0032C (v01 SGI APICSN2 >> 00010001 ? 00000001) >> [ 0.000000] ACPI: SRAT 0000001802c26e30 006B0 (v01 SGI SRATSN2 >> 00010001 ? 00000001) >> [ 0.000000] ACPI: SLIT 0000001802c274f0 0012C (v01 SGI SLITSN2 >> 00010001 ? 00000001) >> [ 0.000000] ACPI: FACP 0000001802c27680 000F4 (v03 SGI FACPSN2 >> 00030001 ? 00000001) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in Pm1aEventBlock: >> 32/0 (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in >> Pm1aControlBlock: 16/0 (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in PmTimerBlock: >> 32/0 (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: 32/64X length mismatch in Gpe0Block: 64/0 >> (20120320/tbfadt-548) >> [ 0.000000] ACPI Warning: Invalid length for Pm1aEventBlock: 0, >> using default 32 (20120320/tbfadt-629) >> [ 0.000000] ACPI Warning: Invalid length for Pm1aControlBlock: 0, >> using default 16 (20120320/tbfadt-629) >> [ 0.000000] ACPI Warning: Invalid length for PmTimerBlock: 0, using >> default 32 (20120320/tbfadt-629) >> [ 0.000000] ACPI: DSDT 0000001802c29750 0088B (v02 SGI DSDTSN2 >> 00020101 ? 0000088B) >> [ 0.000000] ACPI: FACS 0000001802c27630 00040 >> [ 0.000000] ACPI: SSDT 0000001802c2a1e0 00095 (v02 SGI SSDTSN2 >> 00020101 ? 00000095) >> [ 0.000000] ACPI: SSDT 0000001802c2a2f0 000F5 (v02 SGI SSDTSN2 >> 00020101 ? 000000F5) >> [ 0.000000] ACPI: SSDT 0000001802c2a400 001F2 (v02 SGI SSDTSN2 >> 00020101 ? 000001F2) >> [ 0.000000] ACPI: SSDT 0000001802c29ff0 00095 (v02 SGI SSDTSN2 >> 00020101 ? 00000095) >> [ 0.000000] ACPI: SSDT 0000001802c2a610 0007E (v02 SGI SSDTSN2 >> 00020101 ? 0000007E) >> [ 0.000000] ACPI: SSDT 0000001802c2a7d0 00139 (v02 SGI SSDTSN2 >> 00020101 ? 00000139) >> [ 0.000000] ACPI: SSDT 0000001802c2a6a0 00090 (v02 SGI SSDTSN2 >> 00020101 ? 00000090) >> [ 0.000000] SGI SAL version 1.54 >> [ 0.000000] Virtual mem_map starts at 0xa0007ffca0600000 >> [ 0.000000] Zone PFN ranges: >> [ 0.000000] DMA 0x00600c00 -> 0x1000000000 >> [ 0.000000] Normal empty >> [ 0.000000] Movable zone start PFN for each node >> [ 0.000000] Early memory PFN ranges >> [ 0.000000] 0: 0x00600c00 -> 0x0063e000 >> [ 0.000000] 0: 0x00680000 -> 0x006bdfff >> [ 0.000000] 1: 0x01600c00 -> 0x0163e000 >> [ 0.000000] 1: 0x01680000 -> 0x016be000 >> [ 0.000000] 2: 0x02600c00 -> 0x0263e000 >> [ 0.000000] 2: 0x02680000 -> 0x026be000 >> [ 0.000000] 3: 0x03600c00 -> 0x0363e000 >> [ 0.000000] 3: 0x03680000 -> 0x036be000 >> [ 0.000000] 4: 0x04600c00 -> 0x0463e000 >> [ 0.000000] 4: 0x04680000 -> 0x046be000 >> [ 0.000000] 5: 0x05600c00 -> 0x0563e000 >> [ 0.000000] 5: 0x05680000 -> 0x056be000 >> [ 0.000000] 6: 0x06600c00 -> 0x0663e000 >> [ 0.000000] 6: 0x06680000 -> 0x066bdfff >> [ 0.000000] 7: 0x07600c00 -> 0x0763e000 >> [ 0.000000] 7: 0x07680000 -> 0x076be000 >> [ 0.000000] 8: 0x08600c00 -> 0x0863e000 >> [ 0.000000] 8: 0x08680000 -> 0x086be000 >> [ 0.000000] 9: 0x09600c00 -> 0x0963e000 >> [ 0.000000] 9: 0x09680000 -> 0x096be000 >> [ 0.000000] 10: 0x0a600c00 -> 0x0a63e000 >> [ 0.000000] 10: 0x0a680000 -> 0x0a6be000 >> [ 0.000000] 11: 0x0b600c00 -> 0x0b63e000 >> [ 0.000000] 11: 0x0b680000 -> 0x0b6be000 >> [ 0.000000] 12: 0x0c600c00 -> 0x0c63e000 >> [ 0.000000] 12: 0x0c680000 -> 0x0c6be000 >> [ 0.000000] 13: 0x0d600c00 -> 0x0d63e000 >> [ 0.000000] 13: 0x0d680000 -> 0x0d6be000 >> [ 0.000000] 14: 0x0e600c00 -> 0x0e63e000 >> [ 0.000000] 14: 0x0e680000 -> 0x0e6bdfff >> [ 0.000000] 15: 0x0f600c00 -> 0x0f63e000 >> [ 0.000000] 15: 0x0f680000 -> 0x0f6bd9ff >> [ 0.000000] 15: 0x0f6bde00 -> 0x0f6bdf56 >> [ 0.000000] 15: 0x0f6bdf65 -> 0x0f6bdf84 >> [ 0.000000] 15: 0x0f6bdfa0 -> 0x0f6bdfb9 >> [ 0.000000] Built 16 zonelists in Node order, mobility grouping on. >> Total pages: 8033770 >> [ 0.000000] Policy zone: DMA >> [ 0.000000] Kernel command line: >> BOOT_IMAGE=scsi1:/EFI/debian/vmlinuz-3.4.49 root=/dev/md0 ro >> [ 0.000000] PID hash table entries: 4096 (order: 1, 32768 bytes) >> [ 0.000000] Memory: 128722256k/129185072k available (7956k code, >> 496464k reserved, 4790k data, 816k init) >> [ 0.000000] SLUB: Genslabs=17, HWalign=128, Order=0-3, >> MinObjects=0, CPUs=64, Nodes=256 >> [ 0.000000] Hierarchical RCU implementation. >> [ 0.000000] CONFIG_RCU_FANOUT set to non-default value of 32 >> [ 0.000000] NR_IRQS:1024 >> [ 0.000000] ACPI: Local APIC address c0000000fee00000 >> [ 0.000000] register_intr: No IOSAPIC for GSI 52 >> [ 0.000000] WARNING: Persistent clock returned invalid value! >> [ 0.000000] Check your CMOS/BIOS settings. >> [ 0.000000] Console: colour dummy device 80x25 >> [ 0.000000] console [ttySG0] enabled >> [ 0.000000] console [ttySG0] enabled >> [ 0.044000] Calibrating delay loop... 3182.59 BogoMIPS (lpj=6365184) >> [ 0.065688] pid_max: default: 65536 minimum: 512 >> [ 0.073194] Security Framework initialized >> [ 0.084011] SELinux: Disabled at boot. >> [ 0.112172] Dentry cache hash table entries: 16777216 (order: 13, >> 134217728 bytes) >> [ 0.387579] Inode-cache hash table entries: 8388608 (order: 12, >> 67108864 bytes) >> [ 0.523179] Mount-cache hash table entries: 1024 >> [ 0.528217] Initializing cgroup subsys cpuacct >> [ 0.532005] Initializing cgroup subsys devices >> [ 0.544004] Initializing cgroup subsys freezer >> [ 0.560004] Initializing cgroup subsys net_cls >> [ 0.568224] ACPI: Core revision 20120320 >> [ 0.581597] Boot processor id 0x0/0x0 >> [ 0.040000] Fixed BSP b0 value from CPU 1 >> [ 0.659422] Brought up 64 CPUs >> [ 0.660031] Total of 64 processors activated (203685.88 BogoMIPS). >> [ 0.706196] devtmpfs: initialized >> [ 0.724306] DMI not present or invalid. >> [ 0.726680] dummy: >> [ 0.732470] NET: Registered protocol family 16 >> [ 0.740147] ACPI: bus type pci registered >> [ 0.752148] ACPI DSDT OEM Rev 0x20101 >> [ 0.782373] bio: create slab <bio-0> at 0 >> [ 0.785459] ACPI: Added _OSI(Module Device) >> [ 0.792003] ACPI: Added _OSI(Processor Device) >> [ 0.804004] ACPI: Added _OSI(3.0 _SCP Extensions) >> [ 0.820003] ACPI: Added _OSI(Processor Aggregator Device) >> [ 0.828294] ACPI: SCI (ACPI GSI 52) not registered >> [ 0.845126] ACPI: Interpreter enabled >> [ 0.860003] ACPI: (supports S0) >> [ 0.872000] ACPI: Using platform specific model for interrupt routing >> [ 0.881647] ACPI: No dock devices found. >> [ 0.892058] [Firmware Bug]: ACPI: no secondary bus range in _CRS >> [ 0.904010] ACPI: PCI Root Bridge [P000] (domain 0002 [bus 00-ff]) >> [ 0.916043] pci_root PNP0A03:00: host bridge window [mem >> 0x2010200000-0x20103fffff] (PCI address [0x200000-0x3fffff]) >> [ 0.928008] pci_root PNP0A03:00: host bridge window [mem >> 0x2010400000-0x20105fffff] (PCI address [0x400000-0x5fffff]) >> [ 0.944005] pci_root PNP0A03:00: host bridge window [mem >> 0x2010600000-0x20106fffff] (PCI address [0x600000-0x6fffff]) >> [ 0.956005] pci_root PNP0A03:00: host bridge window [mem >> 0x2180700000-0x21bffeffff] (PCI address [0x700000-0x3ffeffff]) >> [ 0.980007] pci_root PNP0A03:00: host bridge window [mem >> 0x2180000000-0x21800fffff] (PCI address [0x0-0xfffff]) >> [ 0.992041] PCI host bridge to bus 0002:00 >> [ 1.000008] pci_bus 0002:00: root bus resource [mem >> 0x2010200000-0x20103fffff] (bus address [0x00200000-0x003fffff]) >> [ 1.016005] pci_bus 0002:00: root bus resource [mem >> 0x2010400000-0x20105fffff] (bus address [0x00400000-0x005fffff]) >> [ 1.036006] pci_bus 0002:00: root bus resource [mem >> 0x2010600000-0x20106fffff] (bus address [0x00600000-0x006fffff]) >> [ 1.048006] pci_bus 0002:00: root bus resource [mem >> 0x2180700000-0x21bffeffff] (bus address [0x00700000-0x3ffeffff]) >> [ 1.064005] pci_bus 0002:00: root bus resource [mem >> 0x2180000000-0x21800fffff] (bus address [0x00000000-0x000fffff]) >> [ 1.081959] pci0002:00: Requesting ACPI _OSC control (0x1d) >> [ 1.092007] pci0002:00: ACPI _OSC request failed (AE_NOT_FOUND), >> returned control mask: 0x1d >> [ 1.108002] ACPI _OSC control for PCIe not granted, disabling ASPM >> [ 1.120074] [Firmware Bug]: ACPI: no secondary bus range in _CRS >> [ 1.128007] ACPI: PCI Root Bridge [P001] (domain 0001 [bus 00-ff]) >> [ 1.140036] pci_root PNP0A03:01: host bridge window [mem >> 0x2000200000-0x20003fffff] (PCI address [0x200000-0x3fffff]) >> [ 1.156005] pci_root PNP0A03:01: host bridge window [mem >> 0x2000400000-0x20005fffff] (PCI address [0x400000-0x5fffff]) >> [ 1.172005] pci_root PNP0A03:01: host bridge window [mem >> 0x2000600000-0x20006fffff] (PCI address [0x600000-0x6fffff]) >> [ 1.184007] pci_root PNP0A03:01: host bridge window [io >> 0x1000000-0x10fffff] (PCI address [0x0-0xfffff]) >> [ 1.196008] pci_root PNP0A03:01: host bridge window [mem >> 0x21c0700000-0x21fffeffff] (PCI address [0x700000-0x3ffeffff]) >> [ 1.208010] pci_root PNP0A03:01: host bridge window [mem >> 0x21c0000000-0x21c00fffff] (PCI address [0x0-0xfffff]) >> [ 1.220046] PCI host bridge to bus 0001:00 >> [ 1.236005] pci_bus 0001:00: root bus resource [mem >> 0x2000200000-0x20003fffff] (bus address [0x00200000-0x003fffff]) >> [ 1.248005] pci_bus 0001:00: root bus resource [mem >> 0x2000400000-0x20005fffff] (bus address [0x00400000-0x005fffff]) >> [ 1.260006] pci_bus 0001:00: root bus resource [mem >> 0x2000600000-0x20006fffff] (bus address [0x00600000-0x006fffff]) >> [ 1.272005] pci_bus 0001:00: root bus resource [io >> 0x1000000-0x10fffff] (bus address [0x0000-0xfffff]) >> [ 1.288005] pci_bus 0001:00: root bus resource [mem >> 0x21c0700000-0x21fffeffff] (bus address [0x00700000-0x3ffeffff]) >> [ 1.304005] pci_bus 0001:00: root bus resource [mem >> 0x21c0000000-0x21c00fffff] (bus address [0x00000000-0x000fffff]) >> [ 1.334427] pci 0001:00:03.0: PCI bridge to [bus 01-01] >> [ 1.336601] pci0001:00: Requesting ACPI _OSC control (0x1d) >> [ 1.348005] pci0001:00: ACPI _OSC request failed (AE_NOT_FOUND), >> returned control mask: 0x1d >> [ 1.364002] ACPI _OSC control for PCIe not granted, disabling ASPM >> [ 1.376640] [Firmware Bug]: ACPI: no secondary bus range in _CRS >> [ 1.388008] ACPI: PCI Root Bridge [P000] (domain 0011 [bus 00-ff]) >> [ 1.400031] pci_root PNP0A03:02: host bridge window [mem >> 0x6200000000-0x67ffffffff] (PCI address [0x0-0x5ffffffff]) >> [ 1.412009] pci_root PNP0A03:02: host bridge window [io >> 0x2000000-0x2ffffff] (PCI address [0x0-0xffffff]) >> [ 1.428045] PCI host bridge to bus 0011:00 >> [ 1.444005] pci_bus 0011:00: root bus resource [mem >> 0x6200000000-0x67ffffffff] (bus address [0x00000000-0x5ffffffff]) >> [ 1.456007] pci_bus 0011:00: root bus resource [io >> 0x2000000-0x2ffffff] (bus address [0x0000-0xffffff]) >> [ 1.473755] pci 0011:00:01.0: PCI bridge to [bus 01-01] >> [ 1.480452] pci 0011:00:02.0: PCI bridge to [bus 02-02] >> [ 1.492243] pci0011:00: Requesting ACPI _OSC control (0x1d) >> [ 1.500004] pci0011:00: ACPI _OSC request failed (AE_NOT_FOUND), >> returned control mask: 0x1d >> [ 1.516002] ACPI _OSC control for PCIe not granted, disabling ASPM >> [ 1.536571] vgaarb: loaded >> [ 1.544413] Switching to clocksource sn2_rtc >> [ 1.560324] pnp: PnP ACPI init >> [ 1.569151] ACPI: bus type pnp registered >> [ 1.578709] pnp: PnP ACPI: found 3 devices >> [ 1.594314] ACPI: ACPI bus type pnp unregistered >> [ 1.610913] NET: Registered protocol family 2 >> [ 1.616848] IP route cache hash table entries: 524288 (order: 8, >> 4194304 bytes) >> [ 1.631962] TCP established hash table entries: 524288 (order: 9, >> 8388608 bytes) >> [ 1.656604] TCP bind hash table entries: 65536 (order: 6, 1048576 bytes) >> [ 1.662448] TCP: Hash tables configured (established 524288 bind 65536) >> [ 1.669308] TCP: reno registered >> [ 1.685228] UDP hash table entries: 65536 (order: 7, 2097152 bytes) >> [ 1.704860] UDP-Lite hash table entries: 65536 (order: 7, 2097152 bytes) >> [ 1.718780] NET: Registered protocol family 1 >> [ 1.831029] Unpacking initramfs... >> [ 2.559641] Freeing initrd memory: 18784kB freed >> [ 2.562393] perfmon: version 2.0 IRQ 238 >> [ 2.569242] perfmon: Montecito PMU detected, 27 PMCs, 35 PMDs, 12 >> counters (47 bits) >> [ 2.609465] perfmon: added sampling format default_format >> [ 2.612017] perfmon_default_smpl: default_format v2.0 registered >> [ 2.941690] audit: initializing netlink socket (disabled) >> [ 2.944540] type=2000 audit(2.940:1): initialized >> [ 3.053549] HugeTLB registered 256 MB page size, pre-allocated 0 pages >> [ 3.060106] VFS: Disk quotas dquot_6.5.2 >> [ 3.063376] Dquot-cache hash table entries: 2048 (order 0, 16384 bytes) >> [ 3.077548] msgmni has been set to 32768 >> [ 3.085721] Block layer SCSI generic (bsg) driver version 0.4 >> loaded (major 253) >> [ 3.101259] io scheduler noop registered >> [ 3.114227] io scheduler deadline registered >> [ 3.124480] io scheduler cfq registered (default) >> [ 3.138410] input: Power Button as >> /devices/LNXSYSTM:00/LNXPWRBN:00/input/input0 >> [ 3.151399] ACPI: Power Button [PWRF] >> [ 3.164463] input: Sleep Button as >> /devices/LNXSYSTM:00/LNXSLPBN:00/input/input1 >> [ 3.175828] ACPI: Sleep Button [SLPF] >> [ 3.190913] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled >> [ 3.206082] sn_console: Console driver init >> [ 3.219494] ttySG0 at I/O 0x0 (irq = 0) is a SGI SN L1 >> [ 3.300195] Linux agpgart interface v0.103 >> [ 3.303450] mousedev: PS/2 mouse device common for all mice >> [ 3.311532] rtc-efi rtc-efi: rtc core: registered rtc-efi as rtc0 >> [ 3.324627] TCP: cubic registered >> [ 3.334577] NET: Registered protocol family 17 >> [ 3.344580] Registering the dns_resolver key type >> [ 3.358790] registered taskstats version 1 >> [ 3.371272] rtc-efi rtc-efi: setting system clock to 2013-08-20 >> 21:27:57 UTC (1377034077) >> [ 3.381022] Freeing unused kernel memory: 816kB freedLoading, please wait... >> [ 3.420446] kernel unaligned access to 0xe000005a80008014, >> ip=0xa00000010020df90 >> [ 3.423708] Unable to handle kernel paging request at virtual >> address 8000800000000018 >> [ 3.427693] udevd[247]: Oops 8813272891392 [1] >> [ 3.427693] Modules linked in: >> [ 3.427693] >> [ 3.427693] Pid: 247, CPU 17, comm: udevd >> [ 3.427693] psr : 0000101008522030 ifs : 8000000000000001 ip : >> [<a000000100236f40>] Not tainted (3.4.49) >> [ 3.427693] ip is at mntget+0x20/0xa0 >> [ 3.427693] unat: 0000000000000000 pfs : 0000000000000286 rsc : >> 0000000000000003 >> [ 3.427693] rnat: 000000000000003c bsps: 0000000000000038 pr : >> 000000000001c299 >> [ 3.427693] ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: >> 0009804c0270033f >> [ 3.427693] csd : 0000000000000000 ssd : 0000000000000000 >> [ 3.427693] b0 : a00000010020dfb0 b6 : a000000100309c40 b7 : >> a0000001000102d0 >> [ 3.427693] f6 : 000000000000000000000 f7 : 000000000000000000000 >> [ 3.427693] f8 : 000000000000000000000 f9 : 000000000000000000000 >> [ 3.427693] f10 : 000000000000000000000 f11 : 000000000000000000000 >> [ 3.427693] r1 : a000000100e58960 r2 : 0000000000c80000 r3 : >> 0000000000000064 >> [ 3.427693] r8 : 8000800000000000 r9 : a000000100c05370 r10 : >> e000009805e4fd98 >> [ 3.427693] r11 : e000005a80008000 r12 : e000009805e4fd40 r13 : >> e000009805e48000 >> [ 3.427693] r14 : 0000000000c80064 r15 : 0000001008526030 r16 : >> 0000000000c80064 >> [ 3.427693] r17 : 0000000000000000 r18 : 8000800000000018 r19 : >> e000009805e4fd98 >> [ 3.427693] r20 : 0000000000000000 r21 : e000009805e4fd50 r22 : >> e000009805e4fdcc >> [ 3.427693] r23 : 0000000000000001 r24 : 0000000000000044 r25 : >> e0000118031462f8 >> [ 3.427693] r26 : fffffffffffc62f8 r27 : fffffffffffc62f8 r28 : >> e000011803180000 >> [ 3.427693] r29 : fffffffffffc62f0 r30 : 0000000000000063 r31 : >> 0000000000000063 >> [ 3.427693] >> [ 3.427693] Call Trace: >> [ 3.427693] [<a000000100014b00>] show_stack+0x40/0x90 >> [ 3.427693] sp=e000009805e4f910 >> bsp=e000009805e49108 >> [ 3.427693] [<a000000100015370>] show_regs+0x7d0/0x900 >> [ 3.427693] sp=e000009805e4fae0 >> bsp=e000009805e49098 >> [ 3.427693] [<a00000010003b8c0>] die+0x1c0/0x320 >> [ 3.427693] sp=e000009805e4fae0 >> bsp=e000009805e49058 >> [ 3.427693] [<a0000001007c0060>] ia64_do_page_fault+0xa00/0xae0 >> [ 3.427693] sp=e000009805e4fae0 >> bsp=e000009805e49008 >> [ 3.427693] [<a00000010000bdc0>] ia64_native_leave_kernel+0x0/0x270 >> [ 3.427693] sp=e000009805e4fb70 >> bsp=e000009805e49008 >> [ 3.427693] [<a000000100236f40>] mntget+0x20/0xa0 >> [ 3.427693] sp=e000009805e4fd40 >> bsp=e000009805e49000 >> [ 3.427693] [<a00000010020dfb0>] path_get+0x30/0xe0 >> [ 3.427693] sp=e000009805e4fd40 >> bsp=e000009805e48fd0 >> [ 3.427693] [<a000000100214860>] path_init+0x7c0/0x880 >> [ 3.427693] sp=e000009805e4fd40 >> bsp=e000009805e48f88 >> [ 3.427693] [<a000000100214970>] path_lookupat+0x50/0x1180 >> [ 3.427693] sp=e000009805e4fd50 >> bsp=e000009805e48e88 >> [ 3.427693] [<a000000100215ad0>] do_path_lookup+0x30/0x180 >> [ 3.427693] sp=e000009805e4fd80 >> bsp=e000009805e48e48 >> [ 3.427693] [<a000000100215c50>] kern_path_create+0x30/0x280 >> [ 3.427693] sp=e000009805e4fd80 >> bsp=e000009805e48e08 >> [ 3.427693] [<a000000100219200>] user_path_create+0x60/0xc0 >> [ 3.427693] sp=e000009805e4fe20 >> bsp=e000009805e48dc0 >> [ 3.427693] [<a00000010021a9d0>] sys_symlinkat+0x70/0x1e0 >> [ 3.427693] sp=e000009805e4fe20 >> bsp=e000009805e48d50 >> [ 3.427693] [<a00000010000bc40>] ia64_ret_from_syscall+0x0/0x20 >> [ 3.427693] sp=e000009805e4fe30 >> bsp=e000009805e48d50 >> [ 3.427693] [<a000000000040720>] ia64_ivt+0xffffffff00040720/0x400 >> [ 3.427693] sp=e000009805e50000 >> bsp=e000009805e48d50 >> [ 3.427693] Disabling lock debugging due to kernel taint >> [ 63.422259] INFO: rcu_sched self-detected stall on CPU { 17} >> [ 63.427883] INFO: rcu_sched detected stalls on CPUs/tasks: { 17} >> (detected by 51, t=15002 jiffies) >> [ 63.427883] INFO: Stall ended before state dump start >> [ 63.422259] (t=15010 jiffies) >> [ 63.422259] >> [ 63.422259] Call Trace: >> [ 63.422259] [<a000000100014b00>] show_stack+0x40/0x90 >> [ 63.422259] sp=e000009805e4f690 >> bsp=e000009805e49560 >> [ 63.422259] [<a000000100014b80>] dump_stack+0x30/0x50 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e49548 >> [ 63.422259] [<a000000100146e90>] __rcu_pending+0x1b0/0x9c0 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e494d8 >> [ 63.422259] [<a000000100148520>] rcu_check_callbacks+0x100/0x1a0 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e494b0 >> [ 63.422259] [<a000000100086a40>] update_process_times+0x60/0xc0 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e49480 >> [ 63.422259] [<a00000010003a940>] timer_interrupt+0x1c0/0x300 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e49420 >> [ 63.422259] [<a000000100138460>] handle_irq_event_percpu+0xc0/0x3c0 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e49390 >> [ 63.422259] [<a00000010013fcb0>] handle_percpu_irq+0x110/0x1a0 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e49360 >> [ 63.422259] [<a000000100137590>] generic_handle_irq+0x90/0xc0 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e49340 >> [ 63.422259] [<a000000100013200>] ia64_handle_irq+0x2a0/0x340 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e492b0 >> [ 63.422259] [<a00000010000bdc0>] ia64_native_leave_kernel+0x0/0x270 >> [ 63.422259] sp=e000009805e4f860 >> bsp=e000009805e492b0 >> [ 63.422259] [<a0000001002366f0>] vfsmount_lock_local_lock+0xb0/0xe0 >> [ 63.422259] sp=e000009805e4fa30 >> bsp=e000009805e492a0 >> [ 63.422259] [<a000000100239720>] mntput_no_expire+0x40/0x380 >> [ 63.422259] sp=e000009805e4fa30 >> bsp=e000009805e49250 >> [ 63.422259] [<a000000100239ac0>] mntput+0x60/0x80 >> [ 63.422259] sp=e000009805e4fa30 >> bsp=e000009805e49230 >> [ 63.422259] [<a0000001001fb910>] fput+0x510/0x540 >> [ 63.422259] sp=e000009805e4fa30 >> bsp=e000009805e491d8 >> [ 63.422259] [<a0000001001a6710>] remove_vma+0xd0/0x160 >> [ 63.422259] sp=e000009805e4fa30 >> bsp=e000009805e491b0 >> [ 63.422259] [<a0000001001a9a90>] exit_mmap+0x470/0x500 >> [ 63.422259] sp=e000009805e4fa30 >> bsp=e000009805e49170 >> [ 63.422259] [<a000000100065230>] mmput+0x90/0x240 >> [ 63.422259] sp=e000009805e4fab0 >> bsp=e000009805e49150 >> [ 63.422259] [<a000000100070490>] exit_mm+0x270/0x2a0 >> [ 63.422259] sp=e000009805e4fab0 >> bsp=e000009805e49118 >> [ 63.422259] [<a000000100073f10>] do_exit+0x510/0x1400 >> [ 63.422259] sp=e000009805e4fac0 >> bsp=e000009805e49098 >> [ 63.422259] [<a00000010003ba00>] die+0x300/0x320 >> [ 63.422259] sp=e000009805e4fae0 >> bsp=e000009805e49058 >> [ 63.422259] [<a0000001007c0060>] ia64_do_page_fault+0xa00/0xae0 >> [ 63.422259] sp=e000009805e4fae0 >> bsp=e000009805e49008 >> [ 63.422259] [<a00000010000bdc0>] ia64_native_leave_kernel+0x0/0x270 >> [ 63.422259] sp=e000009805e4fb70 >> bsp=e000009805e49008 >> [ 63.422259] [<a000000100236f40>] mntget+0x20/0xa0 >> [ 63.422259] sp=e000009805e4fd40 >> bsp=e000009805e49000 >> [ 63.422259] [<a00000010020dfb0>] path_get+0x30/0xe0 >> [ 63.422259] sp=e000009805e4fd40 >> bsp=e000009805e48fd0 >> [ 63.422259] [<a000000100214860>] path_init+0x7c0/0x880 >> [ 63.422259] sp=e000009805e4fd40 >> bsp=e000009805e48f88 >> [ 63.422259] [<a000000100214970>] path_lookupat+0x50/0x1180 >> [ 63.422259] sp=e000009805e4fd50 >> bsp=e000009805e48e88 >> [ 63.422259] [<a000000100215ad0>] do_path_lookup+0x30/0x180 >> [ 63.422259] sp=e000009805e4fd80 >> bsp=e000009805e48e48 >> [ 63.422259] [<a000000100215c50>] kern_path_create+0x30/0x280051c21 >> [ 63.422259] sp=e000009805e4fd80 >> bsp=e000009805e48e08 >> [ 63.422259] [<a000000100219200>] user_path_create+0x60/0xc0 >> [ 63.422259] sp=e000009805e4fe20 >> bsp=e000009805e48dc0 >> [ 63.422259] [<a00000010021a9d0>] sys_symlinkat+0x70/0x1e0 >> [ 63.422259] sp=e000009805e4fe20 >> bsp=e000009805e48d50 >> [ 63.422259] [<a00000010000bc40>] ia64_ret_from_syscall+0x0/0x20 >> [ 63.422259] sp=e000009805e4fe30 >> bsp=e000009805e48d50 >> [ 63.422259] [<a000000000040720>] ia64_ivt+0xffffffff00040720/0x400 >> [ 63.422259] sp=e000009805e50000 >> bsp=e000009805e48d50 >> >> On Mon, Aug 19, 2013 at 7:05 PM, Barclay Jameson >> <almightybeeij@xxxxxxxxx> wrote: >>> I have posted on Nekochan >>> (http://forums.nekochan.net/viewtopic.php?f=3&t=16727918) asking help >>> for an error when trying to boot a Kernel compiled >= 3.2 (using >>> Debian Wheezy). >>> Here is the error: >>> >>> 000 051.21^1#0a: index time stamp type component subcomponent >>> 000 051.21^1#0a: ----- ------------------ --------- ------------ ------------ >>> 000 051.21^1#0a: 0 0x000000c92deef702 MD_HW 051.21^1#0 >>> Non-existent Memory Address Error >>> 000 051.21^1#0a: 1 0x000000ce43920f08 PI_HW 051.21^1#0 RRB >>> Time-out Error >>> 000 051.21^1#0a: 2 0x000000ce43b16400 PROC_MCA 051.21^1#0a Bus Check >>> >>> A more detailed error is listed below: >>> >>> 000 051.21^1#0a: SH2_EVENT_OCCURRED : 0x0000008180000003 >>> 000 051.21^1#0a: MD Hardware Interrupt Pending >>> 000 051.21^1#0a: SH2_FIRST_ERROR : 0x0000000000000002 >>> 000 051.21^1#0a: MD Hardware Interrupt Pending >>> 000 051.21^1#0a: SH2_MEM_ERROR_SUMMARY : 0x0000007800000002 >>> 000 051.21^1#0a: Non-existent Memory Address Error >>> 000 051.21^1#0a: SH2_MEM_FIRST_ERROR : 0x0000000000000002 >>> 000 051.21^1#0a: MD_HW_INT: Non-existent Memory Address Error >>> 000 051.21^1#0a: SH2_MISC_ERR_HDR_UPPER : 0x0000000001f00004 >>> 000 051.21^1#0a: Non-Existant Memory Address Error Header Captured >>> 000 051.21^1#0a: Echo: 0x1f >>> 000 051.21^1#0a: SH2_MISC_ERR_HDR_LOWER : 0x8800010000000000 >>> 000 051.21^1#0a: Source : pi chiplet, nasid 0x0 >>> 000 051.21^1#0a: Command : NCRD, Non-coherent read >>> 000 051.21^1#0a: Read Operation >>> 000 051.21^1#0a: SH2_MISC_ADRS_ERR_HDR_LOWER_A : 0x80000001014cf070 >>> 000 051.21^1#0a: Address <37:0>: 0x1014cf070 >>> 000 051.21^1#0a: Read Operation >>> 000 051.21^1#0a: SH2_MD_HW_TIME_STAMP : 0x800000fa22fade06 >>> 000 051.21^1#0a: >>> 000 051.21^1#0a: PI_HW :051.21^1#0 :RRB Time-out Error >>> 000 051.21^1#0a: >>> 000 051.21^1#0a: SH2_EVENT_OCCURRED : 0x0000008180000003 >>> 000 051.21^1#0a: PI Hardware Interrupt Pending >>> 000 051.21^1#0a: SH2_FIRST_ERROR : 0x0000000000000002 >>> 000 051.21^1#0a: SH2_PI_ERROR_SUMMARY : 0x0000000000000010 >>> 000 051.21^1#0a: RRB Time-out Error >>> 000 051.21^1#0a: SH2_PI_FIRST_ERROR : 0x0000000000000010 >>> 000 051.21^1#0a: RRB Time-out Error >>> 000 051.21^1#0a: SH2_PI_ERROR_DETAIL_1 : >>> 0xfe200001014cf071 >>> 000 051.21^1#0a: SH2_PI_ERROR_DETAIL_2 : >>> 0x000000001f0801f1 >>> 000 051.21^1#0a: Address : 0x1014cf070 >>> 000 051.21^1#0a: Table Select : 0x4 >>> 000 051.21^1#0a: Command : RESERVED_FE >>> 000 051.21^1#0a: IsReal : 0x1 >>> 000 051.21^1#0a: RRB Idx : 0x1f >>> 000 051.21^1#0a: WRB Idx : 0x0 >>> 000 051.21^1#0a: IRB Idx : 0x0 >>> 000 051.21^1#0a: Error Code : 0x4 >>> 000 051.21^1#0a: Echo : 0x1f >>> 000 051.21^1#0a: Source : not available >>> 000 051.21^1#0a: Supplemental : 0x0 >>> 000 051.21^1#0a: AXB Queue : 0x0 >>> 000 051.21^1#0a: SH2_PI_HW_TIME_STAMP : 0x800000feba7afc05 >>> 000 051.21^1#0a: >>> 000 051.21^1#0a: PROC_MCA :051.21^1#0a :Bus Check >>> 000 051.21^1#0a: >>> 000 051.21^1#0a: processor lid : 0x0000000000000000 >>> 000 051.21^1#0a: cpu: A nasid: 0x0 >>> 000 051.21^1#0a: processor state parameter : 0x20010000fff21120 >>> 000 051.21^1#0a: rendevous was not attempted >>> 000 051.21^1#0a: min state is valid >>> 000 051.21^1#0a: not continuable >>> 000 051.21^1#0a: machine check is isolated >>> 000 051.21^1#0a: more info available >>> 000 051.21^1#0a: ip logged is not precise >>> 000 051.21^1#0a: min state is not precise >>> 000 051.21^1#0a: shared MCA >>> 000 051.21^1#0a: bus check >>> 000 051.21^1#0a: PAL recovery status: >>> 000 051.21^1#0a: error was isolated and contained, continuable >>> if sw can recover >>> 000 051.21^1#0a: processor error map : 0x0000000001000000 >>> 000 051.21^1#0a: processor code id: 0 >>> 000 051.21^1#0a: logical thread id: 0 >>> 000 051.21^1#0a: processor bus level 1 error >>> 000 051.21^1#0a: processor structure: bus >>> 000 051.21^1#0a: bus check : 0x1880000000800141 >>> 000 051.21^1#0a: bus transaction size: 1 >>> 000 051.21^1#0a: external bus error >>> 000 051.21^1#0a: transaction type: partial read >>> 000 051.21^1#0a: bus error severity: 0 >>> 000 051.21^1#0a: bus hierarchy: 0 >>> 000 051.21^1#0a: UCE detected on incoming >>> 000 051.21^1#0a: ia64 instruction set >>> 000 051.21^1#0a: machine check corrected >>> 000 051.21^1#0a: target address valid >>> 000 051.21^1#0a: target identifier : 0x00000001014cf071 >>> >>> Anybody have any any pointers to help me solve this problem? >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-ia64" in >> the body of a message to majordomo@xxxxxxxxxxxxxxx >> More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-ia64" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html