[Bugme-new] [Bug 11422] New: Kernel freezes hard on Intel systems

bugme-daemon at bugzilla.kernel.org bugme-daemon at bugzilla.kernel.org
Mon Aug 25 04:27:11 PDT 2008


http://bugzilla.kernel.org/show_bug.cgi?id=11422

           Summary: Kernel freezes hard on Intel systems
           Product: Power Management
           Version: 2.5
     KernelVersion: 2.6.27-rc4
          Platform: All
        OS/Version: Linux
              Tree: Mainline
            Status: NEW
          Severity: high
          Priority: P1
         Component: Other
        AssignedTo: power-management_other at kernel-bugs.osdl.org
        ReportedBy: gtdev at spearhead.de


Latest working kernel version: 2.6.22.19
Earliest failing kernel version: 2.6.25 (from debian packages)
Distribution: Debian GNU/Linux etch+testing+unstable
Hardware Environment: Lenovo Thinkpad (Z61m 9453-A11, Core 2 Duo,
  Intel ICH7 chipset, ATI/AMD Radeon Mobility X1400 graphics)
Software Environment:
Problem Description: System freezes hard after a random amount of time

Steps to reproduce:
 * Start system under any kernel from the 2.6.25 series or higher (I had
similar freezes with anything after 2.6.22.19, but I haven't verified that
those were related)
 * Wait

The crash is apparently unrelated to any installed software. It happens with or
without DRM modules loaded, with or without X running, with or without user
interaction.

This is the same as http://kerneltrap.org/node/16521 (which I will reproduce
here).

The freezes result in a complete lockup of the system. No output is generated
on the console, in syslog, or in messages.

    * Magic SysRq is inoperable.
    * I tried a lot of options in kernel hacking, including lock debugging.
That only sped up the time to freeze. NMI watchdog produces output.
    * I built a minimal kernel with all but the essential drivers disabled, so
I rule out issues with sound, network, PCCard, DRI/DRM, and others.
    * It happens with a stock Debian kernel (2.6.25, built for 486 arch) as
well as with custom-built kernels.
    * I tried building with both GCC 4.3 and 4.2.
    * The systems run perfectly fine with older kernels (2.6.21, 2.6.22
series), as well as Windows. memtest86+ doesn't find any issues.
    * "noacpi" is not an option since the laptop won't even boot with that. I
tried disabling stuff like MSI(-X), IRQ balancing, tick-free kernel, all to no
avail.
    * 2.6.26.2 runs fine on a non-SMP AMD system. Affected systems are
dual-core Intels. Setting the "nosmp" option doesn't help.

I have talked to someone else who is stuck to 2.6.21 kernels due to mystery
freezes as well. The codepath in the stack trace below also comes up in a lot
of reports, so maybe this should even be blocking until it is resolved.

The output below is largely the same on all tested kernels, except for precise
offsets.

--- NMI watchdog output (stock 2.6.26.2 kernel) ---
Pid: 0, comm: swapper Not tainted (2.6.26.2-debug #2)
EIP: 0060:[<c0117210>] EFLAGS: 00000097 CPU: 0
EIP is at hpet_rtc_interrupt+0x2e0/0x320
EAX: 00000000 EBX: 00000002 ECX: 00000046 EDX: 00000002
ESI: ffffc8ab EDI: c03f1edc EBP: c03f1ee8 ESP: c03f1e9c
 DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
Process swapper (pid: 0, ti=c03f0000 task=c03c9300 task.ti=c03f0000)
Stack: 03aa5b2e 00000000 f7bc7c00 f8800128 00000000 a61408d3 0061fd6e 00000000
       00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
       f7b87f80 00000000 00000000 c03f1f00 c0159d81 00000000 c03e7080 f7b87f80
Call Trace:
 [<c0159d81>] ? handle_IRQ_event+0x31/0x60
 [<c015af65>] ? handle_edge_irq+0xb5/0x150
 [<c0106c50>] ? do_IRQ+0x40/0x80
 [<c0104783>] ? common_iterrupt+0x23/0x28
 [<c013007b>] ? del_timer_sync+0x1b/0x20
 [<f8858058>] ? acpi_idle_enter_bm+0x2c2/0x344 [processor]
 [<c013f6c6>] ? pm_qos_requirement+0x26/0x30
 [<c0298891>] ? cpuidle_idle_call+0x81/0xc0
 [<c0298810>] ? cpuidle_idle_call+0x0/0xc0
 [<c0102c82>] ? cpu_idle+0x62/0xe0
 [<c0319f6e>] ? rest_init+0x4e/0x60
 =======================
Code: 80 8d 04 46 89 45 d8 89 f8 83 e7 0f c1 f8 04 8d 04 80 8d 04 47 89 45 dc
8b 45 cc 48 89 45 e0 e9 70 fd ff ff 8d b4 26 00 00 00 00 <f3> 90 a1 80 6b 3e c0
29 f0 83 f8 04 76 f2 e9 d2 fe ff ff 90 8d


-- 
Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


More information about the Bugme-new mailing list