[Bugme-new] [Bug 21252] New: unable to handle paging request while using two KDE 4 sessions

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Wed Oct 27 03:24:54 PDT 2010


https://bugzilla.kernel.org/show_bug.cgi?id=21252

           Summary: unable to handle paging request while using two KDE 4
                    sessions
           Product: Memory Management
           Version: 2.5
    Kernel Version: 2.6.36
          Platform: All
        OS/Version: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Page Allocator
        AssignedTo: akpm at linux-foundation.org
        ReportedBy: Martin at Lichtvoll.de
        Regression: No


Created an attachment (id=35202)
 --> (https://bugzilla.kernel.org/attachment.cgi?id=35202)
syslog

Rather unspecific bug title, cause I do not know exactly where to sort this. 

This is with:

martin at shambhala:~> cat /proc/version
Linux version 2.6.36-tp42-gtt-vram-apart-v2-dirty (martin at shambhala) (gcc
version 4.4.5 (Debian 4.4.5-4) ) #1 PREEMPT Thu Oct 21 12:10:49 CEST 2010

I only seen it once so far. This is upstream kernel with a with patch from

Bug 28402 - random radeon/kms/drm related freezes with kernel 2.6.34
https://bugs.freedesktop.org/show_bug.cgi?id=28402#c82

While using one of two KDE sessions on my ThinkPad T42 suddenly the X.org
display was replaced by parts of a backtrace that I now dig out from the
syslog:

Oct 27 09:25:35 shambhala kernel: BUG: unable to handle kernel paging request
at 9a7a7a86
Oct 27 09:25:35 shambhala kernel: IP: [<c10bb66e>] vma_prio_tree_next+0x2e/0x50
Oct 27 09:25:35 shambhala kernel: *pde = 00000000 
Oct 27 09:25:35 shambhala kernel: Oops: 0000 [#1] PREEMPT 
Oct 27 09:25:35 shambhala kernel: last sysfs file:
/sys/devices/pci0000:00/0000:00:1e.0/0000:02:02.0/rf_kill
Oct 27 09:25:35 shambhala kernel: Modules linked in: ufs qnx4 hfsplus hfs minix
jfs ext2 tp_smapi hdaps thinkpad_ec vboxnetadp vboxnetflt vboxdrv snd_usb_audio
snd_hwdep snd_usbmidi_lib sata_sil acpi_cpufreq mperf cpufreq_userspace
cpufreq_stats cpufreq_conservative cpufreq_powersave ppdev lp cn ipv6 microcode
fuse tun usblp ohci_hcd usb_storage usb_libusual nls_iso8859_15 nls_iso8859_1
nls_cp850 ntfs vfat msdos fat reiserfs isofs udf crc_itu_t smbfs pktcdvd uinput
radeon ttm drm_kms_helper drm i2c_algo_bit input_polldev dm_crypt snd_intel8x0m
snd_intel8x0 thinkpad_acpi hwmon snd_ac97_codec ac97_bus snd_pcm_oss
snd_mixer_oss snd_seq_dummy snd_pcm snd_seq_oss snd_seq_midi snd_rawmidi
snd_seq_midi_event snd_seq snd_timer pcmcia ipw2200 libipw snd_seq_device
rtc_cmos yenta_socket cfg80211 rtc_core pcmcia_rsrc snd snd_page_alloc video
joydev intel_agp output pcmcia_core rtc_lib led_class parport_pc evdev agpgart
nvram i2c_i801 rfkill lib80211 parport processor battery ac button dm_mod
raid10 raid456 async_pq async_xor 
Oct 27 09:25:35 shambhala kernel: xor async_memcpy async_raid6_recov raid6_pq
async_tx raid1 raid0 linear md_mod btrfs zlib_deflate crc32c libcrc32c usbhid
uhci_hcd ehci_hcd sg usbcore sr_mod thermal cdrom [last unloaded: hdaps]
Oct 27 09:25:35 shambhala kernel:
Oct 27 09:25:35 shambhala kernel: Pid: 319, comm: kswapd0 Tainted: G        W  
2.6.36-tp42-gtt-vram-apart-v2-dirty #1 2373CXG/2373CXG
Oct 27 09:25:35 shambhala kernel: EIP: 0060:[<c10bb66e>] EFLAGS: 00010282 CPU:
0
Oct 27 09:25:35 shambhala kernel: EIP is at vma_prio_tree_next+0x2e/0x50
Oct 27 09:25:35 shambhala kernel: EAX: 9a7a7a52 EBX: c42ce1b8 ECX: 00000000
EDX: f71e5dfc
Oct 27 09:25:35 shambhala kernel: ESI: c1f127e0 EDI: 00000000 EBP: f71e5dd0
ESP: f71e5dd0
Oct 27 09:25:35 shambhala kernel: DS: 007b ES: 007b FS: 0000 GS: 00e0 SS: 0068
Oct 27 09:25:35 shambhala kernel: Process kswapd0 (pid: 319, ti=f71e4000
task=f7163e80 task.ti=f71e4000)
Oct 27 09:25:35 shambhala kernel: Stack:
Oct 27 09:25:35 shambhala kernel: f71e5e28 c10ca52a f71e5e18 f71e5e7c f71e5e70
00000000 ffffffff 00000001
Oct 27 09:25:35 shambhala kernel: <0> 00000020 00000000 00000001 cce2a918
00000004 00000000 00000000 f4989cd0
Oct 27 09:25:35 shambhala kernel: <0> 00000000 00000000 00000028 c1f127e0
f71e5f64 c1f127f8 f71e5e90 c10b51d3
Oct 27 09:25:35 shambhala kernel: Call Trace:
Oct 27 09:25:35 shambhala kernel: [<c10ca52a>] ? page_referenced+0xaa/0x2d0
Oct 27 09:25:35 shambhala kernel: [<c10b51d3>] ? shrink_active_list+0x183/0x310
Oct 27 09:25:35 shambhala kernel: [<c10b65a3>] ? shrink_zone+0x3c3/0x5b0
Oct 27 09:25:35 shambhala kernel: [<c10dd92e>] ?
__mem_cgroup_largest_soft_limit_node+0x5e/0x150
Oct 27 09:25:35 shambhala kernel: [<c10b6df4>] ? kswapd+0x534/0x790
Oct 27 09:25:35 shambhala kernel: [<c1055460>] ?
autoremove_wake_function+0x0/0x50
Oct 27 09:25:35 shambhala kernel: [<c10b68c0>] ? kswapd+0x0/0x790
Oct 27 09:25:35 shambhala kernel: [<c1054f04>] ? kthread+0x74/0x80
Oct 27 09:25:35 shambhala kernel: [<c1054e90>] ? kthread+0x0/0x80
Oct 27 09:25:35 shambhala kernel: [<c100347e>] ? kernel_thread_helper+0x6/0x10
Oct 27 09:25:35 shambhala kernel: Code: 0f 1f 44 00 00 85 c0 74 29 8b 48 30 85
c9 74 15 8b 40 34 85 c0 74 1b 8b 50 28 0f 18 02 90 5d c3 90 8d 74 26 00 8b 40
28 83 e8 28 <8b> 48 34 85 c9 74 e5 89 d0 e8 a4 5c 1a 00 85 c0 74 0c 83 e8 28 
Oct 27 09:25:35 shambhala kernel: EIP: [<c10bb66e>]
vma_prio_tree_next+0x2e/0x50 SS:ESP 0068:f71e5dd0
Oct 27 09:25:35 shambhala kernel: CR2: 000000009a7a7a86
Oct 27 09:25:35 shambhala kernel: ---[ end trace f13586feeb79aae5 ]---
Oct 27 09:25:35 shambhala kernel: note: kswapd0[319] exited with preempt_count
1

After this I found that the KDE plasma desktop of that session wouldn't respond
to any click any more while other KDE applications on the same desktop and the
other KDE session still work. On looking at ps aux | grep plasma-desktop I
found the process to be in D state. An strace -p on the PID didn't give
anything.

In order to try to get the KDE session back working again, I tried to killed
its only child ksysguard, but this left it in zombie state, and then I did a
killall -u on the username. On this I got:

Oct 27 09:52:49 shambhala kernel: BUG: unable to handle kernel paging request
at 9a7a7a7e
Oct 27 09:52:49 shambhala kernel: IP: [<c10bb811>]
vma_prio_tree_remove+0xd1/0xf0
Oct 27 09:52:49 shambhala kernel: *pde = 00000000 
Oct 27 09:52:49 shambhala kernel: Oops: 0002 [#2] PREEMPT 
Oct 27 09:52:49 shambhala kernel: last sysfs file:
/sys/devices/pci0000:00/0000:00:1e.0/0000:02:02.0/rf_kill
Oct 27 09:52:49 shambhala kernel: Modules linked in: ufs qnx4 hfsplus hfs minix
jfs ext2 tp_smapi hdaps thinkpad_ec
 vboxnetadp vboxnetflt vboxdrv snd_usb_audio snd_hwdep snd_usbmidi_lib sata_sil
acpi_cpufreq mperf cpufreq_userspac
e cpufreq_stats cpufreq_conservative cpufreq_powersave ppdev lp cn ipv6
microcode fuse tun usblp ohci_hcd usb_stora
ge usb_libusual nls_iso8859_15 nls_iso8859_1 nls_cp850 ntfs vfat msdos fat
reiserfs isofs udf crc_itu_t smbfs pktcd
vd uinput radeon ttm drm_kms_helper drm i2c_algo_bit input_polldev dm_crypt
snd_intel8x0m snd_intel8x0 thinkpad_acp
i hwmon snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_pcm
snd_seq_oss snd_seq_midi snd_rawmid
i snd_seq_midi_event snd_seq snd_timer pcmcia ipw2200 libipw snd_seq_device
rtc_cmos yenta_socket cfg80211 rtc_core
 pcmcia_rsrc snd snd_page_alloc video joydev intel_agp output pcmcia_core
rtc_lib led_class parport_pc evdev agpgar
t nvram i2c_i801 rfkill lib80211 parport processor battery ac button dm_mod
raid10 raid456 async_pq async_xor 
Oct 27 09:52:49 shambhala kernel: xor async_memcpy async_raid6_recov raid6_pq
async_tx raid1 raid0 linear md_mod bt
rfs zlib_deflate crc32c libcrc32c usbhid uhci_hcd ehci_hcd sg usbcore sr_mod
thermal cdrom [last unloaded: hdaps]
Oct 27 09:52:49 shambhala kernel:
Oct 27 09:52:49 shambhala kernel: Pid: 13466, comm: akonadi_nepomuk Tainted: G 
    D W   2.6.36-tp42-gtt-vram-apar
t-v2-dirty #1 2373CXG/2373CXG
Oct 27 09:52:49 shambhala kernel: EIP: 0060:[<c10bb811>] EFLAGS: 00210246 CPU:
0
Oct 27 09:52:49 shambhala kernel: EIP is at vma_prio_tree_remove+0xd1/0xf0
Oct 27 09:52:49 shambhala kernel: EAX: c42ce1b8 EBX: 9a7a7a7a ECX: ef61ca78
EDX: c42ce1e0
Oct 27 09:52:49 shambhala kernel: ESI: c42ce1b8 EDI: 00000000 EBP: cb10bd88
ESP: cb10bd78
Oct 27 09:52:49 shambhala kernel: DS: 007b ES: 007b FS: 0000 GS: 00e0 SS: 0068
Oct 27 09:52:49 shambhala kernel: Process akonadi_nepomuk (pid: 13466,
ti=cb10a000 task=c356d780 task.ti=cb10a000)
Oct 27 09:52:49 shambhala kernel: Stack:
Oct 27 09:52:49 shambhala kernel: f4989cd0 00000000 c42ce1b8 b1fbd000 cb10bd94
c10c651b cb10a000 cb10bda0
Oct 27 09:52:49 shambhala kernel: <0> c10c67f3 c42ce160 cb10bdc4 c10c13f3
00000000 b1c00000 00000000 c1560aa4
Oct 27 09:52:49 shambhala kernel: <0> d378bf20 f6bf9380 00000000 cb10bde8
c10c88c6 00000000 cb10bdd8 00000000
Oct 27 09:52:49 shambhala kernel: Call Trace:
Oct 27 09:52:49 shambhala kernel: [<c10c651b>] ?
__remove_shared_vm_struct+0x3b/0x60
Oct 27 09:52:49 shambhala kernel: [<c10c67f3>] ? unlink_file_vma+0x23/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c13f3>] ? free_pgtables+0x53/0xa0
Oct 27 09:52:49 shambhala kernel: [<c10c88c6>] ? exit_mmap+0xd6/0x180
Oct 27 09:52:49 shambhala kernel: [<c10356eb>] ? mmput+0x3b/0xe0
Oct 27 09:52:49 shambhala kernel: [<c103a2bb>] ? exit_mm+0xdb/0xf0
Oct 27 09:52:49 shambhala kernel: [<c103babd>] ? do_exit+0x5bd/0x710
Oct 27 09:52:49 shambhala kernel: [<c1048d64>] ? __dequeue_signal+0x14/0x120
Oct 27 09:52:49 shambhala kernel: [<c1048ea2>] ? dequeue_signal+0x32/0x190
Oct 27 09:52:49 shambhala kernel: [<c103bc43>] ? do_group_exit+0x33/0xa0
Oct 27 09:52:49 shambhala kernel: [<c104b038>] ?
get_signal_to_deliver+0x298/0x4e0
Oct 27 09:52:49 shambhala kernel: [<c108438c>] ?
__delayacct_blkio_end+0x4c/0x50
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10023e0>] ? do_signal+0x70/0xaa0
Oct 27 09:52:49 shambhala kernel: [<c10a7d94>] ? unlock_page+0x44/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c3e76>] ? handle_mm_fault+0x4a6/0x870
Oct 27 09:52:49 shambhala kernel: [<c1023946>] ? do_page_fault+0x1b6/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10e2f3e>] ? vfs_read+0x14e/0x160
Oct 27 09:52:49 shambhala kernel: [<c10e3f70>] ? fget_light+0x90/0xa0
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c1002e65>] ? do_notify_resume+0x55/0x70
Oct 27 09:52:49 shambhala kernel: [<c13ff754>] ? work_notifysig+0x13/0x1b
Oct 27 09:52:49 shambhala kernel: [<c13f0000>] ? packet_sendmsg+0x6f0/0x950
Oct 27 09:52:49 shambhala kernel: Code: eb b3 90 8d b4 26 00 00 00 00 8d 50 28
8b 45 f0 e8 45 5d 1a 00 c7 43 34 00 00 00 00 eb 9a 8d 74 26 00 8b 58 28 8d 50
28 8b 48 2c <89> 4b 04 89 19 89 50 28 89 50 2c eb 80 0f 0b eb fe 8d b4 26 00 
Oct 27 09:52:49 shambhala kernel: EIP: [<c10bb811>]
vma_prio_tree_remove+0xd1/0xf0 SS:ESP 0068:cb10bd78
Oct 27 09:52:49 shambhala kernel: CR2: 000000009a7a7a7e
Oct 27 09:52:49 shambhala kernel: ---[ end trace f13586feeb79aae6 ]---
Oct 27 09:52:49 shambhala kernel: Fixing recursive fault but reboot is needed!
Oct 27 09:52:49 shambhala kernel: BUG: scheduling while atomic:
akonadi_nepomuk/13466/0x00000003
Oct 27 09:52:49 shambhala kernel: Modules linked in: ufs qnx4 hfsplus hfs minix
jfs ext2 tp_smapi hdaps thinkpad_ec vboxnetadp vboxnetflt vboxdrv snd_usb_audio
snd_hwdep snd_usbmidi_lib sata_sil acpi_cpufreq mperf cpufreq_userspace
cpufreq_stats cpufreq_conservative cpufreq_powersave ppdev lp cn ipv6 microcode
fuse tun usblp ohci_hcd usb_storage usb_libusual nls_iso8859_15 nls_iso8859_1
nls_cp850 ntfs vfat msdos fat reiserfs isofs udf crc_itu_t smbfs pktcdvd uinput
radeon ttm drm_kms_helper drm i2c_algo_bit input_polldev dm_crypt snd_intel8x0m
snd_intel8x0 thinkpad_acpi hwmon snd_ac97_codec ac97_bus snd_pcm_oss
snd_mixer_oss snd_seq_dummy snd_pcm snd_seq_oss snd_seq_midi snd_rawmidi
snd_seq_midi_event snd_seq snd_timer pcmcia ipw2200 libipw snd_seq_device
rtc_cmos yenta_socket cfg80211 rtc_core pcmcia_rsrc snd snd_page_alloc video
joydev intel_agp output pcmcia_core rtc_lib led_class parport_pc evdev agpgart
nvram i2c_i801 rfkill lib80211 parport processor battery ac button dm_mod
raid10 raid456 async_pq async_xor 
Oct 27 09:52:49 shambhala kernel: xor async_memcpy async_raid6_recov raid6_pq
async_tx raid1 raid0 linear md_mod btrfs zlib_deflate crc32c libcrc32c usbhid
uhci_hcd ehci_hcd sg usbcore sr_mod thermal cdrom [last unloaded: hdaps]
Oct 27 09:52:49 shambhala kernel: Pid: 13466, comm: akonadi_nepomuk Tainted: G 
    D W   2.6.36-tp42-gtt-vram-apart-v2-dirty #1
Oct 27 09:52:49 shambhala kernel: Call Trace:
Oct 27 09:52:49 shambhala kernel: [<c102ba91>] __schedule_bug+0x61/0x70
Oct 27 09:52:49 shambhala kernel: [<c13fd212>] schedule+0x4d2/0x5d0
Oct 27 09:52:49 shambhala kernel: [<c103dfc4>] ? irq_exit+0x34/0x80
Oct 27 09:52:49 shambhala kernel: [<c103bb10>] do_exit+0x610/0x710
Oct 27 09:52:49 shambhala kernel: [<c1038065>] ? kmsg_dump+0x145/0x150
Oct 27 09:52:49 shambhala kernel: [<c13fcc9c>] ? printk+0x1d/0x21
Oct 27 09:52:49 shambhala kernel: [<c1005f18>] oops_end+0x68/0xa0
Oct 27 09:52:49 shambhala kernel: [<c13fcc9c>] ? printk+0x1d/0x21
Oct 27 09:52:49 shambhala kernel: [<c1023456>] no_context+0xc6/0x160
Oct 27 09:52:49 shambhala kernel: [<c1023587>]
__bad_area_nosemaphore+0x97/0x130
Oct 27 09:52:49 shambhala kernel: [<c1023637>] bad_area_nosemaphore+0x17/0x20
Oct 27 09:52:49 shambhala kernel: [<c1023aee>] do_page_fault+0x35e/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10ad1c2>] ? free_pages_prepare+0x102/0x170
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c13ffa9d>] error_code+0x65/0x6c
Oct 27 09:52:49 shambhala kernel: [<c10bb811>] ? vma_prio_tree_remove+0xd1/0xf0
Oct 27 09:52:49 shambhala kernel: [<c10c651b>]
__remove_shared_vm_struct+0x3b/0x60
Oct 27 09:52:49 shambhala kernel: [<c10c67f3>] unlink_file_vma+0x23/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c13f3>] free_pgtables+0x53/0xa0
Oct 27 09:52:49 shambhala kernel: [<c10c88c6>] exit_mmap+0xd6/0x180
Oct 27 09:52:49 shambhala kernel: [<c10356eb>] mmput+0x3b/0xe0
Oct 27 09:52:49 shambhala kernel: [<c103a2bb>] exit_mm+0xdb/0xf0
Oct 27 09:52:49 shambhala kernel: [<c103babd>] do_exit+0x5bd/0x710
Oct 27 09:52:49 shambhala kernel: [<c1048d64>] ? __dequeue_signal+0x14/0x120
Oct 27 09:52:49 shambhala kernel: [<c1048ea2>] ? dequeue_signal+0x32/0x190
Oct 27 09:52:49 shambhala kernel: [<c103bc43>] do_group_exit+0x33/0xa0
Oct 27 09:52:49 shambhala kernel: [<c104b038>]
get_signal_to_deliver+0x298/0x4e0
Oct 27 09:52:49 shambhala kernel: [<c108438c>] ?
__delayacct_blkio_end+0x4c/0x50
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10023e0>] do_signal+0x70/0xaa0
Oct 27 09:52:49 shambhala kernel: [<c10a7d94>] ? unlock_page+0x44/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c3e76>] ? handle_mm_fault+0x4a6/0x870
Oct 27 09:52:49 shambhala kernel: [<c1023946>] ? do_page_fault+0x1b6/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10e2f3e>] ? vfs_read+0x14e/0x160
Oct 27 09:52:49 shambhala kernel: [<c10e3f70>] ? fget_light+0x90/0xa0
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c1002e65>] do_notify_resume+0x55/0x70
Oct 27 09:52:49 shambhala kernel: [<c13ff754>] work_notifysig+0x13/0x1b
Oct 27 09:52:49 shambhala kernel: [<c13f0000>] ? packet_sendmsg+0x6f0/0x950

On this hint that a reboot was needed I gracefully killed remaining user
processes also from the other KDE session and initiated a reboot. I couldn't
leave the machine in that state longer cause I am currently holding a Linux
Performance Tuning training and need that laptop for work ;).

Sometime prior to reboot I had lots of processes in D state. I'll add a
ps-aux.txt as attachment.

Some things that come to mind:

1) This laptop is using in-kernel-suspend and that quite some hibernation
cycles prior to the bug.

2) I have libcgroup /  cgroup-bin 0.36.2-3 debian packages installed. Currently
with the default configuration except for also activating the memory
controller. 

shambhala:~> grep CGROUP_MEM /boot/config-2.6.36-tp42-gtt-vram-apart-v2-dirty  
CONFIG_CGROUP_MEM_RES_CTLR=y
CONFIG_CGROUP_MEM_RES_CTLR_SWAP=y

For demonstrating purposes I did a mkdir in the cgroup filesystem and added the
current shell as process to it to show the memory usage the memory controller
accounted to it. I did not set any limit.

I'll add the complete syslog and an archive with quite some stuff from /proc
that might turn out to be useful in the course of the day in training breaks.
syslog already has the reboot after the bug happened. The traces are before.
There is also from eSATA PCMCIA related trace before. I am using a eSATA PCMCIA
controller for my external harddisks.

-- 
Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


More information about the Bugme-new mailing list