惊慌失措;Oops:0000[#1]";(查看日志了解详细信息)-更新的内核测试

惊慌失措;Oops:0000[#1]";(查看日志了解详细信息)-更新的内核测试,oop,crash,linux-kernel,kernel,panic,Oop,Crash,Linux Kernel,Kernel,Panic,我试图解释一个可重复的系统崩溃,但不是经常发生。我相信这与第三方驱动程序和光学扫描设备有关。我不确定扫描设备的驱动程序是否导致了挂起,如果它在USB驱动程序中带来了导致挂起的东西,或者是完全不同的东西 我不是一个驱动程序开发人员,所以我在这里阅读Linux内核崩溃手册的时候,在崩溃转储中摸索 我能够使用kexec工具设置调试内核以生成vmcore。在崩溃时,可以看到恐慌:“Oops:0000[#1]”(查看日志了解详细信息),在日志中可以看到错误:无法处理虚拟地址00100处的内核分页请求 你能

我试图解释一个可重复的系统崩溃,但不是经常发生。我相信这与第三方驱动程序和光学扫描设备有关。我不确定扫描设备的驱动程序是否导致了挂起,如果它在USB驱动程序中带来了导致挂起的东西,或者是完全不同的东西

我不是一个驱动程序开发人员,所以我在这里阅读Linux内核崩溃手册的时候,在崩溃转储中摸索

我能够使用kexec工具设置调试内核以生成vmcore。在崩溃时,可以看到恐慌:“Oops:0000[#1]”(查看日志了解详细信息),在日志中可以看到错误:无法处理虚拟地址00100处的内核分页请求

你能帮我解释更多的转储,这样我就可以回到驱动程序的开发人员那里,告诉他们结果吗?或者你能告诉我根本不是驱动程序吗

这里有更多的垃圾。非常感谢

KERNEL: /usr/lib/debug/lib/modules/2.6.18-194.el5debug/vmlinux
DUMPFILE: /root/kernel/vmcore_12032014
CPUS: 4
DATE: Sat Oct 20 16:51:55 2001
UPTIME: 00:03:55
LOAD AVERAGE: 0.79, 0.83, 0.38
TASKS: 128
NODENAME: 3232A
RELEASE: 2.6.18-194.el5debug
VERSION: #1 SMP Fri Apr 2 16:56:41 EDT 2010
MACHINE: i686  (1800 Mhz)
MEMORY: 2 GB
PANIC: "Oops: 0000 [#1]" (check log for details)
PID: 0
COMMAND: "swapper"
TASK: c069d460  (1 of 4)  [THREAD_INFO: c075f000]
CPU: 0
STATE: TASK_RUNNING (PANIC)
崩溃>英国电信

PID: 0      TASK: c069d460  CPU: 0   COMMAND: "swapper"
#0 [c080cdb0] crash_kexec at c044a386
#1 [c080cdf4] die at c04065c3
#2 [c080ce24] do_page_fault at c062d1b7
#3 [c080ce5c] error_code (via page_fault) at c0405bc3
    EAX: f7c7de2c  EBX: 000ffaec  ECX: 00000000  EDX: f4282e8c  EBP: f7c665a0
    DS:  007b      ESI: f7c7ddf8  ES:  007b      EDI: c080cea4
    CS:  0060      EIP: c05a2ef2  ERR: ffffffff  EFLAGS: 00010012
#4 [c080ce90] hiddev_send_event at c05a2ef2
#5 [c080cea0] hiddev_report_event at c05a2fcc
#6 [c080cec8] hid_input_report at c05a0fc3
#7 [c080cf1c] hid_irq_in at c05a261d
#8 [c080cf2c] usb_hcd_giveback_urb at c05964b6
#9 [c080cf3c] uhci_giveback_urb at f883f618 [uhci_hcd]
#10 [c080cf58] uhci_scan_schedule at f883fc14 [uhci_hcd]
#11 [c080cfa4] uhci_irq at f884163d [uhci_hcd]
#12 [c080cfc4] usb_hcd_irq at c0596f9b
#13 [c080cfcc] handle_IRQ_event at c0454d5f
#14 [c080cfe4] __do_IRQ at c0454e2d
--- <hard IRQ> ---
#0 [c075ff80] do_IRQ at c040761f
#1 [c075ff98] common_interrupt at c0405a28
    EAX: 00000000  EBX: c075f008  ECX: 00000000  EDX: 00000000  EBP: 00000020
    DS:  007b      ESI: c06440f8  ES:  007b      EDI: ca138790
    CS:  0060      EIP: c0403d0a  ERR: ffffff4e  EFLAGS: 00000246
#2 [c075ffcc] mwait_idle at c0403d0a
#3 [c075ffd4] cpu_idle at c0403cc4
PID: 126    TASK: ca2fc5b0  CPU: 3   COMMAND: "khubd"
 #0 [f7fb2dfc] crash_kexec at c044da51
 #1 [f7fb2e40] die at c0406649
 #2 [f7fb2e70] do_page_fault at c0639449
 #3 [f7fb2eb0] device_del at c057053e
 #4 [f7fb2efc] device_del at c0570552
 #5 [f7fb2f10] device_unregister at c0570669
 #6 [f7fb2f18] usb_remove_ep_files at c05a3dac
 #7 [f7fb2f38] usb_remove_sysfs_dev_files at c05a39d6
 #8 [f7fb2f44] usb_disconnect at c059db59
 #9 [f7fb2f58] hub_thread at c059e663
#10 [f7fb2fcc] kthread at c0439ef6
#11 [f7fb2fe4] kernel_thread_helper at c0405e0d
崩溃>英国电信

PID: 0      TASK: c069d460  CPU: 0   COMMAND: "swapper"
#0 [c080cdb0] crash_kexec at c044a386
#1 [c080cdf4] die at c04065c3
#2 [c080ce24] do_page_fault at c062d1b7
#3 [c080ce5c] error_code (via page_fault) at c0405bc3
    EAX: f7c7de2c  EBX: 000ffaec  ECX: 00000000  EDX: f4282e8c  EBP: f7c665a0
    DS:  007b      ESI: f7c7ddf8  ES:  007b      EDI: c080cea4
    CS:  0060      EIP: c05a2ef2  ERR: ffffffff  EFLAGS: 00010012
#4 [c080ce90] hiddev_send_event at c05a2ef2
#5 [c080cea0] hiddev_report_event at c05a2fcc
#6 [c080cec8] hid_input_report at c05a0fc3
#7 [c080cf1c] hid_irq_in at c05a261d
#8 [c080cf2c] usb_hcd_giveback_urb at c05964b6
#9 [c080cf3c] uhci_giveback_urb at f883f618 [uhci_hcd]
#10 [c080cf58] uhci_scan_schedule at f883fc14 [uhci_hcd]
#11 [c080cfa4] uhci_irq at f884163d [uhci_hcd]
#12 [c080cfc4] usb_hcd_irq at c0596f9b
#13 [c080cfcc] handle_IRQ_event at c0454d5f
#14 [c080cfe4] __do_IRQ at c0454e2d
--- <hard IRQ> ---
#0 [c075ff80] do_IRQ at c040761f
#1 [c075ff98] common_interrupt at c0405a28
    EAX: 00000000  EBX: c075f008  ECX: 00000000  EDX: 00000000  EBP: 00000020
    DS:  007b      ESI: c06440f8  ES:  007b      EDI: ca138790
    CS:  0060      EIP: c0403d0a  ERR: ffffff4e  EFLAGS: 00000246
#2 [c075ffcc] mwait_idle at c0403d0a
#3 [c075ffd4] cpu_idle at c0403cc4
PID: 126    TASK: ca2fc5b0  CPU: 3   COMMAND: "khubd"
 #0 [f7fb2dfc] crash_kexec at c044da51
 #1 [f7fb2e40] die at c0406649
 #2 [f7fb2e70] do_page_fault at c0639449
 #3 [f7fb2eb0] device_del at c057053e
 #4 [f7fb2efc] device_del at c0570552
 #5 [f7fb2f10] device_unregister at c0570669
 #6 [f7fb2f18] usb_remove_ep_files at c05a3dac
 #7 [f7fb2f38] usb_remove_sysfs_dev_files at c05a39d6
 #8 [f7fb2f44] usb_disconnect at c059db59
 #9 [f7fb2f58] hub_thread at c059e663
#10 [f7fb2fcc] kthread at c0439ef6
#11 [f7fb2fe4] kernel_thread_helper at c0405e0d
崩溃>日志(截断)

Scan:无错误-在命令\u回调中
扫描:在命令_回调中
扫描:接收到的第一个缓冲区=0x2a
扫描:命令传输的最后一个数据包=42命令大小42
扫描:0x2 0x44 0x20 0x54 0x65 0x73 0x74 0x20 0x4d 0x65 0x73 0x73
扫描:读取当前数据
扫描:IOCTL\u PS3\u PEAK\u MSG
扫描:在扫描仪中读取扫描:将1个数据块复制到用户空间
扫描:IOCTL\u PS3\u SCANNER\u命令忙插槽=0x0总计\u插槽=0x8忙图像插槽=0x0
扫描:0x2 0x38 0x3 0x4
扫描:处于写入\回调状态=0x0
扫描:在图像中
扫描:Urb状态=0xFFFFB9 Urb长度=0x0
扫描:在图像中\u回调\u乐趣-与扫描仪的连接中断。
扫描:urb设置为重新提交
扫描:在命令_回调中
扫描:Urb状态=0xFFFFB9 Urb长度=0x0
扫描:在命令_callback_fun中-与扫描仪的连接中断。
扫描:urb设置为重新提交
usb 1-6:usb断开连接,地址11
错误:无法处理虚拟地址6B93处的内核分页请求
打印eip:
c04bd310
*pde=00000000
Oops:0000[1]
开关电源
最后一个sysfs文件:/devices/pci0000:00/0000:00:1d.7/usb1/1-6/b配置值
链接模块:vfat fat pdi_ps3_drv_扫描仪(U)xt_limit xt_tcpudp xt_state ip_conntrack nfnetlink iptable_filter ip_tables x_tables loop dm_mirror dm_multipath scsi_dh video backlight sbs power scsi_hwmon i2c ecdell wmi按钮电池asus acpi ac lp joydev snd_hda_intel snd_seq虚拟sg snd_seq oss snd_seq midi_事件snd_seq snd_snd_seq snd_设备snd_oss pcm混音器sndu pcm sndu timer parportu pc sndĀu pageĀu alloc sndĀu hwdep sndĀu tpmĀtis tpm soundcore r8169 tpmĀu bios i2cĀi801 serioĀ原始pcspkr pl2303 usb串行mii i2cĀĀcore dmĀraid45 dmĀ消息dmĀ区域Āhash dmĀlog dmĀmod dmĀdmĀmem mem mem缓存usbĀ存储
中央处理器:3
EIP:0060:[]受污染:G------------------VLI
EFLAGS:00010202(2.6.18-371.6.1.el5debug#1)
EIP位于sysfs\u hash\u和\u remove+0x18/0x103
eax:6B ebx:F4B4 ecx:c0570543 edx:6B
esi:f4be5b0c edi:f3cd936c ebp:f3cd93c4 esp:F7FB2E0
ds:007b es:007b ss:0068
过程khubd(pid:126,ti=f7fb2000任务=ca2fc5b0任务。ti=f7fb2000)
堆栈:f4be5c50 6B6B6B f764d16c f4be5bf4 f4be5b0c f3cd936c f3cd93c4 c0570557
F4BE0C f3cd93a0 f3cd936c f7c8da2c c057066e f7fb2f28 c05a3db1 f7fb2f28
C06827A600000000 305f7065 c06d0030 f3cd936c f3cd93c4 c05a39db f3cd96cc
呼叫跟踪:
[]设备删除+0x5c/0x16b
[]设备\u注销+0x8/0x10
[]usb_删除_ep_文件+0x53/0x75
[]usb\u删除\u系统文件\u开发文件+0xf/0x6c
[]usb_断开连接+0xa7/0xe0
[]轮毂螺纹+0x327/0x987
[]kthread+0xa1/0xec
[]自动删除唤醒功能+0x0/0x2d
[]轮毂螺纹+0x0/0x987
[]kthread+0xc0/0xec
[]kthread+0x0/0xec
[]内核线程辅助程序+0x7/0x10
=======================
代码:8b 40 24 8b 40 c3 8b 40 14 8b 00 c3 8b 40 14 8b 00 c3 55 56 53 ec 0c 85 c0 89 44 04 89 14 24 0f 84 e5 00 00 89 c2 40 28 85 c0 0f 84 d8 00 00 00 8b 52 70 05 90 00 00 89 54
EIP:[]系统哈希和删除+0x18/0x103 SS:ESP 0068:f7fb2ee0

Oops:0000表示找不到页面。处理中断时,故障处于内核模式。堆栈上有几个函数,但崩溃发生在hiddev_report_event()中的某个地方——可能是在处理一个包含伪成员的链接列表。这是我看帖子时的感受。谢谢你,彼得。正如您可能已经看到的,我们有一个带有第三方驱动程序的受污染内核。崩溃就发生在我们试图读取扫描时,所以你认为这是第三方驱动程序的问题,还是我们暴露了内核USB驱动程序中的某些内容?我知道这些都是模糊的问题。如果你能给我一个小方向,让我深入挖掘,我将非常感激你好,安东尼奥不会太关注那个“受污染”的司机。这意味着,除其他外,它可能不是内置在股票内核中的开源软件。许多/大多数受污染的驾驶人功能正常。其他一些东西可能会污染内核。我猜这个错误是由于内存缓冲区管理不善造成的。你好,Peter L。如果你在那里的话。你能看看我原始问题中编辑的信息吗?我已经更新到最新发布的CentOS 5.5内核(2.6.18-371.6.1),但仍然出现了内核恐慌。然而,这次死机发生在命令“khubd”上,而不是“swapper”上。我能够捕获垃圾场,并提供了我希望的相关信息,以查明这个问题。它是内核中的bug还是驱动程序中的什么?谢谢,Antoniokhubd是USB集线器守护进程。它可能不是崩溃的根源,很可能是一个有缺陷的USB驱动程序。顺便说一句,“swapper”是一个始终在运行的进程,它经常以当前状态出现在崩溃转储中,但也几乎从不出错。看起来设备可能在崩溃时已断开连接。信息不足
Scan: no error - In command_callback
Scan: In command_callback
Scan:    First buffer received = 0x2a
Scan:    Last packet of command transfer = 42  Command size 42
Scan:    0x2 0x44 0x20 0x54 0x65 0x73 0x74 0x20 0x4d 0x65 0x73 0x73
Scan: Read data present
Scan: IOCTL_PS3_PEAK_MSG
Scan: In scanner_read    Scan: Copied 1 chunks of data to user space
Scan: IOCTL_PS3_SCANNER_COMMAND  Busy slots = 0x0        Total_slots = 0x8       Busy     image slots = 0x0
Scan:    0x2 0x38 0x3 0x4
Scan: In write_callback  status = 0x0
Scan: In image_callback
Scan:    Urb status = 0xffffffb9 Urb length = 0x0
Scan: In image_callback_fun - lost connection to scanner.
Scan: urb set to re-submit
Scan: In command_callback
Scan:    Urb status = 0xffffffb9 Urb length =  0x0
Scan: In command_callback_fun - lost connection to scanner.
Scan: urb set to re-submit
usb 1-6: USB disconnect, address 11
BUG: unable to handle kernel paging request at virtual address 6b6b6b93
 printing eip:
c04bd310
*pde = 00000000
Oops: 0000 [#1]
SMP
last sysfs file: /devices/pci0000:00/0000:00:1d.7/usb1/1-6/bConfigurationValue
Modules linked in: vfat fat pdi_ps3_drv_scanner(U) xt_limit xt_tcpudp xt_state         ip_conntrack nfnetlink iptable_filter ip_tables x_tables loop dm_mirror dm_multipath     scsi_dh video backlight sbs power_meter hwmon i2c_ec dell_wmi wmi button battery asus_acpi     ac lp joydev snd_hda_intel snd_seq_dummy sg snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer parport_pc snd_page_alloc snd_hwdep snd parport tpm_tis tpm soundcore r8169 tpm_bios i2c_i801 serio_raw pcspkr pl2303 usbserial mii i2c_core dm_raid45 dm_message dm_region_hash dm_log dm_mod dm_mem_cache usb_storage ata_piix libata sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
CPU:    3
EIP:    0060:[<c04bd310>]    Tainted: G     --------------------    VLI
EFLAGS: 00010202   (2.6.18-371.6.1.el5debug #1)
EIP is at sysfs_hash_and_remove+0x18/0x103
eax: 6b6b6b6b   ebx: f4be5bf4   ecx: c0570543   edx: 6b6b6b6b
esi: f4be5b0c   edi: f3cd936c   ebp: f3cd93c4   esp: f7fb2ee0
ds: 007b   es: 007b   ss: 0068
Process khubd (pid: 126, ti=f7fb2000 task=ca2fc5b0 task.ti=f7fb2000)
Stack: f4be5c50 6b6b6b6b f764d16c f4be5bf4 f4be5b0c f3cd936c f3cd93c4 c0570557
       f4be5b0c f3cd93a0 f3cd936c f7c8da2c c057066e f7fb2f28 c05a3db1 f7fb2f28
       c06827a6 00000000 305f7065 c06d0030 f3cd936c f3cd93c4 c05a39db f3cd96cc
Call Trace:
 [<c0570557>] device_del+0x5c/0x16b
 [<c057066e>] device_unregister+0x8/0x10
 [<c05a3db1>] usb_remove_ep_files+0x53/0x75
 [<c05a39db>] usb_remove_sysfs_dev_files+0xf/0x6c
 [<c059db5e>] usb_disconnect+0xa7/0xe0
 [<c059e668>] hub_thread+0x327/0x987
 [<c0439ed9>] kthread+0xa1/0xec
 [<c0439fab>] autoremove_wake_function+0x0/0x2d
 [<c059e341>] hub_thread+0x0/0x987
 [<c0439ef8>] kthread+0xc0/0xec
 [<c0439e38>] kthread+0x0/0xec
 [<c0405e0f>] kernel_thread_helper+0x7/0x10
 =======================
Code: 8b 40 24 8b 40 40 c3 8b 40 14 8b 00 c3 8b 40 14 8b 00 c3 55 57 56 53 83 ec 0c 85 c0 89 44 24 04 89 14 24 0f 84 e5 00 00 00 89 c2 <8b> 40 28 85 c0 0f 84 d8 00 00 00 8b 52 70 05 90 00 00 00 89 54
EIP: [<c04bd310>] sysfs_hash_and_remove+0x18/0x103 SS:ESP 0068:f7fb2ee0