[Crash-utility] bt command does not show stack traces of some CPUs.
Takao Indoh
indou.takao at jp.fujitsu.com
Mon Nov 21 10:54:53 UTC 2005
Hi, I found a problem on crash-4.0-2.12.
Summary:
bt command does not show stack traces of some CPUs.
Condition:
This problem happens only on ia64 machine.
There are two conditions to reproduce this problem.
1) Diskdump is executed via OS_INIT.
2) The machine has more than 8 CPUs.
Details:
When I executed bt command for vmcore which was created
on the 32 CPU machine, bt didn't show stack traces of some CPU.
Please see attached file(bt_failed.txt). Stack traces from CPU0 to
CPU7 are showed normally, but stack traces from CPU8 to CPU31 are not.
(Please don't worry about a message "unwind: bsp (xxxxxxxxx) out of
range". This is a problem of our platform.)
Cause:
I found a bug in ia64.c.
2679 ms->ia64_init_stack_size = get_array_length("ia64_init_stack",
2680 NULL, 0);
get_array_length() gets the length of stack of OS_INIT, and the
length is stored at ms->ia64_init_stack_size. However, the value
which get_array_length gives is different from actual stack length
because "ia64_init_stack" is declared like this:
u64 ia64_init_stack[NR_CPUS*KERNEL_STACK_SIZE/8];
Therefore, correct length of a stack is this:
get_array_length("ia64_init_stack", NULL, 0) * sizeof(u64)
I don't know how to fix, but it seems that attached patch
(ia64.c.patch) corrects this problem.
Another attached patch(test.patch) also seems to fix the problem,
but I don't know which is better.
Regards,
Takao Indoh
-------------- next part --------------
KERNEL: /usr/lib/debug/lib/modules/2.6.9-22.EL.guest/vmlinux
DUMPFILE: ./vmcore
CPUS: 32
DATE: Sat Nov 12 12:56:07 2005
UPTIME: 00:04:25
LOAD AVERAGE: 1.84, 1.51, 0.66
TASKS: 294
NODENAME: build.fujitsu.com
RELEASE: 2.6.9-22.EL.guest
VERSION: #1 SMP Tue Nov 8 17:16:52 JST 2005
MACHINE: ia64 (1600 Mhz)
MEMORY: 1023.5 GB
PANIC: ""
PID: 0
COMMAND: "swapper"
TASK: e00001c004080000 (1 of 32) [THREAD_INFO: e00001c004080dc0]
CPU: 5
STATE: TASK_RUNNING (INIT)
crash> bt -a
PID: 0 TASK: a0000001006c4000 CPU: 0 COMMAND: "swapper"
#0 [BSP:a0000001006c5018] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047c7530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120010 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e0000140209c0330 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a0000001000e54e0 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: a0000001006cbe20 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: a0000001006cbd80 R13: a0000001006c4000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126010
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 000000000013bc00 R25: 000000007faabd00
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffe9b74ed80000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 1000be7bcccccc837a100 F11: 1003e0000000000000000
#1 [BSP:a0000001006c4fc0] v at e00000007fd20300
#2 [BSP:a0000001006c4f70] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xa0000001006c4b80) out of range [0xa0000001006c4dc0-0xa0000001006c5018]
PID: 0 TASK: e000024004100000 CPU: 1 COMMAND: "swapper"
#0 [BSP:e000024004100f30] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047cf530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120010 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e000020020270018 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a000000100015c40 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: e000024004107e30 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: e000024004107d90 R13: e000024004100000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126010
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 00000000003dbc00 R25: 80000000ffed93d0
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffe6cfd6e00000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 10009835ffffffd66ea00 F11: 1003e0000000000000000
#1 [BSP:e000024004100ee0] v at e00000007fd20300
#2 [BSP:e000024004100e90] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xe000024004100aa0) out of range [0xe000024004100dc0-0xe000024004100f30]
PID: 0 TASK: e000014004070000 CPU: 2 COMMAND: "swapper"
#0 [BSP:e000014004070f30] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047d7530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120010 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e000020020618018 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a0000001000e54e0 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: e000014004077e30 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: e000014004077d90 R13: e000014004070000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126010
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 00000000003f7c00 R25: 80000000ffed93d0
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffe9c03e760000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 1000bf308ccccc7fe7040 F11: 1003e0000000000000000
#1 [BSP:e000014004070ee0] v at e00000007fd20300
#2 [BSP:e000014004070e90] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xe000014004070aa0) out of range [0xe000014004070dc0-0xe000014004070f30]
PID: 0 TASK: e000024004108000 CPU: 3 COMMAND: "swapper"
#0 [BSP:e000024004108f30] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047df530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120030 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e000024020638018 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a0000001000e54e0 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: e00002400410fe30 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: e00002400410fd90 R13: e000024004108000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126030
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 00000000003fe400 R25: 80000000ffed93d0
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffeac196b60000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 1000cf4bbfffffb290840 F11: 1003e0000000000000000
#1 [BSP:e000024004108ee0] v at e00000007fd20300
#2 [BSP:e000024004108e90] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xe000024004108aa0) out of range [0xe000024004108dc0-0xe000024004108f30]
PID: 0 TASK: e00000017c8b0000 CPU: 4 COMMAND: "swapper"
#0 [BSP:e00000017c8b0f30] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047e7530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120030 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e000028020608018 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a0000001000e54e0 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: e00000017c8b7e30 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: e00000017c8b7d90 R13: e00000017c8b0000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126030
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 00000000003bfc00 R25: 80000000ffed93d0
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffe9aafb020000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 1000bd82733332eececc0 F11: 1003e0000000000000000
#1 [BSP:e00000017c8b0ee0] v at e00000007fd20300
#2 [BSP:e00000017c8b0e90] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xe00000017c8b0aa0) out of range [0xe00000017c8b0dc0-0xe00000017c8b0f30]
PID: 0 TASK: e00001c004080000 CPU: 5 COMMAND: "swapper"
#0 [BSP:e00001c004081120] start_disk_dump at a000000200699ca0
#1 [BSP:e00001c004081100] try_crashdump at a0000001000ab710
#2 [BSP:e00001c0040810d0] start_dump_osinit at a00000010004cae0
#3 [BSP:e00001c0040810a8] unw_init_running at a00000010000faf0
#4 [BSP:e00001c004080f30] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047ef530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120010 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e00000017c9a0018 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a0000001000e54e0 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: e00001c004087e30 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: e00001c004087d90 R13: e00001c004080000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126010
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 00000000003c2c00 R25: 80000000ffed93d0
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffe9c5f0760000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 1000bfa3bfffffb0d3040 F11: 1003e0000000000000000
#5 [BSP:e00001c004080ee0] v at e00000007fd20300
#6 [BSP:e00001c004080e90] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xe00001c004080aa0) out of range [0xe00001c004080dc0-0xe00001c004080f30]
PID: 0 TASK: e00002c004158000 CPU: 6 COMMAND: "swapper"
#0 [BSP:e00002c004158f30] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047f7530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120030 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e000014020628018 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a0000001000e54e0 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: e00002c00415fe30 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: e00002c00415fd90 R13: e00002c004158000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126030
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 000000000029bc00 R25: 80000000ffed93d0
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffe9b8399e0000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 1000be8e5999994fe8f40 F11: 1003e0000000000000000
#1 [BSP:e00002c004158ee0] v at e00000007fd20300
#2 [BSP:e00002c004158e90] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xe00002c004158aa0) out of range [0xe00002c004158dc0-0xe00002c004158f30]
PID: 0 TASK: e000018004088000 CPU: 7 COMMAND: "swapper"
#0 [BSP:e000018004088f30] ia64_init_handler at a00000010004f4c0
EFRAME: e0000000047ff530
B0: a000000100015f60 CR_IIP: e00000007fd20300
CR_IPSR: 0000101008120010 CR_IFS: 000000000000050a
AR_PFS: 000000000000050f AR_RSC: 0000000000000000
AR_UNAT: 0000000000000000 AR_RNAT: 0000000000000000
AR_CCV: e000018020618018 AR_FPSR: 0009804c8a70433f
LOADRS: 0000000000000000 AR_BSPSTORE: 0000000000000000
B6: a0000001000e54e0 B7: e00000007fd08010
PR: 80000000ff6d1125 R1: a00000010099d150
R2: e00001800408fe30 R3: 000000000000050f
R8: 0000000000000000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000
R12: e00001800408fd90 R13: e000018004088000
R14: e00000007fd20300 R15: 800000000000050a
R16: a000000100015f80 R17: 0000101008126010
R18: 0000000000000004 R19: 80000000ff6d1125
R20: 88c0001c00000000 R21: 0000000000000007
R22: 0000000000000000 R23: 0000000000000609
R24: 00000000003ffc00 R25: 80000000ffed93d0
R26: 0000000000000000 R27: 0000000000000000
R28: 0000000000000000 R29: 0000000000000000
R30: 0000000000000000 R31: 0000000000000000
F6: 0fffbccccccccc8c00000 F7: 0ffead19c9a0000000000
F8: 000000000000000000000 F9: 100068000000000000000
F10: 1000d847eccccca2e0ae0 F11: 1003e0000000000000000
#1 [BSP:e000018004088ee0] v at e00000007fd20300
#2 [BSP:e000018004088e90] ia64_pal_call_static at a000000100015f60
bt: unwind: bsp (0xe000018004088aa0) out of range [0xe000018004088dc0-0xe000018004088f30]
PID: 0 TASK: e000024004118000 CPU: 8 COMMAND: "swapper"
PID: 0 TASK: e000014020528000 CPU: 9 COMMAND: "swapper"
PID: 0 TASK: e000020004108000 CPU: 10 COMMAND: "swapper"
PID: 0 TASK: e00002c004108000 CPU: 11 COMMAND: "swapper"
PID: 0 TASK: e00001c0202d8000 CPU: 12 COMMAND: "swapper"
PID: 0 TASK: e000028004108000 CPU: 13 COMMAND: "swapper"
PID: 0 TASK: e000014020538000 CPU: 14 COMMAND: "swapper"
PID: 0 TASK: e000024004130000 CPU: 15 COMMAND: "swapper"
PID: 0 TASK: e00000017c8d0000 CPU: 16 COMMAND: "swapper"
PID: 0 TASK: e00001c020548000 CPU: 17 COMMAND: "swapper"
PID: 0 TASK: e00002c004120000 CPU: 18 COMMAND: "swapper"
PID: 0 TASK: e00001c020550000 CPU: 19 COMMAND: "swapper"
PID: 0 TASK: e000028004118000 CPU: 20 COMMAND: "swapper"
PID: 0 TASK: e00001c020558000 CPU: 21 COMMAND: "swapper"
PID: 0 TASK: e000028004120000 CPU: 22 COMMAND: "swapper"
PID: 0 TASK: e000014020558000 CPU: 23 COMMAND: "swapper"
PID: 0 TASK: e000024020570000 CPU: 24 COMMAND: "swapper"
PID: 0 TASK: e00000017c8f0000 CPU: 25 COMMAND: "swapper"
PID: 0 TASK: e00001c020568000 CPU: 26 COMMAND: "swapper"
PID: 0 TASK: e00002c0040c0000 CPU: 27 COMMAND: "swapper"
PID: 0 TASK: e000018020560000 CPU: 28 COMMAND: "swapper"
PID: 0 TASK: e000024020580000 CPU: 29 COMMAND: "swapper"
PID: 0 TASK: e000014020570000 CPU: 30 COMMAND: "swapper"
PID: 0 TASK: e000020020560000 CPU: 31 COMMAND: "swapper"
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ia64.c.patch
Type: application/octet-stream
Size: 1888 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/crash-utility/attachments/20051121/3dab9773/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.patch
Type: application/octet-stream
Size: 709 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/crash-utility/attachments/20051121/3dab9773/attachment-0001.obj>
More information about the Crash-utility
mailing list