We have Win7 ( actually WES7) running on a PC with Asus motherboard, together with our PCIe board.
When we shut down the system, about once in 500 times, we get a HW unrecoverable error BSOD, probably caused by improper handling of PCIe communication by our board.
The dump + analysis below ends up in a L2 cache error - each time on a random CPU core.
We already more or less gave up finding the root cause of the crash.
However, when this BSOD occurs, the PC will NOT restart. It remains in the BSOD, even when we check "Automatic restart" in the System failure control panel.
Are there any advanced settings that will force a reboot after BSOD, even when caused by an unrecoverable HW error ?
Best regards,
- Bernard Willaert
Barco - Healthcare Division
Belgium
CACHE PROBLEM: GCACHEL2_ERR_ERR
Microsoft (R) Windows Debugger Version 6.12.0002.633 AMD64
Copyright (c) Microsoft Corporation. All rights reserved.
Loading Dump File [F:\Compositor\101415-6224-01.dmp]
Mini Kernel Dump File: Only registers and stack trace are available
Symbol search path is: SRV*d:\localsymbols*http://msdl.microsoft.com/download/symbols;D:\Personal\Projects\Compositor\software\drivers\DriverInstall\AVStream;D:\Personal\Projects\Compositor\software\drivers\DriverInstall\Bus;D:\Personal\Projects\Compositor\software\drivers\DriverInstall\Network
Executable search path is:
Windows 7 Kernel Version 7601 (Service Pack 1) MP (8 procs) Free x64
Product: WinNt, suite: TerminalServer EmbeddedNT SingleUserTS
Built by: 7601.18933.amd64fre.win7sp1_gdr.150715-0600
Machine Name:
Kernel base = 0xfffff80002e01000 PsLoadedModuleList = 0xfffff80003048730
Debug session time: Tue Oct 13 18:23:37.252 2015 (UTC + 2:00)
System Uptime: 0 days 0:01:22.064
Loading Kernel Symbols
...............................................................
................................................................
....................
Loading User Symbols
Loading unloaded module list
...
*******************************************************************************
*
Bugcheck Analysis *
*
*******************************************************************************
Use !analyze -v to get detailed debugging information.
BugCheck 124,
{0, fffffa800ea51028, be200000, c110a}
Probably caused by : hardware
Followup: MachineOwner
6: kd> !analyze -v
*******************************************************************************
*
Bugcheck Analysis *
*
*******************************************************************************
WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: fffffa800ea51028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000be200000, High order 32-bits of the MCi_STATUS value.
Arg4: 00000000000c110a, Low order 32-bits of the MCi_STATUS value.
Debugging Details:
BUGCHECK_STR: 0x124_GenuineIntel
CUSTOMER_CRASH_COUNT: 1
DEFAULT_BUCKET_ID: VISTA_DRIVER_FAULT
PROCESS_NAME: System
CURRENT_IRQL: f
STACK_TEXT:
fffff88003103b58 0000000000000000 : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : nt!KeBugCheckEx
STACK_COMMAND: kb
FOLLOWUP_NAME: MachineOwner
MODULE_NAME: hardware
IMAGE_NAME: hardware
DEBUG_FLR_IMAGE_TIMESTAMP: 0
FAILURE_BUCKET_ID: X64_0x124_GenuineIntel_PROCESSOR_CACHE
BUCKET_ID: X64_0x124_GenuineIntel_PROCESSOR_CACHE
Followup: MachineOwner
6: kd> !errrec fffffa800ea51028
Common Platform Error Record @ fffffa800ea51028
Record Id : 01d105d35366bf82
Severity : Fatal (1)
Length : 928
Creator : Microsoft
Notify Type : Machine Check Exception
Timestamp : 10/13/2015 16:23:37
Flags : 0x00000000
===============================================================================
Section 0 : Processor Generic
Descriptor @ fffffa800ea510a8
Section @ fffffa800ea51180
Offset : 344
Length : 192
Flags : 0x00000001 Primary
Severity : Fatal
Proc. Type : x86/x64
Instr. Set : x64
Error Type : Cache error
Operation : Generic
Flags : 0x00
Level : 2
CPU Version : 0x00000000000306e4
Processor ID : 0x0000000000000006
===============================================================================
Section 1 : x86/x64 Processor Specific
Descriptor @ fffffa800ea510f0
Section @ fffffa800ea51240
Offset : 536
Length : 128
Flags : 0x00000000
Severity : Fatal
Local APIC Id : 0x0000000000000006
CPU Id : e4 06 03 00 00 08 20 06 - bf e3 be 7f ff fb eb bf
00 00 00 00 00 00 00 00 - 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 - 00 00 00 00 00 00 00 00
Proc. Info 0 @ fffffa800ea51240
===============================================================================
Section 2 : x86/x64 MCA
Descriptor @ fffffa800ea51138
Section @ fffffa800ea512c0
Offset : 664
Length : 264
Flags : 0x00000000
Severity : Fatal
Error : GCACHEL2_ERR_ERR (Proc 6 Bank 19)
Status : 0xbe200000000c110a
Address : 0x00000000e0100000
Misc. : 0x9cfc781600802086
6: kd> !errrec fffffa800ea510a8
Common Platform Error Record @ fffffa800ea510a8
Signature : *** INVALID ***
Revision : 0.192
Record Id : 1d6e2b24fa535bb9
Severity : Fatal (1)
Length : 1272661940
Creator :
{00000000-0000-0000-1802-000080000000}
Notify Type :
{00000201-0000-0000-b0a0-3edc44a19747}
Platform Id :
{00000000-0000-0000-0000-000000000000}
Platform Id : {00000001-0000-0000-0000-000000000000}
Flags : 0x00000000
6: kd> !errrec fffffa800ea510f0
Common Platform Error Record @ fffffa800ea510f0
Signature : *** INVALID ***
Revision : 0.128
Record Id : e8f7c35c5e56339c
Severity : Recoverable (0)
Length : 1201119556
Creator : {00000000-0000-0000-9802-000008010000}
Notify Type : {00000201-0000-0000-011d-1e8af9425745}
Flags : 0x00000000
6: kd> !errrec fffffa800ea51138
Common Platform Error Record @ fffffa800ea51138
Signature : *** INVALID ***
Revision : 1.8
Record Id : 0000000000000000
Severity : Recoverable (0)
Length : 1163346681
Creator : {00000000-0000-0000-7f01-000000000000}
Notify Type : {00010200-0200-0000-e406-030000000000}
Platform Id : {00000000-0000-0000-0000-000000000000}
Flags : 0x00000000
6: kd> !errrec fffffa800ea51180
Common Platform Error Record @ fffffa800ea51180
Signature : *** INVALID ***
Revision : 0.0
Record Id : 0000000000000000
Severity : Invalid (512)
Length : 0
Creator :
{00000000-0000-0000-0000-000000000000}
Notify Type : {00000000-0000-0000-0000-000000000000}
Platform Id :
{00000000-0000-0000-0000-000000000000}
Flags : 0x00000000
===============================================================================
Section 0 :
{00000000-0000-0000-0600-000000000000}