BSOD(WHEA_UNCORRECTABLE_ERROR (124)) when reboot

Hi

I get BSOD when reboot, please give me a clue, thank you

Windows 8 Kernel Version 9200 MP (96 procs) Free x64
Product: Server, suite: TerminalServer DataCenter SingleUserTS
Built by: 9200.17166.amd64fre.win8_gdr.141031-1551

*******************************************************************************
* *
* Bugcheck Analysis *
* *
*******************************************************************************

WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000005, Generic Error
Arg2: fffffab002573028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 0000000000000000
Arg4: 0000000000000000

Debugging Details:

DUMP_FILE_ATTRIBUTES: 0xc
Insufficient Dumpfile Size
Kernel Generated Triage Dump

BUGCHECK_STR: 0x124_GenuineIntel

CUSTOMER_CRASH_COUNT: 1

DEFAULT_BUCKET_ID: WIN8_DRIVER_FAULT_SERVER

PROCESS_NAME: System

CURRENT_IRQL: f

ANALYSIS_VERSION: 6.3.9600.17298 (debuggers(dbg).141024-1500) amd64fre

STACK_TEXT:
fffff880042b2d48 0000000000000000 : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : nt!KeBugCheckEx

STACK_COMMAND: kb

FOLLOWUP_NAME: MachineOwner

MODULE_NAME: GenuineIntel

IMAGE_NAME: GenuineIntel

DEBUG_FLR_IMAGE_TIMESTAMP: 0

IMAGE_VERSION:

FAILURE_BUCKET_ID: 0x124_GenuineIntel_PCIEXPRESS

BUCKET_ID: 0x124_GenuineIntel_PCIEXPRESS

ANALYSIS_SOURCE: KM

FAILURE_ID_HASH_STRING: km:0x124_genuineintel_pciexpress

FAILURE_ID_HASH: {0aebd40d-d4c8-456d-b299-4fede7e8bf0c}

Followup: MachineOwner

  • EventData

ErrorSource 8
FRUId {00000000-0000-0000-0000-000000000000}
FRUText
ValidBits 0xcf
PortType 4
Version 0x110
Command 0x4010
Status 0x540
Bus 0x0
Device 0x0
Function 0x0
Segment 0x0
SecondaryBus 0x0
Slot 0x0
VendorID 0x****
DeviceID 0x****
ClassCode 0x180
DeviceSerialNumber 0x0
BridgeControl 0x0
BridgeStatus 0x0
UncorrectableErrorStatus 0x0
CorrectableErrorStatus 0x0
HeaderLog 00000000000000000000000000000000
Length 408
RawData 435045521002FFFFFFFF01000100000002000000980100000422080014050F140000000000000000000000000000000000000000000000000000000000000000BDC407CF89B7184EB3C41F732CB5713167A4623E40AB9A40A698F362D464B38F04A5D8CA9092D001000000004552000000000001000000000000000000000000C8000000D0000000010200000100000054E995D9C1BB0F43AD91B44DCB3C6F3500000000000000000000000000000000010000000000000000000000000000000000000000000000CF00000000000000040000001001000010404005000000005F1C3005800100000000000000000000000000000000000000000000010510110000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000

Did some board go to D3 while there was unfinished DMA transaction? Do you quiesce DMA when you put a board to D3?

I don’t have any details about what’s going wrong, but I can tell you that " PortType 4" is PCIe Root Port.

Having the error statuses both be zero is interesting.

Peter
OSR
@OSRDrivers

thanks for reply

could you tell me what the following mean?

===============================================================================
Common Platform Error Record @ fffffab00258b028

Record Id : 01d0941584fc45f5
Severity : Fatal (1)
Length : 408
Creator : Microsoft
Notify Type : Generic
Timestamp : 5/21/2015 22:30:12 (UTC)
Flags : 0x00000000

===============================================================================
Section 0 : PCI Express

Descriptor @ fffffab00258b0a8
Section @ fffffab00258b0f0
Offset : 200
Length : 208
Flags : 0x00000001 Primary
Severity : Fatal

Unable to load image pci.sys, Win32 error 0n2
*** WARNING: Unable to verify timestamp for pci.sys
Port Type : Root Port
Version : 1.16
Command/Status: 0x4010/0x0540
Device Id :
VenId:DevId : ****:****
Class code : 000180
Function No : 0x00
Device No : 0x00
Segment : 0x0000
Primary Bus : 0x00
Second. Bus : 0x00
Slot : 0x0000
Express Capability Information @ fffffab00258b124
Device Caps : 00000000 Role-Based Error Reporting: 0
Device Ctl : 0000 ur fe nf ce
Dev Status : 0000 ur fe nf ce
Root Ctl : 0000 fs nfs cs

AER Information @ fffffab00258b160
Uncorrectable Error Status : 00000000 ur ecrc mtlp rof uc ca cto fcp ptlp sd dlp und
Uncorrectable Error Mask : 00000000 ur ecrc mtlp rof uc ca cto fcp ptlp sd dlp und
Uncorrectable Error Severity : 00000000 ur ecrc mtlp rof uc ca cto fcp ptlp sd dlp und
Correctable Error Status : 00000000 adv rtto rnro dllp tlp re
Correctable Error Mask : 00000000 adv rtto rnro dllp tlp re
Caps & Control : 00000000 ecrcchken ecrcchkcap ecrcgenen ecrcgencap fep
Header Log : 00000000 00000000 00000000 00000000
Root Error Command : 00000000 fen nfen cen
Root Error Status : 00000000 MSG# 00 fer nfer fuf mur ur mcr cer
Correctable Error Source ID : 00,00,00
Correctable Error Source ID : 00,00,00