I've been following this thread/forums for quite a while but never got around registering. Anyway, here I am and sadly my first post is about a whole lot of problems I have had with everyones favorite motherboard.
I have serious problems with two new workstations I set up based on the ASUS X99-E WS USB 3.1 with Intel Xeon E5 v4 CPUs and I'm really out of ideas whats wrong here.
ASUS X99-E WS USB 3.1, latest BIOS v3201
Intel Xeon E5 2687W v4 (Broadwell-EP, 12c)
Samsung DDR4 regECC 16GB (M393A2K40BB1-CRC) x4 64GB total
Nvidia Quadro M5000
Intel 750 PCIe 1.2TB
Seasonic Platinum 1200W
OS: Windows 10 Enterprise 2016 LTSB (v1607)
When I finished the first build I ran a few tests to confirm everything is working:
HCI Memtest Pro - 500% passed
ASUS Realbench - 4 hours passed
SPECviewperf 12.1 - full run passed
A day later I suddenly got display driver crashes with the Quadro card and random reboots when heavy graphics tasks were running, like Unigine Heaven. The Quadro card has been running in a different system and is healthy.
My first thought was a PSU problem so I tried a different one, a Corsair AX1500i from my gaming PC, with the same result. I also installed one of my GTX1080 cards and that one also suffered of random crashes.
Shortly after these events I got complete lockups and bluescreens: WHEA_UNCORRECTABLE_ERROR.
I have never seen that kind of bluescreen with a Xeon CPU, my only conclusion here was the motherboard must be defect.
The Xeon CPU and the X99-E WS 3.1 were send in for RMA and I was told the CPU has been tested and is healthy, the board is still being processed.
Now to build #2, identical parts:
With that one I didn't even get to run stability tests.
During installation of Windows 10 I got a bluescreen: CRITICAL_PROCESS_DIED. After rebooting the Intel 750 SSD was no longer recognized by the BIOS. I tried different PCIe slots and a different 750 SSD without any luck, this board would no longer detect the Intel 750 SSD. The GPU in slot 1 worked fine.
After a lot of power on/off cycles the SSD was eventually detected again in PCIe slot 3. I managed to finish another Windows 10 installation and had the system up and running for a few hours before the same problem happened again, bluescreen and the SSD was gone. Note: Both Intel 750 SSDs work perfect in my gaming PC (X99, 5960X, RVE).
And as if this wasn't enough the board also started to say there are SMART errors with my connected SATA HDD, a Seagate Enterprise Capacity v4 6TB. There are no SMART errors, the HDD is healthy. Conclusion: Both PCIe slots and SATA slots defect, or are they?
I'm out of ideas at this point, is the quality control at ASUS this bad that I bought 2 defect boards?
Or is this a compability problem and Broadwell-EP or Broadwell in general is not working well with first gen X99 boards despite the BIOS update?
The Xeon E5 2687W v4 and the Samsung M393A2K40BB1-CRC memory is on the ASUS QVL list it should be fully supported.
I had posted this on hardforum.com first but this thread here on overclock.net might be the better place to seek for help after all.
Any help is much appreciated.