Overclock.net › Forums › Graphics Cards › NVIDIA › Computer has crashed twice... How to find problem?
New Posts  All Forums:Forum Nav:

Computer has crashed twice... How to find problem?

post #1 of 8
Thread Starter 
Just curious if there is a tool somewhere I can use to find problems with a GPU?

The other night when playing Skyrim my PC completely locked up and forced me to hard restart... The same thing occurred just now when I attempted to run Crysis 2 with the high res textures and DX11.

I am not sure what is causing this however since I am able to run SWTOR for hours at a time with no issue... When a freeze occurs I don't get any kind of error screen, no BSOD, my PC doesn't shut down or anything... My PC just becomes completely frozen and it has only been these two times... I just want to know if there is anything I can do now to identify the problem before something breaks...

Thanks

I am assuming it is GPU related since the Skyrim lock up occurred while using a high res texture pack there as well... After disabling that texture pack I was able to run Skyrim a further 2 hours with no issues... And like I said, other games are running fine, I have played several hours of both SWTOR and Company of Heroes since all of this started happening...

I just don't know what is causing this or how to check... My temps have been fine as well, ~89C max for both cards...

EDIT: Oh and the lock ups occurred within minutes of beginning gameplay... In Skyrim, I loaded my save, walked around for about 10 minutes... And in Crysis I reached the point where you stand up with your suit for the first time... And there is no build up to a freeze... No hampered gameplay, massive increase in temps, slowdowns, etc... It goes from perfectly normal function at locked 60fps to frozen solid...
Edited by Squeets - 1/31/12 at 4:05pm
Computer
(13 items)
 
  
CPUMotherboardGraphicsRAM
i5 2500k @ Stock EVGA Z75 EVGA GTX 680 @ Stock 16GB Corsair Vengeance 1600 
Hard DriveOSMonitorPower
60GB Vertex 2, 2x256GB Vertex 4 RAID0, 2x1TB WD... Windows 8 Pro 64-bit 28" LED Seasonic X-850 
Case
Corsair 600T White 
  hide details  
Reply
Computer
(13 items)
 
  
CPUMotherboardGraphicsRAM
i5 2500k @ Stock EVGA Z75 EVGA GTX 680 @ Stock 16GB Corsair Vengeance 1600 
Hard DriveOSMonitorPower
60GB Vertex 2, 2x256GB Vertex 4 RAID0, 2x1TB WD... Windows 8 Pro 64-bit 28" LED Seasonic X-850 
Case
Corsair 600T White 
  hide details  
Reply
post #2 of 8
89c on two cards adds up, so it's possible you may be overheating. I'd recommend for diagnostic purposes you remove the side panel of your case. Also, I recommend completely uninstalling and reinstalling your video card drivers, as they may be a software issue as well.
post #3 of 8
Thread Starter 
Quote:
Originally Posted by E-Peen View Post

89c on two cards adds up, so it's possible you may be overheating. I'd recommend for diagnostic purposes you remove the side panel of your case. Also, I recommend completely uninstalling and reinstalling your video card drivers, as they may be a software issue as well.

But these cards have always run this hot I even made a thread a while back about lowering temps like that and people told me it just comes with the territory of 470s, they are hot by default...
Computer
(13 items)
 
  
CPUMotherboardGraphicsRAM
i5 2500k @ Stock EVGA Z75 EVGA GTX 680 @ Stock 16GB Corsair Vengeance 1600 
Hard DriveOSMonitorPower
60GB Vertex 2, 2x256GB Vertex 4 RAID0, 2x1TB WD... Windows 8 Pro 64-bit 28" LED Seasonic X-850 
Case
Corsair 600T White 
  hide details  
Reply
Computer
(13 items)
 
  
CPUMotherboardGraphicsRAM
i5 2500k @ Stock EVGA Z75 EVGA GTX 680 @ Stock 16GB Corsair Vengeance 1600 
Hard DriveOSMonitorPower
60GB Vertex 2, 2x256GB Vertex 4 RAID0, 2x1TB WD... Windows 8 Pro 64-bit 28" LED Seasonic X-850 
Case
Corsair 600T White 
  hide details  
Reply
post #4 of 8
I don't own that card specifically, but I would say that is absolutely not the default temperatures for a GPU, that is unacceptable in SLI. I would even personally say 89c even on a single card is pretty high. So you have to think, 89c on one card, and then 89c on another card, add those together, PLUS your other components and how hot they are, and your case is closed and only has a few fans, your PC is pretty much an oven.

So like I said, just to eliminate overheating, you might as well remove your side panel and monitor your temperatures, that way if they're lower and you're still crashing, we know that's not the issue.
post #5 of 8
Thread Starter 
Quote:
Originally Posted by E-Peen View Post

I don't own that card specifically, but I would say that is absolutely not the default temperatures for a GPU, that is unacceptable in SLI. I would even personally say 89c even on a single card is pretty high. So you have to think, 89c on one card, and then 89c on another card, add those together, PLUS your other components and how hot they are, and your case is closed and only has a few fans, your PC is pretty much an oven.
So like I said, just to eliminate overheating, you might as well remove your side panel and monitor your temperatures, that way if they're lower and you're still crashing, we know that's not the issue.

Just an update to this... I have both done what you said and gone one step further... I have taken the side off and continued to receive the issue and then I used both of my 470s by themselves one by one... Same issue...

Image of the error I received once when my computer didn't completely lock up...
http://i.imgur.com/dI2lw.png

I just don't understand it... My cards have been completely fine for months, sure a little hot, but fine... Now for the last few weeks it has been crashes galore on games... And I don't get it because sometimes they work fine...
For example:
I raided on SWTOR for 6 hours this week and played it a total of probably 10 hours... (2 5 hour sessions)... 5 hours of continuous play (with both cards at 88C)... NO ISSUES AT ALL.
Get on Crysis 2, CRASHES WITHIN MINUTES. (with both cards at 85C)
Get on Skyrim, CRASHES IN ABOUT 30 MINUTES. (with both cards at 78C)
Get on Dawn of War 2: Retribution, DRIVER ERROR IN ABOUT 10 MINUTES. (both cards INDIVIDUALLY at 72C and 73C BOTH crash on their own)...
And on Minecraft it just randomly gets errors... Sometimes I get an error at 10 minutes, sometimes 4 hours...

I have tried both the current 285 drivers and the beta drivers... Done clean installs, etc...

I don't know what to do at this point... Normally I would think this was the GPU being damaged or something, but like I said, it happens with both of them individually too (what are the chances of both GPUs coming down after months of flawless working with the same issue at the same time) and sometimes it doesn't happen at all with some games... I have NEVER had a crash, artifacts, or anything with SWTOR at all...

Any suggestions as to what I can do now?
Computer
(13 items)
 
  
CPUMotherboardGraphicsRAM
i5 2500k @ Stock EVGA Z75 EVGA GTX 680 @ Stock 16GB Corsair Vengeance 1600 
Hard DriveOSMonitorPower
60GB Vertex 2, 2x256GB Vertex 4 RAID0, 2x1TB WD... Windows 8 Pro 64-bit 28" LED Seasonic X-850 
Case
Corsair 600T White 
  hide details  
Reply
Computer
(13 items)
 
  
CPUMotherboardGraphicsRAM
i5 2500k @ Stock EVGA Z75 EVGA GTX 680 @ Stock 16GB Corsair Vengeance 1600 
Hard DriveOSMonitorPower
60GB Vertex 2, 2x256GB Vertex 4 RAID0, 2x1TB WD... Windows 8 Pro 64-bit 28" LED Seasonic X-850 
Case
Corsair 600T White 
  hide details  
Reply
post #6 of 8
I don't have any suggestions right now, but just wanted to clear up any possible confusion introduced by E-peen. 89C is perfectly acceptable for a 470, their max temperature is specified at 105C by Nvidia so you are right at the 10-12C buffer zone that I like to keep. I won't say that temperatures are absolutely not your problem, but I will say that at 89C they should not be your problem.
Main Rig
(16 items)
 
  
CPUMotherboardGraphicsRAM
Intel i7 2700k ASUS P8P67 WS Revolution EVGA 980 Ti SC+ Samsung 4x4GB DDR3 1866MHz 
Hard DriveHard DriveOptical DriveCooling
Samsung 850 Evo 1TB Samsung Spinpoint F4 2TB Samsung BD Combo Noctua NH-D14 
OSMonitorPowerCase
Windows 10 64 bit Asus PG279Q Kingwin Lazer Platinum 1000W Silverstone Raven RV03 
  hide details  
Reply
Main Rig
(16 items)
 
  
CPUMotherboardGraphicsRAM
Intel i7 2700k ASUS P8P67 WS Revolution EVGA 980 Ti SC+ Samsung 4x4GB DDR3 1866MHz 
Hard DriveHard DriveOptical DriveCooling
Samsung 850 Evo 1TB Samsung Spinpoint F4 2TB Samsung BD Combo Noctua NH-D14 
OSMonitorPowerCase
Windows 10 64 bit Asus PG279Q Kingwin Lazer Platinum 1000W Silverstone Raven RV03 
  hide details  
Reply
post #7 of 8

http://i.imgur.com/dI2lw.png is related to TDR.

 

Very possible related to PSU. Reseat the both cards, check all PCIe power cables from PSU to the cards. Or try different PSU.

 

Another possibility is Hardware Acceleration GPUs during game play. Turn it off in FlashPlayer, Youtube and in WMP, VLC, IE, FF, Chrome, etc. in advance settings.

 

 

Capture.JPG

 

 

 


Edited by Ken1649 - 2/9/12 at 6:07am
ケンジ
(16 items)
 
  
CPUCPUMotherboardGraphics
i7-3960X i7-980X GA-X79-UD7 / GA-X58A-UD9 7970 + 465 
GraphicsRAMRAMHard Drive
2x 6990 + 580 16GB 2133 MHz 24GB 2000 MHz 120GB x 2 RAID0 
Hard DriveHard DriveCoolingCooling
120GB x 4 RAID0 Raptor 300GB x 2 RAID0 CPU-370 CPU-370 
CoolingCoolingOSPower
VID-AR699 EK FB X58A-UD9 Win 7 Ultimate x64 1500W / 1250W+1200W 
  hide details  
Reply
ケンジ
(16 items)
 
  
CPUCPUMotherboardGraphics
i7-3960X i7-980X GA-X79-UD7 / GA-X58A-UD9 7970 + 465 
GraphicsRAMRAMHard Drive
2x 6990 + 580 16GB 2133 MHz 24GB 2000 MHz 120GB x 2 RAID0 
Hard DriveHard DriveCoolingCooling
120GB x 4 RAID0 Raptor 300GB x 2 RAID0 CPU-370 CPU-370 
CoolingCoolingOSPower
VID-AR699 EK FB X58A-UD9 Win 7 Ultimate x64 1500W / 1250W+1200W 
  hide details  
Reply
post #8 of 8
Quote:
Originally Posted by Squeets View Post

...
I don't know what to do at this point... Normally I would think this was the GPU being damaged or something, but like I said, it happens with both of them individually too (what are the chances of both GPUs coming down after months of flawless working with the same issue at the same time) and sometimes it doesn't happen at all with some games... I have NEVER had a crash, artifacts, or anything with SWTOR at all...
Any suggestions as to what I can do now?


Its highly unlikely both gpu's fail at the same time. It can usually something else. Check your 12v first you can do that with gpuz and both gpus installed while stress testing. That way it is considered, LOAD conditions. It should not go below 11.7 volts.

If it were me I would start stress testing w/ furmark, or msi krombuster (same thing). Do both gpus at the same time. If you can get it to error, pull the bottom gpu.

Then test separate. If you tested both cards and both throw the error, I'd start testing ram sticks.

msi.com

techpowerup.com/gpuz

nvidia.com/tdr errors

Edited by DrFPS - 2/9/12 at 2:42pm
MyCleanPC
(13 items)
 
  
CPUMotherboardGraphicsRAM
i7 4770k ASUS MAXIMUS VI EVGA master blaster Corsair Vengence  
Hard DriveOptical DriveOSMonitor
OCZ SSD raid0 samsung Win 7 Samsung 
PowerCase
Enermax rev 1050 Stacker 832 
  hide details  
Reply
MyCleanPC
(13 items)
 
  
CPUMotherboardGraphicsRAM
i7 4770k ASUS MAXIMUS VI EVGA master blaster Corsair Vengence  
Hard DriveOptical DriveOSMonitor
OCZ SSD raid0 samsung Win 7 Samsung 
PowerCase
Enermax rev 1050 Stacker 832 
  hide details  
Reply
New Posts  All Forums:Forum Nav:
  Return Home
  Back to Forum: NVIDIA
Overclock.net › Forums › Graphics Cards › NVIDIA › Computer has crashed twice... How to find problem?