980ti memory or vrm overheating - Overclock.net - An Overclocking Community

Forum Jump: 

980ti memory or vrm overheating

Reply
 
Thread Tools
post #1 of 22 (permalink) Old 11-27-2018, 01:48 AM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
980ti memory or vrm overheating

EVGA 980ti rebooting from overheated VRM or Memory. Fixable?

I use a Raijintek morpheus cooler on my EVGA 980ti (06G-P4-4995-KR) for the past 3 years. GPU core temp was always 50-55c while gaming, pretty cool, but recently it started hard rebooting suddenly. Discovered after troubleshooting for a long time the VRM or memory is overheating.

The 980ti doesn't have VRM temperature monitor so I placed a thermal probe on top of the GPU plate, right above one of the VRM or memory chips. Idles around 25-30c, but around 50c it would reboot, usually takes 2-3hrs of gaming, pretty sure the chip underneath is actually hotter than that. Read VRM often are rated at 100-120c. Not sure about memory.

If I limit GPU power to 60% using msi afterburner the probe temp will be around 42c, it won't reboot, could play all day. However, my 980ti performance has been reduced to that of a 970 or worse.

Has my GPU degraded beyond hope or repair? What are my options? Will replacing the thermal pad in between the plates help?
Sugita2Junko is offline  
Sponsored Links
Advertisement
 
post #2 of 22 (permalink) Old 11-27-2018, 02:45 AM
AMD OC'ing Enthusiast
 
mattliston's Avatar
 
Join Date: Nov 2008
Location: Wisconsin, USA
Posts: 2,524
Rep: 104 (Unique: 82)
dont use power limit to reduce vrm temps. That simply makes the card struggle to stay stable even more flipping back and forth with voltages and frequencies.


Use Afterburner to reduce clocks and voltage directly.



Id say a good negative 300mhz core offset with a big chunk of voltage reduction would be better than lowering the power limit. If it was a VRM temperature issue, it will "solve" the issue right then and there.


This does not rule out a component failure though, simply a temperature issue.
mattliston is offline  
post #3 of 22 (permalink) Old 11-27-2018, 03:41 AM
 
Desolutional's Avatar
 
Join Date: May 2015
Location: United Kingdom
Posts: 2,433
Rep: 163 (Unique: 100)
Have you tried running Furmark on it yet? Let that run for a few hours, it will max out the power target, might help you crash earlier than 2-3 hours.

CPU: i7-5820K @ 4.3Ghz (-0.08V/1.33V Adaptive = 1.25V); Cache: 3.9GHz (1.20V); VCCIN: 1.85V @ LLC 6 12hr x265 (AVX2) Stable
CPU: i7-6950X @ 4.0Ghz (-0.08V Offset = 1.22V); Cache: 3.2GHz (1.06V); VCCIN: 1.90V @ LLC 5 12hr x265 (AVX2) Stable
RAM: 64GB @ 3000MHz (14-14-12-31 1T, 1.35V) 12hr GSAT Stable
GPU: EVGA Hybrid GTX 1070 @ 2076MHz Core, 8.6GHz VRAM, Stock Voltage 24hr [email protected] Stable


Desolutional is offline  
Sponsored Links
Advertisement
 
post #4 of 22 (permalink) Old 11-27-2018, 12:25 PM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by mattliston View Post
dont use power limit to reduce vrm temps. That simply makes the card struggle to stay stable even more flipping back and forth with voltages and frequencies.


Use Afterburner to reduce clocks and voltage directly.



Id say a good negative 300mhz core offset with a big chunk of voltage reduction would be better than lowering the power limit. If it was a VRM temperature issue, it will "solve" the issue right then and there.


This does not rule out a component failure though, simply a temperature issue.
Max I can lower for core clock is -90 and memory clock -201. Core voltage the slider is at the min, seems I can only add voltage.

What component failure as in PSU or Mobo? I've replaced the PSU thinking it was at fault but turns out didn't fix it.
Sugita2Junko is offline  
post #5 of 22 (permalink) Old 11-27-2018, 12:27 PM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by Desolutional View Post
Have you tried running Furmark on it yet? Let that run for a few hours, it will max out the power target, might help you crash earlier than 2-3 hours.
Furmark last about 5 seconds if it has already rebooted prior and still hot.

From normal/cold state it would die around 15-20m
Sugita2Junko is offline  
post #6 of 22 (permalink) Old 11-27-2018, 12:36 PM
9 Cans of Ravioli
 
The Pook's Avatar
 
Join Date: May 2006
Posts: 16,568
Rep: 1071 (Unique: 629)
Don't use Furmark and stop suggesting people use it. It serves 0 purpose and most cards will cripple themselves to a lower P state if it detects it's running. Just run a benchmark if you want to test things out.

@Sugita2Junko : Where did you put the temp probe on? You said the "VRM" but the VRM consists of chokes/coils/inductors/capacitors/MOSFETs/voltage regulators and not all of them really put out that much heat or need a heatsink. The only one that should have one are the mosfets.

Intel i9 9900K Coffee Lake @ X8 5309mhz (+47.5%)
AMD FX-8320 Vishera @ X4 5022mhz (+43.5%)
Intel i5 4690K Haswell @ X4 5013mhz (+43.2%)
AMD X4 960T Zosma @ X6 4870mhz (+62.3%)
Intel i7 6700 Skylake @ X4 4709mhz (+38.5%)
Intel i5 6400 Skylake @ X4 4588mhz (+69.9%)
2 OP pls nerf
(24 items)
NUC U
(10 items)
CPU
i9 9900K
Motherboard
AsRock Z390 Taichi
GPU
MSI GTX 1080 Ti
RAM
G.Skill DDR4 4133 32GB
Hard Drive
Mushkin Pilot NVMe 1TB
Hard Drive
HP EX950 NVMe 2TB
Hard Drive
Samsung 860 Evo 1TB
Hard Drive
Sandisk Ultra II 480GB SSD
Hard Drive
Toshiba X300 5TB
Hard Drive
Western Digital EZRZ 3TB
Power Supply
Seasonic Focus Plus Gold 750w
Cooling
Corsair H115i Pro
Case
Fractal Design Meshify S2
Operating System
Windows 10 Education x64
Monitor
Acer XF270HU
Monitor
Dell E2311H
Keyboard
CoolerMaster MasterKeys Pro S
Mouse
Logitech G502
Audio
SMSL SA50 + FiiO E10K
Audio
Micca RB42
Audio
Polk Audio PSW111
Audio
Philips Fidelio X2
Audio
Philips SHP9500S
Audio
Audio Technica ATH-M50X
CPU
i3 8109U
Motherboard
Intel NUC8i3BEH
GPU
Iris Plus 655
RAM
Kingston 8GB DDR4 2400
Hard Drive
Dierka K1 120GB
Hard Drive
ADATA SU800 128GB M.2
Operating System
Windows 10 Professional
Operating System
MX Linux
Keyboard
Rii RKM709 Wireless
Mouse
Logitech M510
CPU
i7 860
Motherboard
Advantech AIMB 280 ITX
GPU
EVGA GTX 950 2GB
RAM
Kingston 2GB DDR3 1333
Hard Drive
Seagate Momentus 2.5" 500GB
Hard Drive
Western Digital Blue 2.5" 320GB
Power Supply
Silverstone SFX-L Gold 500w
Cooling
Arctic Alpine 11 GT 2
Case
Silverstone Milo Z ML07B
Operating System
Windows XP x86 SP3
Keyboard
Velocifire TKL01 Outemu Browns
Mouse
Logitech M512
▲ hide details ▲


The Pook is offline  
post #7 of 22 (permalink) Old 11-27-2018, 12:42 PM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by The Pook View Post
Don't use Furmark and stop suggesting people use it. It serves 0 purpose and most cards will cripple themselves to a lower P state if it detects it's running. Just run a benchmark if you want to test things out.

@Sugita2Junko : Where did you put the temp probe on? You said the "VRM" but the VRM consists of chokes/coils/inductors/capacitors/MOSFETs/voltage regulators and not all of them really put out that much heat or need a heatsink. The only one that should have one are the mosfets.
http://images.hardwarecanucks.com/im...X-980-TI-1.PNG

It has a back & front plate covering. I just put the probe on the front plate which I felt was hottest, so hot I can't leave my finger for more than a sec. In between the expose mosfet (square things?) & gpu core.
Sugita2Junko is offline  
post #8 of 22 (permalink) Old 11-27-2018, 12:50 PM
9 Cans of Ravioli
 
The Pook's Avatar
 
Join Date: May 2006
Posts: 16,568
Rep: 1071 (Unique: 629)
The bigger gray square "things" in that picture are chokes and they don't need a heatsink.

The partly melted thing in this picture is a mosfet and what you should be checking the temps of. If it doesn't have a heatsink, stop using the card until you get some on it.
Attached Thumbnails
Click image for larger version

Name:	IMG_1184.jpg
Views:	28
Size:	2.51 MB
ID:	234528  


Intel i9 9900K Coffee Lake @ X8 5309mhz (+47.5%)
AMD FX-8320 Vishera @ X4 5022mhz (+43.5%)
Intel i5 4690K Haswell @ X4 5013mhz (+43.2%)
AMD X4 960T Zosma @ X6 4870mhz (+62.3%)
Intel i7 6700 Skylake @ X4 4709mhz (+38.5%)
Intel i5 6400 Skylake @ X4 4588mhz (+69.9%)
2 OP pls nerf
(24 items)
NUC U
(10 items)
CPU
i9 9900K
Motherboard
AsRock Z390 Taichi
GPU
MSI GTX 1080 Ti
RAM
G.Skill DDR4 4133 32GB
Hard Drive
Mushkin Pilot NVMe 1TB
Hard Drive
HP EX950 NVMe 2TB
Hard Drive
Samsung 860 Evo 1TB
Hard Drive
Sandisk Ultra II 480GB SSD
Hard Drive
Toshiba X300 5TB
Hard Drive
Western Digital EZRZ 3TB
Power Supply
Seasonic Focus Plus Gold 750w
Cooling
Corsair H115i Pro
Case
Fractal Design Meshify S2
Operating System
Windows 10 Education x64
Monitor
Acer XF270HU
Monitor
Dell E2311H
Keyboard
CoolerMaster MasterKeys Pro S
Mouse
Logitech G502
Audio
SMSL SA50 + FiiO E10K
Audio
Micca RB42
Audio
Polk Audio PSW111
Audio
Philips Fidelio X2
Audio
Philips SHP9500S
Audio
Audio Technica ATH-M50X
CPU
i3 8109U
Motherboard
Intel NUC8i3BEH
GPU
Iris Plus 655
RAM
Kingston 8GB DDR4 2400
Hard Drive
Dierka K1 120GB
Hard Drive
ADATA SU800 128GB M.2
Operating System
Windows 10 Professional
Operating System
MX Linux
Keyboard
Rii RKM709 Wireless
Mouse
Logitech M510
CPU
i7 860
Motherboard
Advantech AIMB 280 ITX
GPU
EVGA GTX 950 2GB
RAM
Kingston 2GB DDR3 1333
Hard Drive
Seagate Momentus 2.5" 500GB
Hard Drive
Western Digital Blue 2.5" 320GB
Power Supply
Silverstone SFX-L Gold 500w
Cooling
Arctic Alpine 11 GT 2
Case
Silverstone Milo Z ML07B
Operating System
Windows XP x86 SP3
Keyboard
Velocifire TKL01 Outemu Browns
Mouse
Logitech M512
▲ hide details ▲


The Pook is offline  
post #9 of 22 (permalink) Old 11-27-2018, 01:07 PM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by The Pook View Post
The bigger gray square "things" in that picture are chokes and they don't need a heatsink.

The partly melted thing in this picture is a mosfet and what you should be checking the temps of. If it doesn't have a heatsink, stop using the card until you get some on it.
https://i.imgur.com/GmS9JK7.jpg

These are the areas too hot to touch for me. I circled it green. I only touched the plate. Probe location is in between there in red circle.

https://www.fudzilla.com/images/stor...ti_sc_pcb1.jpg

Those square chips surrounding the GPU core are VRAM?

https://qph.fs.quoracdn.net/main-qim...838524ae8118-c

The VRM when I touched it was scorching hot too. Didn't put a probe on it to know how hot.
Sugita2Junko is offline  
post #10 of 22 (permalink) Old 11-27-2018, 01:39 PM
New to Overclock.net
 
8051's Avatar
 
Join Date: Apr 2014
Posts: 2,781
Rep: 22 (Unique: 16)
The green areas look like where the VRAM is located.

What kind of fan do you have blowing down on the VRM side of the card? You might want to consider putting a more powerful shrouded fan over that location.
8051 is offline  
Reply

Quick Reply
Message:
Options

Register Now

In order to be able to post messages on the Overclock.net - An Overclocking Community forums, you must first register.
Please enter your desired user name, your email address and other required details in the form below.
User Name:
If you do not want to register, fill this field only and the name will be used as user name for your post.
Password
Please enter a password for your user account. Note that passwords are case-sensitive.
Password:
Confirm Password:
Email Address
Please enter a valid email address for yourself.
Email Address:

Log-in



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Show Printable Version Show Printable Version
Email this Page Email this Page


Forum Jump: 

Posting Rules  
You may post new threads
You may post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off