980ti memory or vrm overheating - Page 2 - Overclock.net - An Overclocking Community

Forum Jump: 

980ti memory or vrm overheating

Reply
 
Thread Tools
post #11 of 22 (permalink) Old 11-27-2018, 02:01 PM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by 8051 View Post
The green areas look like where the VRAM is located.

What kind of fan do you have blowing down on the VRM side of the card? You might want to consider putting a more powerful shrouded fan over that location.
NF-A12x25 running at 2000rpm, previously I had NF-F12 1500rpm

Wonder if the thermal pads under the plate degraded?

Last edited by Sugita2Junko; 11-27-2018 at 02:13 PM.
Sugita2Junko is offline  
Sponsored Links
Advertisement
 
post #12 of 22 (permalink) Old 11-27-2018, 02:24 PM
 
Desolutional's Avatar
 
Join Date: May 2015
Location: United Kingdom
Posts: 2,433
Rep: 162 (Unique: 100)
Quote: Originally Posted by The Pook View Post
Don't use Furmark and stop suggesting people use it. It serves 0 purpose and most cards will cripple themselves to a lower P state if it detects it's running. Just run a benchmark if you want to test things out.
Furmark will max out power consumption at P0 on most recent modern GPUs without dropping down a state. This is the best way to test if there is an issue with power delivery, e.g. underpowered PSU. It's also very useful for VRAM artifact testing. Considering the OPs card failed 5 seconds into the test on a warm boot implies something is wrong, and by reducing core offset and VRAM offset, they can eliminate those from the equation.

Caveat, it is less useful for general stability testing however - synthetic demos and games will be better for that.

@Sugita2Junko , those green sections are the definitely the VRAM modules. Disassemble the backplate and card and ensure that the thermal pads are mating with the VRAM modules, when removing them they should have rectangular shaped indentations if they have been mounted. Unstable VRAM can cause kernel panic and driver lockup on the 980 Ti, especially Hynix memory, the fact that it occurs after a few hours means something is heating up to steady state, unstable core clock would result in a watchdog timeout, not a hard reboot.

CPU: i7-5820K @ 4.3Ghz (-0.08V/1.33V Adaptive = 1.25V); Cache: 3.9GHz (1.20V); VCCIN: 1.85V @ LLC 6 12hr x265 (AVX2) Stable
CPU: i7-6950X @ 4.0Ghz (-0.08V Offset = 1.22V); Cache: 3.2GHz (1.06V); VCCIN: 1.90V @ LLC 5 12hr x265 (AVX2) Stable
RAM: 64GB @ 3000MHz (14-14-12-31 1T, 1.35V) 12hr GSAT Stable
GPU: EVGA Hybrid GTX 1070 @ 2076MHz Core, 8.6GHz VRAM, Stock Voltage 24hr [email protected] Stable



Last edited by Desolutional; 11-27-2018 at 02:32 PM.
Desolutional is offline  
post #13 of 22 (permalink) Old 11-27-2018, 02:49 PM
9 Cans of Ravioli
 
The Pook's Avatar
 
Join Date: May 2006
Posts: 16,363
Rep: 1035 (Unique: 619)
Quote: Originally Posted by Desolutional View Post
It's also very useful for VRAM artifact testing.

You can find your vRAM OC in less than a minute with Unigine Heaven, that's not a reason to use Furmark. Start the test in windowed mode at 720p/1080p, pause somewhere during the test on a scene of your choice, increase your vRAM OC until you either get artifacts or performance drops (the scene will still render @ xxx FPS while it's paused) and then back off until you don't have artifacts or performance goes back up. You don't need Furmark to OC quickly, it's a useless program that does nothing outside making a ton of heat for no reason - pretty much why it's nicknamed "power virus."

Intel i9 9900K Coffee Lake @ X8 5309mhz (+47.5%)
AMD FX-8320 Vishera @ X4 5022mhz (+43.5%)
Intel i5 4690K Haswell @ X4 5013mhz (+43.2%)
AMD X4 960T Zosma @ X6 4870mhz (+62.3%)
Intel i7 6700 Skylake @ X4 4709mhz (+38.5%)
Intel i5 6400 Skylake @ X4 4588mhz (+69.9%)
2 OP pls nerf
(24 items)
NUC U
(10 items)
CPU
i9 9900K
Motherboard
AsRock Z390 Taichi
GPU
MSI GTX 1080 Ti
RAM
G.Skill DDR4 4133 32GB
Hard Drive
Mushkin Pilot NVMe 1TB
Hard Drive
HP EX950 NVMe 2TB
Hard Drive
Samsung 860 Evo 1TB
Hard Drive
Sandisk Ultra II 480GB SSD
Hard Drive
Toshiba X300 5TB
Hard Drive
Western Digital EZRZ 3TB
Power Supply
Seasonic Focus Plus Gold 750w
Cooling
Corsair H115i Pro
Case
Fractal Design Meshify S2
Operating System
Windows 10 Education x64
Monitor
Acer XF270HU
Monitor
Dell E2311H
Keyboard
CoolerMaster MasterKeys Pro S
Mouse
Logitech G502
Audio
SMSL SA50 + FiiO E10K
Audio
Micca RB42
Audio
Polk Audio PSW111
Audio
Philips Fidelio X2
Audio
Philips SHP9500S
Audio
Audio Technica ATH-M50X
CPU
i3 8109U
Motherboard
Intel NUC8i3BEH
GPU
Iris Plus 655
RAM
Kingston 8GB DDR4 2400
Hard Drive
Dierka K1 120GB
Hard Drive
ADATA SU800 128GB M.2
Operating System
Windows 10 Professional
Operating System
MX Linux
Keyboard
Rii RKM709 Wireless
Mouse
Logitech M510
CPU
i7 860
Motherboard
Advantech AIMB 280 ITX
GPU
EVGA GTX 950 2GB
RAM
Kingston 2GB DDR3 1333
Hard Drive
Seagate Momentus 2.5" 500GB
Hard Drive
Western Digital Blue 2.5" 320GB
Power Supply
Silverstone SFX-L Gold 500w
Cooling
Arctic Alpine 11 GT 2
Case
Silverstone Milo Z ML07B
Operating System
Windows XP x86 SP3
Keyboard
Velocifire TKL01 Outemu Browns
Mouse
Logitech M512
▲ hide details ▲


The Pook is offline  
Sponsored Links
Advertisement
 
post #14 of 22 (permalink) Old 11-27-2018, 07:35 PM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by Desolutional View Post
Furmark will max out power consumption at P0 on most recent modern GPUs without dropping down a state. This is the best way to test if there is an issue with power delivery, e.g. underpowered PSU. It's also very useful for VRAM artifact testing. Considering the OPs card failed 5 seconds into the test on a warm boot implies something is wrong, and by reducing core offset and VRAM offset, they can eliminate those from the equation.

Caveat, it is less useful for general stability testing however - synthetic demos and games will be better for that.

@Sugita2Junko , those green sections are the definitely the VRAM modules. Disassemble the backplate and card and ensure that the thermal pads are mating with the VRAM modules, when removing them they should have rectangular shaped indentations if they have been mounted. Unstable VRAM can cause kernel panic and driver lockup on the 980 Ti, especially Hynix memory, the fact that it occurs after a few hours means something is heating up to steady state, unstable core clock would result in a watchdog timeout, not a hard reboot.
I can run furmark a long time before it reboots. But using realbench it reboots with in 15minutes from cold boot. Once it reboots running realbench immediately following would instantly reboot in a few seconds, but furmark can last a few minutes.

Don't see any artifacts or issues while gaming. No bluescreen or kernel panic or driver lockup. It just hard reboots once hot enough or something. Not sure if it is VRAM or VRM or mosfet or what but something is overheating. All I know is GPU core is well below thermal limit usually 50-60c gaming.

Going to see my EVGA will do an RMA on my 980ti. It is 2 weeks past the 3yr warranty cut off mark. If not, ill order some thermal pad and maybe stick on extra aluminum heatsink on the plate to help cool.
Sugita2Junko is offline  
post #15 of 22 (permalink) Old 11-27-2018, 08:21 PM
New to Overclock.net
 
8051's Avatar
 
Join Date: Apr 2014
Posts: 2,773
Rep: 22 (Unique: 16)
Do you ever see any perfcaps before it reboots? I wonder if it could be your PSU and some sort of over current protection kicking in? PSU's heat up too and if they heat up enough that can affect their output.
8051 is offline  
post #16 of 22 (permalink) Old 11-28-2018, 12:26 AM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by 8051 View Post
Do you ever see any perfcaps before it reboots? I wonder if it could be your PSU and some sort of over current protection kicking in? PSU's heat up too and if they heat up enough that can affect their output.
I play Overwatch and cap the FPS to 162, so the GPU utilization is usually 60-80%, rarely maxing out. At first I also thought it was a PSU issue. Swapped out a brand new one and still reboots. While trying to take the GPU out to re-seat I discovered it was freaking hot. Too hot to hold.

Installed a thermal probe on the front/backplate which covers the VRAM etc and found out it usually reboots when temp reach 50C. I tried stress testing again but this time fanning it hard to cool a little and it didn't reboot. Lowered the power limit to 60% and my games stopped rebooting.

Last edited by Sugita2Junko; 11-28-2018 at 12:34 AM.
Sugita2Junko is offline  
post #17 of 22 (permalink) Old 11-28-2018, 12:51 PM
New to Overclock.net
 
8051's Avatar
 
Join Date: Apr 2014
Posts: 2,773
Rep: 22 (Unique: 16)
Quote: Originally Posted by Sugita2Junko View Post
I play Overwatch and cap the FPS to 162, so the GPU utilization is usually 60-80%, rarely maxing out. At first I also thought it was a PSU issue. Swapped out a brand new one and still reboots. While trying to take the GPU out to re-seat I discovered it was freaking hot. Too hot to hold.

Installed a thermal probe on the front/backplate which covers the VRAM etc and found out it usually reboots when temp reach 50C. I tried stress testing again but this time fanning it hard to cool a little and it didn't reboot. Lowered the power limit to 60% and my games stopped rebooting.
It sounds like you definitely need more cooling. Maybe you could try attaching an even higher powered fan or fans to the heatsink. Personally, I have a noctua NFA14-ippc3000 and a SanAce 127x38mm attached to my GPU's heatsink and both are shrouded.
8051 is offline  
post #18 of 22 (permalink) Old 11-29-2018, 07:44 AM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by 8051 View Post
It sounds like you definitely need more cooling. Maybe you could try attaching an even higher powered fan or fans to the heatsink. Personally, I have a noctua NFA14-ippc3000 and a SanAce 127x38mm attached to my GPU's heatsink and both are shrouded.
It has been fine for 2 years with dual NF-F12 @ 1500rpm which is pretty strong, much stronger than the stock fans. Not sure if GPU component just degraded, generating more heat than before or something faulty. EVGA is actually sending me a RMA replacement despite being out of warranty shy of 2 weeks.
Sugita2Junko is offline  
post #19 of 22 (permalink) Old 11-29-2018, 11:44 AM
New to Overclock.net
 
8051's Avatar
 
Join Date: Apr 2014
Posts: 2,773
Rep: 22 (Unique: 16)
Quote: Originally Posted by Sugita2Junko View Post
It has been fine for 2 years with dual NF-F12 @ 1500rpm which is pretty strong, much stronger than the stock fans. Not sure if GPU component just degraded, generating more heat than before or something faulty. EVGA is actually sending me a RMA replacement despite being out of warranty shy of 2 weeks.
That's great. How are they sending you a replacement 980Ti though? They can't have any of those in stock anymore can they?
8051 is offline  
post #20 of 22 (permalink) Old 11-29-2018, 12:03 PM - Thread Starter
New to Overclock.net
 
Sugita2Junko's Avatar
 
Join Date: Sep 2012
Posts: 89
Rep: 2 (Unique: 2)
Quote: Originally Posted by 8051 View Post
That's great. How are they sending you a replacement 980Ti though? They can't have any of those in stock anymore can they?
No clue what I am getting, but people on reddit reported getting either a refurb 980ti, 1070ti or 1080. Guess it is whatever they have available and fixed.
Sugita2Junko is offline  
Reply

Quick Reply
Message:
Options

Register Now

In order to be able to post messages on the Overclock.net - An Overclocking Community forums, you must first register.
Please enter your desired user name, your email address and other required details in the form below.
User Name:
If you do not want to register, fill this field only and the name will be used as user name for your post.
Password
Please enter a password for your user account. Note that passwords are case-sensitive.
Password:
Confirm Password:
Email Address
Please enter a valid email address for yourself.
Email Address:

Log-in



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Show Printable Version Show Printable Version
Email this Page Email this Page


Forum Jump: 

Posting Rules  
You may post new threads
You may post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off