Overclock.net › Forums › Graphics Cards › NVIDIA › [Proclockers] **FAKE,LOCK ME** GTC 2014 nvidia keynote info 20nm maxwell preview + r340 drivers
New Posts  All Forums:Forum Nav:

[Proclockers] **FAKE,LOCK ME** GTC 2014 nvidia keynote info 20nm maxwell preview + r340 drivers - Page 6

post #51 of 81
That would be the funniest thing ever. 780ti kingpin for physx. You should definitely do that and post pics up, whenever GM200/204 hits of course. thumb.gif
post #52 of 81
Quote:
Originally Posted by mcg75 View Post

Why would you feel sorry for them? That's exactly the way it was last generation too.

The GTX 680 obliterated the 580 on release despite the 680 not being big Kepler.

seems pretty obvious to me

580 was the big Fermi chip (or wasn't it the 480?) that launched 18 months before 680
680 was about 20-25% faster at the same pricepoint

there's lots of "if's" but let's just assume for the sake of comparsion

let's say 880 comes out in September....with performance of a Titan Black with 4GB VRAM $599 or less
780Ti is still $100 more and Titan Black $300 more all within less than 9 month time frame

don't you see how drastically different it would be compared to the 580-> 680 launch?
     
CPUMotherboardGraphicsGraphics
Intel Core i5-3570K @ 4.2GHz Gigabyte Z77X-UD5H-WB EVGA GTX 670 FTW 2GB @ 1.3/7.1GHz 1.21V EVGA GTX 670 FTW 2GB @ 1.3/7.1GHz 1.21V 
RAMHard DriveHard DriveOptical Drive
Samsung Green DDR3 4X4GB @ 2.1GHz Samsung 850 Pro 2TB Samsung SpinPoint F4 EcoGreen 2TB HDD LG GH24NS72 DVD Rewriter 
CoolingOSMonitorMonitor
Swiftech H240-X Liquid Cooler Microsoft Windows 10 Pro 64-bit Dell Ultrasharp U2312HM E-IPS Dell Ultrasharp U2311H E-IPS 
MonitorKeyboardPowerCase
Dell Ultrasharp U2311H E-IPS Logitech G15 Gaming Keyboard Rev 2 Seasonic X-750 80+ Gold 750W Rev 3 Corsair Carbide 500R Arctic White 
MouseMouse PadAudioAudio
Alienware TactX Gaming Mouse 5000DPI SteelSeries QcK Gaming Mousepad Edifier M3300SF 2.1 Speaker Set 54W RMS Logitech G930 7.1 Wireless Gaming Headset 
AudioAudioOtherOther
Sennheiser HD 650 Open-Ear Headphones Creative Sound Blaster E5 High-Res USB DAC/ Por... Logitech F710 Wireless Gamepad  Thrustmaster T-Flight Hotas X Flight Stick 
CPUGraphicsRAMHard Drive
Intel Core m7-6Y75 vPro Intel Graphics HD 515 DDR3L-1600MHz 8GB Samsung 960 Pro 1TB  
OSMonitorOther
Windows 10 Pro x64 Sharp IGZO 4K UHD touch Dell x Wacom active stylus 
  hide details  
Reply
     
CPUMotherboardGraphicsGraphics
Intel Core i5-3570K @ 4.2GHz Gigabyte Z77X-UD5H-WB EVGA GTX 670 FTW 2GB @ 1.3/7.1GHz 1.21V EVGA GTX 670 FTW 2GB @ 1.3/7.1GHz 1.21V 
RAMHard DriveHard DriveOptical Drive
Samsung Green DDR3 4X4GB @ 2.1GHz Samsung 850 Pro 2TB Samsung SpinPoint F4 EcoGreen 2TB HDD LG GH24NS72 DVD Rewriter 
CoolingOSMonitorMonitor
Swiftech H240-X Liquid Cooler Microsoft Windows 10 Pro 64-bit Dell Ultrasharp U2312HM E-IPS Dell Ultrasharp U2311H E-IPS 
MonitorKeyboardPowerCase
Dell Ultrasharp U2311H E-IPS Logitech G15 Gaming Keyboard Rev 2 Seasonic X-750 80+ Gold 750W Rev 3 Corsair Carbide 500R Arctic White 
MouseMouse PadAudioAudio
Alienware TactX Gaming Mouse 5000DPI SteelSeries QcK Gaming Mousepad Edifier M3300SF 2.1 Speaker Set 54W RMS Logitech G930 7.1 Wireless Gaming Headset 
AudioAudioOtherOther
Sennheiser HD 650 Open-Ear Headphones Creative Sound Blaster E5 High-Res USB DAC/ Por... Logitech F710 Wireless Gamepad  Thrustmaster T-Flight Hotas X Flight Stick 
CPUGraphicsRAMHard Drive
Intel Core m7-6Y75 vPro Intel Graphics HD 515 DDR3L-1600MHz 8GB Samsung 960 Pro 1TB  
OSMonitorOther
Windows 10 Pro x64 Sharp IGZO 4K UHD touch Dell x Wacom active stylus 
  hide details  
Reply
post #53 of 81
Quote:
Originally Posted by fateswarm View Post

OMG you know what that means?

AMD soon to announce something.

You missed it already.

This is AMD's trolling GTC 2014. http://www.overclock.net/t/1473985/you-are-invited-to-amd-s-news-conference-in-sunnyvale-ca-march-26

----

More on topic, I hope to see block diagrams for big Maxwell

GM107 is only one GPC (5 SMM)

I want to see something like this:
Warning: Spoiler! (Click to show)
1280 Maxwell CUDA, 256-bit memory bus, 80 TMU, 32 ROP , 120W TDP but actual ~80-100W power draw

midrange GM206 / enthusiast GM204 , if 2 GPCs

GTX 750 Ti is 21.1Gpixels/s, 52.8 GTexels/s, 1,690 GFlops when at 1320Mhz
So double everything and you get 42.2 Gpixels/s, 105.6 GTexels/s, 3,380Gflops
* 31.65 GPixels/s more likely if GM206 , with 24 ROPs ala GTX 660

For reference:
GTX 760 @ 1110Mhz = 26.6 GPixel/s , 107 GTexel/s , 2,557 GFLOPS
R9 280@933Mhz = 29.9 GPixel/s , 104 GTexel/s , 3,344 GFLOPS

Texturing would be about the same but about 30% more compute (like R9 280) and pixel pushing is about a GTX 780

If GM204 has 3 GPCs (1920 Maxwell CUDA) with the same number SMM per GPC , clocked at 1320Mhz
32 ROPs: 42.2 Gpixels/s, 158.4 GTexels/s, 5,070GFlops
48 ROPs: 63.3 Gpixels/s, 158.4 GTexels/s, 5,070 GFlops

For reference:
GTX 780 @1006Mhz = 48.3 GPixel/s , 193 GTexel/s , 4,636 GFLOPS
GTX 770 @1202Mhz = 38.5 GPixel/s, 154 GTexel/s , 3,693 GFLOPS
R9 290 @947Mhz = 60.6 GPixel/s, 152 GTexel/s , 4,849 GFLOPS
R9 280X @1100Mhz= 35.2 GPixel/s, 141 GTexel/s, 4,506 GFLOPS

GM200 = ??? , it's not going to be the same setup for sure since it will likely have double precision at a higher rate than 1/32

Texture Fill Rate = (# of TMUs) x (Core Clock)
Pixel Fill Rate = (# of ROPs) x (Core Clock)
FLOPs=cores x Core clock x FLOPs per cycle

Edited by AlphaC - 3/24/14 at 7:43pm
Workstation stuff
(407 photos)
SpecViewperf 12.0.1
(158 photos)
 
Reply
Workstation stuff
(407 photos)
SpecViewperf 12.0.1
(158 photos)
 
Reply
post #54 of 81
I'm getting a 1440p display soon.

I was eyeing a 780 or a 780 Ti, but I'm holding off.

I might just keep one of my 570's for PhysX when I replace it with an 880, assuming we get the 880's this year.
post #55 of 81
Thread Starter 
Quote:
Originally Posted by AlphaC View Post

You missed it already.

This is AMD's trolling GTC 2014. http://www.overclock.net/t/1473985/you-are-invited-to-amd-s-news-conference-in-sunnyvale-ca-march-26

----

More on topic, I hope to see block diagrams for big Maxwell

GM107 is only one GPC (5 SMM)

I want to see something like this:
Warning: Spoiler! (Click to show)
1280 Maxwell CUDA, 256-bit memory bus, 80 TMU, 32 ROP , 120W TDP but actual ~80-100W power draw

Here's my speculation from mid-Feb (original post: http://hardforum.com/showthread.php?t=1806694 )



Click the spoiler! Detailed speculation about 20nm Maxwell (Click to show)
I'm gonna go off the deep end here with a detailed guesswork at what we might see with GM200 at 20nm. THE RESULTS AND MATH IS THEORETICAL ONLY; THIS IS NOT BASED ON ANY SECRETLY-KNOWN INFO FOR UNRELEASED PRODUCTS.


This slide says the case for Maxwell's first generation is reportedly, for the same performance level we should be seeing half the wattage used even on 28nm, and that per-core we are going to see a 35% gain (by their claims, at this point of course). So a card that would take Kepler 300w can be done in a 150w or less envelope is their claim, on 28nm even. That leaves a huge amount of extra power to continue stuffing in more cores and increasing clocks. Now add in that the Big Maxwell is going to be (GM200/204) on 20nm, and that improves power efficiency and die size usable even further. Now add in that this is Maxwell FIRST generation, for the GM107, and that GM200/204 are SECOND generation by all known info.... and we have a recipe for easily doubling the performance at least, if their claims are anywhere near true.

So, time for some math based on the theoretical and what is a real product launching in a couple of days (GTX 750 Ti).

GM107 has 5 SMM units containing 128 cuda cores, in each GPC. One GPC is what GM107 is using. It uses 60 watts for performance that is only beaten out by a GTX 660 by about 12% per the leaked benches. A GTX 660 has 960 Kepler cores, while a GM107 has 640 Maxwell cores. A GTX 660 uses a TDP rating of 140 watts. See an interesting number here?

960 is 50% more cores than the 640 ones Maxwell is using for the 750Ti. Now, it is around 12% slower there.... see the magic number close by? Nvidia claims a 35% performance increase PER CORE and additionally that the cores there will use only around half the power overall in TDP. Now extrapolate on some napkin math: the GM107 only has a 128-bit bus. That means the bandwidth efficiency is greatly improved because a GTX 650 Ti Boost is a huge amount above a GTX 650 Ti (192-bit vs. 128-bit bus widths and basically the same otherwise).

So it's safe to say that with 50% more bandwidth and 50% more cores, a GTX 660 only performing ~12% higher than a Maxwell part with slightly higher clocks is incredibly impressive. Now let's scale! We know at 28nm lithography that GM107 is around 148 square millimeters for the die size. Let's COMPLETELY IGNORE 20NM for a second here on the power savings and size! Forget about it for a minute. It would be extremely easy to see, since even on 28nm the reticule size is around 570mm2, that they could use 3 GPC units on 28nm taking around 420-430mm2 with this imaginary chip that would have 15 SMM units. 15 SMM units times 128 per unit would mean 1920 Maxwell cores.

Pretend their scheduler is great and the performance scales well with core count and clocks, and that they kept the idea of triple everything there in this hypothetical, non-existent card that is an illustration only. So we'd have a 384 bit bus with 1920 Maxwell cores, probably 7ghz memory speed of GDDR5 like Kepler does at least, and a TDP that fits inside of 200 watts. Now let's say that you only get about 75% scaling from core count here, which is reasonable even though Kepler scales pretty linearly, but it's a new architecture with Maxwell, so let's make the safe assumption. So a GTX 660 performs 12% better than a GM107 with 640 cores. Triple the core count there with our rough napkin math again with everything else and you would have a card performing around the same as GK110 fully unlocked by that theory-crafting, at least, and it has better potential for higher clocks thanks to the lower power usage.

However, in reality, we know they are going 20nm. This allows for even more power savings. This also allows for many more transistors per square mm on the die. So pretend they want to go for a 520mm2 chip on 20nm, keep costs down a tad for Big Maxwell and improve yields per wafer. According to released documents such as this: http://www.cadence.com/rl/Resources/overview/20nm_qa.pdf we can expect to see transistor counts possible of 8-12 billion. GK110 is 7.1 billion transistors. Using that as a point of reference, let's scale 28nm GK110 to 28nm Maxwell GM107: we need less memory controllers and pad space, so we can safely come up with a number in the neighborhood, considering the 148mm2 die size compared to GK110's 551mm2 size. At 20nm, you will be able to fit upwards of 11-12 billion transistors for a high-end part. For 28nm the 148mm2 die size indicates roughly 1.7-1.8 billion transistors with the 128-bit bus.

So now we have a decent number here: we know that Maxwell at 28nm in GM107 form is taking about 2 billion transistors to perform at a level about 89% as fast as a GK106. Let's use this as a base for the next part of this thought exercise smile.gif.

So 2 billion.... it's safe to say they could fit 7 GPC's at 20nm easily since 20nm should provide roughly a 2x density shrink in die size used per transistor, very easily, and we wouldn't need to duplicate memory controllers beyond 3x of the 28nm design's if we went for a 384-bit bus. At 20nm let's say they went for a 384-bit bus, to feed 7 GPC's worth of cores since the architecture is more bandwidth-efficient than Kepler, clearly by far. That'd give us 4480 Maxwell cores which already are much more bandwidth-efficient, so it probably would be plenty well-fed on that end of things.

Power-wise we'd be looking at an envelope of, 250 watts since Maxwell GM107 at 28nm takes a full-card power of 60w for 640 cores. That means probably around 45w for the GPU itself, allowing 10w for the GDDR5 and 5w for the fan and other circuitry. The move to 20nm will improve power efficiency drastically, due to the shorter gate lengths. So it's fair to say they could fit seven of those inside of that envelope, more than easily.

Again, let's go for a linear scaling factor of approximately 75% for the core count improvements... so we have a GTX 660 GK106 card we will use and compare to a GK110 780 Ti. A GTX 660 at 1080p is able to pull about 50% of the performance of a fully-unlocked, 780 Ti (source: http://www.techpowerup.com/reviews/NVIDIA/GeForce_GTX_780_Ti/27.html).

Now we know each SMM will provide around 90% of that performance, and revision 2.0 (second gen) on 20nm will probably be closer to a 100% figure. So let's conservatively say each SMM-based GPC results in a real-world performance of a GTX 660. In other words, around half of a 780 Ti. Now let's conservatively also say we only see the benefit of 75% of the cores when scaling it to 7 GPC's or 35 SMM's. 35 SMM's would be, as you recall 4480 maxwell cores. Multiply the performance of the 50% card by 7 and we'd have 350% (or 2.5x faster than) of the performance of a full GK110. However, let's now apply the 75% rough rule and we come up with a much more reasonable 262.5% of the performance, or 2.6x as fast as (1.6x faster than) a GTX 780 Ti.

My predictions, therefore, are that we will see a Big Maxwell on 20nm with at minimum twice the performance of a GTX 780 Ti, and by current rumors it is due this year. Add in that Maxwell will be able to clock higher at 20nm (I based those power figures off of the numbers above which were of a card at 28nm with a 1085mhz GPU clock, and realistically they can probably fit a stock core speed of 1100-1150mhz of this big a chip in then. Thus my predictions are we would see a 450-460mm2 die size for this hypothetical card, at 20nm, with a 250-260w TDP rating and thanks to the 2x transistor density, approximately 9.2b transistors.

Napkin math, for sure, but worth thinking about, eh?
The Sig Rig
(16 items)
 
  
CPUMotherboardGraphicsRAM
i7 2600K @ 4.xghz MSI Z68A-GD65 (G3) MSI GTX 970 Gaming SLI @ 1504mhz core/8020mhz m... 16GB DDR3 2x 8gb @ 1866mhz C8 
Hard DriveHard DriveOptical DriveCooling
240GB Sandisk Extreme A few 2tb hdd's. DVDRW Corsair H75 CLC on the CPU 
OSMonitorKeyboardPower
Windows 8.1 Pro 64-bit ACER B326hk 4k 60hz 3840x2160 IPS sst display Coolermaster Cherry MX Brown switch mechanical ... Corsair TX 950w 
CaseMouseMouse PadAudio
Fractal Design Define R3 Black Pearl Logitech MX518 EverGlide Titan "Medium" Soundblaster Z PCIE 
  hide details  
Reply
The Sig Rig
(16 items)
 
  
CPUMotherboardGraphicsRAM
i7 2600K @ 4.xghz MSI Z68A-GD65 (G3) MSI GTX 970 Gaming SLI @ 1504mhz core/8020mhz m... 16GB DDR3 2x 8gb @ 1866mhz C8 
Hard DriveHard DriveOptical DriveCooling
240GB Sandisk Extreme A few 2tb hdd's. DVDRW Corsair H75 CLC on the CPU 
OSMonitorKeyboardPower
Windows 8.1 Pro 64-bit ACER B326hk 4k 60hz 3840x2160 IPS sst display Coolermaster Cherry MX Brown switch mechanical ... Corsair TX 950w 
CaseMouseMouse PadAudio
Fractal Design Define R3 Black Pearl Logitech MX518 EverGlide Titan "Medium" Soundblaster Z PCIE 
  hide details  
Reply
post #56 of 81
My body is ready as HELL!
post #57 of 81
Can't wait for the livestream. Though I'm pretty sure that it's mostly going to be grid, mobile, tegra, shield, streaming, g-sync and actually very little proper info that anyone really cares about...
Quote:
Originally Posted by twitchyzero View Post

i feel sorry for those that just bought Titan/TitanBlack/780/780Ti

my guess is GTX 880 is a GM10x small die that has 780Ti performance for $499 while drawing something ridiculous like under 150W

I don't see why you'd feel sorry for GK110 owners... Most of us have had these cards for ages now. Hell, some of us Titan owners have been sitting on our cards for over 13 months now. That's as long as the difference between the GTX 580 and the 7970...
 
Benching
(17 items)
 
 
CPUMotherboardGraphicsRAM
[i7 5960X @ 4.8GHz] [Rampage V Extreme] [Titan 1400MHz (1500MHz bench)] [Various] 
Hard DriveCoolingCoolingCooling
[250GB 840EVO +2x SpinpointF3 1TB RAID0] [LD PC-V2 SS Phase Change] [XSPC X2O 750 pump/res] [Monsta 360 full copper + EK XT 360 + XT 240] 
MonitorPowerCaseAudio
[Crossover 27Q LED-P 1440p+ASUS 1200p+LG 1080p] [Corsair AX1200] [Dimastech Easy v3.0] [Sennheiser HD558s] 
CPUCPUMotherboardGraphics
FX 8320, FX 8350, Phenom II x2 555BE i7 3930K, i7 860, i7 4770K, 68x Celeron D CVF, commando, 2x RIVE, Z87X-OC Asus 4870x2, Sapphire 4870 
GraphicsGraphicsGraphicsGraphics
2x 5870, 5850, 5830, 5770 2x 3870x2, 3870 GTX Titan, GTX 480, GTX 590 GTX 285, GTX 260, 4x 9800GT, 8800GTX 
RAMHard DriveCoolingCooling
4x4GB vengeance, 2x4GB predatorX, 2x1GB OCZ DDR2 Intel X25-M 80GB LD PC-V2 SS Phase Change OCN Marksman 
CoolingCoolingOSPower
2x old tek slims (GPU) Various watercooling stuff win7, winxp AX1200 
Case
test bench / cardboard box 
  hide details  
Reply
 
Benching
(17 items)
 
 
CPUMotherboardGraphicsRAM
[i7 5960X @ 4.8GHz] [Rampage V Extreme] [Titan 1400MHz (1500MHz bench)] [Various] 
Hard DriveCoolingCoolingCooling
[250GB 840EVO +2x SpinpointF3 1TB RAID0] [LD PC-V2 SS Phase Change] [XSPC X2O 750 pump/res] [Monsta 360 full copper + EK XT 360 + XT 240] 
MonitorPowerCaseAudio
[Crossover 27Q LED-P 1440p+ASUS 1200p+LG 1080p] [Corsair AX1200] [Dimastech Easy v3.0] [Sennheiser HD558s] 
CPUCPUMotherboardGraphics
FX 8320, FX 8350, Phenom II x2 555BE i7 3930K, i7 860, i7 4770K, 68x Celeron D CVF, commando, 2x RIVE, Z87X-OC Asus 4870x2, Sapphire 4870 
GraphicsGraphicsGraphicsGraphics
2x 5870, 5850, 5830, 5770 2x 3870x2, 3870 GTX Titan, GTX 480, GTX 590 GTX 285, GTX 260, 4x 9800GT, 8800GTX 
RAMHard DriveCoolingCooling
4x4GB vengeance, 2x4GB predatorX, 2x1GB OCZ DDR2 Intel X25-M 80GB LD PC-V2 SS Phase Change OCN Marksman 
CoolingCoolingOSPower
2x old tek slims (GPU) Various watercooling stuff win7, winxp AX1200 
Case
test bench / cardboard box 
  hide details  
Reply
post #58 of 81
GoldenTiger , the reason why memory bandwidth is not as much of a factor is the cache size has been raised a lot.

L2 is 2560KB instead of 256KB on GK107

GK106 = 384 KB
GK104 = 512KB
GK110 = 1536KB (1.5MB)

Pitcairn / R9 270X = 512KB
Tahiti / R9 280X = 768KB
Hawaii / R9 290 = 1MB
Edited by AlphaC - 3/24/14 at 8:57pm
Workstation stuff
(407 photos)
SpecViewperf 12.0.1
(158 photos)
 
Reply
Workstation stuff
(407 photos)
SpecViewperf 12.0.1
(158 photos)
 
Reply
post #59 of 81
:wheee:Guess who's excited? Just about anyone here.
post #60 of 81
Want to buy Denver please. That is the only way i'll be touching maxwell.
Quote:
Originally Posted by twitchyzero View Post

Warning: Spoiler! (Click to show)
seems pretty obvious to me

580 was the big Fermi chip (or wasn't it the 480?) that launched 18 months before 680
680 was about 20-25% faster at the same pricepoint

there's lots of "if's" but let's just assume for the sake of comparsion

let's say 880 comes out in September....with performance of a Titan Black with 4GB VRAM $599 or less
780Ti is still $100 more and Titan Black $300 more all within less than 9 month time frame

don't you see how drastically different it would be compared to the 580-> 680 launch?

I have loved every day of being a titan owner for the last 14 months. The maxwell performance jump could be even larger than we think, if Denver is on board... Though something tells me Denver may not show until closer to DX12 now.
IT'S WHITE
(21 items)
 
 
CPUMotherboardGraphicsGraphics
4930k Rampage iv extreme Black Edition GTX Titan  GTX Titan 
GraphicsRAMHard DriveHard Drive
GTX Titan GskillZ Trident X Samsung EVO samsung 840 evo 
Optical DriveCoolingCoolingOS
Asus External Blue Ray drive 4x 480x60mm Swiftech MCP35x2 Win 8.1.2 
MonitorMonitorMonitorPower
AOC AOC AOC EVGA G2 1300w 
CaseMouseMouse PadAudio
STH10 RAT7 what is this?  Vali / Modi :(  
Other
Corsair HX850 
  hide details  
Reply
IT'S WHITE
(21 items)
 
 
CPUMotherboardGraphicsGraphics
4930k Rampage iv extreme Black Edition GTX Titan  GTX Titan 
GraphicsRAMHard DriveHard Drive
GTX Titan GskillZ Trident X Samsung EVO samsung 840 evo 
Optical DriveCoolingCoolingOS
Asus External Blue Ray drive 4x 480x60mm Swiftech MCP35x2 Win 8.1.2 
MonitorMonitorMonitorPower
AOC AOC AOC EVGA G2 1300w 
CaseMouseMouse PadAudio
STH10 RAT7 what is this?  Vali / Modi :(  
Other
Corsair HX850 
  hide details  
Reply
New Posts  All Forums:Forum Nav:
  Return Home
  Back to Forum: NVIDIA
Overclock.net › Forums › Graphics Cards › NVIDIA › [Proclockers] **FAKE,LOCK ME** GTC 2014 nvidia keynote info 20nm maxwell preview + r340 drivers