[Tom's Hardware] Breakthrough DL Training Algorithm on Intel Xeon CPU System Outperforms Volta GPU By 3.5x - Overclock.net - An Overclocking Community
Forum Jump: 

[Tom's Hardware] Breakthrough DL Training Algorithm on Intel Xeon CPU System Outperforms Volta GPU By 3.5x

 
Thread Tools
post #1 of 4 (permalink) Old 03-05-2020, 12:22 PM - Thread Starter
sudo apt install sl
 
Join Date: Dec 2009
Posts: 6,361
Rep: 188 (Unique: 130)
[Tom's Hardware] Breakthrough DL Training Algorithm on Intel Xeon CPU System Outperforms Volta GPU By 3.5x

Source: https://www.tomshardware.com/news/br...ta-gpus-by-35x

Quote:
Updated 11:00am PT: Corrected the article to reflect that the tests were conducted with a single V100 GPU.

Original Article:

Computer scientists from Rice University, in collaboration with Intel Labs, have announced a breakthrough new deep learning algorithm – called SLIDE – that trains AI models faster on CPUs than traditional algorithms on GPUs. For some types of computation, this effectively moves the performance crown of fastest chip for training to CPUs.

In particular, the researchers benchmarked a system with 44 “Xeon-class cores” against a $100,000 system with eight Nvida Volta V100 GPUs with tensor cores, although they only used one V100 for the tests. The Xeon system completed the task in one hour using SLIDE, compared to 3.5 hours for a single Volta V100 with a TensorFlow implementation. The researchers also noted that the algorithm may be further optimized as it competes against a mature (software and hardware) platform. For example, it did not yet use Intel's DLBoost acceleration.
Processors are going to be in much more demand than they already are.

Silent
(20 items)
CPU
Core i9 9900K... CoffeeTime! @ 5.1Ghz w/ 1.36v
Motherboard
Maximus VIII Formula
GPU
Titan RTX @ 2100Mhz/2075Mhz
RAM
TeamGroup Xtreem 32GB 3200Mhz CL15
Hard Drive
HP EX950 2TB
Hard Drive
Samsung 850 Evo 1TB
Hard Drive
Samsung 850 Evo 1TB
Power Supply
EVGA SuperNova 1200w P2
Cooling
EK Supremacy Full Copper Clean
Cooling
XSPC D5 Photon v2
Cooling
Black Ice Gen 2 GTX360 x2
Cooling
EK-Vector Radeon VII - Copper + Plexi
Case
Thermaltake Core X5 Tempered Glass Edition
Operating System
Clear Linux
Monitor
Acer XF270HUA
Keyboard
Cherry MX Board 6.0
Mouse
Logitech G600
Mouse
Alugraphics GamerArt
Audio
Definitive Technology Incline
Audio
SMSL M8A
▲ hide details ▲

Last edited by WannaBeOCer; 03-05-2020 at 01:00 PM.
WannaBeOCer is offline  
Sponsored Links
Advertisement
 
post #2 of 4 (permalink) Old 03-05-2020, 12:52 PM
What goes here?
 
speed_demon's Avatar
 
Join Date: Nov 2006
Location: Wisconsin
Posts: 2,243
Rep: 120 (Unique: 88)
Neat. Wonder if AMD has a similar competing technology.

Quis custodiet ipsos custodes?
Desktop
(11 items)
CPU
Celeron G3930 eXtra Slow Edition
Motherboard
Gigabyte Z270 Gaming K5
GPU
MSI R9 390
RAM
12GB DDR4-3000
Hard Drive
120GB HP M700 SSD
Hard Drive
Samsung Spintpoint M9T 2TB
Hard Drive
Seagate Barracuda 750GB
Power Supply
Corsair CX750M
Case
Fractal Define S
Keyboard
Alienware W/Cherry MX Switches
Mouse
Logitech G5 Gen.1
▲ hide details ▲


speed_demon is offline  
post #3 of 4 (permalink) Old 03-05-2020, 12:58 PM - Thread Starter
sudo apt install sl
 
Join Date: Dec 2009
Posts: 6,361
Rep: 188 (Unique: 130)
Quote: Originally Posted by speed_demon View Post
Neat. Wonder if AMD has a similar competing technology.
This algorithm should also work on AMD's processors since they stated in the article they aren't using DLBoost yet. ASIC is great for inferencing but until this breakthrough GPUs were the standard for training.

Quote:
Optimizations
The researchers say that there are further performance improvements left as they have “just scratched the surface”. To that end, they say that they have not used vectorization – such as AVX SIMD instructions – including Intel’s DLBoost acceleration and claimed “there are a lot of other tricks we could still use to make this even faster.”

Silent
(20 items)
CPU
Core i9 9900K... CoffeeTime! @ 5.1Ghz w/ 1.36v
Motherboard
Maximus VIII Formula
GPU
Titan RTX @ 2100Mhz/2075Mhz
RAM
TeamGroup Xtreem 32GB 3200Mhz CL15
Hard Drive
HP EX950 2TB
Hard Drive
Samsung 850 Evo 1TB
Hard Drive
Samsung 850 Evo 1TB
Power Supply
EVGA SuperNova 1200w P2
Cooling
EK Supremacy Full Copper Clean
Cooling
XSPC D5 Photon v2
Cooling
Black Ice Gen 2 GTX360 x2
Cooling
EK-Vector Radeon VII - Copper + Plexi
Case
Thermaltake Core X5 Tempered Glass Edition
Operating System
Clear Linux
Monitor
Acer XF270HUA
Keyboard
Cherry MX Board 6.0
Mouse
Logitech G600
Mouse
Alugraphics GamerArt
Audio
Definitive Technology Incline
Audio
SMSL M8A
▲ hide details ▲
WannaBeOCer is offline  
Sponsored Links
Advertisement
 
post #4 of 4 (permalink) Old 03-05-2020, 01:27 PM
New to Overclock.net
 
DNMock's Avatar
 
Join Date: Jul 2014
Location: Dallas
Posts: 3,708
Rep: 174 (Unique: 128)
Interesting and really cool to see. I wonder what the power consumption difference is though. As a guess, I would say the 44 core intel chip is pulling twice as much power as the Tesla is but who knows.


DNMock is offline  
Reply

Quick Reply
Message:
Options

Register Now

In order to be able to post messages on the Overclock.net - An Overclocking Community forums, you must first register.
Please enter your desired user name, your email address and other required details in the form below.
User Name:
If you do not want to register, fill this field only and the name will be used as user name for your post.
Password
Please enter a password for your user account. Note that passwords are case-sensitive.
Password:
Confirm Password:
Email Address
Please enter a valid email address for yourself.
Email Address:

Log-in



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Show Printable Version Show Printable Version
Email this Page Email this Page


Forum Jump: 

Posting Rules  
You may post new threads
You may post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off