Overclock.net - Overclocking.net
     
 
Home Gallery Reviews Blogs Register Today's Posts Mark Forums Read Members List


Go Back   Overclock.net - Overclocking.net > Benchmarks > Benchmarking Software and Discussion

Reply
 
LinkBack Thread Tools
Old 09-04-09   #1 (permalink)
Commodore 64
 
wuttz's Avatar
 
amd ati

Join Date: Mar 2009
Location: Pearland, Texas
Posts: 989

Rep: 81 wuttz is acknowledged by some
Unique Rep: 66
Trader Rating: 6
Default SuperPi 1m/32m? Nope!

did a little experiment today on the fastest software i've seen to calculate pi values to 1m/32m digits.

first off, credit goes to amdzone.com(abinstein), where i learned about this.



i've not seen any windows pi calculator come close to this kind of performance. this is using the gmp-chudnovsky.c program from: http://gmplib.org/

my scores(from image above)
1m pi digits total time = ~1sec(HWBOT WR=6.75secs w/ LN2)
32m pi digits total time = ~72secs(HWBOT WR=375.30secs w/ LN2)

my settings are all at stock (DDRAM=1066, 7-7-7)with my sig rig.

09/05/2009
Now why do I say SuperPi is useless?

SuperPi is an antiquated benchmark that assesses core performance using an antiquated fpu x87 instruction set. It is a grossly inefficient computer program that uses floating point units to calculate pi and "evaluate" processor cache performance. It's not by any means a "real world" reflection of todays modern cpu's microarch capabilities. It's like running a horse(fpu x87 instructions) down today's paved interstate(the cpu hardware), when all the while a car(newer simd's) are available.

SuperPi fails to take advantage of newer instruction sets, and hence also fails to evaluate modern-day hardware capabilities that newer processors support and are designed with. The FPU X87 instruction set was developed for an external co-processor dedicated to computing mathematical tasks and worked with the 80286/80386 microprocessors. Now if you havent heard of the 286/386, then you know SuperPi is that old.

How many real world applications today still use the FPU x87 instruction set? Moreover, FPU x87 legacy support is still integrated in todays microprocessors since it uses less than 5% of the microprocessor die. The headache in legacy support is not worth the cost of removing it from the die, so it stays there. However, why dont we benchers use a better benchmark tool that uses at least more than 5% of the processor die(and updated code)? Or are we just too involved in saying, hey 5% of my processor runs really really well!



Lastly, with SuperPi being closed source you can't inspect the code for any vendor optimizations, so it is inherently suspect. To make matters worse, it is mostly compiled on vendor-biased compilers (which is all we have in a Windows world).

Now my point is, maybe we have focused too much on the hardware and continually do hardware upgrades that we forget the fact we are using a 15 year old outdated benchmark to evaluate said state-of-the-art equipment? Using SuperPi to "benchmark" processors is really just shoving garbage, age old, code down the latest processor that you have. Unless the typical computer user today still likes to use Windows 3.1, then I can understand the significance of SuperPi. But alas, 'tis time to move on. It's the Windows 7 era already.

Shouldn't state-of-the-art hardware go hand in hand with cutting edge benchmarking software?

Or is the pursuit of performance really just on the hardware side?

[Will be updated as I learn more about CPU architectures and software.]

System: sys.info
CPU
| AMD 955BE |
Motherboard
| 790FX-GD70 |
Memory
| 2X2GB OCZ3 DDR3-2000 |
Graphics Card
| 4870X2 |
Hard Drive
| 7200.11 500GB/32MB |
Power Supply
| PCP&C 750W |
CPU cooling
| Prolimatech Megahalem |
OS
| Windows7 x64 |
Monitor
| Syncmaster 2343BWX |

Last edited by wuttz : 09-05-09 at 03:57 AM
wuttz is offline wuttz's Gallery   Reply With Quote
Old 09-04-09   #2 (permalink)
4.0 GHz
 
aaronmonto's Avatar
 
intel nvidia

Join Date: Nov 2008
Location: Northwest US
Posts: 1,093
Blog Entries: 1

Rep: 97 aaronmonto is acknowledged by some
Unique Rep: 76
Hardware Reviews: 6
Trader Rating: 0
Default

Is the Phenom II still 35% slower than C2D's at the same clock speed in this benchie?
__________________
Quote:
Originally Posted by sinner491 View Post
hey thanks for your time but dats it??? dats all i need to make a computer?
Quote:
Originally Posted by Chucklez View Post
What do you mean? I have read a lot of reviews and a lot of them have come out as i5 (2.66GHz) on par with Phenom II (3.4GHz). Even including those apps and such where Intel clearly has a huge advantage.

System: Catasor II - Revo Evo ;D
CPU
Core i5 750 @ 4.2
Motherboard
Gigabyte P55-UD3R
Memory
4GB Trident DDR3 2000 @ 2100
Graphics Card
ASUS HD 4830
Hard Drive
2 x 1TB F3
Sound Card
Onboard
Power Supply
OCZ ModXStream 700W
Case
CM ATCS 840 Silver
CPU cooling
Corsair H50
GPU cooling
Stock
OS
Windows 7 Ultimate x64
Monitor
SAMSUNG 2333SW 1080P
aaronmonto is offline   Reply With Quote
Old 09-04-09   #3 (permalink)
Commodore 64
 
wuttz's Avatar
 
amd ati

Join Date: Mar 2009
Location: Pearland, Texas
Posts: 989

Rep: 81 wuttz is acknowledged by some
Unique Rep: 66
Trader Rating: 6
Default

Quote:
Originally Posted by aaronmonto View Post
Is the Phenom II still 35% slower than C2D's at the same clock speed in this benchie?
http://gmplib.org/pi-with-gmp.html

theres times listed there, although its not comprehensive. if we have intel benchers interested, i'd appreciate your input.

System: sys.info
CPU
| AMD 955BE |
Motherboard
| 790FX-GD70 |
Memory
| 2X2GB OCZ3 DDR3-2000 |
Graphics Card
| 4870X2 |
Hard Drive
| 7200.11 500GB/32MB |
Power Supply
| PCP&C 750W |
CPU cooling
| Prolimatech Megahalem |
OS
| Windows7 x64 |
Monitor
| Syncmaster 2343BWX |
wuttz is offline wuttz's Gallery   Reply With Quote
Old 09-04-09   #4 (permalink)
2 + 2 = 5
 
DuckieHo's Avatar
 
intel nvidia

Join Date: Nov 2006
Location: In a Chair.
Posts: 34,845

Rep: 4162 DuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guruDuckieHo is a guru
Unique Rep: 1902
Trader Rating: 56
Default

Quote:
Originally Posted by wuttz View Post
did a little experiment today on the fastest software i've seen to calculate pi values to 1m/32m digits.

first off, credit goes to amdzone.com(abinstein), where i learned about this.



i've not seen any windows pi calculator come close to this kind of performance. this is using the gmp-chudnovsky.c program from: http://gmplib.org/

my scores(from image above)
1m pi digits total time = ~1sec(HWBOT WR=6.75secs w/ LN2)
32m pi digits total time = ~72secs(HWBOT WR=375.30secs w/ LN2)

my settings are all at stock (DDRAM=1066, 7-7-7)with my sig rig.


now my point is, maybe we have focused too much on the hardware and continually do hardware upgrades that we forget the fact we are using a 15 year old outdated benchmark to evaluate said state-of-the-art equipment?

shouldn't state-of-the-art hardware go hand in hand with cutting edge benchmarking software?

is the pursuit of performance really just on the hardware side?
A benchmark's purpose is NOT get the best score. It is to evaluate performance of different hardware. SuperPi testings floating point and cache performance. I don't believe it is multi-threaded. If you want even faster PI scores, there are certain methods that can be broken apart into threads. Those would yield faster times.

i.e. You don't benchmark a car's 1/4 mi time on downhill slope. Yeah, it would reduce the times for all cars but that's not the point.


There are a number of different ways to caculating PI.
The one used in that benchmarks is this: http://en.wikipedia.org/wiki/Chudnovsky_algorithm

Here's at least another dozen methods: http://en.wikipedia.org/wiki/Numeric...ions_of_%CF%80
__________________
To answer most of your questions: (1) a fridge cannot cool a PC (2) 64-bit OS for over 3.4GB (3) If a PCIe card fits, it should work (4) Resolution, not screen size (5) If you have a question, it is not news (6) Report, not respond to Spam (7) Single-Rail/Non-Modular PSUs are not always better than Multi-Rail/Modular


System: Three Dead Mobos in a Year
CPU
Q6600 (3.4GHz)
Motherboard
EVGA 780i
Memory
2x2GB OCZ Reaper 1096MHz
Graphics Card
GTX260 55nm
Hard Drive
PERC 6/i: 3xRAID0 7200.12 500GB
Sound Card
X-Fi XtremeMusic
Power Supply
Corsair 620HX
Case
Li Lian PC-V2100 [10x120mm fans]
CPU cooling
FuZion V2 + Quad-Heatercore
GPU cooling
EK Block + DDC-3.2
OS
Vista Ultimate 64
Monitor
Samsung 226BW "C" + Sceptre 19"
DuckieHo is online now Overclocked Account DuckieHo's Gallery   Reply With Quote
Old 09-04-09   #5 (permalink)
4.0 GHz
 
aaronmonto's Avatar
 
intel nvidia

Join Date: Nov 2008
Location: Northwest US
Posts: 1,093
Blog Entries: 1

Rep: 97 aaronmonto is acknowledged by some
Unique Rep: 76
Hardware Reviews: 6
Trader Rating: 0
Default

Quote:
Originally Posted by DuckieHo View Post
A benchmark's purpose is NOT get the best score. It is to evaluate performance of different hardware. SuperPi testings floating point and cache performance. I don't believe it is multi-threaded. If you want even faster PI scores, there are certain methods that can be broken apart into threads. Those would yield faster times.

i.e. You don't benchmark a car's 1/4 mi time on downhill slope. Yeah, it would reduce the times for all cars but that's not the point.


There are a number of different ways to caculating PI.
The one used in that benchmarks is this: http://en.wikipedia.org/wiki/Chudnovsky_algorithm

Here's at least another dozen methods: http://en.wikipedia.org/wiki/Numeric...ions_of_%CF%80
You just explained the world to me... The Phenom II's L3 is slower than my RAM. No wonder.
__________________
Quote:
Originally Posted by sinner491 View Post
hey thanks for your time but dats it??? dats all i need to make a computer?
Quote:
Originally Posted by Chucklez View Post
What do you mean? I have read a lot of reviews and a lot of them have come out as i5 (2.66GHz) on par with Phenom II (3.4GHz). Even including those apps and such where Intel clearly has a huge advantage.

System: Catasor II - Revo Evo ;D
CPU
Core i5 750 @ 4.2
Motherboard
Gigabyte P55-UD3R
Memory
4GB Trident DDR3 2000 @ 2100
Graphics Card
ASUS HD 4830
Hard Drive
2 x 1TB F3
Sound Card
Onboard
Power Supply
OCZ ModXStream 700W
Case
CM ATCS 840 Silver
CPU cooling
Corsair H50
GPU cooling
Stock
OS
Windows 7 Ultimate x64
Monitor
SAMSUNG 2333SW 1080P
aaronmonto is offline   Reply With Quote
Old 09-04-09   #6 (permalink)
Commodore 64
 
wuttz's Avatar
 
amd ati

Join Date: Mar 2009
Location: Pearland, Texas
Posts: 989

Rep: 81 wuttz is acknowledged by some
Unique Rep: 66
Trader Rating: 6
Default

Quote:
Originally Posted by DuckieHo View Post
i.e. You don't benchmark a car's 1/4 mi time on downhill slope. Yeah, it would reduce the times for all cars but that's not the point.
i believe the better example would be having a "horse" as the benchmark software(superpi), and running that horse down a paved interstate. is that "horse" really "evaluating the true performance of the paved interstate?"

also, gmp-chudnovsky.c is an open source program with the code open for inspection, just to be sure no vendor-optimized code gets dropped in.

on top of that, it is compiled on a vendor-neutral opensource compiler, gcc.

can you say the same for superpi?

System: sys.info
CPU
| AMD 955BE |
Motherboard
| 790FX-GD70 |
Memory
| 2X2GB OCZ3 DDR3-2000 |
Graphics Card
| 4870X2 |
Hard Drive
| 7200.11 500GB/32MB |
Power Supply
| PCP&C 750W |
CPU cooling
| Prolimatech Megahalem |
OS
| Windows7 x64 |
Monitor
| Syncmaster 2343BWX |

Last edited by wuttz : 09-04-09 at 01:30 PM
wuttz is offline wuttz's Gallery   Reply With Quote
Old 09-05-09   #7 (permalink)
Commodore 64
 
wuttz's Avatar
 
amd ati

Join Date: Mar 2009
Location: Pearland, Texas
Posts: 989

Rep: 81 wuttz is acknowledged by some
Unique Rep: 66
Trader Rating: 6
Default

did a comparison run using y-cruncher, a windows pi calculator client that makes use of the same chudnovsky compute algorithm i used under linux.

first, a core-for-core comparison between the two operating systems.



the linux bench was faster by 5 seconds in computing 32m pi values.

now, turning on multi-threading for y-cruncher..



it finishes the task in ~21.3secs!

these programs clearly make use of more efficient code and up-to-date instruction sets that modern computer architectures have. so i'd say these pi calculators reflect a better assessment of a cpu's computing abilities, instead of superpi.

System: sys.info
CPU
| AMD 955BE |
Motherboard
| 790FX-GD70 |
Memory
| 2X2GB OCZ3 DDR3-2000 |
Graphics Card
| 4870X2 |
Hard Drive
| 7200.11 500GB/32MB |
Power Supply
| PCP&C 750W |
CPU cooling
| Prolimatech Megahalem |
OS
| Windows7 x64 |
Monitor
| Syncmaster 2343BWX |
wuttz is offline wuttz's Gallery   Reply With Quote
Old 09-07-09   #8 (permalink)
Mobo Master
 
Starbuck5000's Avatar
 
intel nvidia

Join Date: May 2008
Location: England
Posts: 1,010

Rep: 70 Starbuck5000 is acknowledged by some
Unique Rep: 63
Trader Rating: 4
Default

Like what Duckie said.

Competative benchmarking isn't really about a showing the most accurate reflection, since when have sythenic benches really be accurate to real world performance anyway? If that was the case then we would only be benching Vantage.

Really competative benchmarking is more about the tweaks and the skill in tuning the system. PI is a benchmark that responds well to this. SuperPI is a single threaded benchmark. If you wanna see it added to hwbot make this same post on their forums
__________________
Electrical Engineering and Computer Science Student (Joint Bsc Hons)
STUDENT FINANCE HURRY UP!!!

Has far to many components for a 20 year old, and while you're at it forgive my spelling, the keyboard is dyslexic

Quote:
Originally Posted by Tator Tot View Post
Ah the dickery.

Sadly enough, I don't like being a dick in game. It's much more fun flaunting your junk IRL.

System: 24/7 rig, now with less powah than the bench
CPU
W3540
Motherboard
Evga X58 LE
Memory
3x2GB OCZ Golds
Graphics Card
Evga GTX295
Hard Drive
Perc 5i; 4 WDC3200AAKS's Raid 5, 1TB Caviar Green
Sound Card
Asus Xonar D2X
Power Supply
1000W Thermaltake modular
Case
Lian Li PC-A71
CPU cooling
EK-Supreme/BIX420
GPU cooling
EK full cover
OS
Windows 7 Pro - Retail MSDN-AA
Monitor
20.1" LG
Starbuck5000 is offline   Reply With Quote
Old 09-14-09   #9 (permalink)
I got nothin'...
 
BenBrown's Avatar
 
amd ati

Join Date: Apr 2008
Location: Tulsa, OK
Posts: 2,374
Blog Entries: 8

Hardware Reviews: 7
Trader Rating: 15
Default

I tend to agree more with wuttz. Standards need to be updated every once in a while. This should definitely be posted on HWBot.rg.
__________________
"You make a living by what you get. You make a life by what you give." --Winston Churchill

3DMark Vantage - 19,552 -- 3DMark06 - 25,066 -- 3DMark05 - 31,688 -- 3DMark03 - 114,287 -- CPU-Z - 4800MHz

** Stuff For Sale **
**The Brown Blogger**
Phenom I vs. Phenom II - Clock for Clock & Phenom II Overclocking Results

System: WATER DRAGON
CPU
Phenom X4 965 BE (CACYC AC 0931DPMW)
Motherboard
MSI 790FX-GD70
Memory
G.SKILL Ripjaws 4GB(2 x 2GB) DDR3 1600
Graphics Card
3x VisionTek HD 4870
Hard Drive
2x WD 500GB HD
Sound Card
On Board
Power Supply
Ultra X3 1000W
Case
COOLER MASTER Centurion 590
CPU cooling
Swiftech Apogee GTZ (Custom H2O)
GPU cooling
Stock Cooler
OS
Windows 7 x64 Ultimate
Monitor
2x HP w2007
5 Million+ Folding at Home points
BenBrown is offline Overclocked Account BenBrown's Gallery   Reply With Quote
Old 09-18-09   #10 (permalink)
Solder Slinger
 
CL3P20's Avatar
 
intel nvidia

Join Date: Apr 2006
Location: sacramento, CA
Posts: 8,627

Folding Team Rank: 39
Trader Rating: 12
Default

I dont know what to think...I am getting .541sec on the 1mil Pi calc... ....more testing will need to be done on the nature of the tuning that most affects the outcome of the score [other than CPU speed].

System: 965-terror
CPU
e8400 ES- 533*7.5, 1.32v
Motherboard
P5B-DLX- ICHv and MCHv mod
Memory
2x1 Patriot Viper, 533*2
Graphics Card
9800GTX- 800/1944/1212 + 8800GS/Physx
Hard Drive
1x storage..2x bench HDD's
Sound Card
none
Power Supply
a hot an overworked TX650
Case
CyberDruid Tech station...sick
CPU cooling
ArticFreezer
GPU cooling
stock
OS
xp64-lite: gaming/xp-lite: bench'ng
Monitor
HPf2105 21"ws
4 Million+ Folding at Home points
CL3P20 is offline I fold for Overclock.net Overclocked Account CL3P20's Gallery   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools



All times are GMT -5. The time now is 03:29 AM.


Overclock.net is a Carbon Neutral Site Creative Commons License

Terms of Service / Forum Rules | Privacy Policy | DMCA Info | Advertising | Become an Official Vendor
Copyright © 2009 Shogun Interactive Development. Most rights reserved.
Page generated in 0.17857 seconds with 8 queries