Overclock.net › Forums › Overclockers Care › Overclock.net Folding@Home Team › Folding@Home Guides and Tutorials › [Info] To SMP N or Not SMP N - A Quick Observation
New Posts  All Forums:Forum Nav:

[Info] To SMP N or Not SMP N - A Quick Observation

post #1 of 4
Thread Starter 
Hello everyone,

In lieu of asking many folding questions tonight, it piqued my curiousity finally to see if I was really being as efficient as I could be with my new system and folding. As such, despite me simply letting FaH run as-is and not really give it a second thought, I finally took out some time to compile a little stat breakdown and realized that there are some good things I can do regarding the -SMP flag / switch.

Right now I'm running Windows 7 64-bit with the SR-2 system in my sig. (2x e5620 @ 2.4 stock w/Megahalems / 12 GB 1600 RAM, 1 EVGA GTX 580SC). What was very surprising with all this suggesting recently for me to figure out how to run -bigadv WUs, it also simply encouraged me to do some testing of my own. I hope you guys find the info useful.

Firstly I found a great thead here: http://foldingforum.org/viewtopic.php?f=55&t=15088 as well as one or two very good -bigWU threads on OCN. Thanks to Zodac, Deeeebs and others for their insight as well.

---------------

I wanted to test and see simply how the settings of CPU Idle VS Low, GPU Working VS Not, and using the -SMP flag VS an -SMP flag with a thread / core indicator would affect both the TPF (time per frame of a 1% increment) as well as the GPU itself in getting more work done sooner. Here are my results:

All stats tested with Project #6041 a3 SMP Core and Project #6801 GPU SMP Core. All work data save wudata_0x.dat and wuinfo_0x.dat was wiped from the working folder before each core run to keep the tests consistent.

-------------

SMP with no core flag simply means indicating the -SMP flag by itself in the configuration settings.
SMP -15 means that the SMP core uses all processing coresand threads (in my case except one) to allow the GPU to use a core for it's processing offload needs.
Core Priority Low or Idle is the setting in the configuration to account for if other programs are overriding FaH's processing but it can also affect how the CPU SMP and GPU clients affect each other when working concurrently.
Core Load is simply the load shown in Windows Task Manager.
GPU SMP is simply my shorthand for the GPGPU3 Core.

-------------

SMP with -15 and core priority to Low w/GPU working =
7m 53 sec TPF / Core Load @ 96-98%

SMP with no core flag and core priority to Low w/GPU working =
9m 14 sec TPF / Core Load @ 100% continuous

SMP with -15 and core priority to Low w/GPU not working =
7m 48 sec TPF / Core Load @ 93% continuous

SMP with no core flag and core priority to Low w/GPU not working =
7m 17 sec TPF / Core Load @ 100% continuous

-------------
How this all affects the GPU by itself...

GPU SMP with CPU SMP -15
~7,000 iter/sec / 1,250 ns/day

GPU SMP with CPU SMP no core flag
~6,000 iter/sec / 1,200 ns/day

-------------

SMP with -15 and core priority to Idle w/GPU working =
7m 53 sec TPF / Core Load @ 96-97%

SMP with no core flag and core priority to Idle w/GPU working =
9m 49 sec TPF / Core Load @ 100% continuous

SMP with -15 and core priority to Idle w/GPU not working =
7m 50 sec TPF / Core Load @ 93-97%

SMP with no core flag and core priority to Idle w/GPU not working =
7m 27 sec TPF / Core Load @ 100% continuous

-------------

As you can see, there are only a few wrong settings that conflict with each other and elevate the time needed per %. While one would think that by folding with their GPU and simply setting the -SMP flag would allow the system and program to automatically make the best choices, it would instead appear that giving (in my case) one extra core or thread for the GPU very much offsets the time needed for both the CPU and GPU to do their work. By forcing a particular -SMP core count, not only does it give the system a core to process any CPU load required by the GPU (and speeds it up slightly), it also seperates the CPU / GPU loads. In other words, the GPU load butting into a SMP core load of 100% forces the CPUs to take extra time to work on whatever the GPU needs done and in fact, slows down the CPUs significantly, as can be seen by nearly two minutes extra *per* TPF!

Of course, the most shocking thing is that I ran the "slow" method for almost my entire folding endeavors - it was only curiousity that encouraged me to really test it out and learn that yes, there are in fact, better ways of running two mixed clients.

I hope this helps and gives everyone more PPD if you weren't using the correct flags or switches already! Thanks.
Melchior SR-2
(15 items)
 
  
CPUMotherboardGraphicsRAM
2x Intel E5645 @ 3.6ghz EVGA SR-2 EVGA GTX Titan SC 36GB Mushkin DDR3 1600 
Hard DriveOptical DriveCoolingOS
512GB Samsung SSD / 300GB Seagate / 2 TB WD Cav... LG Blu-Ray Burner / 2x LG Millenniata DVD+/-RW Twin Corsair H80 Pumps + 3 YL Medium Fans Windows 7 Pro 64-bit 
MonitorKeyboardPowerCase
Dell 27" Ultrasharp U2711 Dell Quietkey Generic EVGA SR-2 PSU Lian-Li PC P80 
MouseMouse PadOther
Dell Optical Generic None Sometimes, 2x Radeon 5970s for compute needs 
  hide details  
Reply
Melchior SR-2
(15 items)
 
  
CPUMotherboardGraphicsRAM
2x Intel E5645 @ 3.6ghz EVGA SR-2 EVGA GTX Titan SC 36GB Mushkin DDR3 1600 
Hard DriveOptical DriveCoolingOS
512GB Samsung SSD / 300GB Seagate / 2 TB WD Cav... LG Blu-Ray Burner / 2x LG Millenniata DVD+/-RW Twin Corsair H80 Pumps + 3 YL Medium Fans Windows 7 Pro 64-bit 
MonitorKeyboardPowerCase
Dell 27" Ultrasharp U2711 Dell Quietkey Generic EVGA SR-2 PSU Lian-Li PC P80 
MouseMouse PadOther
Dell Optical Generic None Sometimes, 2x Radeon 5970s for compute needs 
  hide details  
Reply
post #2 of 4
Approved for good info.

Even though I hate threads beign here without pictures.
Megadoomer
(14 items)
 
Family Computer
(13 items)
 
 
CPUMotherboardGraphicsRAM
Phenom II X6 1090T @ 4.0Ghz ASUS M4A89GTD PRO Sparkle GTS 450 2x4GB G-Skill Sniper 
Hard DriveCoolingOSMonitor
Samsung F1 1TB CM Hyper 212+ Windows 7 Professional x64 Samsung T220 
KeyboardPowerCaseMouse
Logitech MX3000 Laser CM 1000M HAF 922 Logitech VX Revolution 
CPUMotherboardGraphicsRAM
Q6600 Asus PN5-D 750i Evga GTS 250 2x2GB Crucial Ballistix 
Hard DriveOptical DriveOSMonitor
750GB Hitachi Samsung Super Writemaster Windows 7 Professional x64 19" Dell 
PowerCase
Corsair 450VX Antec 900 
  hide details  
Reply
Megadoomer
(14 items)
 
Family Computer
(13 items)
 
 
CPUMotherboardGraphicsRAM
Phenom II X6 1090T @ 4.0Ghz ASUS M4A89GTD PRO Sparkle GTS 450 2x4GB G-Skill Sniper 
Hard DriveCoolingOSMonitor
Samsung F1 1TB CM Hyper 212+ Windows 7 Professional x64 Samsung T220 
KeyboardPowerCaseMouse
Logitech MX3000 Laser CM 1000M HAF 922 Logitech VX Revolution 
CPUMotherboardGraphicsRAM
Q6600 Asus PN5-D 750i Evga GTS 250 2x2GB Crucial Ballistix 
Hard DriveOptical DriveOSMonitor
750GB Hitachi Samsung Super Writemaster Windows 7 Professional x64 19" Dell 
PowerCase
Corsair 450VX Antec 900 
  hide details  
Reply
post #3 of 4
This is great info.... I have been using SMP on my CPU for about a month now... I actually stopped folding on my CPU because it just seemed to be not worth it. I was getting TPF's of 20+min on my cpu... I just starting folding again on my cpu about 3 days ago... and in 3 days only submitted 2 wu's....

came across this post and well added a "3" after -smp... and guess what my TPF has been cut by more than half... it went from 20+min TPF to about 7min TPF... all I can say is thanks for the infor REP+ for you

*edit
hmmm... well i could just be the WU i just got... the dreaded 6701 is pretty intense... guess i'll see when i do get another one... either way great info... nothings wrong with shaving off a few seconds... or even minutes in my case
Oneone Uapo
(19 items)
 
  
CPUMotherboardGraphicsRAM
i7 2600k Gigabyte P67A-UD7-B3 Tri-Fire HD 6950 G-Skill RipjawsX 4x2GB 
Hard DriveHard DriveHard DriveHard Drive
Crucial M4 64GB Western Digital Raptor Western Digital Western Digital 
CoolingCoolingCoolingOS
Triple Koolance VID697X Waterblocks Koolance Ram-33 Waterblocks Koolance CPU-370 Waterblock Windows 7 Pro 64bit 
MonitorPowerCaseMouse
3x Acer H203H Corsair HX1050 Mountain Mods Ext. Acc. Logitech G700 
  hide details  
Reply
Oneone Uapo
(19 items)
 
  
CPUMotherboardGraphicsRAM
i7 2600k Gigabyte P67A-UD7-B3 Tri-Fire HD 6950 G-Skill RipjawsX 4x2GB 
Hard DriveHard DriveHard DriveHard Drive
Crucial M4 64GB Western Digital Raptor Western Digital Western Digital 
CoolingCoolingCoolingOS
Triple Koolance VID697X Waterblocks Koolance Ram-33 Waterblocks Koolance CPU-370 Waterblock Windows 7 Pro 64bit 
MonitorPowerCaseMouse
3x Acer H203H Corsair HX1050 Mountain Mods Ext. Acc. Logitech G700 
  hide details  
Reply
post #4 of 4
Thread Starter 
Glad to know it helped you out!
Melchior SR-2
(15 items)
 
  
CPUMotherboardGraphicsRAM
2x Intel E5645 @ 3.6ghz EVGA SR-2 EVGA GTX Titan SC 36GB Mushkin DDR3 1600 
Hard DriveOptical DriveCoolingOS
512GB Samsung SSD / 300GB Seagate / 2 TB WD Cav... LG Blu-Ray Burner / 2x LG Millenniata DVD+/-RW Twin Corsair H80 Pumps + 3 YL Medium Fans Windows 7 Pro 64-bit 
MonitorKeyboardPowerCase
Dell 27" Ultrasharp U2711 Dell Quietkey Generic EVGA SR-2 PSU Lian-Li PC P80 
MouseMouse PadOther
Dell Optical Generic None Sometimes, 2x Radeon 5970s for compute needs 
  hide details  
Reply
Melchior SR-2
(15 items)
 
  
CPUMotherboardGraphicsRAM
2x Intel E5645 @ 3.6ghz EVGA SR-2 EVGA GTX Titan SC 36GB Mushkin DDR3 1600 
Hard DriveOptical DriveCoolingOS
512GB Samsung SSD / 300GB Seagate / 2 TB WD Cav... LG Blu-Ray Burner / 2x LG Millenniata DVD+/-RW Twin Corsair H80 Pumps + 3 YL Medium Fans Windows 7 Pro 64-bit 
MonitorKeyboardPowerCase
Dell 27" Ultrasharp U2711 Dell Quietkey Generic EVGA SR-2 PSU Lian-Li PC P80 
MouseMouse PadOther
Dell Optical Generic None Sometimes, 2x Radeon 5970s for compute needs 
  hide details  
Reply
New Posts  All Forums:Forum Nav:
  Return Home
Overclock.net › Forums › Overclockers Care › Overclock.net Folding@Home Team › Folding@Home Guides and Tutorials › [Info] To SMP N or Not SMP N - A Quick Observation