Folding@Home UNSTABLE_MACHINE - Overclock.net - An Overclocking Community

Forum Jump: 

[email protected] UNSTABLE_MACHINE

 
Thread Tools
post #1 of 3 (permalink) Old 12-31-2011, 07:59 PM - Thread Starter
New to Overclock.net
 
Senorpie7's Avatar
 
Join Date: Dec 2011
Location: Chicago IL
Posts: 406
Rep: 12 (Unique: 10)
So I am using the: "GPU3 (required for Fermi) no-nonsense console client" on my rig.
It works the charm almost all the time, until a little while ago I have noticed the client spat this at me:






--- Opening Log file [January 1 03:13:48 UTC]


# Windows GPU Console Edition #################################################
###############################################################################

[email protected] Client Version 6.41r2

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\TURBO\Desktop\Gpu
Executable: C:\Users\TURBO\Desktop\Gpu\[email protected]


[03:13:48] - Ask before connecting: No
[03:13:48] - User name: senorpie7 (Team 37726)
[03:13:48] - User ID: 4C81BB12022B2795
[03:13:48] - Machine ID: 9
[03:13:48]
[03:13:48] Gpu type=3 species=21.
[03:13:48] Work directory not found. Creating...
[03:13:48] Could not open work queue, generating new queue...
[03:13:48] - Preparing to get new work unit...
[03:13:48] Cleaning up work directory
[03:13:48] + Attempting to get work packet
[03:13:48] Gpu type=3 species=21.
[03:13:48] - Connecting to assignment server
[03:13:49] - Successful: assigned to (171.64.65.105).
[03:13:49] + News From [email protected]: Welcome to [email protected]
[03:13:49] Loaded queue successfully.
[03:13:49] Gpu type=3 species=21.
[03:13:50] + Closed connections
[03:13:50]
[03:13:50] + Processing work unit
[03:13:50] Core required: FahCore_15.exe
[03:13:50] Core not found.
[03:13:50] - Core is not present or corrupted.
[03:13:50] - Attempting to download new core...
[03:13:50] + Downloading new core: FahCore_15.exe
[03:13:50] + 10240 bytes downloaded
[03:13:50] + 20480 bytes downloaded
[03:13:50] + 30720 bytes downloaded
[03:13:50] + 40960 bytes downloaded
[03:13:50] + 51200 bytes downloaded
[03:13:50] + 61440 bytes downloaded
[03:13:51] + 71680 bytes downloaded
[03:13:51] + 81920 bytes downloaded
[03:13:51] + 92160 bytes downloaded
[03:13:51] + 102400 bytes downloaded
[03:13:51] + 112640 bytes downloaded
[03:13:51] + 122880 bytes downloaded
[03:13:51] + 133120 bytes downloaded
[03:13:51] + 143360 bytes downloaded
[03:13:51] + 153600 bytes downloaded
[03:13:51] + 163840 bytes downloaded
[03:13:51] + 174080 bytes downloaded
[03:13:51] + 184320 bytes downloaded
[03:13:51] + 194560 bytes downloaded
[03:13:51] + 204800 bytes downloaded
[03:13:51] + 215040 bytes downloaded
[03:13:51] + 225280 bytes downloaded
[03:13:51] + 235520 bytes downloaded
[03:13:51] + 245760 bytes downloaded
[03:13:51] + 256000 bytes downloaded
[03:13:51] + 266240 bytes downloaded
[03:13:51] + 276480 bytes downloaded
[03:13:51] + 286720 bytes downloaded
[03:13:51] + 296960 bytes downloaded
[03:13:51] + 307200 bytes downloaded
[03:13:51] + 317440 bytes downloaded
[03:13:51] + 327680 bytes downloaded
[03:13:51] + 337920 bytes downloaded
[03:13:51] + 348160 bytes downloaded
[03:13:51] + 358400 bytes downloaded
[03:13:51] + 368640 bytes downloaded
[03:13:51] + 378880 bytes downloaded
[03:13:51] + 389120 bytes downloaded
[03:13:51] + 399360 bytes downloaded
[03:13:51] + 409600 bytes downloaded
[03:13:52] + 419840 bytes downloaded
[03:13:52] + 430080 bytes downloaded
[03:13:52] + 440320 bytes downloaded
[03:13:52] + 450560 bytes downloaded
[03:13:52] + 460800 bytes downloaded
[03:13:52] + 471040 bytes downloaded
[03:13:52] + 481280 bytes downloaded
[03:13:52] + 491520 bytes downloaded
[03:13:52] + 501760 bytes downloaded
[03:13:52] + 512000 bytes downloaded
[03:13:52] + 522240 bytes downloaded
[03:13:52] + 532480 bytes downloaded
[03:13:52] + 542720 bytes downloaded
[03:13:52] + 552960 bytes downloaded
[03:13:52] + 563200 bytes downloaded
[03:13:52] + 573440 bytes downloaded
[03:13:52] + 583680 bytes downloaded
[03:13:52] + 593920 bytes downloaded
[03:13:52] + 604160 bytes downloaded
[03:13:52] + 614400 bytes downloaded
[03:13:52] + 624640 bytes downloaded
[03:13:52] + 634880 bytes downloaded
[03:13:52] + 645120 bytes downloaded
[03:13:52] + 655360 bytes downloaded
[03:13:52] + 665600 bytes downloaded
[03:13:52] + 675840 bytes downloaded
[03:13:52] + 686080 bytes downloaded
[03:13:52] + 696320 bytes downloaded
[03:13:52] + 706560 bytes downloaded
[03:13:52] + 716800 bytes downloaded
[03:13:52] + 727040 bytes downloaded
[03:13:52] + 737280 bytes downloaded
[03:13:52] + 747520 bytes downloaded
[03:13:52] + 757760 bytes downloaded
[03:13:52] + 768000 bytes downloaded
[03:13:53] + 778240 bytes downloaded
[03:13:53] + 788480 bytes downloaded
[03:13:53] + 798720 bytes downloaded
[03:13:53] + 808960 bytes downloaded
[03:13:53] + 819200 bytes downloaded
[03:13:53] + 829440 bytes downloaded
[03:13:53] + 839680 bytes downloaded
[03:13:53] + 849920 bytes downloaded
[03:13:53] + 860160 bytes downloaded
[03:13:53] + 870400 bytes downloaded
[03:13:53] + 880640 bytes downloaded
[03:13:53] + 890880 bytes downloaded
[03:13:53] + 901120 bytes downloaded
[03:13:53] + 911360 bytes downloaded
[03:13:53] + 921600 bytes downloaded
[03:13:53] + 931840 bytes downloaded
[03:13:53] + 942080 bytes downloaded
[03:13:53] + 952320 bytes downloaded
[03:13:53] + 962560 bytes downloaded
[03:13:53] + 972800 bytes downloaded
[03:13:53] + 983040 bytes downloaded
[03:13:53] + 993280 bytes downloaded
[03:13:53] + 1003520 bytes downloaded
[03:13:53] + 1013760 bytes downloaded
[03:13:53] + 1024000 bytes downloaded
[03:13:53] + 1034240 bytes downloaded
[03:13:53] + 1044480 bytes downloaded
[03:13:53] + 1054720 bytes downloaded
[03:13:53] + 1064960 bytes downloaded
[03:13:53] + 1075200 bytes downloaded
[03:13:53] + 1085440 bytes downloaded
[03:13:53] + 1095680 bytes downloaded
[03:13:53] + 1105920 bytes downloaded
[03:13:53] + 1116160 bytes downloaded
[03:13:54] + 1126400 bytes downloaded
[03:13:54] + 1136640 bytes downloaded
[03:13:54] + 1146880 bytes downloaded
[03:13:54] + 1157120 bytes downloaded
[03:13:54] + 1167360 bytes downloaded
[03:13:54] + 1177600 bytes downloaded
[03:13:54] + 1187840 bytes downloaded
[03:13:54] + 1198080 bytes downloaded
[03:13:54] + 1208320 bytes downloaded
[03:13:54] + 1218560 bytes downloaded
[03:13:54] + 1228800 bytes downloaded
[03:13:54] + 1239040 bytes downloaded
[03:13:54] + 1249280 bytes downloaded
[03:13:54] + 1259520 bytes downloaded
[03:13:54] + 1269760 bytes downloaded
[03:13:54] + 1280000 bytes downloaded
[03:13:54] + 1290240 bytes downloaded
[03:13:54] + 1300480 bytes downloaded
[03:13:54] + 1310720 bytes downloaded
[03:13:54] + 1320960 bytes downloaded
[03:13:54] + 1331200 bytes downloaded
[03:13:54] + 1341440 bytes downloaded
[03:13:54] + 1351680 bytes downloaded
[03:13:54] + 1361920 bytes downloaded
[03:13:54] + 1372160 bytes downloaded
[03:13:54] + 1382400 bytes downloaded
[03:13:54] + 1392640 bytes downloaded
[03:13:54] + 1402880 bytes downloaded
[03:13:54] + 1413120 bytes downloaded
[03:13:54] + 1423360 bytes downloaded
[03:13:54] + 1433600 bytes downloaded
[03:13:54] + 1443840 bytes downloaded
[03:13:54] + 1454080 bytes downloaded
[03:13:54] + 1464320 bytes downloaded
[03:13:54] + 1474560 bytes downloaded
[03:13:55] + 1484800 bytes downloaded
[03:13:55] + 1495040 bytes downloaded
[03:13:55] + 1505280 bytes downloaded
[03:13:55] + 1515520 bytes downloaded
[03:13:55] + 1525760 bytes downloaded
[03:13:55] + 1536000 bytes downloaded
[03:13:55] + 1537937 bytes downloaded
[03:13:55] Verifying core Core_15.fah...
[03:13:55] Signature is VALID
[03:13:55]
[03:13:55] Trying to unzip core FahCore_15.exe
[03:13:55] Decompressed FahCore_15.exe (4615168 bytes) successfully
[03:14:00] + Core successfully engaged
[03:14:05]
[03:14:05] + Processing work unit
[03:14:05] Core required: FahCore_15.exe
[03:14:05] Core found.
[03:14:05] Working on queue slot 01 [January 1 03:14:05 UTC]
[03:14:05] + Working ...
[03:14:06]
[03:14:06] *
*
[03:14:06] [email protected] GPU Core
[03:14:06] Version 2.20 (Tue Aug 2 12:06:37 PDT 2011)
[03:14:06] Build host SimbiosNvdWin7
[03:14:06] Board Type NVIDIA/CUDA
[03:14:06] Core 15
[03:14:06]
[03:14:06] Window's signal control handler registered.
[03:14:06] Preparing to commence simulation
[03:14:06] - Looking at optimizations...
[03:14:06] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[03:14:06] - Created dyn
[03:14:06] - Files status OK
[03:14:06] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:14:06] - Expanded 124396 -> 501826 (decompressed 403.4 percent)
[03:14:06] Called DecompressByteArray: compressed_data_size=124396 data_size=501826, decompressed_data_size=501826 diff=0
[03:14:06] - Digital signature verified
[03:14:06]
[03:14:06] Project: 7622 (Run 128, Clone 0, Gen 10)
[03:14:06]
[03:14:06] Assembly optimizations on if available.
[03:14:06] Entering M.D.
[03:14:08] Tpr hash work/wudata_01.tpr: 2912143410 3409615516 94164752 1173386794 1596545895
[03:14:08] calling fah_main gpuDeviceId=0
[03:14:08] Working on Protein
[03:14:08] Client config found, loading data.
[03:14:08] Starting GUI Server
[03:15:20] Setting checkpoint frequency: 400000
[03:15:20] Completed 3 out of 40000000 steps (0%).
[03:27:48] Completed 400000 out of 40000000 steps (1%).
[03:27:50] mdrun_gpu returned 52
[03:27:50] NANs detected on GPU
[03:27:50]
[03:27:50] [email protected] Core Shutdown: UNSTABLE_MACHINE
[03:27:53] CoreStatus = 7A (122)
[03:27:53] Sending work to server
[03:27:53] Project: 7622 (Run 128, Clone 0, Gen 10)
[03:27:53] - Read packet limit of 540015616... Set to 524286976.
[03:27:53] - Error: Could not get length of results file work/wuresults_01.dat
[03:27:53] - Error: Could not read unit 01 file. Removing from queue.
[03:27:53] - Preparing to get new work unit...
[03:27:53] Cleaning up work directory
[03:27:53] + Attempting to get work packet
[03:27:53] Gpu type=3 species=21.
[03:27:53] - Connecting to assignment server
[03:27:54] - Successful: assigned to (171.64.65.105).
[03:27:54] + News From [email protected]: Welcome to [email protected]
[03:27:54] Loaded queue successfully.
[03:27:54] Gpu type=3 species=21.
[03:27:55] + Closed connections
[03:28:00]
[03:28:00] + Processing work unit
[03:28:00] Core required: FahCore_15.exe
[03:28:00] Core found.
[03:28:00] Working on queue slot 02 [January 1 03:28:00 UTC]
[03:28:00] + Working ...
[03:28:00]
[03:28:00] *
*

Yeah, am I doing something wrong here? I have tried everything from reinstalling to deleting the queue.dat, the FAHlog.txt, Unitinfo.txt, and the FahCore_15. Please help me out! So stuck it sucks!

“I’m sorry, but having a DB9 on the drive and not driving it is a bit like having Keira Knightley in your bed and sleeping on the couch". ~Jeremy Clarkson
92% of teens have moved on to rap. If you are part of the 8% who still listen to real music, copy and paste this. rolleyes.gif

If I am misspelling or saying things that do not make sense on a post: I am either on an android device, or I have found my Guinness.
smil3dbd4e4c2e742.gif


ninja.gifStock Cooler Club ninja.gif


Senorpie7 is offline  
Sponsored Links
Advertisement
 
post #2 of 3 (permalink) Old 01-11-2012, 02:51 AM
New to Overclock.net
 
blkhwk20k's Avatar
 
Join Date: Feb 2011
Location: Colorado
Posts: 257
Rep: 14 (Unique: 9)
Quote:
Originally Posted by Senorpie7 View Post

[03:14:06] Project: 7622 (Run 128, Clone 0, Gen 10)

[03:27:50] NANs detected on GPU
[03:27:50]
[03:27:50] [email protected] Core Shutdown: UNSTABLE_MACHINE

Advanced method WU's stress a GPU to the max. When it spits out an unstable machine error it is mostly an unstable overclock on the card. What seems like a stable overclock while playing a game, when folding it might not be stable enough. I would bump voltage up a notch until either you run into the max voltage for your card or temps get too high. Once you hit that limit and it is still throwing errors, start reducing your overclock by 5Mhz until it finishes -advmethod WU's.


blkhwk20k is offline  
post #3 of 3 (permalink) Old 01-11-2012, 02:56 AM
Iconoclast
 
Blameless's Avatar
 
Join Date: Feb 2008
Posts: 30,035
Rep: 3132 (Unique: 1869)
You should test stability before you start folding on a piece of hardware, not with [email protected]

Anyway, as blkhwk20k states, gaming is not enough to determine GPU stabitliy. You really need long runs of dedicated stress tests, in addition to other testing, to be sure the card is performing accurate calculations. Even then backing off another 5-10MHz may be prudent.

...rightful liberty is unobstructed action according to our will within limits drawn around us by the equal rights of others. I do not add 'within the limits of the law,' because law is often but the tyrant's will, and always so when it violates the right of an individual. -- Thomas Jefferson
Blameless is offline  
Reply

Quick Reply
Message:
Options

Register Now

In order to be able to post messages on the Overclock.net - An Overclocking Community forums, you must first register.
Please enter your desired user name, your email address and other required details in the form below.
User Name:
If you do not want to register, fill this field only and the name will be used as user name for your post.
Password
Please enter a password for your user account. Note that passwords are case-sensitive.
Password:
Confirm Password:
Email Address
Please enter a valid email address for yourself.
Email Address:

Log-in



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Show Printable Version Show Printable Version
Email this Page Email this Page


Forum Jump: 

Posting Rules  
You may post new threads
You may post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off