Overclock.net › Forums › Overclockers Care › Overclock.net Folding@Home Team › Server reports problem with Unit
New Posts  All Forums:Forum Nav:

Server reports problem with Unit

post #1 of 5
Thread Starter 
Hi,

So, I got into folding yesterday and so far I've kept a GTS 450 and a 250 folding non-stop ever since. Problem is that once a work unit is completed and sent back to the server, the server reports a problem. This has happened to 3 WU's so far. The only thing I can think of is that I've changed the machine ID's a couple times while the GPU client was running. The only WU's that have been submitted successfully were the ones that were folding during the night.

Its a real hard problem to diagnose because you have to wait until a whole work unit is completed, so if anyone has any insight into this, it would be greatly appreciated. Is it possible that the machine ID that downloaded the data has to the same as the ID that uploads it back to the server?

Also, this has happened on one CPU core so far - which I've also changed the machine ID on.
Icarus
(19 items)
 
   
CPUMotherboardRAMRAM
i5 3570k Asrock H77-m Pro4 Corsair Vengeance Patriot 
RAMHard DriveHard DriveHard Drive
Kingston HyperX Blu Western Digital Red 3x 3TB Intel 310 160GB Western Digital Green 2x 2TB 
OSMonitorPowerCase
Server 2008 R2 Sony KDL55HX750 Corsair Builder Series 400W Silverstone Grandia GD06B 
  hide details  
Reply
Icarus
(19 items)
 
   
CPUMotherboardRAMRAM
i5 3570k Asrock H77-m Pro4 Corsair Vengeance Patriot 
RAMHard DriveHard DriveHard Drive
Kingston HyperX Blu Western Digital Red 3x 3TB Intel 310 160GB Western Digital Green 2x 2TB 
OSMonitorPowerCase
Server 2008 R2 Sony KDL55HX750 Corsair Builder Series 400W Silverstone Grandia GD06B 
  hide details  
Reply
post #2 of 5
Post a log in [code] tags with this server error you are getting.
>.<
(13 items)
 
  
CPUMotherboardGraphicsRAM
i7 930 4.0ghz 1.27v EVGA E758 3-Way (black/gray) Evga GTX 480 / Evga 9800gtx+ (physx&folding) Corsair Dominator 6gb 1600 8-8-8-24 
Hard DriveOSMonitorPower
x25-M SSD 80gb + 1TB F3 + 2x2TB WD Green Win 7 64bit Viewsonic 20" + Samsung 40" Corsair 1000w 
Case
Haf 932 (modded) - Now caseless 
  hide details  
Reply
>.<
(13 items)
 
  
CPUMotherboardGraphicsRAM
i7 930 4.0ghz 1.27v EVGA E758 3-Way (black/gray) Evga GTX 480 / Evga 9800gtx+ (physx&folding) Corsair Dominator 6gb 1600 8-8-8-24 
Hard DriveOSMonitorPower
x25-M SSD 80gb + 1TB F3 + 2x2TB WD Green Win 7 64bit Viewsonic 20" + Samsung 40" Corsair 1000w 
Case
Haf 932 (modded) - Now caseless 
  hide details  
Reply
post #3 of 5
Thread Starter 
Code:
Launch directory: U:\\Folding\\Folding@home-gpu


[17:18:47] - Ask before connecting: No
[17:18:47] - User name: shinigamibob (Team 37726)
[17:18:47] - User ID: ****************
[17:18:47] - Machine ID: 2
[17:18:47] 
[17:18:47] Gpu type=3 species=30.
[17:18:47] Loaded queue successfully.
[17:18:47] Initialization complete
[17:18:47] 
[17:18:47] + Processing work unit
[17:18:47] Core required: FahCore_15.exe
[17:18:47] Core found.
[17:18:47] Working on queue slot 02 [January 24 17:18:47 UTC]
[17:18:47] + Working ...
[17:18:47] 
[17:18:47] *------------------------------*
[17:18:47] Folding@Home GPU Core
[17:18:47] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[17:18:47] 
[17:18:47] Build host: SimbiosNvdWin7
[17:18:47] Board Type: NVIDIA/CUDA
[17:18:47] Core      : x=15
[17:18:47]  Window's signal control handler registered.
[17:18:47] Preparing to commence simulation
[17:18:47] - Ensuring status. Please wait.
[17:18:57] - Looking at optimizations...
[17:18:57] - Working with standard loops on this execution.
[17:18:57] - Previous termination of core was improper.
[17:18:57] - Files status OK
[17:18:57] sizeof(CORE_PACKET_HDR) = 512 file=<>
[17:18:57] - Expanded 41531 -> 162639 (decompressed 391.6 percent)
[17:18:57] Called DecompressByteArray: compressed_data_size=41531 data_size=162639, decompressed_data_size=162639 diff=0
[17:18:57] - Digital signature verified
[17:18:57] 
[17:18:57] Project: 6805 (Run 3824, Clone 0, Gen 28)
[17:18:57] 
[17:18:57] Entering M.D.
[17:18:59] Will resume from checkpoint file work/wudata_02.ckp
[17:18:59] Tpr hash work/wudata_02.tpr:  145020334 2534098507 1177771334 2862661440 2606763529
[17:18:59] Working on ALZHEIMER'S DISEASE AMYLOID
[17:18:59] Client config found, loading data.
[17:18:59] Starting GUI Server
[17:19:00] Resuming from checkpoint
[17:19:00] fcCheckPointResume: retreived and current tpr file hash:
[17:19:00]    0    145020334    145020334
[17:19:00]    1   2534098507   2534098507
[17:19:00]    2   1177771334   1177771334
[17:19:00]    3   2862661440   2862661440
[17:19:00]    4   2606763529   2606763529
[17:19:00] fcCheckPointResume: file hashes same.
[17:19:00] fcCheckPointResume: state restored.
[17:19:00] fcCheckPointResume: name work/wudata_02.log Verified work/wudata_02.log
[17:19:00] fcCheckPointResume: name work/wudata_02.trr Verified work/wudata_02.trr
[17:19:00] fcCheckPointResume: name work/wudata_02.xtc Verified work/wudata_02.xtc
[17:19:00] fcCheckPointResume: name work/wudata_02.edr Verified work/wudata_02.edr
[17:19:00] fcCheckPointResume: state restored 2
[17:19:00] Resumed from checkpoint
[17:19:00] Setting checkpoint frequency: 500000
[17:19:00] Completed  34500001 out of 50000000 steps (69%).
...
...
[18:16:04] Completed  49499999 out of 50000000 steps (99%).
[18:17:57] Completed  49999999 out of 50000000 steps (100%).
[18:17:57] Finished fah_main
[18:17:57] 
[18:17:57] Successful run
[18:17:57] DynamicWrapper: Finished Work Unit: sleep=10000
[18:18:07] Reserved 2339484 bytes for xtc file; Cosm status=0
[18:18:07] Allocated 2339484 bytes for xtc file
[18:18:07] - Reading up to 2339484 from "work/wudata_02.xtc": Read 2339484
[18:18:07] Read 2339484 bytes from xtc file; available packet space=784090980
[18:18:07] xtc file hash check passed.
[18:18:07] Reserved 72360 72360 784090980 bytes for arc file=<work/wudata_02.trr> Cosm status=0
[18:18:07] Allocated 72360 bytes for arc file
[18:18:07] - Reading up to 72360 from "work/wudata_02.trr": Read 72360
[18:18:07] Read 72360 bytes from arc file; available packet space=784018620
[18:18:07] trr file hash check passed.
[18:18:07] Allocated 544 bytes for edr file
[18:18:07] Read bedfile
[18:18:07] edr file hash check passed.
[18:18:07] Allocated 121517 bytes for logfile
[18:18:07] Read logfile
[18:18:07] GuardedRun: success in DynamicWrapper
[18:18:07] GuardedRun: done
[18:18:07] Run: GuardedRun completed.
[18:18:07] + Opened results file
[18:18:07] - Writing 2534417 bytes of core data to disk...
[18:18:08] Done: 2533905 -> 2374966 (compressed to 93.7 percent)
[18:18:08]   ... Done.
[18:18:08] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[18:18:08] Shutting down core 
[18:18:08] 
[18:18:08] Folding@home Core Shutdown: FINISHED_UNIT
[18:18:12] CoreStatus = 64 (100)
[18:18:12] Sending work to server
[18:18:12] Project: 6805 (Run 3824, Clone 0, Gen 28)


[18:18:12] + Attempting to send results [January 24 18:18:12 UTC]
[18:18:12] Gpu type=3 species=30.
[18:18:34] - Server reports problem with unit.
[18:18:34] - Preparing to get new work unit...
[18:18:34] Cleaning up work directory
[18:18:34] + Attempting to get work packet
[18:18:34] Gpu type=3 species=30.
[18:18:34] - Connecting to assignment server
[18:18:35] - Successful: assigned to (171.64.65.64).
[18:18:35] + News From Folding@Home: Welcome to Folding@Home
[18:18:35] Loaded queue successfully.
[18:18:35] Gpu type=3 species=30.
[18:18:36] + Closed connections
[18:18:36] 
[18:18:36] + Processing work unit
Icarus
(19 items)
 
   
CPUMotherboardRAMRAM
i5 3570k Asrock H77-m Pro4 Corsair Vengeance Patriot 
RAMHard DriveHard DriveHard Drive
Kingston HyperX Blu Western Digital Red 3x 3TB Intel 310 160GB Western Digital Green 2x 2TB 
OSMonitorPowerCase
Server 2008 R2 Sony KDL55HX750 Corsair Builder Series 400W Silverstone Grandia GD06B 
  hide details  
Reply
Icarus
(19 items)
 
   
CPUMotherboardRAMRAM
i5 3570k Asrock H77-m Pro4 Corsair Vengeance Patriot 
RAMHard DriveHard DriveHard Drive
Kingston HyperX Blu Western Digital Red 3x 3TB Intel 310 160GB Western Digital Green 2x 2TB 
OSMonitorPowerCase
Server 2008 R2 Sony KDL55HX750 Corsair Builder Series 400W Silverstone Grandia GD06B 
  hide details  
Reply
post #4 of 5
Think your initial assumptions are correct...

Have a read through that, but I gather that both MachineID and UserID that send the WU need to match what they were when the WU was downloaded

http://foldingforum.org/viewtopic.ph...cc27c&start=15
post #5 of 5
Thread Starter 
Damn it, this sucks. I have absolutely no idea what machine ID's I used for all of the clients. I guess the best way is to wait till all the current WU are completed, then change each of the 6 clients individually, then start a completely new WU for each. Hopefully the 6 that are in progress right now will be sent without a problem.

EDIT: I think I might be able to solve this problem by checking what HFM.net reports as the current machine ID, then change it to match that on the clients. I believe HFM reports the machine ID that downloaded the WU, so if I change it back to whatever it reports, I believe it will work properly. I'll keep you guys updated on this.

EDIT 2: What happens if 2 clients on the same computer have the same machine ID? It also looks like they are working on the exact same WU. Is that bad? sorry, i'm very new to this.
Edited by shinigamibob - 1/24/11 at 12:50pm
Icarus
(19 items)
 
   
CPUMotherboardRAMRAM
i5 3570k Asrock H77-m Pro4 Corsair Vengeance Patriot 
RAMHard DriveHard DriveHard Drive
Kingston HyperX Blu Western Digital Red 3x 3TB Intel 310 160GB Western Digital Green 2x 2TB 
OSMonitorPowerCase
Server 2008 R2 Sony KDL55HX750 Corsair Builder Series 400W Silverstone Grandia GD06B 
  hide details  
Reply
Icarus
(19 items)
 
   
CPUMotherboardRAMRAM
i5 3570k Asrock H77-m Pro4 Corsair Vengeance Patriot 
RAMHard DriveHard DriveHard Drive
Kingston HyperX Blu Western Digital Red 3x 3TB Intel 310 160GB Western Digital Green 2x 2TB 
OSMonitorPowerCase
Server 2008 R2 Sony KDL55HX750 Corsair Builder Series 400W Silverstone Grandia GD06B 
  hide details  
Reply
New Posts  All Forums:Forum Nav:
  Return Home
  Back to Forum: Overclock.net Folding@Home Team
Overclock.net › Forums › Overclockers Care › Overclock.net Folding@Home Team › Server reports problem with Unit