Overclock.net › Forums › Overclockers Care › Overclock.net Folding@Home Team › -bigadv crashing upon download, *SOLVED - bad WU*
New Posts  All Forums:Forum Nav:

-bigadv crashing upon download, *SOLVED - bad WU* - Page 2

post #11 of 16
Thread Starter 
I'll try to add more clarification, as I can't try this folder trick quite yet. I'm 84% complete with a 6701 smp unit, so I won't be done for another 2 hours.

The first log file that I copied is my biggest issue. Basically, what happens is, that when the client downloads a -bigadv unit, everything works for a moment, and then a Windows error pops up saying that FahCore_a3 stops working. When I close out of that error and reset the client, I get stuck in the second log that I posted, where things just loops.

So, the main issue is resolving why the client initially crashes, which apparently is a memory error. However, I've never seen this memory error before, which is why I am so confused. I've been folding -bigadv on and off for over a year, and I've never had an issue. This is the first time my clients have begun showing any signs of a problem with my memory. Granted, I just pushed from 3.4 to 4.2 and now I'm at 4.0, but even so, I do not understand how my memory has all of a sudden stopped functioning properly. :\\

This push was only in the last 2 weeks, but if you look at my EOC stats page, you will see that I have successfully completed 7 bigadv since the beginning of January, which is when I upped my OC. I have also been folding them in November and December, and the bigadv units make up the majority of the points there as well, which was done at 3.4ghz.

EDIT: For further clarification, this "loop" is only occuring with bigadv units. As soon as I switch over to SMP, everything works just dandy. For example, last night after I had thought I had fixed it, I re-enabled bigadv and let my client finish the SMP unit it was working on. While I was watching a movie on my computer (quite typical, never had issues before), I saw the Windows error pop up. I just investigated in the log, and this is what happened last night around midnight when my SMP unit finished:

Code:
[04:31:36] Completed 500000 out of 500000 steps  (100%)
[04:31:37] DynamicWrapper: Finished Work Unit: sleep=10000
[04:31:47] 
[04:31:47] Finished Work Unit:
[04:31:47] - Reading up to 3699984 from "work/wudata_01.trr": Read 3699984
[04:31:47] trr file hash check passed.
[04:31:47] edr file hash check passed.
[04:31:47] logfile size: 63571
[04:31:47] Leaving Run
[04:31:51] - Writing 3799107 bytes of core data to disk...
[04:31:51]   ... Done.
[04:31:51] - Shutting down core
[04:31:51] 
[04:31:51] Folding@home Core Shutdown: FINISHED_UNIT
[04:31:55] CoreStatus = 64 (100)
[04:31:55] Sending work to server
[04:31:55] Project: 6058 (Run 0, Clone 19, Gen 214)


[04:31:55] + Attempting to send results [January 30 04:31:55 UTC]
[04:32:28] + Results successfully sent
[04:32:28] Thank you for your contribution to Folding@Home.
[04:32:28] + Number of Units Completed: 14

[04:32:32] - Preparing to get new work unit...
[04:32:32] Cleaning up work directory
[04:32:32] + Attempting to get work packet
[04:32:32] Passkey found
[04:32:32] - Connecting to assignment server
[04:32:33] - Successful: assigned to (130.237.232.141).
[04:32:33] + News From Folding@Home: Welcome to Folding@Home
[04:32:33] Loaded queue successfully.
[04:32:34] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[04:32:41] + Attempting to get work packet
[04:32:41] Passkey found
[04:32:41] - Connecting to assignment server
[04:32:41] - Successful: assigned to (130.237.232.141).
[04:32:41] + News From Folding@Home: Welcome to Folding@Home
[04:32:42] Loaded queue successfully.
[04:32:43] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[04:32:54] + Attempting to get work packet
[04:32:54] Passkey found
[04:32:54] - Connecting to assignment server
[04:32:55] - Successful: assigned to (130.237.232.141).
[04:32:55] + News From Folding@Home: Welcome to Folding@Home
[04:32:55] Loaded queue successfully.
[04:32:56] - Attempt #3  to get work failed, and no other work to do.
Waiting before retry.
[04:33:30] + Attempting to get work packet
[04:33:30] Passkey found
[04:33:30] - Connecting to assignment server
[04:33:30] - Successful: assigned to (130.237.232.141).
[04:33:30] + News From Folding@Home: Welcome to Folding@Home
[04:33:30] Loaded queue successfully.
[04:33:32] - Attempt #4  to get work failed, and no other work to do.
Waiting before retry.
[04:34:20] + Attempting to get work packet
[04:34:20] Passkey found
[04:34:20] - Connecting to assignment server
[04:34:20] - Successful: assigned to (130.237.232.141).
[04:34:20] + News From Folding@Home: Welcome to Folding@Home
[04:34:20] Loaded queue successfully.
[04:34:22] - Attempt #5  to get work failed, and no other work to do.
Waiting before retry.
[04:35:44] + Attempting to get work packet
[04:35:44] Passkey found
[04:35:44] - Connecting to assignment server
[04:35:45] - Successful: assigned to (130.237.232.141).
[04:35:45] + News From Folding@Home: Welcome to Folding@Home
[04:35:45] Loaded queue successfully.
[04:35:46] - Attempt #6  to get work failed, and no other work to do.
Waiting before retry.
[04:38:37] + Attempting to get work packet
[04:38:37] Passkey found
[04:38:37] - Connecting to assignment server
[04:38:37] - Successful: assigned to (130.237.232.141).
[04:38:37] + News From Folding@Home: Welcome to Folding@Home
[04:38:37] Loaded queue successfully.
[04:38:39] - Attempt #7  to get work failed, and no other work to do.
Waiting before retry.
[04:44:02] + Attempting to get work packet
[04:44:02] Passkey found
[04:44:02] - Connecting to assignment server
[04:44:03] - Successful: assigned to (130.237.232.141).
[04:44:03] + News From Folding@Home: Welcome to Folding@Home
[04:44:03] Loaded queue successfully.
[04:46:17] + Closed connections
[04:46:17] 
[04:46:17] + Processing work unit
[04:46:17] Core required: FahCore_a3.exe
[04:46:17] Core found.
[04:46:17] Working on queue slot 02 [January 30 04:46:17 UTC]
[04:46:17] + Working ...
[04:46:17] 
[04:46:17] *------------------------------*
[04:46:17] Folding@Home Gromacs SMP Core
[04:46:17] Version 2.22 (Mar 12, 2010)
[04:46:17] 
[04:46:17] Preparing to commence simulation
[04:46:17] - Looking at optimizations...
[04:46:17] - Created dyn
[04:46:17] - Files status OK
[04:46:22] - Expanded 24858837 -> 30796293 (decompressed 123.8 percent)
[04:46:22] Called DecompressByteArray: compressed_data_size=24858837 data_size=30796293, decompressed_data_size=30796293 diff=0
[04:46:22] - Digital signature verified
[04:46:22] 
[04:46:22] Project: 6900 (Run 43, Clone 23, Gen 1)
[04:46:22] 
[04:46:22] Assembly optimizations on if available.
[04:46:22] Entering M.D.
[04:46:31] Completed 0 out of 250000 steps  (0%)
[04:46:33] Gromacs cannot continue further.
[04:46:33] Going to send back what have done -- stepsTotalG=250000
[04:46:33] Work fraction=-1.#IND steps=250000.
[04:47:00] logfile size=1818 infoLength=1818 edr=0 trr=23
[04:47:00] logfile size: 1818 info=1818 bed=0 hdr=23
[04:47:00] - Writing 2354 bytes of core data to disk...
[04:47:03] CoreStatus = C0000005 (-1073741819)
[04:47:03] Client-core communications error: ERROR 0xc0000005
[04:47:03] Deleting current work unit & continuing...
[04:47:19] - Preparing to get new work unit...
[04:47:19] Cleaning up work directory
[04:47:19] + Attempting to get work packet
[04:47:19] Passkey found
[04:47:19] - Connecting to assignment server
[04:47:20] - Successful: assigned to (130.237.232.141).
[04:47:20] + News From Folding@Home: Welcome to Folding@Home
[04:47:20] Loaded queue successfully.

Folding@Home Client Shutdown at user request.

Folding@Home Client Shutdown.

Edited by stu. - 1/30/11 at 9:13am
 
F@H
(14 items)
 
 
CPUMotherboardGraphicsRAM
Core i7 920 @ 4.0ghz EVGA x58 132-BL-E758-A1 2x EVGA 460 1 GB G.SKILL 3 x 2 GB 
Hard DriveOSMonitorKeyboard
Mushkin 40 GB SSD / WD Black 1 TB Windows 7 Home Premium 64-bit DCLCD 20.1" Logitech G15 
PowerCaseMouse
Antec TP 750 Antec 900 Logitech G5 
CPUMotherboardRAMCooling
i7 2700k ASUS Maximus Gene-Z z68 G.Skill 2133mhz Noctua NH-D14 
OSPowerCase
Ubuntu 10.10 BFG 650 Silverstone TJ08-E 
  hide details  
Reply
 
F@H
(14 items)
 
 
CPUMotherboardGraphicsRAM
Core i7 920 @ 4.0ghz EVGA x58 132-BL-E758-A1 2x EVGA 460 1 GB G.SKILL 3 x 2 GB 
Hard DriveOSMonitorKeyboard
Mushkin 40 GB SSD / WD Black 1 TB Windows 7 Home Premium 64-bit DCLCD 20.1" Logitech G15 
PowerCaseMouse
Antec TP 750 Antec 900 Logitech G5 
CPUMotherboardRAMCooling
i7 2700k ASUS Maximus Gene-Z z68 G.Skill 2133mhz Noctua NH-D14 
OSPowerCase
Ubuntu 10.10 BFG 650 Silverstone TJ08-E 
  hide details  
Reply
post #12 of 16
I understand.

what Iam trying to tell you is where your hidden, user-specific stuff is. Files that you have not done anything with. Why sometimes just copying over the "Folding" folder does not work.

This screenshot shows you where the other stuff is, including the current activity. This is where things can get screwed up if there is an unxpected shutdown. If you are stuck in a download status somewhere this is the other stuff that you sometimes have to clean to get back on track.

-TG

Edited by Tleilaxu Ghola - 1/30/11 at 9:46am
Gom Jabbar
(13 items)
 
  
CPU
i7 930 @ 4.20 GHz 1.29v HT 
  hide details  
Reply
Gom Jabbar
(13 items)
 
  
CPU
i7 930 @ 4.20 GHz 1.29v HT 
  hide details  
Reply
post #13 of 16
Thread Starter 
I'll get back to you with what happens in about 1.5 hours or so.
 
F@H
(14 items)
 
 
CPUMotherboardGraphicsRAM
Core i7 920 @ 4.0ghz EVGA x58 132-BL-E758-A1 2x EVGA 460 1 GB G.SKILL 3 x 2 GB 
Hard DriveOSMonitorKeyboard
Mushkin 40 GB SSD / WD Black 1 TB Windows 7 Home Premium 64-bit DCLCD 20.1" Logitech G15 
PowerCaseMouse
Antec TP 750 Antec 900 Logitech G5 
CPUMotherboardRAMCooling
i7 2700k ASUS Maximus Gene-Z z68 G.Skill 2133mhz Noctua NH-D14 
OSPowerCase
Ubuntu 10.10 BFG 650 Silverstone TJ08-E 
  hide details  
Reply
 
F@H
(14 items)
 
 
CPUMotherboardGraphicsRAM
Core i7 920 @ 4.0ghz EVGA x58 132-BL-E758-A1 2x EVGA 460 1 GB G.SKILL 3 x 2 GB 
Hard DriveOSMonitorKeyboard
Mushkin 40 GB SSD / WD Black 1 TB Windows 7 Home Premium 64-bit DCLCD 20.1" Logitech G15 
PowerCaseMouse
Antec TP 750 Antec 900 Logitech G5 
CPUMotherboardRAMCooling
i7 2700k ASUS Maximus Gene-Z z68 G.Skill 2133mhz Noctua NH-D14 
OSPowerCase
Ubuntu 10.10 BFG 650 Silverstone TJ08-E 
  hide details  
Reply
post #14 of 16
I noticed in your log's that the WU was the same both times. 6900 (Run 43, Clone 23, Gen 1) which made me suspect a bad WU, I then did a search at Stanford folding forums and found that you were not the only one having problems with that particular WU. It has been marked as a bad WU. So there is most likely nothing wrong with your memory or anything else. http://foldingforum.org/viewtopic.ph...5c6a38d062045f

See what happens after your current WU finishes it should not download the same WU again because it has been marked as bad at Stanford. If it does delete everything in the folder except the client and the .cfg file and see what happens it should take you out of the loop and download a different WU.
Toy
(13 items)
 
  
CPUMotherboardGraphicsRAM
I7 980X 4.4Ghz P6T SE ASUS ENGTX 460 Super Talent DDR3 2000 
Hard DriveOSPowerCase
2-64GB SSD-0 Raid 4-Seagate 500GB (2TB) 0-Raid Ubuntu 10.10 CoolMax 1200W Thermaltake Kandalf LCS 
  hide details  
Reply
Toy
(13 items)
 
  
CPUMotherboardGraphicsRAM
I7 980X 4.4Ghz P6T SE ASUS ENGTX 460 Super Talent DDR3 2000 
Hard DriveOSPowerCase
2-64GB SSD-0 Raid 4-Seagate 500GB (2TB) 0-Raid Ubuntu 10.10 CoolMax 1200W Thermaltake Kandalf LCS 
  hide details  
Reply
post #15 of 16
Thread Starter 
Quote:
Originally Posted by Grandpa_01 View Post
I noticed in your log's that the WU was the same both times. 6900 (Run 43, Clone 23, Gen 1) which made me suspect a bad WU, I then did a search at Stanford folding forums and found that you were not the only one having problems with that particular WU. It has been marked as a bad WU. So there is most likely nothing wrong with your memory or anything else. http://foldingforum.org/viewtopic.ph...5c6a38d062045f

See what happens after your current WU finishes it should not download the same WU again because it has been marked as bad at Stanford. If it does delete everything in the folder except the client and the .cfg file and see what happens it should take you out of the loop and download a different WU.
Wow! I would never have thought to look into something like that, thanks!
 
F@H
(14 items)
 
 
CPUMotherboardGraphicsRAM
Core i7 920 @ 4.0ghz EVGA x58 132-BL-E758-A1 2x EVGA 460 1 GB G.SKILL 3 x 2 GB 
Hard DriveOSMonitorKeyboard
Mushkin 40 GB SSD / WD Black 1 TB Windows 7 Home Premium 64-bit DCLCD 20.1" Logitech G15 
PowerCaseMouse
Antec TP 750 Antec 900 Logitech G5 
CPUMotherboardRAMCooling
i7 2700k ASUS Maximus Gene-Z z68 G.Skill 2133mhz Noctua NH-D14 
OSPowerCase
Ubuntu 10.10 BFG 650 Silverstone TJ08-E 
  hide details  
Reply
 
F@H
(14 items)
 
 
CPUMotherboardGraphicsRAM
Core i7 920 @ 4.0ghz EVGA x58 132-BL-E758-A1 2x EVGA 460 1 GB G.SKILL 3 x 2 GB 
Hard DriveOSMonitorKeyboard
Mushkin 40 GB SSD / WD Black 1 TB Windows 7 Home Premium 64-bit DCLCD 20.1" Logitech G15 
PowerCaseMouse
Antec TP 750 Antec 900 Logitech G5 
CPUMotherboardRAMCooling
i7 2700k ASUS Maximus Gene-Z z68 G.Skill 2133mhz Noctua NH-D14 
OSPowerCase
Ubuntu 10.10 BFG 650 Silverstone TJ08-E 
  hide details  
Reply
post #16 of 16
Quote:
Originally Posted by Grandpa_01 View Post
I noticed in your log's that the WU was the same both times. 6900 (Run 43, Clone 23, Gen 1) which made me suspect a bad WU, I then did a search at Stanford folding forums and found that you were not the only one having problems with that particular WU. It has been marked as a bad WU. So there is most likely nothing wrong with your memory or anything else. http://foldingforum.org/viewtopic.ph...5c6a38d062045f

See what happens after your current WU finishes it should not download the same WU again because it has been marked as bad at Stanford. If it does delete everything in the folder except the client and the .cfg file and see what happens it should take you out of the loop and download a different WU.

Great Answer! That's a nice catch.. +1

-TG
Gom Jabbar
(13 items)
 
  
CPU
i7 930 @ 4.20 GHz 1.29v HT 
  hide details  
Reply
Gom Jabbar
(13 items)
 
  
CPU
i7 930 @ 4.20 GHz 1.29v HT 
  hide details  
Reply
New Posts  All Forums:Forum Nav:
  Return Home
  Back to Forum: Overclock.net Folding@Home Team
Overclock.net › Forums › Overclockers Care › Overclock.net Folding@Home Team › -bigadv crashing upon download, *SOLVED - bad WU*