Having this problem again today when adding a new host. now it won't let me try and get new tasks because I reached my daily quota of 40 for the day because of this MD5 error.
New WUs/data files that were changed lately to blame?
... New WUs/data files that were changed lately to blame?
Unlikely I think. There are no other reports I'm aware of. I would expect to see other reports if it was happening to all new installs. Last time, the problem was not caused by bad data or tasks - just a bad stored value for the checksum.
I'm also a bit surprised that with a GPU plus 32 CPUs, there is a daily limit of only 40.
Have you compared the file on the new host to the same file on one of your other hosts? Have you checked the MD5sum stored in client_state.xml on each? The previous occurrence was a bad MD5sum stored in the state file. Do you have a bad MD5sum on the new host? It should be the same on all hosts. If you have, can you try editing the state file to change the bad value into the correct value? That was the workaround used last time.
If all of those things turn out negative, I have seen examples of checksum failures being caused by bad RAM. Since it's a new host, are you confident the RAM is good?
I dug through some of last year's posts and seem to have it working. Here's what I did:
1. Exit BOINC manager, making sure all work has stopped.
2. Open your BOINC program files and look for "client_state.xml" (Mine was in C > Program Data > BOINC)
3. Open the file "client_state.xml" with a simple text editor like Notepad.
4. Search for the line:
<file>
<name>hsgamma_FGRP5_XXX.exe</name>
The "XXX" part doesn't matter, it's just version number info. I did a Ctrl+F search for occurrences of "hsgamma" and found the right line almost immediately.
5. JUST BEFORE that ^ line you just found, paste in all this:
6. Save your newly edited "client_state.xml" file, overwriting the old one. Make sure it does NOT save as a Text Document (*.txt), use the All Files (*.*) dropdown option. Also make sure it DOES save with ANSI coding (which seems pretty standard).
7. Restart BOINC
Thanks to everyone who had to work this out for themselves last time. Hope it helps somebody else!
Sorry for the trouble, and thanks for the notification.
This is a result of a race condition between the two FGRP workunit generators. This should be fixed now (i.e. not happen again). Also too the broken md5sums were fixed in the database, i.e. new tasks should be sent out with the correct checksum even if they are instantiated from old workunits.
The problem has been fixed
)
The problem has been fixed for now, i.e. newly generated workunits and tasks generated from old workunits should have the correct md5 checksum now.
The root cause of the problem has been identified and measurements are taken to prevent the same from happening in the future.
BM
But I do have the same
)
But I do have the same promblem on a newly buit [HOST#12619532]
Moreover I have MD5 problem with lots of other files, like FGRP OpenCL NVidia-1k.exe.
Will try to change BOINC version. Let's see what happens.
Having this problem again
)
Having this problem again today when adding a new host. now it won't let me try and get new tasks because I reached my daily quota of 40 for the day because of this MD5 error.
New WUs/data files that were changed lately to blame?
Please fix (again)...thanks!
bluestang wrote:... New
)
Unlikely I think. There are no other reports I'm aware of. I would expect to see other reports if it was happening to all new installs. Last time, the problem was not caused by bad data or tasks - just a bad stored value for the checksum.
I'm also a bit surprised that with a GPU plus 32 CPUs, there is a daily limit of only 40.
Have you compared the file on the new host to the same file on one of your other hosts? Have you checked the MD5sum stored in client_state.xml on each? The previous occurrence was a bad MD5sum stored in the state file. Do you have a bad MD5sum on the new host? It should be the same on all hosts. If you have, can you try editing the state file to change the bad value into the correct value? That was the workaround used last time.
If all of those things turn out negative, I have seen examples of checksum failures being caused by bad RAM. Since it's a new host, are you confident the RAM is good?
Cheers,
Gary.
Dear experts, I got
)
Dear experts,
I got repeatedly a checksum error trying to download new work unites.
See here:
28.09.2018 09:06:24 | Einstein@Home | Started download of LATeah0040F.dat 28.09.2018 09:06:24 | Einstein@Home | Started download of JPLEPH.405
28.09.2018 09:06:34 | Einstein@Home | Finished download of LATeah0040F.dat 28.09.2018 09:06:40 | Einstein@Home | work fetch suspended by user 28.09.2018 09:06:45 | Einstein@Home | task LATeah0040F_1192.0_114550_0.0_0 suspended by user 28.09.2018 09:06:46 | Einstein@Home | Finished download of JPLEPH.405 28.09.2018 09:06:46 | Einstein@Home | [error] MD5 check failed for JPLEPH.405 28.09.2018 09:06:46 | Einstein@Home | [error] expected d41d8cd98f00b204e9800998ecf8427e, got d6ce12bacd2a81a56423f5f238ba84eb 28.09.2018 09:06:46 | Einstein@Home | [error] Checksum or signature error for JPLEPH.405 28.09.2018 09:06:47 | Einstein@Home | task LATeah0040F_1192.0_114234_0.0_0 suspended by user 28.09.2018 09:06:48 | Einstein@Home | task LATeah0040F_1192.0_114155_0.0_0 suspended by user
Up to now I got 53 aborted tasks:
...
I have also been getting
)
I have also been getting download errors on all my WU's here the past couple days. :(
Task 789416778
Stderr output
Well, there is evidently an
)
Well, there is evidently an issue again with this as it's not just me.
Hopefully it will be fixed soon as I plan on adding more power for Einstein.
I dug through some of last
)
I dug through some of last year's posts and seem to have it working. Here's what I did:
1. Exit BOINC manager, making sure all work has stopped.
2. Open your BOINC program files and look for "client_state.xml" (Mine was in C > Program Data > BOINC)
3. Open the file "client_state.xml" with a simple text editor like Notepad.
4. Search for the line:
5. JUST BEFORE that ^ line you just found, paste in all this:
6. Save your newly edited "client_state.xml" file, overwriting the old one. Make sure it does NOT save as a Text Document (*.txt), use the All Files (*.*) dropdown option. Also make sure it DOES save with ANSI coding (which seems pretty standard).
7. Restart BOINC
Thanks to everyone who had to work this out for themselves last time. Hope it helps somebody else!
I've also got this error on a
)
I've also got this error on a new installation.
Sorry for the trouble, and
)
Sorry for the trouble, and thanks for the notification.
This is a result of a race condition between the two FGRP workunit generators. This should be fixed now (i.e. not happen again). Also too the broken md5sums were fixed in the database, i.e. new tasks should be sent out with the correct checksum even if they are instantiated from old workunits.
BM