Lost a task -- related to Internet going down

Message boards : Number crunching : Lost a task -- related to Internet going down
Message board moderation

To post messages, you must log in.

AuthorMessage
rhb

Send message
Joined: 1 Jul 12
Posts: 7
Credit: 531,806
RAC: 0
United States
Message 470 - Posted: 28 Jul 2012, 0:07:31 UTC

I lost task yafu_C111_1343367317_16_0, too many restarts due to lack of heartbeat.

http://yafu.dyndns.org/yafu/result.php?resultid=807093

The internet went down at the time it happened. I have had similar problems, though it usually leads to an error 11 (SIGSEGV). If you have any ideas why boinc would restart the task and the heartbeat would continue to not be present, I might be able to debug this (by deliberately disconnecting the internet, if necessary).

The router is usually still working fine when I have problems, but the cable modem loses its upstream connection. In some cases, neither BoincTasks nor the Boinc manager can contact the client when it happens. I suspect the OS still thinks the internet is up, and boinc gets in trouble trying to use DNS or something else that times out.

I can bring this up on Boinc's forum, but thought I would ask your opinion first.
ID: 470 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile yoyo_rkn
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 22 Aug 11
Posts: 736
Credit: 17,612,101
RAC: 51
Germany
Message 471 - Posted: 28 Jul 2012, 12:35:51 UTC

As far as I know the heartbeat problem is a well known problem of Boinc. You should discuss this with the Boinc developer.
yoyo
ID: 471 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Lost a task -- related to Internet going down




Datenschutz / Privacy Copyright © 2011-2024 Rechenkraft.net e.V. & yoyo