Message boards : Theory Application : Checkpoints
Message board moderation

To post messages, you must log in.

AuthorMessage
boboviz

Send message
Joined: 24 Oct 19
Posts: 276
Credit: 783,415
RAC: 2,464
Message 8688 - Posted: 8 Apr 2025, 8:32:33 UTC

I know that wus are not so long, but a checkpoint system will be welcome...
ID: 8688 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileLaurence CERN
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1150
Credit: 342,328
RAC: 4
Message 8689 - Posted: 8 Apr 2025, 13:59:41 UTC - in response to Message 8688.  

Is this because the docker apps start from scratch?
ID: 8689 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 276
Credit: 783,415
RAC: 2,464
Message 8690 - Posted: 8 Apr 2025, 18:52:19 UTC - in response to Message 8689.  

In reply to Laurence CERN's message of 8 Apr 2025:
Is this because the docker apps start from scratch?


I notice that, after a reboot, my wus restarted from 0%.
ID: 8690 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 276
Credit: 783,415
RAC: 2,464
Message 8839 - Posted: 19 Jun 2025, 12:33:32 UTC - in response to Message 8690.  

No news about checkpoints??
Yesterday, after over 9hrs of crunch, i had to shutdown my pc....

This is the log, obviously
command output:
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES52e0e4ab0be7 localhost/boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18:latest /bin/sh -c ./entr... 22 hours ago Up 9 hours boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0EOMrunning docker command: stats --no-stream --format "{{.CPUPerc}} {{.MemUsage}}" boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0
command output:
104.52% 215.8MB / 7.964GBEOMrunning docker command: ps --all -f "name=boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0"
command output:
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES52e0e4ab0be7 localhost/boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18:latest /bin/sh -c ./entr... 22 hours ago Up 9 hours boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0EOMrunning docker command: stats --no-stream --format "{{.CPUPerc}} {{.MemUsage}}" boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0
command output:
104.52% 215.8MB / 7.964GBEOMgot quit/abort from client
running docker command: stop boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0
command output:
time="2025-06-19T07:42:05+02:00" level=warning msg="StopSignal SIGTERM failed to stop container boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0 in 10 seconds, resorting to SIGKILL"boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2922-4894335-18_0EOM
</stderr_txt>
ID: 8839 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileLaurence CERN
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1150
Credit: 342,328
RAC: 4
Message 8860 - Posted: 1 Jul 2025, 10:17:14 UTC - in response to Message 8839.  

Checkpoints should be supported. It is something that will need to be working before we release to production.
ID: 8860 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 276
Credit: 783,415
RAC: 2,464
Message 8865 - Posted: 3 Jul 2025, 6:53:25 UTC - in response to Message 8860.  

In reply to Laurence CERN's message of 1 Jul 2025:
Checkpoints should be supported. It is something that will need to be working before we release to production.


Great news you're working on!!
ID: 8865 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 17 Mar 15
Posts: 106
Credit: 1,038,379
RAC: 432
Message 8926 - Posted: 18 Jul 2025, 14:26:24 UTC

I don't know if it is relate to checkpoint management, but with VBox appI I have groups of tasks that are failing together, more or less when I stopped and then restarted my mac.

7 last night, 8 the day before, again on the 13...
ID: 8926 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Theory Application : Checkpoints


©2025 CERN