Message boards : ATLAS Application : ATLAS vbox v.1.14
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 7591 - Posted: 17 Jul 2022, 16:30:07 UTC - in response to Message 7590.  

No, the last dev-Theory with the postponed error is 30 hours ago.
Since then all postponed error tasks were ATLAS, but in fact it were 'only' two incidences, cause the others in queue "Ready to start" follow like lemmings.
ATLAS however starts much more frequently a new task; about every 40 minutes and since the error always occur during the VM-setup ATLAS will be more affected.

I'm using only 1 vboxwrapper for the development system: version 26205 (renamed to 26204 so as not to have to edit the client.xml)
ID: 7591 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 7592 - Posted: 18 Jul 2022, 4:37:12 UTC - in response to Message 7591.  
Last modified: 18 Jul 2022, 5:28:37 UTC

Crystal,
can you give us the link for vboxwrapper205?
Theory:
VBoxService 5.2.6 r120293 (verbosity: 0) linux.amd64 (Jan 15 2018 14:51:00) release log
Atlas:
VBoxService 5.2.32 r132073 (verbosity: 0) linux.amd64 (Jul 12 2019 10:32:28) release log
What when in production vboxwrapper from Theory or Atlas is a conflict with this new wrapper?
CMS:
2022-07-17 07:32:50 (11316): Guest Log: 00:00:00.002474 main 5.2.6 r120293 started. Verbose level = 0
ID: 7592 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 7593 - Posted: 18 Jul 2022, 5:24:00 UTC - in response to Message 7592.  
Last modified: 18 Jul 2022, 5:28:48 UTC

Version 26205 is a pre-release and not (yet) supported by the LHC@home development project,
so it's not up to me to distribute this unsupported version.
But to inform others, I post my remarks here and since my machine is not hidden, you may study the resullts.

btw: I just changed the mix to 2 dev-Theory's and 1 dual core dev-ATLAS.
ID: 7593 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 7594 - Posted: 18 Jul 2022, 5:30:36 UTC - in response to Message 7593.  

Thanks Crystal,
have made a correction of my message before yours.
Can it be a conflict with 5.2.32 vboxwrapper from Atlas in Production?
ID: 7594 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 7595 - Posted: 18 Jul 2022, 5:44:01 UTC - in response to Message 7594.  

Can it be a conflict with 5.2.32 vboxwrapper from Atlas in Production?
With 5.2.32 you probably mean the VBox service inside the Linux VM. That's not a wrapper.
As far as I know BOINC's VBoxwrapper is not aware of what's going on inside a VM.
The only connection is sharing a directory on the host for input/output files.
The rest is monitoring and controlling the state of the Virtual Machine.
ID: 7595 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 7597 - Posted: 19 Jul 2022, 9:25:34 UTC

I restarted the test with the combi: 1 production-Theory task, 1 dev-Theory task and 1 dual core dev-ATLAS task on my laptop.
These combination was the reason of the postponed error before.
It seems that the VirtualBox universal unique identifier (UUID) of the 2 Theory VMs from development and production were the same.
This could have lead to the virtualbox errors, causing BOINC to postpone a task.
I changed the UUID of the production Theory VM and all tasks returned since July 18th after 16:30 UTC are part of the new combi-test.

ATLAS tasks of development
Theory tasks of development
Theory tasks of LHC@home production
ID: 7597 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 7598 - Posted: 19 Jul 2022, 12:18:07 UTC - in response to Message 7597.  
Last modified: 19 Jul 2022, 12:19:24 UTC

With wrapper204 had in -dev postponed, when Atlas in Production AND Atlas in -dev are running together.
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3099440
ID: 7598 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 7599 - Posted: 19 Jul 2022, 16:48:22 UTC

My combi-test was unsuccessful after running almost a whole day without errors.

After a dev Theory finished, all tasks in queue got the postponed status.

One hour later the dev-ATLAS tasks suffered the same fate.

19-Jul-2022 16:47:19 [lhcathome-dev] Computation for task Theory_2390-1117494-270_0 finished
19-Jul-2022 16:47:19 [lhcathome-dev] Starting task Theory_2390-1095455-270_0
19-Jul-2022 16:47:21 [lhcathome-dev] Started upload of Theory_2390-1117494-270_0_r2143229471_result
19-Jul-2022 16:47:23 [lhcathome-dev] Finished upload of Theory_2390-1117494-270_0_r2143229471_result
19-Jul-2022 16:48:03 [lhcathome-dev] Task Theory_2390-1095455-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 16:48:04 [lhcathome-dev] Starting task Theory_2390-1119226-270_0
19-Jul-2022 16:48:48 [lhcathome-dev] Task Theory_2390-1119226-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 16:48:48 [lhcathome-dev] Starting task Theory_2390-1091736-270_0
19-Jul-2022 16:49:32 [lhcathome-dev] Task Theory_2390-1091736-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 16:49:32 [lhcathome-dev] Starting task Theory_2390-1102201-270_0
19-Jul-2022 16:50:16 [lhcathome-dev] Task Theory_2390-1102201-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 16:50:16 [lhcathome-dev] Starting task Theory_2390-1150787-270_0
19-Jul-2022 16:51:00 [lhcathome-dev] Task Theory_2390-1150787-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 16:51:00 [lhcathome-dev] Starting task Theory_2390-1114113-270_0
19-Jul-2022 16:51:45 [lhcathome-dev] Task Theory_2390-1114113-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.

19-Jul-2022 17:13:15 [lhcathome-dev] Computation for task rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0 finished
19-Jul-2022 17:13:15 [lhcathome-dev] Starting task MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0
19-Jul-2022 17:13:17 [lhcathome-dev] Started upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_result
19-Jul-2022 17:13:17 [lhcathome-dev] Started upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_hits
19-Jul-2022 17:13:20 [lhcathome-dev] Finished upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_result
19-Jul-2022 17:13:20 [lhcathome-dev] Finished upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_hits

19-Jul-2022 17:19:42 [LHC@home] Computation for task Theory_2390-1087953-270_0 finished
19-Jul-2022 17:19:42 [LHC@home] Starting task Theory_2390-1104258-271_0
19-Jul-2022 17:19:44 [LHC@home] Started upload of Theory_2390-1087953-270_0_r1252526142_result
19-Jul-2022 17:19:46 [LHC@home] Finished upload of Theory_2390-1087953-270_0_r1252526142_result

19-Jul-2022 17:45:50 [lhcathome-dev] Computation for task MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0 finished
19-Jul-2022 17:45:50 [lhcathome-dev] Starting task IylNDm7F1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmgzHKDmeWH7em_0
19-Jul-2022 17:45:52 [lhcathome-dev] Started upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_result
19-Jul-2022 17:45:52 [lhcathome-dev] Started upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_hits
19-Jul-2022 17:45:56 [lhcathome-dev] Finished upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_result
19-Jul-2022 17:45:59 [lhcathome-dev] Finished upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_hits
19-Jul-2022 17:46:34 [lhcathome-dev] Task IylNDm7F1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmgzHKDmeWH7em_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 17:46:34 [lhcathome-dev] Starting task wEfLDmlw1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmhzHKDmy9NpQn_0
19-Jul-2022 17:47:18 [lhcathome-dev] Task wEfLDmlw1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmhzHKDmy9NpQn_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 17:47:18 [lhcathome-dev] Starting task EznNDmyX2X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmizHKDmnW9jUn_0
19-Jul-2022 17:48:03 [lhcathome-dev] Task EznNDmyX2X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmizHKDmnW9jUn_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
19-Jul-2022 17:48:03 [lhcathome-dev] Starting task sP9MDmXE3X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmjzHKDmM4CERm_0
19-Jul-2022 17:48:47 [lhcathome-dev] Task sP9MDmXE3X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmjzHKDmM4CERm_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
ID: 7599 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 7600 - Posted: 20 Jul 2022, 3:59:00 UTC - in response to Message 7599.  
Last modified: 20 Jul 2022, 4:01:11 UTC

ATLAS_vbox_job_1.14.xml:
no line <pf_host_port>7859</pf_host_port>
as in Theory_2022_06_14.xml
What, when this port is in use?

This lines are missing in Atlas.xml
<heartbeat_filename>heartbeat</heartbeat_filename>
<minimum_heartbeat_interval>1200</minimum_heartbeat_interval>
What is the default, when missing?
ID: 7600 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 7601 - Posted: 20 Jul 2022, 5:30:32 UTC

This time only dev-Theory's got postponed. The ATLAS tasks finished, but the client got no new tasks because of the suspended (postponed) Theory's

20-Jul-2022 00:39:02 [lhcathome-dev] Computation for task dwlNDmNb7X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmzzHKDmCYXBLn_0 finished
20-Jul-2022 00:39:02 [lhcathome-dev] Starting task Bx1MDmRd7X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm0zHKDmciSyrn_0

20-Jul-2022 00:45:03 [lhcathome-dev] Computation for task Theory_2390-1103537-270_0 finished
20-Jul-2022 00:45:03 [lhcathome-dev] Starting task Theory_2390-1096414-270_0
20-Jul-2022 00:45:47 [lhcathome-dev] Task Theory_2390-1096414-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
20-Jul-2022 00:45:47 [lhcathome-dev] Starting task Theory_2390-1083266-270_0
20-Jul-2022 00:46:31 [lhcathome-dev] Task Theory_2390-1083266-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
20-Jul-2022 00:46:31 [lhcathome-dev] Starting task Theory_2390-1096300-270_0
20-Jul-2022 00:47:15 [lhcathome-dev] Task Theory_2390-1096300-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
20-Jul-2022 00:47:15 [lhcathome-dev] Starting task Theory_2390-1105817-270_0
20-Jul-2022 00:48:00 [lhcathome-dev] Task Theory_2390-1105817-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
20-Jul-2022 00:48:00 [lhcathome-dev] Starting task Theory_2390-1131075-270_0
20-Jul-2022 00:48:44 [lhcathome-dev] Task Theory_2390-1131075-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
20-Jul-2022 00:48:44 [lhcathome-dev] Starting task Theory_2390-1109083-270_0
20-Jul-2022 00:49:28 [lhcathome-dev] Task Theory_2390-1109083-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.
20-Jul-2022 00:49:28 [lhcathome-dev] Starting task Theory_2390-1096903-270_0
20-Jul-2022 00:50:12 [lhcathome-dev] Task Theory_2390-1096903-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up.

20-Jul-2022 01:11:51 [lhcathome-dev] Computation for task Bx1MDmRd7X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm0zHKDmciSyrn_0 finished
20-Jul-2022 01:11:51 [lhcathome-dev] Starting task EH9NDmuJ8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm1zHKDm0LeGDn_0

20-Jul-2022 01:49:35 [lhcathome-dev] Computation for task EH9NDmuJ8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm1zHKDm0LeGDn_0 finished
20-Jul-2022 01:49:35 [lhcathome-dev] Starting task ZByKDm2t8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm2zHKDmV4cZom_0

20-Jul-2022 02:22:41 [lhcathome-dev] Computation for task ZByKDm2t8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm2zHKDmV4cZom_0 finished
20-Jul-2022 02:22:41 [lhcathome-dev] Starting task ZNhLDmh59X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm4zHKDmnFVmXm_0

20-Jul-2022 02:37:39 [lhcathome-dev] Result 4haMDmcwAY1n7Olcko1bjSoqABFKDmABFKDmKFbSDm5zHKDm00Du5n_0 is no longer usable

20-Jul-2022 02:56:21 [lhcathome-dev] Computation for task ZNhLDmh59X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm4zHKDmnFVmXm_0 finished
ID: 7601 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 7602 - Posted: 20 Jul 2022, 7:02:05 UTC

I will continue testing.
I started from scratch and for now only 1 dev-Theory together with 1 production-Theory.
ID: 7602 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 153
Credit: 319,905
RAC: 512
Message 7627 - Posted: 25 Jul 2022, 21:38:10 UTC

Any roadmap to implement new vbox wrapper to LHC@Home official project??
ID: 7627 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3

Message boards : ATLAS Application : ATLAS vbox v.1.14


©2024 CERN