Message boards :
ATLAS Application :
ATLAS vbox v.1.14
Message board moderation
Previous · 1 · 2 · 3
Author | Message |
---|---|
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 862,257 RAC: 15 |
No, the last dev-Theory with the postponed error is 30 hours ago. Since then all postponed error tasks were ATLAS, but in fact it were 'only' two incidences, cause the others in queue "Ready to start" follow like lemmings. ATLAS however starts much more frequently a new task; about every 40 minutes and since the error always occur during the VM-setup ATLAS will be more affected. I'm using only 1 vboxwrapper for the development system: version 26205 (renamed to 26204 so as not to have to edit the client.xml) |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 0 |
Crystal, can you give us the link for vboxwrapper205? Theory: VBoxService 5.2.6 r120293 (verbosity: 0) linux.amd64 (Jan 15 2018 14:51:00) release log Atlas: VBoxService 5.2.32 r132073 (verbosity: 0) linux.amd64 (Jul 12 2019 10:32:28) release log What when in production vboxwrapper from Theory or Atlas is a conflict with this new wrapper? CMS: 2022-07-17 07:32:50 (11316): Guest Log: 00:00:00.002474 main 5.2.6 r120293 started. Verbose level = 0 |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 862,257 RAC: 15 |
Version 26205 is a pre-release and not (yet) supported by the LHC@home development project, so it's not up to me to distribute this unsupported version. But to inform others, I post my remarks here and since my machine is not hidden, you may study the resullts. btw: I just changed the mix to 2 dev-Theory's and 1 dual core dev-ATLAS. |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 0 |
Thanks Crystal, have made a correction of my message before yours. Can it be a conflict with 5.2.32 vboxwrapper from Atlas in Production? |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 862,257 RAC: 15 |
Can it be a conflict with 5.2.32 vboxwrapper from Atlas in Production?With 5.2.32 you probably mean the VBox service inside the Linux VM. That's not a wrapper. As far as I know BOINC's VBoxwrapper is not aware of what's going on inside a VM. The only connection is sharing a directory on the host for input/output files. The rest is monitoring and controlling the state of the Virtual Machine. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 862,257 RAC: 15 |
I restarted the test with the combi: 1 production-Theory task, 1 dev-Theory task and 1 dual core dev-ATLAS task on my laptop. These combination was the reason of the postponed error before. It seems that the VirtualBox universal unique identifier (UUID) of the 2 Theory VMs from development and production were the same. This could have lead to the virtualbox errors, causing BOINC to postpone a task. I changed the UUID of the production Theory VM and all tasks returned since July 18th after 16:30 UTC are part of the new combi-test. ATLAS tasks of development Theory tasks of development Theory tasks of LHC@home production |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 0 |
With wrapper204 had in -dev postponed, when Atlas in Production AND Atlas in -dev are running together. https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3099440 |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 862,257 RAC: 15 |
My combi-test was unsuccessful after running almost a whole day without errors. After a dev Theory finished, all tasks in queue got the postponed status. One hour later the dev-ATLAS tasks suffered the same fate. 19-Jul-2022 16:47:19 [lhcathome-dev] Computation for task Theory_2390-1117494-270_0 finished 19-Jul-2022 16:47:19 [lhcathome-dev] Starting task Theory_2390-1095455-270_0 19-Jul-2022 16:47:21 [lhcathome-dev] Started upload of Theory_2390-1117494-270_0_r2143229471_result 19-Jul-2022 16:47:23 [lhcathome-dev] Finished upload of Theory_2390-1117494-270_0_r2143229471_result 19-Jul-2022 16:48:03 [lhcathome-dev] Task Theory_2390-1095455-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 16:48:04 [lhcathome-dev] Starting task Theory_2390-1119226-270_0 19-Jul-2022 16:48:48 [lhcathome-dev] Task Theory_2390-1119226-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 16:48:48 [lhcathome-dev] Starting task Theory_2390-1091736-270_0 19-Jul-2022 16:49:32 [lhcathome-dev] Task Theory_2390-1091736-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 16:49:32 [lhcathome-dev] Starting task Theory_2390-1102201-270_0 19-Jul-2022 16:50:16 [lhcathome-dev] Task Theory_2390-1102201-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 16:50:16 [lhcathome-dev] Starting task Theory_2390-1150787-270_0 19-Jul-2022 16:51:00 [lhcathome-dev] Task Theory_2390-1150787-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 16:51:00 [lhcathome-dev] Starting task Theory_2390-1114113-270_0 19-Jul-2022 16:51:45 [lhcathome-dev] Task Theory_2390-1114113-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 17:13:15 [lhcathome-dev] Computation for task rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0 finished 19-Jul-2022 17:13:15 [lhcathome-dev] Starting task MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0 19-Jul-2022 17:13:17 [lhcathome-dev] Started upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_result 19-Jul-2022 17:13:17 [lhcathome-dev] Started upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_hits 19-Jul-2022 17:13:20 [lhcathome-dev] Finished upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_result 19-Jul-2022 17:13:20 [lhcathome-dev] Finished upload of rXsMDmTvyX1n7Olcko1bjSoqABFKDmABFKDmKFbSDmdzHKDmpihctn_0_r632147121_ATLAS_hits 19-Jul-2022 17:19:42 [LHC@home] Computation for task Theory_2390-1087953-270_0 finished 19-Jul-2022 17:19:42 [LHC@home] Starting task Theory_2390-1104258-271_0 19-Jul-2022 17:19:44 [LHC@home] Started upload of Theory_2390-1087953-270_0_r1252526142_result 19-Jul-2022 17:19:46 [LHC@home] Finished upload of Theory_2390-1087953-270_0_r1252526142_result 19-Jul-2022 17:45:50 [lhcathome-dev] Computation for task MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0 finished 19-Jul-2022 17:45:50 [lhcathome-dev] Starting task IylNDm7F1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmgzHKDmeWH7em_0 19-Jul-2022 17:45:52 [lhcathome-dev] Started upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_result 19-Jul-2022 17:45:52 [lhcathome-dev] Started upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_hits 19-Jul-2022 17:45:56 [lhcathome-dev] Finished upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_result 19-Jul-2022 17:45:59 [lhcathome-dev] Finished upload of MfALDmYD0X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmfzHKDmYDh7Mm_0_r459839238_ATLAS_hits 19-Jul-2022 17:46:34 [lhcathome-dev] Task IylNDm7F1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmgzHKDmeWH7em_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 17:46:34 [lhcathome-dev] Starting task wEfLDmlw1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmhzHKDmy9NpQn_0 19-Jul-2022 17:47:18 [lhcathome-dev] Task wEfLDmlw1X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmhzHKDmy9NpQn_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 17:47:18 [lhcathome-dev] Starting task EznNDmyX2X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmizHKDmnW9jUn_0 19-Jul-2022 17:48:03 [lhcathome-dev] Task EznNDmyX2X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmizHKDmnW9jUn_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 19-Jul-2022 17:48:03 [lhcathome-dev] Starting task sP9MDmXE3X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmjzHKDmM4CERm_0 19-Jul-2022 17:48:47 [lhcathome-dev] Task sP9MDmXE3X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmjzHKDmM4CERm_0 postponed for 86400 seconds: VM environment needs to be cleaned up. |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 0 |
ATLAS_vbox_job_1.14.xml: no line <pf_host_port>7859</pf_host_port> as in Theory_2022_06_14.xml What, when this port is in use? This lines are missing in Atlas.xml <heartbeat_filename>heartbeat</heartbeat_filename> <minimum_heartbeat_interval>1200</minimum_heartbeat_interval> What is the default, when missing? |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 862,257 RAC: 15 |
This time only dev-Theory's got postponed. The ATLAS tasks finished, but the client got no new tasks because of the suspended (postponed) Theory's 20-Jul-2022 00:39:02 [lhcathome-dev] Computation for task dwlNDmNb7X1n7Olcko1bjSoqABFKDmABFKDmKFbSDmzzHKDmCYXBLn_0 finished 20-Jul-2022 00:39:02 [lhcathome-dev] Starting task Bx1MDmRd7X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm0zHKDmciSyrn_0 20-Jul-2022 00:45:03 [lhcathome-dev] Computation for task Theory_2390-1103537-270_0 finished 20-Jul-2022 00:45:03 [lhcathome-dev] Starting task Theory_2390-1096414-270_0 20-Jul-2022 00:45:47 [lhcathome-dev] Task Theory_2390-1096414-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 20-Jul-2022 00:45:47 [lhcathome-dev] Starting task Theory_2390-1083266-270_0 20-Jul-2022 00:46:31 [lhcathome-dev] Task Theory_2390-1083266-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 20-Jul-2022 00:46:31 [lhcathome-dev] Starting task Theory_2390-1096300-270_0 20-Jul-2022 00:47:15 [lhcathome-dev] Task Theory_2390-1096300-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 20-Jul-2022 00:47:15 [lhcathome-dev] Starting task Theory_2390-1105817-270_0 20-Jul-2022 00:48:00 [lhcathome-dev] Task Theory_2390-1105817-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 20-Jul-2022 00:48:00 [lhcathome-dev] Starting task Theory_2390-1131075-270_0 20-Jul-2022 00:48:44 [lhcathome-dev] Task Theory_2390-1131075-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 20-Jul-2022 00:48:44 [lhcathome-dev] Starting task Theory_2390-1109083-270_0 20-Jul-2022 00:49:28 [lhcathome-dev] Task Theory_2390-1109083-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 20-Jul-2022 00:49:28 [lhcathome-dev] Starting task Theory_2390-1096903-270_0 20-Jul-2022 00:50:12 [lhcathome-dev] Task Theory_2390-1096903-270_0 postponed for 86400 seconds: VM environment needs to be cleaned up. 20-Jul-2022 01:11:51 [lhcathome-dev] Computation for task Bx1MDmRd7X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm0zHKDmciSyrn_0 finished 20-Jul-2022 01:11:51 [lhcathome-dev] Starting task EH9NDmuJ8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm1zHKDm0LeGDn_0 20-Jul-2022 01:49:35 [lhcathome-dev] Computation for task EH9NDmuJ8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm1zHKDm0LeGDn_0 finished 20-Jul-2022 01:49:35 [lhcathome-dev] Starting task ZByKDm2t8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm2zHKDmV4cZom_0 20-Jul-2022 02:22:41 [lhcathome-dev] Computation for task ZByKDm2t8X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm2zHKDmV4cZom_0 finished 20-Jul-2022 02:22:41 [lhcathome-dev] Starting task ZNhLDmh59X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm4zHKDmnFVmXm_0 20-Jul-2022 02:37:39 [lhcathome-dev] Result 4haMDmcwAY1n7Olcko1bjSoqABFKDmABFKDmKFbSDm5zHKDm00Du5n_0 is no longer usable 20-Jul-2022 02:56:21 [lhcathome-dev] Computation for task ZNhLDmh59X1n7Olcko1bjSoqABFKDmABFKDmKFbSDm4zHKDmnFVmXm_0 finished |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 862,257 RAC: 15 |
I will continue testing. I started from scratch and for now only 1 dev-Theory together with 1 production-Theory. |
Send message Joined: 24 Oct 19 Posts: 171 Credit: 543,238 RAC: 149 |
Any roadmap to implement new vbox wrapper to LHC@Home official project?? |
©2024 CERN