Message boards : News : Multi-core jobs available for CMS@Home-dev
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,888,015
RAC: 1,314
Message 8346 - Posted: 25 Mar 2024, 17:09:12 UTC

We are currently testing multi-core jobs for CMS@Home. Note that these will only run in -dev as the main project does not currently allow you to select multi-core VMs. We currently have 2-core and 4-core tasks in the queue, so please try selecting 4-core in your machine preferences, and let us know how it works.
ID: 8346 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 670
Credit: 1,828,052
RAC: 3,628
Message 8347 - Posted: 25 Mar 2024, 17:56:10 UTC - in response to Message 8346.  
Last modified: 25 Mar 2024, 17:58:56 UTC

duplicate, sorry.
ID: 8347 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 670
Credit: 1,828,052
RAC: 3,628
Message 8348 - Posted: 25 Mar 2024, 17:57:31 UTC - in response to Message 8346.  

ID: 8348 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1184
Credit: 821,086
RAC: 730
Message 8349 - Posted: 25 Mar 2024, 18:55:47 UTC

What is the process cmsExternalGene doing?
Now I see 4 of those processses on my 4-core VM using up to 100% cpu each.
Do they the event processing and if yes: Does it mean I'm processing 4 jobs concurrently.
That would not make much sense.
ID: 8349 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1184
Credit: 821,086
RAC: 730
Message 8350 - Posted: 25 Mar 2024, 18:56:16 UTC
Last modified: 25 Mar 2024, 18:57:32 UTC

dupe
ID: 8350 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1184
Credit: 821,086
RAC: 730
Message 8351 - Posted: 25 Mar 2024, 18:56:36 UTC - in response to Message 8346.  
Last modified: 25 Mar 2024, 18:57:44 UTC

tripled
ID: 8351 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1184
Credit: 821,086
RAC: 730
Message 8353 - Posted: 26 Mar 2024, 8:15:14 UTC - in response to Message 8349.  

Still running after 13.5 hours the first 4 jobs:



On slow but a bit faster than this machine, it could happen, that the second 4 jobs are been killed by the 18 hours deadline.
Why not run 1 single job 4 times faster like you do with the dual core tasks. 1 cmsRun using 200% cpu and twice as fast.
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3311582
ID: 8353 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1184
Credit: 821,086
RAC: 730
Message 8354 - Posted: 26 Mar 2024, 15:30:34 UTC - in response to Message 8353.  

Still running after 13.5 hours the first 4 jobs:
...
...
On slow but a bit faster than this machine, it could happen, that the second 4 jobs are been killed by the 18 hours deadline.
Why not run 1 single job 4 times faster like you do with the dual core tasks. 1 cmsRun using 200% cpu and twice as fast.
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3311582



Just in time:

Run time 17 hours 32 min 57 sec
Only 4 concurrently running jobs.
ID: 8354 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 670
Credit: 1,828,052
RAC: 3,628
Message 8355 - Posted: 26 Mar 2024, 15:50:19 UTC
Last modified: 26 Mar 2024, 15:55:52 UTC

Crystal,
seeing the same: 4 cmsExternalGene... 153 minutes hours on 4-Cores!
ID: 8355 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1184
Credit: 821,086
RAC: 730
Message 8357 - Posted: 26 Mar 2024, 17:27:55 UTC - in response to Message 8355.  

Crystal,
seeing the same: 4 cmsExternalGene... 153 minutes hours on 4-Cores!
Your task https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3311748 seems to be killed by the 18 hours deadline.

Your result ends with:
2024-03-26 16:45:34 (33976): Status Report: Elapsed Time: '60000.000000'
2024-03-26 16:45:34 (33976): Status Report: CPU Time: '216839.343750'
2024-03-26 18:05:37 (33976): Powering off VM.
2024-03-26 18:05:38 (33976): Successfully stopped VM.
2024-03-26 18:05:38 (33976): Deregistering VM. (boinc_9f8cb45e4f80768f, slot#27)
2024-03-26 18:05:38 (33976): Removing network bandwidth throttle group from VM.
2024-03-26 18:05:38 (33976): Removing VM from VirtualBox.


Mine with:
2024-03-26 13:06:15 (1020): Guest Log: [INFO] glidein exited with return value 0.
2024-03-26 13:06:15 (1020): Guest Log: [INFO] Shutting Down.
2024-03-26 13:06:15 (1020): VM Completion File Detected.
2024-03-26 13:06:15 (1020): VM Completion Message: glidein exited with return value 0.
.
2024-03-26 13:06:15 (1020): Powering off VM.
2024-03-26 13:06:16 (1020): Successfully stopped VM.
2024-03-26 13:06:16 (1020): Deregistering VM. (boinc_15fe7a25adba3060, slot#0)
2024-03-26 13:06:16 (1020): Removing network bandwidth throttle group from VM.
2024-03-26 13:06:16 (1020): Removing VM from VirtualBox.
ID: 8357 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 670
Credit: 1,828,052
RAC: 3,628
Message 8358 - Posted: 26 Mar 2024, 17:52:31 UTC - in response to Message 8357.  

This was luck ;-))
Computer ID 4639
Laufzeit 18 Stunden 0 min. 54 sek.
CPU Zeit 2 Tage 17 Stunden 22 min. 43 sek.
Prüfungsstatus Gültig
Punkte 3,721.09
ID: 8358 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1184
Credit: 821,086
RAC: 730
Message 8359 - Posted: 26 Mar 2024, 18:14:35 UTC - in response to Message 8358.  

This was luck ;-))
No luck for CMS, cause the running cms jobs did not return a result.
Your task was valid BOINC-wise, but a valid cms-job should end with: VM Completion Message: glidein exited with return value 0.
0 means no error. Your job ended by a shutdown given by vboxwrapper after 18 hours wall clock time.
ID: 8359 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Fardringle

Send message
Joined: 31 Jul 22
Posts: 11
Credit: 3,546,719
RAC: 10,776
Message 8417 - Posted: 22 Apr 2024, 2:48:12 UTC - in response to Message 8346.  

We are currently testing multi-core jobs for CMS@Home. Note that these will only run in -dev as the main project does not currently allow you to select multi-core VMs. We currently have 2-core and 4-core tasks in the queue, so please try selecting 4-core in your machine preferences, and let us know how it works.


It has been almost a full month since these tasks were released and I have been running two computers (an i7-4790 CPU and an i7-8650U CPU) with nothing except a single 4-thread task on each of them the whole time, and one other computer (a Xeon E5-2699 v3 CPU) with one 4-thread LHC task running along with a bunch of other work from different BOINC projects, and my task history says that there have only been a couple of failures that whole time. And I'm pretty sure those failures were my own fault from forcing the computer to reboot without letting BOINC stop the task properly first.
ID: 8417 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : News : Multi-core jobs available for CMS@Home-dev


©2024 CERN