Message boards : Theory Application : Multiple wus
Message board moderation

To post messages, you must log in.

AuthorMessage
boboviz

Send message
Joined: 24 Oct 19
Posts: 226
Credit: 623,724
RAC: 481
Message 8770 - Posted: 24 Apr 2025, 9:12:04 UTC

I have no problems when i crunch 3/4 wus at the same time.
I try to run more wus (7) and i have this error after 10 minutes:

(0x1) - exit code 1 (0x1)</message>
<stderr_txt>
docker_wrapper config:
workdir: /boinc_slot_dir
use GPU: no
create args: --cap-add=SYS_ADMIN --device /dev/fuse
verbose: 1
Using podman
running docker command: ps --all --filter "name=boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4470857-685_0"
command output:
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMESEOMcreating container boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4470857-685_0
running docker command: images
command output:
REPOSITORY TAG IMAGE ID CREATED SIZEdocker.io/library/almalinux 9 df3270cc8bc8 6 weeks ago 217 MBEOMbuilding image
running docker command: build . -t boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4470857-685 -f Dockerfile
read_from_pipe() error: timeout
build_image() failed: -182

</stderr_txt>


Is an hw "limitation" of my pc? Maybe the usage of ram or the speed of SSD??
ID: 8770 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 750
Credit: 3,170,889
RAC: 32,113
Message 8771 - Posted: 24 Apr 2025, 9:26:03 UTC - in response to Message 8770.  

In reply to boboviz's message of 24 Apr 2025:
Is an hw "limitation" of my pc? Maybe the usage of ram or the speed of SSD??

No, get also only 4 Task.
ID: 8771 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 226
Credit: 623,724
RAC: 481
Message 8772 - Posted: 24 Apr 2025, 12:08:45 UTC - in response to Message 8771.  

In reply to maeax's message of 24 Apr 2025:
In reply to boboviz's message of 24 Apr 2025:
Is an hw "limitation" of my pc? Maybe the usage of ram or the speed of SSD??

No, get also only 4 Task.


It's a pity.
My 16 cores could be useful at full load
ID: 8772 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1243
Credit: 966,851
RAC: 569
Message 8773 - Posted: 24 Apr 2025, 12:41:22 UTC - in response to Message 8772.  

In reply to boboviz's message of 24 Apr 2025:It's a pity.
My 16 cores could be useful at full load
I'm running the VBox-version with 10 tasks concurrently, No issues; very rarely I get the error: "VM Heartbeat file specified, but missing."
In the past I even tested 20 at the same time, where we were testing the multi-attach vdi.
Do you have set in your project preferences the Max # CPUs to "No limit"?
ID: 8773 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 226
Credit: 623,724
RAC: 481
Message 8774 - Posted: 24 Apr 2025, 13:26:31 UTC - in response to Message 8773.  

In reply to Crystal Pellet's message of 24 Apr 2025:
Do you have set in your project preferences the Max # CPUs to "No limit"?


Yes.

But the problem is not the download, is crunching all the wus at the same time (without errors).
I will re-try when i have some times to control the machine during the simulations.
ID: 8774 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 750
Credit: 3,170,889
RAC: 32,113
Message 8775 - Posted: 24 Apr 2025, 13:46:01 UTC - in response to Message 8773.  

2. PC Win11pro with docker running now:
https://lhcathomedev.cern.ch/lhcathome-dev/results.php?hostid=3452
Yes, four Tasks parallel. Change in prefs is not possible.
ID: 8775 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1243
Credit: 966,851
RAC: 569
Message 8776 - Posted: 24 Apr 2025, 14:40:37 UTC - in response to Message 8775.  
Last modified: 24 Apr 2025, 14:40:57 UTC

Just for testing:
ID            NAME                                                                  CPU %       MEM USAGE / LIMIT  MEM %       NET IO             BLOCK IO    PIDS        CPU TIME       AVG CPU %
a5a2119c7d0f  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4473612-690_0  39.27%      227.9MB / 8.289GB  2.75%       562.8MB / 5.689MB  0B / 0B     83          1m0.788767s    18.78%
c6608f21dc38  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4561573-689_0  0.03%       206.9MB / 8.289GB  2.50%       485.9MB / 5.3MB    0B / 0B     83          1m1.1624757s   18.92%
05bea7804f2e  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4504155-689_0  100.27%     180.8MB / 8.289GB  2.18%       320.3MB / 3.679MB  0B / 0B     84          1m26.6811817s  26.84%
a1e26d646fc8  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4470857-685_1  19.39%      217MB / 8.289GB    2.62%       494.3MB / 5.772MB  0B / 0B     86          1m4.5329454s   20.04%
6a622d7b5be1  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4590618-690_0  28.15%      175.8MB / 8.289GB  2.12%       272.3MB / 2.01MB   0B / 0B     76          17.7426657s    18.05%
9494a4b3e2db  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4495993-685_1  30.53%      148.3MB / 8.289GB  1.79%       90.51MB / 819.2kB  0B / 0B     76          11.1517809s    16.03%
On a 4-core machine with 2 BOINC instances.
So it seems not to be a docker limit.
ID: 8776 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1243
Credit: 966,851
RAC: 569
Message 8777 - Posted: 24 Apr 2025, 19:03:43 UTC - in response to Message 8776.  

No problem to have 6 tasks running concurrently on 1 BOINC instance:
ID            NAME                                                                  CPU %       MEM USAGE / LIMIT  MEM %       NET IO             BLOCK IO    PIDS        CPU TIME      AVG CPU %
7e6af1577817  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4533999-689_0  1.16%       220MB / 8.289GB    2.65%       545MB / 3.576MB    0B / 0B     82          1m11.266852s  26.94%
0eb91d2604da  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4509348-690_0  109.97%     181MB / 8.289GB    2.18%       319.1MB / 2.233MB  0B / 0B     81          2m0.9254772s  45.72%
e2d2f4df770a  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4561581-690_0  47.96%      199.4MB / 8.289GB  2.41%       470.8MB / 3.25MB   0B / 0B     85          1m9.5455715s  29.21%
9d8b8859ad1a  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4483392-690_0  0.01%       171.7MB / 8.289GB  2.07%       319.1MB / 2.283MB  0B / 0B     79          1m2.5948458s  37.37%
07812aa94a0d  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4456326-690_0  0.01%       176.7MB / 8.289GB  2.13%       284.9MB / 1.512MB  0B / 0B     75          19.8581014s   25.87%
a8282cc6f34c  boinc__lhcathomedev.cern.ch_lhcathome-dev__theory_2848-4514541-691_0  0.01%       148.7MB / 8.289GB  1.79%       92.79MB / 451.9kB  0B / 0B     75          10.5042185s   18.09%
ID: 8777 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 226
Credit: 623,724
RAC: 481
Message 8780 - Posted: 25 Apr 2025, 20:21:40 UTC

Now it works, 15 wus on the same time run correctly on my Windows pc.
I've changed nothing...
ID: 8780 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Theory Application : Multiple wus


©2025 CERN