1) Message boards : Theory Application : Suspend/Resume (Message 8402)
Posted 8 Apr 2024 by rilian
Post:
Probably this is the source of issue

Starting BOINC client version 7.18.1 for x86_64-pc-linux-gnu
This a development version of BOINC and may not function properly


as i could not figure out proper way to auto start it

I'll try to use other OS and will write later if i see any new issue
2) Message boards : Theory Application : Suspend/Resume (Message 8401)
Posted 8 Apr 2024 by rilian
Post:
there is a log line
- the user running this application is a member of the 'boinc' group


i noticed boinc does not auto start after reboot and i always start it as `boinc --daemon` which means it is started as my user

maybe that's the issue ?

i will find a way to auto start it
3) Message boards : Theory Application : Suspend/Resume (Message 8400)
Posted 8 Apr 2024 by rilian
Post:
I updated CVMFS, no more warnings

I did sudo systemctl edit boinc-client.service
### Anything between here and the comment below will become the new contents of the file

[Service]
ProtectHome=no
ProtectSystem=full
ProtectControlGroups=no

### Lines below this comment will be discarded


and rebooted

$ systemctl --no-pager show boinc-client |grep -i protect
ProtectClock=no
ProtectKernelTunables=no
ProtectKernelModules=no
ProtectKernelLogs=no
ProtectControlGroups=no
ProtectHome=no
ProtectSystem=full
ProtectProc=default
ProtectHostname=no


, but again same issue, here is the fresh task https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3316927

runc run failed: fchown fd 7: operation not permitted

* fchown() changes the ownership of the file referred to by the open file descriptor fd.



should i add any of
#NoNewPrivileges=true
#ProtectKernelModules=true
#ProtectKernelTunables=true
#RestrictRealtime=true
#RestrictAddressFamilies=AF_INET AF_INET6 AF_UNIX
#RestrictNamespaces=true
#PrivateUsers=true
#CapabilityBoundingSet=
#MemoryDenyWriteExecute=true
#PrivateTmp=true  #Block X11 idle detection

?
4) Message boards : Theory Application : Suspend/Resume (Message 8398)
Posted 8 Apr 2024 by rilian
Post:
$ systemctl --no-pager show boinc-client |grep -i protect
ProtectClock=no
ProtectKernelTunables=no
ProtectKernelModules=no
ProtectKernelLogs=no
ProtectControlGroups=yes
ProtectHome=yes
ProtectSystem=strict
ProtectProc=default
ProtectHostname=no


i have latest boinc provided by Ubuntu, i do not know if it is intended for linux or not
# apt-get install boinc-client
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
boinc-client is already the newest version (7.18.1+dfsg-4).


yes Ubuntu 22.04 has sudo 1.9.9 so i had to compile sudo from sources. Later version was available but i installed 1.9.10 as requested by LHC app logs


cranky-0.1.4: [INFO] Can't find '/etc/cvmfs/domain.d/cern.ch.local'.
cranky-0.1.4: [INFO] Can't find '/etc/cvmfs/config.d/cvmfs-config.cern.ch.local'.

i think you refer to this when say "Your CVMFS configuration does not follow the latest suggestions (see your logs)"

i could not find documentation what should be done with these 2 files

thanks for the help
5) Message boards : Theory Application : Suspend/Resume (Message 8396)
Posted 8 Apr 2024 by rilian
Post:
@computezrmle i installed 50-lhcathome_boinc_theory_native as instructed in other thread https://lhcathomedev.cern.ch/lhcathome-dev/forum_thread.php?id=633&postid=8162#8162

$ sudo cat /etc/sudoers.d/50-lhcathome_boinc_theory_native
# save this file as '/etc/sudoers.d/50-lhcathome_boinc_theory_native'
# ownership must be 'root:root' and access rights must be '-r--r-----'
# '@includedir /etc/sudoers.d' must be enabled in /etc/sudoers

# regular expressions are enclosed between '^' and '$'
# this is supported since sudo version 1.9.10
# for more information read 'man sudoers'

# the regex patterns given here must match the command arguments in the calling script
# missing/additional arguments or an argument order not in sync causes a command to be rejected

# the commands are permitted for the local group 'boinc'
# ensure the calling user is a member of that group


Cmnd_Alias LHCATHOMEBOINC_01 = /usr/bin/cat ^/etc/sudoers.d/50-lhcathome_boinc_theory_native$
Cmnd_Alias LHCATHOMEBOINC_02 = /usr/bin/systemctl ^(freeze|thaw) Theory_[-a-zA-Z0-9_]+\.scope$
Cmnd_Alias LHCATHOMEBOINC_03 = /usr/bin/systemd-run ^--scope -u [a-zA-Z0-9_-]+ -p BindsTo=[a-zA-Z0-9_\.@-]+ -p After=[a-zA-Z0-9_\.@-]+ --slice-inherit --uid=[a-zA-Z0-9_-]+ --gid=boinc --same-dir -q -G /[a-zA-Z0-9_\./-]+/(runc|runc\.new|runc\.old) --root state run -b cernvm [a-zA-Z0-9_-]+$

%boinc     ALL = (ALL) NOPASSWD: LHCATHOMEBOINC_01, LHCATHOMEBOINC_02, LHCATHOMEBOINC_03


Now all my tasks fail due to error like here https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3316642



01:09:09 UTC +00:00 2024-04-08: cranky-0.1.4: [INFO] Starting runc container.
01:09:09 UTC +00:00 2024-04-08: cranky-0.1.4: [INFO] To get some details on systemd level run
01:09:09 UTC +00:00 2024-04-08: cranky-0.1.4: [INFO] systemctl status Theory_2743-2787161-48_0.scope
01:09:09 UTC +00:00 2024-04-08: cranky-0.1.4: [INFO] mcplots runspec: boinc pp jets 7000 80,-,1360 - pythia8 8.301 tune-1 100000 48
01:09:09 UTC +00:00 2024-04-08: cranky-0.1.4: [INFO] ----,^^^^,<<<~_____---,^^^,<<~____--,^^,<~__;_
time="2024-04-08T01:09:09Z" level=error msg="runc run failed: fchown fd 7: operation not permitted"
6) Message boards : Theory Application : Suspend/Resume (Message 8393)
Posted 7 Apr 2024 by rilian
Post:
Please make your computers visible for other volunteers here:
https://lhcathomedev.cern.ch/lhcathome-dev/prefs.php?subset=project

i made
eventually after the other multi-core project tasks were calculated, this one was finished with error

https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3316482
7) Message boards : Theory Application : Suspend/Resume (Message 8390)
Posted 5 Apr 2024 by rilian
Post:
on ubuntu 22.04
my computer also works with MilkyWay@home 8-CPU tasks and i noticed when LHC@home-dev tasks become suspended, they still use lots of CPU

On my 8-cpu machine load average goes to 15+

grep from boinccmd --get_tasks:

   project URL: https://lhcathomedev.cern.ch/lhcathome-dev/
   active_task_state: SUSPENDED
   current CPU time: 23275.890000
   fraction done: 0.837751
   project URL: https://milkyway.cs.rpi.edu/milkyway/
   active_task_state: EXECUTING
   current CPU time: 1141.166000
   fraction done: 0.437930


grep from htop (2nd column is CPU usage)
S  0.0  0.3  4:20.04 ├─ boinc --daemon
R 256.  0.1 24:30.12 │  ├─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h histogr
R 35.2  0.1  3:12.10 │  │  ├─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h hist
R 31.9  0.1  3:11.36 │  │  ├─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h hist
R 35.9  0.1  3:10.74 │  │  ├─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h hist
R 35.2  0.1  3:09.77 │  │  ├─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h hist
R 32.5  0.1  3:09.30 │  │  ├─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h hist
R 35.9  0.1  3:09.25 │  │  ├─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h hist
S  0.0  0.1  0:00.08 │  │  └─ ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.83_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h hist
S  0.0  0.0  0:20.49 │  ├─ ../../projects/lhcathomedev.cern.ch_lhcathome-dev/wrapper_2019_03_02_x86_64-linux
S  0.0  0.0  0:03.09 │  │  ├─ ../../projects/lhcathomedev.cern.ch_lhcathome-dev/wrapper_2019_03_02_x86_64-linux
S  0.0  0.0  0:00.05 │  │  └─ /bin/bash ../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.1.4
S  0.0  0.0  0:00.02 │  │     └─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.01 │  │        ├─ /bin/bash ./job
S  0.0  0.0  0:10.25 │  │        │  └─ /bin/bash ./runRivet.sh boinc pp z1j 8000 - - sherpa 2.2.9 default 5000 43
S  0.0  0.0  0:00.00 │  │        │     ├─ /bin/bash ./runRivet.sh boinc pp z1j 8000 - - sherpa 2.2.9 default 5000 43
S  0.0  0.0  0:00.01 │  │        │     │  └─ /bin/bash ./rungen.sh boinc pp z1j 8000 - - sherpa 2.2.9 default 5000 43 /shared/tmp/tmp.ArGknGQnH2/gene
R 98.9  0.2  6h29:54 │  │        │     │     └─ /cvmfs/sft.cern.ch/lcg/releases/LCG_96/MCGenerators/sherpa/2.2.9/x86_64-centos7-gcc8-opt/bin/Sherpa -
S  0.0  0.0  0:00.00 │  │        │     ├─ /bin/bash ./runRivet.sh boinc pp z1j 8000 - - sherpa 2.2.9 default 5000 43
S  0.0  0.0  0:00.58 │  │        │     │  └─ /shared/rivetvm/rivetvm.exe -a ATLAS_2019_I1744201 -i /shared/tmp/tmp.ArGknGQnH2/generator.hepmc -o /sha
S  0.0  0.0  0:00.00 │  │        │     └─ sleep 3
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        ├─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2
S  0.0  0.0  0:00.00 │  │        └─ /cvmfs/grid.cern.ch/vc/containers/runc.new --root state run -b cernvm Theory_2743-2857775-43_2


i'm not sure if it somehow influences to quality of results or not, but just wanted to mention this



©2024 CERN