1) Message boards : Theory Application : Windows Version (Message 5994)
Posted 20 Feb 2019 by captainjack
Post:
Just got this error code:

upload failure: <file_xfer_error>
  <file_name>Theory_2279-791249-20_0_r485709403_result</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
2) Message boards : Theory Application : Theory native - stderr.txt entries (Message 5908)
Posted 14 Feb 2019 by captainjack
Post:
It may help to install xmllint.

Ubuntu 18.10 says "Unable to locate package xmllint"
What next?
3) Message boards : Theory Application : Theory native - stderr.txt entries (Message 5905)
Posted 14 Feb 2019 by captainjack
Post:
Just got a couple of these:

08:23:36 (8122): wrapper (7.7.26015): starting
08:23:36 (8122): wrapper: running ../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17 ()
../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17: line 26: xmllint: command not found
14:23:36 2019-02-14: cranky-0.0.17: [INFO] Detected  App
14:23:36 2019-02-14: cranky-0.0.17: [INFO] Checking CVMFS.
../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17: line 44: [@]: bad substitution
14:23:36 2019-02-14: cranky-0.0.17: [INFO] Checking runc.
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Creating the filesystem.
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Using /cvmfs/cernvm-prod.cern.ch/cvm3
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Updating config.json.
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Running Container 'runc'.
../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17: line 26: xmllint: command not found
/cvmfs/grid.cern.ch/vc/containers/runc: "run" requires exactly 1 argument(s)
14:23:55 2019-02-14: cranky-0.0.17: [ERROR] Container 'runc' failed.
08:23:55 (8122): cranky exited; CPU time 0.079445
08:23:55 (8122): app exit status: 0xce
08:23:55 (8122): called boinc_finish(195)
4) Message boards : Theory Application : Native Setup Linux (Message 5874)
Posted 12 Feb 2019 by captainjack
Post:
lhcathome has had for a while atlas native tasks. A configuration file for atlas native cvmfs is located in /etc/cvmfs/default.local.

If a volunteer is already running atlas native tasks and decides to add the capability to run theory native tasks, when they enter this command

sudo wget https://lhcathomedev.cern.ch/lhcathome-dev/download/default.local -O /etc/cvmfs/default.local

it will overlay the atlas cvmfs configuration file at that location.
5) Message boards : Theory Application : New Native App - Linux Only (Message 5871)
Posted 12 Feb 2019 by captainjack
Post:
Earlier I wrote:
I managed to snag another one of these tasks this morning. It has been running for 58 minutes. Even though it is allocated 2 CPUs, it looks like it is only using 1 CPU.

Please let me know if you need more information.


Laurence responded:

This may be related to it starting two processes.


The latest attempt:

The machine has a 6 core 12 thread CPU. Each thread should show ~8% of total CPU usage according to the System Monitor.

The machine now has a test Theory task running that is allocated 4 CPUs (threads). According to the System Monitor, the processes that I can see that look like they are associated with the test Theory task are rivetvm.exe using 5% of the total CPU and pythia8.exe using 4% of the total CPU. The task has been running for 25+ minutes. Even though it is allocated 4 threads, it looks like it is really only using 1 thread.

Am I missing something?
6) Message boards : Theory Application : New Native App - Linux Only (Message 5845)
Posted 11 Feb 2019 by captainjack
Post:
m wrote:
.... trying to get an app_config to get this host to only download one task at a time, but I can't even get that right now..... says "missing start tag"

An app_config will not control the number of tasks that get downloaded, it controls the number of tasks that run concurrently. If you want to limit the number of tasks that get downloaded, use the "Max # Jobs" setting in your project preferences.

If you still want to use an app_config for another purpose, post it here and maybe someone else here can help debug.
7) Message boards : Theory Application : New Native App - Linux Only (Message 5832)
Posted 11 Feb 2019 by captainjack
Post:
I managed to snag another one of these tasks this morning. It has been running for 58 minutes. Even though it is allocated 2 CPUs, it looks like it is only using 1 CPU.

Please let me know if you need more information.
8) Message boards : CMS Application : No Tasks (Message 5666)
Posted 17 Nov 2018 by captainjack
Post:
Got this error message on a CMS test task this morning.

11/17/2018 7:24:24 AM | lhcathome-dev | Aborting task CMS_2887042_1542275487.030765_0: exceeded disk limit: 8435.33MB > 7629.39MB

Task had been suspended overnight for a PC shutdown. Task aborted after startup this morning.

Please let me know if you need more information.
9) Message boards : Sixtrack Application : The Sixtrack Application (Message 5615)
Posted 7 Nov 2018 by captainjack
Post:
More error messages that might help with problem resolution:

Wed 07 Nov 2018 10:47:48 AM CST | lhcathome-dev | [error] garbage_collect(); still have active task for acked result Sixtrack_1538966_1540999458.954204_1579_1; state 9
Wed 07 Nov 2018 10:47:49 AM CST | lhcathome-dev | [error] garbage_collect(); still have active task for acked result Sixtrack_1538966_1540999458.954204_1579_1; state 5
Wed 07 Nov 2018 10:47:49 AM CST | lhcathome-dev | Output file Sixtrack_1538966_1540999458.954204_1579_1_r674300233_0 for task Sixtrack_1538966_1540999458.954204_1579_1 absent


Just in case you need it, computer id = 3601.

Please let me know if you need more info.
10) Message boards : Sixtrack Application : The Sixtrack Application (Message 5609)
Posted 6 Nov 2018 by captainjack
Post:
Me too, I also have 45 aborted tasks that are each taking up 328MiB of memory. All of 16GB memory is currently being used and 1 GB of the SWAP file is being used. System currently has no active sixtrack test tasks showing in BOINC. Time for a reboot and stop running sixtrack test tasks.

By the way, does anybody know what we are testing with all the sixtrack tasks? Would love to know if we are supposed to be watching for anything specific.
11) Message boards : Theory Application : New version v3.04 (Message 4945)
Posted 27 May 2017 by captainjack
Post:
Still getting v3.02. I tried resetting the project, no luck. Then I removed and re-added the project. Still getting v3.02.
12) Message boards : CMS Application : Dip? (Message 4383)
Posted 1 Dec 2016 by captainjack
Post:
Ivan asked:

Actually... Can you please post your app_config.xml so that the more-experienced volunteers can have a chance to critique it? Thanks.


I was not using an app_config.xml.

Ivan also asked in a recent thread:

If you don't have an app_config.xml, does BOINC download a new one for you?


BOINC did not download an app_config.xml for me.

I just tried another CMS task and got the same result. Some relevant messages below:

<message>
The filename or extension is too long.
(0xce) - exit code 206 (0xce)
</message>

2016-12-01 07:43:34 (10148): Setting Memory Size for VM. (2048MB)
2016-12-01 07:43:34 (10148): Setting CPU Count for VM. (3)

2016-12-01 07:45:42 (10148): Guest Log: [INFO] CMS application starting. Check log files.
2016-12-01 07:45:42 (10148): Guest Log: [DEBUG] HTCondor ping
2016-12-01 07:45:52 (10148): Guest Log: [DEBUG] 0
2016-12-01 07:56:53 (10148): Guest Log: [ERROR] Condor exited after 673s without running a job.
2016-12-01 07:56:53 (10148): Guest Log: [INFO] Shutting Down

Please let me know if I can provide more info.
13) Message boards : CMS Application : Dip? (Message 4366)
Posted 30 Nov 2016 by captainjack
Post:
I just tried one of the multi-thread tasks and got this:

2016-11-30 09:02:18 (15296): Guest Log: [DEBUG] HTCondor ping
2016-11-30 09:02:28 (15296): Guest Log: [DEBUG] 0
2016-11-30 09:12:59 (15296): Guest Log: [ERROR] Condor exited after 637s without running a job.
2016-11-30 09:12:59 (15296): Guest Log: [INFO] Shutting Down.
14) Message boards : Number crunching : issue of the day (Message 1482)
Posted 18 Nov 2015 by captainjack
Post:
Bill Michael said:

CMS "Waiting to run (Scheduler wait: Please update/recompile VirtualBox Kernal Drivers.)"


If you install DKMS (Dynamic Kernel Management System) before you install VirtualBox, you shouldn't need to recomiple VirtualBox after a kernel update.

Ivan, the same principle applies for Nvidia drivers. If you install DKMS before you install the Nvidia drivers, you shouldn't have to re-install the Nvidia drivers after a kernel update. The only time I have to re-install Nvidia drivers after a kernel update is when I am running a pre-release (alpha or beta) version of the Ubuntu operating system.

I don't have any experience with XeonPhi drivers, but the same principle might apply. It would certainly be worth a test.

Hope that helps.



©2024 CERN