Message boards : Theory Application : Theory native - stderr.txt entries
Message board moderation

To post messages, you must log in.

AuthorMessage
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 481
Credit: 394,720
RAC: 2
Message 5880 - Posted: 13 Feb 2019, 9:23:45 UTC

I suggest to copy the first line from .../slots/x/cernvm/shared/runRivet.log to stderr.txt.
This would be helpful to identify the scientific app in case of an error.

Example from the currently running task:
===> [runRivet] Wed Feb 13 09:09:33 UTC 2019 [boinc pp mb-inelastic 2360 - - pythia6 6.424 default 100000 16]
ID: 5880 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1067
Credit: 334,882
RAC: 39
Message 5896 - Posted: 14 Feb 2019, 10:50:27 UTC - in response to Message 5880.  

I suggest to copy the first line from .../slots/x/cernvm/shared/runRivet.log to stderr.txt.
This would be helpful to identify the scientific app in case of an error.

Example from the currently running task:
===> [runRivet] Wed Feb 13 09:09:33 UTC 2019 [boinc pp mb-inelastic 2360 - - pythia6 6.424 default 100000 16]


This should be done in the latest version but I have to see a successful task first to confirm.
ID: 5896 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1067
Credit: 334,882
RAC: 39
Message 5904 - Posted: 14 Feb 2019, 13:34:40 UTC - in response to Message 5896.  

I suggest to copy the first line from .../slots/x/cernvm/shared/runRivet.log to stderr.txt.
This would be helpful to identify the scientific app in case of an error.

Example from the currently running task:
===> [runRivet] Wed Feb 13 09:09:33 UTC 2019 [boinc pp mb-inelastic 2360 - - pythia6 6.424 default 100000 16]


This should be done in the latest version but I have to see a successful task first to confirm.


It didn't work. I have pushed out a new version (cranky-0.0.17) which I hope fixes this.
ID: 5904 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
captainjack

Send message
Joined: 18 Aug 15
Posts: 14
Credit: 125,335
RAC: 0
Message 5905 - Posted: 14 Feb 2019, 14:27:49 UTC

Just got a couple of these:

08:23:36 (8122): wrapper (7.7.26015): starting
08:23:36 (8122): wrapper: running ../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17 ()
../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17: line 26: xmllint: command not found
14:23:36 2019-02-14: cranky-0.0.17: [INFO] Detected  App
14:23:36 2019-02-14: cranky-0.0.17: [INFO] Checking CVMFS.
../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17: line 44: [@]: bad substitution
14:23:36 2019-02-14: cranky-0.0.17: [INFO] Checking runc.
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Creating the filesystem.
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Using /cvmfs/cernvm-prod.cern.ch/cvm3
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Updating config.json.
14:23:55 2019-02-14: cranky-0.0.17: [INFO] Running Container 'runc'.
../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17: line 26: xmllint: command not found
/cvmfs/grid.cern.ch/vc/containers/runc: "run" requires exactly 1 argument(s)
14:23:55 2019-02-14: cranky-0.0.17: [ERROR] Container 'runc' failed.
08:23:55 (8122): cranky exited; CPU time 0.079445
08:23:55 (8122): app exit status: 0xce
08:23:55 (8122): called boinc_finish(195)
ID: 5905 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 481
Credit: 394,720
RAC: 2
Message 5906 - Posted: 14 Feb 2019, 14:37:21 UTC - in response to Message 5905.  

Just got a couple of these:

../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.17: line 26: xmllint: command not found

It may help to install xmllint.
See:
https://lhcathomedev.cern.ch/lhcathome-dev/forum_thread.php?id=446&postid=5893
ID: 5906 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
captainjack

Send message
Joined: 18 Aug 15
Posts: 14
Credit: 125,335
RAC: 0
Message 5908 - Posted: 14 Feb 2019, 15:59:20 UTC

It may help to install xmllint.

Ubuntu 18.10 says "Unable to locate package xmllint"
What next?
ID: 5908 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1067
Credit: 334,882
RAC: 39
Message 5909 - Posted: 14 Feb 2019, 16:13:45 UTC - in response to Message 5908.  

It may help to install xmllint.

Ubuntu 18.10 says "Unable to locate package xmllint"
What next?

I will try to solve this issue later today. Was hoping that this would be installed by default.
ID: 5909 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1188
Credit: 854,498
RAC: 83
Message 5910 - Posted: 14 Feb 2019, 17:30:51 UTC - in response to Message 5908.  

It may help to install xmllint.

Ubuntu 18.10 says "Unable to locate package xmllint"
What next?

Installed it with
apt-get install libxml2-utils
ID: 5910 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 481
Credit: 394,720
RAC: 2
Message 5911 - Posted: 14 Feb 2019, 17:31:14 UTC - in response to Message 5908.  

It may help to install xmllint.

Ubuntu 18.10 says "Unable to locate package xmllint"
What next?

It's included in "libxml2-utils" or "bash-completion".

See:
https://packages.ubuntu.com/search?lang=en&suite=xenial&arch=any&searchon=contents&keywords=xmllint
ID: 5911 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1188
Credit: 854,498
RAC: 83
Message 5912 - Posted: 15 Feb 2019, 6:23:54 UTC - in response to Message 5904.  

I suggest to copy the first line from .../slots/x/cernvm/shared/runRivet.log to stderr.txt.
This would be helpful to identify the scientific app in case of an error.

Example from the currently running task:
===> [runRivet] Wed Feb 13 09:09:33 UTC 2019 [boinc pp mb-inelastic 2360 - - pythia6 6.424 default 100000 16]


This should be done in the latest version but I have to see a successful task first to confirm.


It didn't work. I have pushed out a new version (cranky-0.0.17) which I hope fixes this.

Your hope was honored: https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2752565

[snip]
19:30:43 2019-02-14: cranky-0.0.17: [INFO] Preparing output.
===> [runRivet] Thu Feb 14 17:24:25 UTC 2019 [boinc pp jets 7000 25,-,100 - herwig7 7.0.3 UE-MMHT 100000 16]
20:30:44 (30726): cranky exited; CPU time 7584.431972
20:30:44 (30726): called boinc_finish(0)

[/snip]

I see time differences. It probably means that the job info is not added directly after the start of the job,
what could cause 'no job-info' when a task starves early.
ID: 5912 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1067
Credit: 334,882
RAC: 39
Message 5913 - Posted: 15 Feb 2019, 7:59:25 UTC - in response to Message 5912.  

I see time differences. It probably means that the job info is not added directly after the start of the job,

Correct. I can change this.
what could cause 'no job-info' when a task starves early.

I don’t understand what you mean.
ID: 5913 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1188
Credit: 854,498
RAC: 83
Message 5914 - Posted: 15 Feb 2019, 8:53:21 UTC - in response to Message 5913.  
Last modified: 15 Feb 2019, 8:55:10 UTC

what could cause 'no job-info' when a task starves early.

I don’t understand what you mean.

When you change adding the job-info to std output directly after the start it's present when a job runs into an error.
ID: 5914 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1067
Credit: 334,882
RAC: 39
Message 5915 - Posted: 15 Feb 2019, 11:24:46 UTC - in response to Message 5913.  

I see time differences. It probably means that the job info is not added directly after the start of the job,

Correct. I can change this.

Done
ID: 5915 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1067
Credit: 334,882
RAC: 39
Message 5916 - Posted: 15 Feb 2019, 11:28:14 UTC - in response to Message 5914.  

what could cause 'no job-info' when a task starves early.

I don’t understand what you mean.

When you change adding the job-info to std output directly after the start it's present when a job runs into an error.

It is not so straight forward as the output goes to the runRivet.log and this only exists some time after the job starts. I can revisit this later when we need to debug these kinds of errors.
ID: 5916 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Theory Application : Theory native - stderr.txt entries


©2024 CERN