Message boards :
ATLAS Application :
New Experimental ATLAS Application
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Send message Joined: 15 Apr 16 Posts: 3 Credit: 8,855 RAC: 0 |
I've completed 3 tasks today but all of them have finished with the message app not supported shutting down. Attached the log of the longest one, the other two just run 5-10min with the same end message. http://lhcathomedev.cern.ch/vLHCathome-dev/results.php?userid=374 2016-04-21 11:03:34 (6456): Setting checkpoint interval to 600 seconds. (Higher value of (Preference: 60 seconds) or (Vbox_job.xml: 600 seconds)) |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
[ERROR] App is not supported. Shutting down! Ignore this message, I don't know why we get this message but it is cosmetic. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Ignore this message, I don't know why we get this message but it is cosmetic. Tasks shutting down after 10 min is not. |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
That is very true! We were out of jobs. Have just submitted some more. Hopefully we can automate this soon and have constant job pressure. The error message issue has been identified and a fix should be there in an hour when new tasks are started. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Thanks, Laurence. Is there any way, we can see, if JOBS are available? We can see boinc-tasks on the SSP. |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Not for now. If the task finishes successfully after about 10 minutes with the message "Normal DAEMON_SHUTDOWN encountered", this suggests that we are out of jobs. However as the results are currently going to dev null, we should not put too many cycles into this. The only purpose here is to identify issues and improve the application. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Console F2 no output. Console F3,F4,F5, (F6) have output. However, no progress display and the info in F4 and F5 seem to contain no useful(real-time) information. No running.log in "show Graphics" |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
I got this error: stderr.log:
|
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Cannot start new task. It is stuck at requesting credentials. Server issue? |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
sever fellover http://lhcathomedev.cern.ch/vLHCathome-dev/forum_thread.php?id=203&postid=2985#2985 |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Console F1 bootscreen ok. Credetials seem to work, but tasks shutting down after 7 min. NO JOBS? http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=156950 |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
You are right! Out of jobs. More submitted. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Seems to be working. Thanks. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Jobs are quite large. Upload size about 140MB. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Running job output should appear here. Output at console F2 and "running.log" |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Each output file is about 20MB. Where do you get the figure of 140MB? |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
I waited, until a job was close to finish and monitored the transfer with process explorer.It took about 20 min on a 1Mbit/s upload. EDIT: jobs run for about 3h on my machine. |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
At the end of the job, you should see in Console 5 (stderr.log) a gfal-copy command. After than has run it should show the bandwidth experienced. Bandwidth: xxx |
Send message Joined: 20 May 15 Posts: 217 Credit: 6,193,119 RAC: 487 |
At the end of the job, you should see in Console 5 (stderr.log) a gfal-copy command. After than has run it should show the bandwidth experienced. I just happened to look whilst it was doing a gfal-copy, it did put up the headers for various columns to describe the copying but it then scrolled up so fast as it moved on to the next job/process I didn't get to read it. Can't find it in any of the logs yet. How many jobs are being run at a time ? |
Send message Joined: 20 May 15 Posts: 217 Credit: 6,193,119 RAC: 487 |
I was looking at F5 with everything scrolling up fast ! F4 is still showing that the bandwidth was 510497 (23,866,170 bytes) Not sure how many jobs it has done but has been running for over 5 hours now. |
©2025 CERN