Message boards :
Theory Application :
Task not starting
Message board moderation
Author | Message |
---|---|
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
I am getting the message: Could not source logging functions from /cvmfs/grid.cern..../bin/logging_functions Task is not doing anything (no CPU usage), no console windows working, show graphics-page not found. |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Thanks for the report, it has already been fix, just waiting for the cache to update. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 2 |
I got 5 Theory's and the first one got a job and is still running. The other 4 were blowing by the wind, running 1 to 2 minutes. Extracts from the result logs: 2016-04-22 16:32:03 (9776): Guest Log: [INFO] Mounting the shared directory 2016-04-22 16:32:03 (9776): Guest Log: [INFO] Shared directory mounted, enabling vboxmonitor 2016-04-22 16:32:03 (9776): VM Completion File Detected. 2016-04-22 16:32:03 (9776): VM Completion Message: 1 2016-04-22 16:33:37 (8472): Guest Log: [INFO] Mounting the shared directory 2016-04-22 16:33:37 (8472): Guest Log: [INFO] Shared directory mounted, enabling vboxmonitor 2016-04-22 16:33:37 (8472): Guest Log: [INFO] Reading volunteer information 2016-04-22 16:33:37 (8472): Guest Log: [INFO] Volunteer: () Host: 2016-04-22 16:33:37 (8472): Guest Log: [ERROR] BOINC_USERID is not an integer. Shuting down! 2016-04-22 16:33:37 (8472): Guest Log: [INFO] VMID: c7b1ab21-6a52-4050-9335-489372ba8b3d 2016-04-22 16:33:37 (8472): Guest Log: [ERROR] BOINC_USERID is not set. 2016-04-22 16:33:37 (8472): Guest Log: [ERROR] The x509 proxy creation failed. 2016-04-22 16:33:37 (8472): Guest Log: [INFO] application starting. Check log files. 2016-04-22 16:33:37 (8472): Guest Log: [ERROR] App is not supported. Shutting down! 2016-04-22 16:33:37 (8472): VM Completion File Detected. 2016-04-22 16:33:37 (8472): VM Completion Message: 1 2016-04-22 16:35:00 (9444): Guest Log: [INFO] Mounting the shared directory 2016-04-22 16:35:00 (9444): Guest Log: [INFO] Shared directory mounted, enabling vboxmonitor 2016-04-22 16:35:00 (9444): VM Completion File Detected. 2016-04-22 16:35:00 (9444): VM Completion Message: 1 2016-04-22 16:36:43 (9588): Guest Log: [INFO] Mounting the shared directory 2016-04-22 16:36:43 (9588): Guest Log: [INFO] Shared directory mounted, enabling vboxmonitor 2016-04-22 16:36:43 (9588): Guest Log: [INFO] Reading volunteer information 2016-04-22 16:36:43 (9588): Guest Log: [INFO] Volunteer: () Host: 2016-04-22 16:36:43 (9588): Guest Log: [ERROR] BOINC_USERID is not an integer. Shuting down! 2016-04-22 16:36:43 (9588): Guest Log: [INFO] VMID: c7b1ab21-6a52-4050-9335-489372ba8b3d 2016-04-22 16:36:43 (9588): Guest Log: [ERROR] BOINC_USERID is not set. 2016-04-22 16:36:43 (9588): Guest Log: [ERROR] The x509 proxy creation failed. 2016-04-22 16:36:43 (9588): Guest Log: [INFO] application starting. Check log files. 2016-04-22 16:36:43 (9588): Guest Log: [ERROR] App is not supported. Shutting down! 2016-04-22 16:36:43 (9588): VM Completion File Detected. 2016-04-22 16:36:43 (9588): VM Completion Message: 1 |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
It is failing to find the BOINC info but I don't know why. It is almost like the init_data.xml file in the shared directory is empty. Can you check in your slot directories on your host. 2016-04-22 16:33:37 (8472): Guest Log: [INFO] Volunteer: () Host: 2016-04-22 16:33:37 (8472): Guest Log: [ERROR] BOINC_USERID is not an integer. Shuting down! It should exit here. Will clean that up. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 2 |
I was 'lucky' to get the error again -> http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=155251 All 4 new tasks ended that way. A 8450 bytes init_data.xml was in the shared directory. I saved the xml-file. Maybe you're not pointing to the right file, due to a typo in file-name like Shuting down! ;) |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Have just added some debug messages. If you see a similar issue in about an hour, please post a link to the task. I think that this is working for others so am a little confused. Fixed the typo but it will push with other fixes. |
Send message Joined: 13 Apr 15 Posts: 138 Credit: 2,969,210 RAC: 0 |
ALL my recent tasks, CMS,ATLAS and Theory, are ending this way after c.2mins. eg. init_data from Task 155406 shows; <userid>196</userid> <teamid>20</teamid> <hostid>508</hostid> <app_name>Theory</app_name> yet console window shows; Guest Log: [INFO] Volunteer: () Host: Guest Log: [ERROR] BOINC_USERID is not an integer. Shuting down! just before the task exits. So at least they're exiting but still aren't seeing this info. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 2 |
|
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
|
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Thanks added some more debugging. Will take about 1 hour to propagate. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 2 |
Thanks added some more debugging. Will take about 1 hour to propagate. Something has changed, but not solved yet. Maybe this helps: |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
I think I have fixed. We will see in 1 hour. |
Send message Joined: 13 Apr 15 Posts: 138 Credit: 2,969,210 RAC: 0 |
|
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Thanks, just pushed another fix. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Same error messages as in message 2962 , but in /usr/bin/boinc-proxy: line 21 Console F1 and F3 working, app running. Logs working. |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Fixed pushed. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 2 |
Job is running. Only Consoles 1 and 3 (top) have output. In CMS however, Console 2 shows the records processing (running.log) and Console 4 the stdout.log Dunno what fix, but my Theory-VM was running already before you pushed. |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
The error should have disappeared by the time the next task starts. In general for all the applications: Console 1 => boot and initialization Console 2 => running.log (job log) Console 3 => top Console 4 => stdout.log (of the job wrapper) Console 5 => stderr.log (of the job wrapper) Console 6 => login prompt 1,2,3 and 6 should always have output and the files are the same ones that can be seen in the Web logs (show graphics). I can confirm that running.log is missing and will look into it asap. EDIT: Fix pushed |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Running.log is back--thank you. Console F2 has it as well. F4 and F5---don't know, have not started a new task. |
©2024 CERN