Message boards :
News :
New CMS Agent
Message board moderation
Author | Message |
---|---|
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
We are just about to push a new CMS Agent to CVMFS. The code has been re-factored to be much simpler, less code = less bugs :) It should appear in a few hours, let us know if there are any problems. |
Send message Joined: 29 May 15 Posts: 147 Credit: 2,842,484 RAC: 0 |
It should appear in a few hoursWhere should it appear ? Is it a new CMS Simulation or is it something inside your server-side ? |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
It is now available! Yeti, it is a new cron job that runs the CMS glidein. Previously the cron job called a python script that runs the CMS glidein but with our re-engineering effort, we don't need the python script anymore. We also provide the user proxy directly rather than providing the user certificate and generating one within the VM. So everything is simpler and hence fewer potential bugs. Hopefully the logging has been fixed and improved to make it easier to debug. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 2 |
Something wrong with the new script? I see only the process glidein_startup and for a second condor_master. cron-stdout only reports every minute: 16:32:01 +0200 2015-08-19 [INFO] CMS glidein ended |
Send message Joined: 29 May 15 Posts: 147 Credit: 2,842,484 RAC: 0 |
Same here, 2 PCs are Looping through, but getting (or fetching) no work. Uptime 22 and 19 minutes |
Send message Joined: 29 May 15 Posts: 147 Credit: 2,842,484 RAC: 0 |
|
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
The good news is the logs are working better :) The bad news is that for whatever reason Condor is failing with Failed to authenticate because the subject '/O=Volunteer Computing/O=CERN/CN=Laurence 2' is not currently trusted by you. If it should be, add it to GSI_DAEMON_NAME or undefine GSI_DAEMON_NAME.|AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5006:Failed to authenticate because the subject '/O=Volunteer Computing/O=CERN/CN=Laurence 2' is not currently trusted by you. If it should be, add it to GSI_DAEMON_NAME or undefine GSI_DAEMON_NAME. 08/19/15 16:39:19 (pid:12158) ERROR "FAILED TO SEND INITIAL KEEP ALIVE TO OUR PARENT <128.142.136.111:57293>" at line 9470 in file /slots/12/dir_4417/userdir/src/condor_daemon_core.V6/daemon_core.cpp 08/19/15 16:39:19 (pid:12158) startd exiting because of fatal exception. We are currently investigating this. |
Send message Joined: 20 Mar 15 Posts: 243 Credit: 886,442 RAC: 0 |
Since somebody mistyped "downloading" could it simply be a typo somewhere? Cern systems' pathnames seem to be the product of a very creative mind. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 2 |
Space in name gives a problem? /cvmfs/cms.cern.ch/CMS@Home/agent/CMSJobAgent.sh: line 9: export: `Pellet ': not a valid identifier |
Send message Joined: 4 May 15 Posts: 64 Credit: 55,584 RAC: 0 |
Doesn't like me either: /cvmfs/cms.cern.ch/CMS@Home/agent/CMSJobAgent.sh: line 9: export: `Haselgrove ': not a valid identifier |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Very sorry. A fix will be deployed shortly ... |
©2024 CERN