Message boards : News : New CMS Agent
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 683 - Posted: 19 Aug 2015, 13:05:07 UTC

We are just about to push a new CMS Agent to CVMFS. The code has been re-factored to be much simpler, less code = less bugs :)

It should appear in a few hours, let us know if there are any problems.
ID: 683 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Yeti
Avatar

Send message
Joined: 29 May 15
Posts: 147
Credit: 2,842,484
RAC: 0
Message 685 - Posted: 19 Aug 2015, 13:19:59 UTC - in response to Message 683.  
Last modified: 19 Aug 2015, 13:20:16 UTC

It should appear in a few hours
Where should it appear ?

Is it a new CMS Simulation or is it something inside your server-side ?
ID: 685 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 687 - Posted: 19 Aug 2015, 13:46:02 UTC - in response to Message 685.  

It is now available!

Yeti, it is a new cron job that runs the CMS glidein. Previously the cron job called a python script that runs the CMS glidein but with our re-engineering effort, we don't need the python script anymore. We also provide the user proxy directly rather than providing the user certificate and generating one within the VM. So everything is simpler and hence fewer potential bugs. Hopefully the logging has been fixed and improved to make it easier to debug.
ID: 687 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1188
Credit: 861,475
RAC: 2
Message 688 - Posted: 19 Aug 2015, 13:47:30 UTC - in response to Message 685.  
Last modified: 19 Aug 2015, 14:40:48 UTC

Something wrong with the new script?

I see only the process glidein_startup and for a second condor_master.

cron-stdout only reports every minute: 16:32:01 +0200 2015-08-19 [INFO] CMS glidein ended
ID: 688 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Yeti
Avatar

Send message
Joined: 29 May 15
Posts: 147
Credit: 2,842,484
RAC: 0
Message 692 - Posted: 19 Aug 2015, 14:45:39 UTC
Last modified: 19 Aug 2015, 14:45:59 UTC

Same here, 2 PCs are Looping through, but getting (or fetching) no work.

Uptime 22 and 19 minutes
ID: 692 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Yeti
Avatar

Send message
Joined: 29 May 15
Posts: 147
Credit: 2,842,484
RAC: 0
Message 693 - Posted: 19 Aug 2015, 14:48:29 UTC

ID: 693 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 694 - Posted: 19 Aug 2015, 14:57:36 UTC - in response to Message 693.  

The good news is the logs are working better :) The bad news is that for whatever reason Condor is failing with

Failed to authenticate because the subject '/O=Volunteer Computing/O=CERN/CN=Laurence 2' is not currently trusted by you. If it should be, add it to GSI_DAEMON_NAME or undefine GSI_DAEMON_NAME.|AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5006:Failed to authenticate because the subject '/O=Volunteer Computing/O=CERN/CN=Laurence 2' is not currently trusted by you. If it should be, add it to GSI_DAEMON_NAME or undefine GSI_DAEMON_NAME.
08/19/15 16:39:19 (pid:12158) ERROR "FAILED TO SEND INITIAL KEEP ALIVE TO OUR PARENT <128.142.136.111:57293>" at line 9470 in file /slots/12/dir_4417/userdir/src/condor_daemon_core.V6/daemon_core.cpp
08/19/15 16:39:19 (pid:12158) startd exiting because of fatal exception.

We are currently investigating this.
ID: 694 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
m
Volunteer tester

Send message
Joined: 20 Mar 15
Posts: 243
Credit: 886,442
RAC: 0
Message 696 - Posted: 19 Aug 2015, 15:10:13 UTC
Last modified: 19 Aug 2015, 15:18:42 UTC

Since somebody mistyped "downloading" could it simply be a typo somewhere? Cern systems' pathnames seem to be the product of a very creative mind.
ID: 696 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1188
Credit: 861,475
RAC: 2
Message 700 - Posted: 19 Aug 2015, 17:27:29 UTC

Space in name gives a problem?

/cvmfs/cms.cern.ch/CMS@Home/agent/CMSJobAgent.sh: line 9: export: `Pellet
': not a valid identifier
ID: 700 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 4 May 15
Posts: 64
Credit: 55,584
RAC: 0
Message 701 - Posted: 19 Aug 2015, 18:04:44 UTC

Doesn't like me either:

/cvmfs/cms.cern.ch/CMS@Home/agent/CMSJobAgent.sh: line 9: export: `Haselgrove
': not a valid identifier
ID: 701 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 702 - Posted: 19 Aug 2015, 18:45:49 UTC - in response to Message 701.  

Very sorry. A fix will be deployed shortly ...
ID: 702 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : News : New CMS Agent


©2024 CERN