Message boards : News : Change Log
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2172 - Posted: 2 Mar 2016, 20:14:56 UTC

This thread will be used to provide information on all the changes that are made to help correlate issues with potential causes. It is needed as not all changes are tied to a new application release such as with the supporting infrastructure.
ID: 2172 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2181 - Posted: 3 Mar 2016, 10:42:25 UTC - in response to Message 2172.  

An initial version of the LHCb app has been added.
ID: 2181 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2182 - Posted: 3 Mar 2016, 10:57:42 UTC - in response to Message 2181.  
Last modified: 3 Mar 2016, 10:57:52 UTC

Added two new discussion topics:

  • CMS Application
  • LHCb Application

ID: 2182 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2185 - Posted: 3 Mar 2016, 14:05:21 UTC - in response to Message 2182.  
Last modified: 3 Mar 2016, 20:21:19 UTC

Added new CMS application v46.26.
ID: 2185 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2194 - Posted: 3 Mar 2016, 20:21:53 UTC - in response to Message 2185.  

A new link has been added to the menu to show plots of the LHCb jobs.
ID: 2194 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2196 - Posted: 3 Mar 2016, 20:56:02 UTC - in response to Message 2194.  

Added plot on application failures to the CMS jobs stats page.
ID: 2196 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2211 - Posted: 4 Mar 2016, 9:04:58 UTC - in response to Message 2196.  

Updated the bootstrap script. Generally cleaned but most importantly removed the cvmfs reload.
ID: 2211 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2214 - Posted: 4 Mar 2016, 11:11:02 UTC - in response to Message 2211.  

Enabled the validator for CMS and pushed a new version of the CMSJobAgent that has additional protections.
ID: 2214 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2226 - Posted: 4 Mar 2016, 15:13:30 UTC - in response to Message 2214.  

Updated HTCondor configuration.
ID: 2226 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 2231 - Posted: 4 Mar 2016, 17:25:41 UTC
Last modified: 4 Mar 2016, 17:30:30 UTC

MOVED FROM SUSPEND/RESUME THREAD.
Originally Posted: 4 Mar 2016, 15:12:52 UTC

The condor configuration has been updated and has been pushed to CVMFS. Should be available with new glideins (runs) this evening. We are targeting suspending up to 2 hours. Here are the current settings.

NOT_RESPONDING_TIMEOUT = 10800 was 7200
CCB_HEARTBEAT_INTERVAL 0 was 3600
ALIVE_INTERVAL = 1800
MAX_CLAIM_ALIVES_MISSED = 6
JobLeaseDuration = 7200

The settings are shown in the glidein-stderr file from the graphics/logs.
ID: 2231 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2261 - Posted: 7 Mar 2016, 15:49:26 UTC - in response to Message 2231.  

- Updated the bootstrap script to fixed the sudo issue
- Updated the CMSJobAgent to collect logs
ID: 2261 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2333 - Posted: 11 Mar 2016, 12:41:27 UTC - in response to Message 2261.  

- Updated LHCb to version v0.3
ID: 2333 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2385 - Posted: 14 Mar 2016, 14:56:58 UTC - in response to Message 2333.  

- Updated the gfal-copy wrapper to measure the bandwidth of the transfer.
ID: 2385 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2393 - Posted: 15 Mar 2016, 15:53:58 UTC - in response to Message 2385.  
Last modified: 15 Mar 2016, 15:57:10 UTC

- Updated Condor configuration to set CLAIM_WORKLIFE = -1 was ifThenElse(DynamicSlot =?= true,3600,-1)
ID: 2393 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Szarararar
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 13 Oct 15
Posts: 3
Credit: 21
RAC: 0
Message 2420 - Posted: 16 Mar 2016, 15:56:18 UTC
Last modified: 16 Mar 2016, 15:58:11 UTC

- Updated server code to 27013 revision.
ID: 2420 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2471 - Posted: 21 Mar 2016, 12:40:18 UTC - in response to Message 2420.  

- glidein configuration update

GLIDEIN_Retire_Time = 43200
GLIDEIN_MAX_WALLTIME = 64800
CLAIM_WORKLIFE = 3600
ID: 2471 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2481 - Posted: 21 Mar 2016, 14:57:22 UTC - in response to Message 2471.  

- Updated LHCb application to v0.5
ID: 2481 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 2489 - Posted: 21 Mar 2016, 19:31:35 UTC - in response to Message 2471.  

Something is wrong with the new glidein settings.
There is only one job per run.



max wall time, 64800
WARNING: job max time is bigger than max_walltime, lowering it.
job max time, 63644
calculated retire time, 666 (was 21600)
using default retire spread, 66
Retire time set to 629 (was 19656)
Die time set to 64273
ID: 2489 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2511 - Posted: 22 Mar 2016, 22:43:04 UTC - in response to Message 2489.  

- Updated CMS to version 46.27

  • Reduced the job duration (VM lifetime to 18 hours)



- Reduced the runtime for jobs to 12 hours

ID: 2511 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1069
Credit: 334,882
RAC: 0
Message 2576 - Posted: 26 Mar 2016, 22:27:14 UTC - in response to Message 2511.  

- Added some debugging info for CVMFS
ID: 2576 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : News : Change Log


©2024 CERN