Message boards :
LHCb Application :
Ready For Production?
Message board moderation
Author | Message |
---|---|
Send message Joined: 12 Sep 14 Posts: 1067 Credit: 329,589 RAC: 87 |
LHCb would like to encourage their members to participate in this activity and as such we are considering to move this application as a beta app in the production project similar to what was done for CMS. Are there any opinions on this suggestions? |
Send message Joined: 13 Feb 15 Posts: 1185 Credit: 849,977 RAC: 1,116 |
I'm not sure whether the application is doing useful work. In the past I saw a python process using most of the cpu and my last LHCb task was ended very quickly with low cpu-usage. http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=193431 Incomplete running log: Directories in PYTHONPATH: [''] 2016-06-01 11:36:45 UTC INFO [Pilot] Executing commands: ['LHCbGetPilotVersion', 'CheckWorkerNode', 'LHCbInstallDIRAC', 'LHCbConfigureBasics', 'LHCbCleanPilotEnv', 'LHCbConfigureSite', 'LHCbConfigureArchitecture', 'LHCbConfigureCPURequirements', 'LaunchAgent'] 2016-06-01 11:36:45 UTC INFO [Pilot] Requested command extensions: ['LHCbPilot'] 2016-06-01 11:36:45 UTC INFO [Pilot] Command LHCbGetPilotVersion instantiated from LHCbPilotCommands 2016-06-01 11:36:45 UTC INFO [LHCbGetPilotVersion] Pilot version not requested as pilot script option, going to find it 2016-06-01 11:36:45 UTC INFO [LHCbGetPilotVersion] Setting pilot version to v8r2p45 2016-06-01 11:36:45 UTC INFO [Pilot] Command CheckWorkerNode instantiated from pilotCommands 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] Uname = Linux 38-37-16958 3.10.64-85.cernvm.x86_64 #1 SMP Fri Jan 9 09:53:29 CET 2015 x86_64 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] Host Name = 38-37-16958 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] Host FQDN = localhost.localdomain 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] WorkingDir = /home/boinc/pilot 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] RedHat Release = Scientific Linux release 6.6 (Carbon) 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] Linux release: 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] LSB_VERSION=base-4.0-amd64:base-4.0-noarch:core-4.0-amd64:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-noarch 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] CPU (model) = Intel(R) Core(TM) i7-2600 CPU @ 3.40GHz 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] CPU (MHz) = 1 x 3463.850 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] Memory (kB) = 2050972 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] FreeMem. (kB) = 1777788 2016-06-01 11:36:45 UTC INFO [CheckWorkerNode] DiskSpace (MB) = 17266 2016-06-01 11:36:45 UTC INFO [Pilot] Command LHCbInstallDIRAC instantiated from LHCbPilotCommands ******************************************************************************** * ---- LHCb Login v8r6p1 ---- * * Building with gcc49 on slc6 x86_64 system (x86_64-slc6-gcc49-opt) * ******************************************************************************** --- User_release_area is set to /home/boinc/pilot/cmtuser --- LHCBPROJECTPATH is set to: /cvmfs/lhcb.cern.ch/lib/lhcb /cvmfs/lhcb.cern.ch/lib/lcg/releases /cvmfs/lhcb.cern.ch/lib/lcg/app/releases /cvmfs/lhcb.cern.ch/lib/lcg/external -------------------------------------------------------------------------------- Using CMTPROJECTPATH = '/cvmfs/lhcb.cern.ch/lib/lhcb:/cvmfs/lhcb.cern.ch/lib/lcg/releases:/cvmfs/lhcb.cern.ch/lib/lcg/app/releases:/cvmfs/lhcb.cern.ch/lib/lcg/external' Environment for LbScripts v8r6p1 ready. (Compat v1r19 from /cvmfs/lhcb.cern.ch/lib/lhcb/COMPAT/COMPAT_v1r19, LbScripts v8r6p1 from /cvmfs/lhcb.cern.ch/lib/lhcb/LBSCRIPTS/LBSCRIPTS_v8r6p1, LCGCMT 84 from /cvmfs/lhcb.cern.ch/lib/lcg/releases/LCGCMT/LCGCMT_84, Compat v1r19 from /cvmfs/lhcb.cern.ch/lib/lhcb/COMPAT/COMPAT_v1r19) 2016-06-01 11:37:12 UTC INFO [LHCbInstallDIRAC] lb-run DONE, for release v8r2p45 2016-06-01 11:37:12 UTC INFO [Pilot] Command LHCbConfigureBasics instantiated from LHCbPilotCommands 2016-06-01 11:37:12 UTC WARN [LHCbConfigureBasics] Can't find shared area, forcing it to /cvmfs/lhcb.cern.ch/lib 2016-06-01 11:37:12 UTC INFO [LHCbConfigureBasics] Executing command dirac-configure -S "LHCb-Production" -C "dips://lbvobox46.cern.ch:9135/Configuration/Server" -o /LocalSite/ReleaseProject=LHCb -o /LocalSite/ReleaseVersion=v8r2p45 -o /LocalSite/SharedArea=/cvmfs/lhcb.cern.ch/lib -DMH --UseServerCertificate -o /DIRAC/Security/CertFile=/etc/grid-security/hostcert.pem -o /DIRAC/Security/KeyFile=/etc/grid-security/hostkey.pem -O pilot.cfg Executing: /cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31/scripts/dirac-configure -S LHCb-Production -C dips://lbvobox46.cern.ch:9135/Configuration/Server -o /LocalSite/ReleaseProject=LHCb -o /LocalSite/ReleaseVersion=v8r2p45 -o /LocalSite/SharedArea=/cvmfs/lhcb.cern.ch/lib -DMH --UseServerCertificate -o /DIRAC/Security/CertFile=/etc/grid-security/hostcert.pem -o /DIRAC/Security/KeyFile=/etc/grid-security/hostkey.pem -O pilot.cfg Checking DIRAC installation at "/cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31" URL banned dips://lhcb-conf2-dirac.cern.ch:9135/Configuration/Server 2016-06-01 11:37:28 UTC INFO [Pilot] Command LHCbCleanPilotEnv instantiated from LHCbPilotCommands 2016-06-01 11:37:28 UTC WARN [LHCbCleanPilotEnv] Can't find shared area, forcing it to /cvmfs/lhcb.cern.ch/lib 2016-06-01 11:37:28 UTC INFO [LHCbCleanPilotEnv] Executing command dirac-configure -S "LHCb-Production" -C "dips://lbvobox46.cern.ch:9135/Configuration/Server" -o /LocalSite/ReleaseProject=LHCb -o /LocalSite/ReleaseVersion=v8r2p45 -o /LocalSite/SharedArea=/cvmfs/lhcb.cern.ch/lib -o /DIRAC/Configuration/Servers=dips://lbvobox46.cern.ch:9135/Configuration/Server -DMH --UseServerCertificate -o /DIRAC/Security/CertFile=/etc/grid-security/hostcert.pem -o /DIRAC/Security/KeyFile=/etc/grid-security/hostkey.pem -O pilot.cfg Executing: /cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31/scripts/dirac-configure -S LHCb-Production -C dips://lbvobox46.cern.ch:9135/Configuration/Server -o /LocalSite/ReleaseProject=LHCb -o /LocalSite/ReleaseVersion=v8r2p45 -o /LocalSite/SharedArea=/cvmfs/lhcb.cern.ch/lib -o /DIRAC/Configuration/Servers=dips://lbvobox46.cern.ch:9135/Configuration/Server -DMH --UseServerCertificate -o /DIRAC/Security/CertFile=/etc/grid-security/hostcert.pem -o /DIRAC/Security/KeyFile=/etc/grid-security/hostkey.pem -O pilot.cfg Checking DIRAC installation at "/cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31" 2016-06-01 11:37:30 UTC INFO [Pilot] Command LHCbConfigureSite instantiated from LHCbPilotCommands 2016-06-01 11:37:30 UTC INFO [LHCbConfigureSite] Executing command dirac-configure -o /LocalSite/GridMiddleware=DIRAC -n "BOINC.World.org" -S "LHCb-Production" -N "Boinc-World-CE.org" -o /LocalSite/GridCE=Boinc-World-CE.org -o /LocalSite/CEQueue=Boinc.World.Queue --UseServerCertificate -o /DIRAC/Security/CertFile=/etc/grid-security/hostcert.pem -o /DIRAC/Security/KeyFile=/etc/grid-security/hostkey.pem -FDMH -O pilot.cfg pilot.cfg Executing: /cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31/scripts/dirac-configure -o /LocalSite/GridMiddleware=DIRAC -n BOINC.World.org -S LHCb-Production -N Boinc-World-CE.org -o /LocalSite/GridCE=Boinc-World-CE.org -o /LocalSite/CEQueue=Boinc.World.Queue --UseServerCertificate -o /DIRAC/Security/CertFile=/etc/grid-security/hostcert.pem -o /DIRAC/Security/KeyFile=/etc/grid-security/hostkey.pem -FDMH -O pilot.cfg pilot.cfg Checking DIRAC installation at "/cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31" Will update the output file pilot.cfg Setting /LocalSite/Site = BOINC.World.org Setting /LocalSite/GridCE = Boinc-World-CE.org 2016-06-01 11:37:32 UTC INFO [Pilot] Command LHCbConfigureArchitecture instantiated from LHCbPilotCommands 2016-06-01 11:37:32 UTC INFO [LHCbConfigureArchitecture] Executing command dirac-architecture -o /DIRAC/Security/UseServerCertificate=yes pilot.cfg x86_64-slc6 2016-06-01 11:37:36 UTC INFO [LHCbConfigureArchitecture] Executing command dirac-configure -FDMH --UseServerCertificate -O pilot.cfg pilot.cfg -S "LHCb-Production" -o /LocalSite/Architecture=x86_64-slc6 Executing: /cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31/scripts/dirac-configure -FDMH --UseServerCertificate -O pilot.cfg pilot.cfg -S LHCb-Production -o /LocalSite/Architecture=x86_64-slc6 Checking DIRAC installation at "/cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31" Will update the output file pilot.cfg Setting /LocalSite/Site = BOINC.World.org Setting /LocalSite/GridCE = Boinc-World-CE.org 2016-06-01 11:37:38 UTC INFO [LHCbConfigureArchitecture] Setting variable CMTCONFIG=x86_64-slc6 2016-06-01 11:37:38 UTC INFO [Pilot] Command LHCbConfigureCPURequirements instantiated from LHCbPilotCommands 2016-06-01 11:37:38 UTC INFO [LHCbConfigureCPURequirements] Executing command dirac-wms-cpu-normalization -U -o /DIRAC/Security/UseServerCertificate=yes -R pilot.cfg pilot.cfg Estimated CPU power is 5.9 HS06 MJF not available on this node 2016-06-01 11:39:10 UTC INFO [LHCbConfigureCPURequirements] Current normalized CPU as determined by 'dirac-wms-cpu-normalization' is 5.900000 2016-06-01 11:39:10 UTC INFO [LHCbConfigureCPURequirements] Executing command dirac-wms-get-queue-cpu-time -o /DIRAC/Security/UseServerCertificate=yes pilot.cfg 16949 2016-06-01 11:39:13 UTC INFO [LHCbConfigureCPURequirements] CPUTime left (in seconds) is 16949 2016-06-01 11:39:13 UTC INFO [LHCbConfigureCPURequirements] Queue length (which is also set as CPUTimeLeft) is 99999.100000 2016-06-01 11:39:13 UTC INFO [LHCbConfigureCPURequirements] Executing command dirac-configure -FDMH -o /DIRAC/Security/UseServerCertificate=yes -O pilot.cfg pilot.cfg -o /LocalSite/CPUTimeLeft=99999 Executing: /cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31/scripts/dirac-configure -FDMH -o /DIRAC/Security/UseServerCertificate=yes -O pilot.cfg pilot.cfg -o /LocalSite/CPUTimeLeft=99999 Checking DIRAC installation at "/cvmfs/lhcb.cern.ch/lib/lhcb/DIRAC/DIRAC_v6r14p31" Will update the output file pilot.cfg Setting /LocalSite/Site = BOINC.World.org Setting /LocalSite/GridCE = Boinc-World-CE.org 2016-06-01 11:39:15 UTC INFO [Pilot] Command LaunchAgent instantiated from pilotCommands 2016-06-01 11:39:15 UTC INFO [LaunchAgent] User Name = boinc 2016-06-01 11:39:15 UTC INFO [LaunchAgent] User Id = 500 2016-06-01 11:39:15 UTC INFO [LaunchAgent] Starting JobAgent 2016-06-01 11:39:15 UTC INFO [LaunchAgent] Executing command dirac-agent WorkloadManagement/JobAgent -o MaxCycles=10 -s /Resources/Computing/CEDefaults -o WorkingDirectory=/home/boinc/pilot -o /LocalSite/MaxCPUTime=99999 -o /LocalSite/CPUTime=99999 -o MaxTotalJobs=10 -o /DIRAC/Security/UseServerCertificate=yes -o /LocalSite/InstancePath=/home/boinc/pilot -o /AgentJobRequirements/ExtraOptions=pilot.cfg pilot.cfg /home/boinc/pilot/pilot.cfg |
Send message Joined: 3 Mar 16 Posts: 10 Credit: 33,623 RAC: 0 |
Hi Crystal, the log is not incomplete. There you see the pilot doing all the necessary steps in order to configure the environment and start a JobAgent, that will then pick a job and execute it. The task may be ending quickly if we do not have waiting jobs. |
Send message Joined: 12 Sep 14 Posts: 1067 Credit: 329,589 RAC: 87 |
With the exception of there not always being jobs available, I don't see any other complaints so in the absence of further comments will assume tacit agreement. |
©2024 CERN