Name | dTzMDmZvoDwnShfckohDCDFpABFKDmABFKDmyiALDmNEFKDmsub0Qn_0 |
Workunit | 1973479 |
Created | 20 Jan 2020, 16:18:10 UTC |
Sent | 21 Jan 2020, 6:33:47 UTC |
Report deadline | 28 Jan 2020, 6:33:47 UTC |
Received | 21 Jan 2020, 7:09:37 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 3848 |
Run time | 34 min 35 sec |
CPU time | 45 min 42 sec |
Validate state | Valid |
Credit | 22.37 |
Device peak FLOPS | 4.66 GFLOPS |
Application version | ATLAS Simulation v1.00 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 1.80 GB |
Peak swap size | 2.56 GB |
Peak disk usage | 715.91 MB |
<core_client_version>7.16.1</core_client_version> <![CDATA[ <stderr_txt> 07:34:44 (30631): wrapper (7.7.26015): starting 07:34:44 (30631): wrapper: running run_atlas (--nthreads 2) Di 21. Jan 07:34:44 CET 2020: Arguments: --nthreads 2 Di 21. Jan 07:34:44 CET 2020: Threads: 2 Di 21. Jan 07:34:44 CET 2020: Checking for CVMFS Di 21. Jan 07:34:44 CET 2020: Probing /cvmfs/atlas.cern.ch... OK Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/atlas-condb.cern.ch... OK Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/grid.cern.ch... OK Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/cernvm-prod.cern.ch... OK Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/sft.cern.ch... OK Di 21. Jan 07:34:46 CET 2020: Probing /cvmfs/alice.cern.ch... OK Di 21. Jan 07:34:46 CET 2020: VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE Di 21. Jan 07:34:46 CET 2020: 2.7.0.0 3040 9842 81164 59423 0 61 1811145 4194304 1 65024 0 51090 91.1509 900735 3819 http://s1cern-cvmfs.openhtc.io/cvmfs/atlas.cern.ch DIRECT 1 Di 21. Jan 07:34:46 CET 2020: CVMFS is ok Di 21. Jan 07:34:46 CET 2020: Using singularity image /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img Di 21. Jan 07:34:46 CET 2020: Checking for singularity binary... Di 21. Jan 07:34:46 CET 2020: Using singularity found in PATH at /usr/bin/singularity Di 21. Jan 07:34:46 CET 2020: Running /usr/bin/singularity --version Di 21. Jan 07:34:46 CET 2020: singularity version 3.5.2-1.1.el7 Di 21. Jan 07:34:46 CET 2020: Checking singularity works with /usr/bin/singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img hostname Di 21. Jan 07:34:47 CET 2020: ryzcos7 Di 21. Jan 07:34:47 CET 2020: Singularity works Di 21. Jan 07:34:47 CET 2020: Set ATHENA_PROC_NUMBER=2 Di 21. Jan 07:34:47 CET 2020: Starting ATLAS job with PandaID=4002876565 Di 21. Jan 07:34:47 CET 2020: Running command: /usr/bin/singularity exec --pwd /var/lib/boinc/slots/0 -B /cvmfs,/var /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img sh start_atlas.sh Di 21. Jan 08:09:17 CET 2020: *** The last 200 lines of the pilot log: *** Di 21. Jan 08:09:17 CET 2020: }, Di 21. Jan 08:09:17 CET 2020: "wallTime": 124 Di 21. Jan 08:09:17 CET 2020: } Di 21. Jan 08:09:17 CET 2020: }, Di 21. Jan 08:09:17 CET 2020: "machine": { Di 21. Jan 08:09:17 CET 2020: "cpu_family": "23", Di 21. Jan 08:09:17 CET 2020: "linux_distribution": [ Di 21. Jan 08:09:17 CET 2020: "CentOS Linux", Di 21. Jan 08:09:17 CET 2020: "7.6.1810", Di 21. Jan 08:09:17 CET 2020: "Core" Di 21. Jan 08:09:17 CET 2020: ], Di 21. Jan 08:09:17 CET 2020: "model": "8", Di 21. Jan 08:09:17 CET 2020: "model_name": "AMD Ryzen 7 2700 Eight-Core Processor", Di 21. Jan 08:09:17 CET 2020: "node": "ryzcos7", Di 21. Jan 08:09:17 CET 2020: "platform": "Linux-3.10.0-693.el7.x86_64-x86_64-with-centos-7.6.1810-Core" Di 21. Jan 08:09:17 CET 2020: }, Di 21. Jan 08:09:17 CET 2020: "transform": { Di 21. Jan 08:09:17 CET 2020: "cpuEfficiency": 0.719, Di 21. Jan 08:09:17 CET 2020: "cpuPWEfficiency": 0.742, Di 21. Jan 08:09:17 CET 2020: "cpuTime": 14, Di 21. Jan 08:09:17 CET 2020: "cpuTimeTotal": 2776, Di 21. Jan 08:09:17 CET 2020: "externalCpuTime": 25, Di 21. Jan 08:09:17 CET 2020: "processedEvents": 10, Di 21. Jan 08:09:17 CET 2020: "trfPredata": null, Di 21. Jan 08:09:17 CET 2020: "wallTime": 1872 Di 21. Jan 08:09:17 CET 2020: } Di 21. Jan 08:09:17 CET 2020: } Di 21. Jan 08:09:17 CET 2020: } Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG | queue_monitor | pilot.util.auxiliary.4002876565 | update_server | xml:will send fileinfo Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | state=finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=running Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | INFO | queue_monitor | pilot.control.job.4002876565 | send_state | pilot will not update the server (heartbeat message will be written to file) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | INFO | queue_monitor | pilot.control.job.4002876565 | send_state | job 4002876565 has finished - writing final server update Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,665 | DEBUG | queue_monitor | pilot.control.job.4002876565 | get_data_structure | building data structure to be sent to server with heartbeat Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,665 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | get_job_metrics | will not add max space = -355519053 B to job metrics Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | DEBUG | queue_monitor | pilot.api.analytics | get_fitted_data | removing tails from data to be fitted Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | INFO | queue_monitor | pilot.api.analytics | get_fitted_data | fitting pss+swap vs Time Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | INFO | queue_monitor | pilot.api.analytics | get_fitted_data | current memory leak: 64.71 B/s (using 25 data points, chi2=2713121) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | DEBUG | queue_monitor | pilot.util.auxiliary.4002876565 | get_job_metrics | job metrics="coreCount=2 actualCoreCount=1 nEvents=10 leak=64.71 chi2=2713121" Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,667 | INFO | queue_monitor | pilot.control.job.4002876565 | get_data_structure | total number of processed events: 10 (read) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,667 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_values | using path: /var/lib/boinc/slots/0/PanDA_Pilot-4002876565/memory_monitor_summary.json (trf name=prmon) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | DEBUG | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | summary_dictionary={'Max': {'rx_packets': 32767, 'nprocs': 10, 'nthreads': 1, 'rx_bytes': 18194544, 'wtime': 1940, 'rss': 5702712, 'write_bytes': 0, 'vmem': 8753116, 'read_bytes': 0, 'stime': 93, 'tx_bytes': 7597689, 'pss': 2247966, 'wchar': 0, 'rchar': 0, 'tx_packets': 21758, 'swap': 0, 'utime': 2669}, 'Avg': {'write_bytes': 0, 'nprocs': 6, 'nthreads': 0, 'rx_bytes': 9376, 'rx_packets': 16, 'vmem': 6055184, 'read_bytes': 0, 'swap': 0, 'tx_bytes': 3915, 'pss': 1640699, 'wchar': 0, 'rchar': 0, 'tx_packets': 11, 'rss': 3755413}} Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard info from prmon json Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard memory fields from prmon json Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | .............................. Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . Timing measurements: Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . get job = 0 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . initial setup = 2 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . payload setup = 0 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . total setup = 2 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . stage-in = 0 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . payload execution = 2015 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . stage-out = 2 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | .............................. Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,672 | DEBUG | queue_monitor | pilot.control.job.4002876565 | send_state | wrote heartbeat to file /var/lib/boinc/slots/0/heartbeat.json Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,672 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | job 4002876565 was dequeued from the monitored payloads queue Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,672 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | tmp job object deleted Di 21. Jan 08:09:17 CET 2020: File "/var/lib/boinc/slots/0/pilot2/pilot/common/exception.py", line 431, in run Di 21. Jan 08:09:17 CET 2020: self._Thread__target(**self._Thread__kwargs) Di 21. Jan 08:09:17 CET 2020: File "/var/lib/boinc/slots/0/pilot2/pilot/control/job.py", line 1920, in job_monitor Di 21. Jan 08:09:17 CET 2020: update_time = send_heartbeat_if_time(jobs[i], args, update_time) Di 21. Jan 08:09:17 CET 2020: exception caught by thread run() function: (<type 'exceptions.IndexError'>, IndexError('deque index out of range',), <traceback object at 0x7f71a71144d0>) Di 21. Jan 08:09:17 CET 2020: Traceback (most recent call last): Di 21. Jan 08:09:17 CET 2020: File "/var/lib/boinc/slots/0/pilot2/pilot/common/exception.py", line 431, in run Di 21. Jan 08:09:17 CET 2020: self._Thread__target(**self._Thread__kwargs) Di 21. Jan 08:09:17 CET 2020: File "/var/lib/boinc/slots/0/pilot2/pilot/control/job.py", line 1920, in job_monitor Di 21. Jan 08:09:17 CET 2020: update_time = send_heartbeat_if_time(jobs[i], args, update_time) Di 21. Jan 08:09:17 CET 2020: IndexError: deque index out of range Di 21. Jan 08:09:17 CET 2020: Di 21. Jan 08:09:17 CET 2020: None Di 21. Jan 08:09:17 CET 2020: exception has been put in bucket queue belonging to thread 'job_monitor' Di 21. Jan 08:09:17 CET 2020: setting graceful stop in 10 s since there is no point in continuing Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | job summary report Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | -------------------------------------------------- Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | PanDA job id: 4002876565 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | task id: 000649-198069-24222 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | errors: (none) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | status: LOG_TRANSFER = DONE Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | pilot state: finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | transexitcode: 0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exeerrorcode: 0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exeerrordiag: Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exitcode: 0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exitmsg: OK Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | cpuconsumptiontime: 2784 s Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | nevents: 10 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | neventsw: 0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | pid: 5480 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | pgrp: 5480 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | corecount: 2 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | event service: False Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | -------------------------------------------------- Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue jobs has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue payloads has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_in has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_out has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue current_data_in has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_jobs has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_payloads has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue monitored_payloads has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_jobs has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_payloads has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_in has 1 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_out has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_jobs has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_payloads has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_in has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_out has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobs has 0 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobids has 1 job(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,105 | INFO | retrieve | pilot.control.job.4002876565 | has_job_completed | job 4002876565 has completed (purged errors) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,105 | INFO | retrieve | pilot.util.processes | cleanup | overall cleanup function is called Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,109 | DEBUG | retrieve | pilot.util.processes | cleanup | work directory was removed: /var/lib/boinc/slots/0/PanDA_Pilot-4002876565 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,907 | WARNING | job | pilot.control.job | control | thread 'job_monitor' received an exception from bucket: deque index out of range Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:08,115 | INFO | retrieve | pilot.info.jobdata | collect_zombies | --- collectZombieJob: --- 10, [5480] Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:08,115 | INFO | retrieve | pilot.info.jobdata | collect_zombies | zombie collector trying to kill pid 5480 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:08,115 | INFO | retrieve | pilot.info.jobdata | collect_zombies | harmless exception when collecting zombies: [Errno 10] No child processes Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,120 | INFO | retrieve | pilot.util.processes | cleanup | collected zombie processes Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,121 | INFO | retrieve | pilot.util.processes | cleanup | will now attempt to kill all subprocesses of pid=5480 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,165 | INFO | retrieve | pilot.util.processes | kill_processes | process IDs to be killed: [5480] (in reverse order) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,201 | WARNING | retrieve | pilot.util.processes | kill_processes | found no corresponding commands to process id(s) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,202 | INFO | retrieve | pilot.util.processes | kill_orphans | Do not look for orphan processes in BOINC jobs Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,202 | INFO | retrieve | pilot.control.job | retrieve | ready for new job Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,202 | INFO | retrieve | root | retrieve | pilot has finished for previous job - re-establishing logging Di 21. Jan 08:09:17 CET 2020: mpi4py not found Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | *** PanDA Pilot version 2.3.4 (12) *** Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | pilot is running in a VM Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,207 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | architecture information: Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,264 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | Di 21. Jan 08:09:17 CET 2020: LSB Version: :core-4.1-amd64:core-4.1-noarch Di 21. Jan 08:09:17 CET 2020: Distributor ID: CentOS Di 21. Jan 08:09:17 CET 2020: Description: CentOS Linux release 7.6.1810 (Core) Di 21. Jan 08:09:17 CET 2020: Release: 7.6.1810 Di 21. Jan 08:09:17 CET 2020: Codename: Core Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,264 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,770 | DEBUG | retrieve | pilot.util.monitoring | check_local_space | checking local space on /var/lib/boinc/slots/0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,784 | INFO | retrieve | pilot.util.monitoring | check_local_space | sufficient remaining disk space (8373927936 B) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,784 | WARNING | retrieve | pilot.control.job | proceed_with_getjob | since timefloor is set to 0, pilot was only allowed to run one job Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,784 | DEBUG | retrieve | pilot.control.job | retrieve | [job] retrieve thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,798 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 16 threads Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,798 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(job, started 140126076606208)>, <ExcThread(validate_post, started 140125438727936)>, <ExcThread(payload, started 140125958813440)>, <ExcThread(copytool_in, started 140125950420736)>, <ExcThread(queue_monitoring, started 140125975598848)>, <ExcThread(copytool_out, started 140125983991552)>, <ExcThread(failed_post, started 140125421942528)>, <ExcThread(data, started 140125992384256)>, <ExcThread(job_monitor, started 140125430335232)>, <ExcThread(execute_payloads, started 140125413549824)>, <ExcThread(validate, started 140126068213504)>, <ExcThread(monitor, started 140125455513344)>, <ExcThread(queue_monitor, started 140125405157120)>, <ExcThread(validate_pre, started 140125942028032)>, <ExcThread(create_data_payload, started 140125967206144)>] Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,136 | INFO | execute_payloads | pilot.control.payload | execute_payloads | [payload] execute_payloads thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,272 | INFO | monitor | pilot.control.monitor | control | [monitor] control thread has ended Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,294 | DEBUG | data | pilot.control.data | control | data control ending since graceful_stop has been set Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,294 | DEBUG | data | pilot.control.data | control | [data] control thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,387 | DEBUG | job | pilot.control.job | control | job control ending since graceful_stop has been set Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,387 | DEBUG | job | pilot.control.job | control | [job] control thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,698 | INFO | validate_pre | pilot.control.payload | validate_pre | [payload] validate_pre thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,830 | DEBUG | validate | pilot.control.job | validate | [job] validate thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,840 | WARNING | copytool_out | pilot.util.common | should_abort | data:copytool_out:received graceful stop - abort after this iteration Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,858 | DEBUG | payload | pilot.control.payload | control | payload control ending since graceful_stop has been set Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,858 | DEBUG | payload | pilot.control.payload | control | [payload] control thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,887 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 9 threads Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,887 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(validate_post, started 140125438727936)>, <ExcThread(copytool_in, started 140125950420736)>, <ExcThread(queue_monitoring, started 140125975598848)>, <ExcThread(copytool_out, started 140125983991552)>, <ExcThread(failed_post, started 140125421942528)>, <ExcThread(job_monitor, started 140125430335232)>, <ExcThread(queue_monitor, started 140125405157120)>, <ExcThread(create_data_payload, started 140125967206144)>] Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,997 | INFO | failed_post | pilot.control.payload | failed_post | [payload] failed_post thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,076 | INFO | validate_post | pilot.control.payload | validate_post | [payload] validate_post thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,276 | DEBUG | copytool_in | pilot.control.data | copytool_in | [data] copytool_in thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,282 | DEBUG | create_data_payload | pilot.control.job | create_data_payload | [job] create_data_payload thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,717 | WARNING | queue_monitor | pilot.util.common | should_abort | job:queue_monitor:received graceful stop - abort after this iteration Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,717 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | [job] queue monitor thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,846 | DEBUG | copytool_out | pilot.control.data | copytool_out | [data] copytool_out thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,913 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 3 threads Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,913 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(queue_monitoring, started 140125975598848)>, <ExcThread(job_monitor, started 140125430335232)>] Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:12,407 | WARNING | queue_monitoring | pilot.util.common | should_abort | data:queue_monitoring:received graceful stop - abort after this iteration Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:15,419 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | [data] queue_monitor thread has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:15,961 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 2 threads Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:15,961 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(job_monitor, started 140125430335232)>] Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17,009 | INFO | MainThread | pilot.workflow.generic | run | end of generic workflow (traces error code: 0) Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17,009 | INFO | MainThread | root | wrap_up | traces error code: 0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17,009 | INFO | MainThread | root | wrap_up | pilot has finished Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== pilot stdout END ==== Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== wrapper stdout RESUME ==== Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] Pilot exit status: 0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] STATUSCODE: 0 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] apfmon messages muted Di 21. Jan 08:09:17 CET 2020: ---- find pandaID.out ---- Di 21. Jan 08:09:17 CET 2020: total 60 Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 11357 Jul 25 16:38 LICENSE Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 20 Sep 9 13:04 MANIFEST.IN Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 8 Dec 12 19:00 PILOTVERSION Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 2212 Nov 14 11:01 README.md Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 221 Jul 25 16:38 TODO.md Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 11 Jan 21 07:35 pandaIDs.out Di 21. Jan 08:09:17 CET 2020: drwx------. 14 boinc boinc 216 Jan 21 07:35 pilot Di 21. Jan 08:09:17 CET 2020: -rwx------. 1 boinc boinc 21225 Dec 12 19:00 pilot.py Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 766 Oct 10 16:01 setup.py Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 11 Jan 21 07:35 /var/lib/boinc/slots/0/pilot2/pandaIDs.out Di 21. Jan 08:09:17 CET 2020: 4002876565 Di 21. Jan 08:09:17 CET 2020: Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] Test setup, not cleaning Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== wrapper stdout END ==== Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== wrapper stderr END ==== Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] wrapper wrapperexiting ec=0, duration=2069 Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] apfmon messages muted Di 21. Jan 08:09:17 CET 2020: *** Error codes and diagnostics *** Di 21. Jan 08:09:17 CET 2020: "exeErrorCode": 0, Di 21. Jan 08:09:17 CET 2020: "exeErrorDiag": "", Di 21. Jan 08:09:17 CET 2020: "pilotErrorCode": 0, Di 21. Jan 08:09:17 CET 2020: "pilotErrorDiag": "", Di 21. Jan 08:09:17 CET 2020: *** Listing of results directory *** Di 21. Jan 08:09:17 CET 2020: insgesamt 379016 Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 267260 20. Jan 16:32 pilot2.tar.gz Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 4492 20. Jan 17:15 queuedata.json Di 21. Jan 08:09:17 CET 2020: -rwx------. 1 boinc boinc 12641 20. Jan 17:17 runpilot2-wrapper.sh Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 107 21. Jan 07:34 wrapper_26015_x86_64-pc-linux-gnu Di 21. Jan 08:09:17 CET 2020: -rwxr-xr-x. 1 boinc boinc 5557 21. Jan 07:34 run_atlas Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 112 21. Jan 07:34 job.xml Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 5991 21. Jan 07:34 init_data.xml Di 21. Jan 08:09:17 CET 2020: drwxrwx--x. 2 boinc boinc 86 21. Jan 07:34 shared Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 0 21. Jan 07:34 boinc_lockfile Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 815 21. Jan 07:34 RTE.tar.gz Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 275414 21. Jan 07:34 input.tar.gz Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 365251149 21. Jan 07:34 EVNT.14296418._001447.pool.root.1 Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 8509 21. Jan 07:34 start_atlas.sh Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 2948 21. Jan 07:34 pandaJob.out Di 21. Jan 08:09:17 CET 2020: drwxr-xr-x. 3 boinc boinc 17 21. Jan 07:34 APPS Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 3699286 21. Jan 07:35 agis_schedconf.cvmfs.json Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 7792440 21. Jan 07:35 agis_ddmendpoints.json Di 21. Jan 08:09:17 CET 2020: drwx------. 3 boinc boinc 229 21. Jan 07:35 pilot2 Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 535 21. Jan 08:02 boinc_task_state.xml Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 9176612 21. Jan 08:08 HITS.000649-198069-24222._078090.pool.root.1 Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 26 21. Jan 08:08 wrapper_checkpoint.txt Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 8192 21. Jan 08:08 boinc_mmap_file Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 784 21. Jan 08:08 memory_monitor_summary.json Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 512991 21. Jan 08:09 log.000649-198069-24222._078090.job.log.tgz.1 Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 11640 21. Jan 08:09 heartbeat.json Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 8556 21. Jan 08:09 pilotlog.txt Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 217247 21. Jan 08:09 log.000649-198069-24222._078090.job.log.1 Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 490 21. Jan 08:09 dTzMDmZvoDwnShfckohDCDFpABFKDmABFKDmyiALDmNEFKDmsub0Qn.diag Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 7084 21. Jan 08:09 runtime_log.err Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 496 21. Jan 08:09 output.list Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 739 21. Jan 08:09 runtime_log Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 757760 21. Jan 08:09 result.tar.gz Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 2252 21. Jan 08:09 stderr.txt Di 21. Jan 08:09:17 CET 2020: HITS file was successfully produced: Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 9176612 21. Jan 08:08 shared/HITS.pool.root.1 Di 21. Jan 08:09:17 CET 2020: *** Contents of shared directory: *** Di 21. Jan 08:09:17 CET 2020: insgesamt 366684 Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 365251149 21. Jan 07:34 ATLAS.root_0 Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 8509 21. Jan 07:34 start_atlas.sh Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 815 21. Jan 07:34 RTE.tar.gz Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 275414 21. Jan 07:34 input.tar.gz Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 9176612 21. Jan 08:08 HITS.pool.root.1 Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 757760 21. Jan 08:09 result.tar.gz 08:09:18 (30631): run_atlas exited; CPU time 2742.075446 08:09:18 (30631): called boinc_finish(0) </stderr_txt> ]]>
©2025 CERN