Name dTzMDmZvoDwnShfckohDCDFpABFKDmABFKDmyiALDmNEFKDmsub0Qn_0
Workunit 1973479
Created 20 Jan 2020, 16:18:10 UTC
Sent 21 Jan 2020, 6:33:47 UTC
Report deadline 28 Jan 2020, 6:33:47 UTC
Received 21 Jan 2020, 7:09:37 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 3848
Run time 34 min 35 sec
CPU time 45 min 42 sec
Validate state Valid
Credit 22.37
Device peak FLOPS 4.66 GFLOPS
Application version ATLAS Simulation v1.00 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.80 GB
Peak swap size 2.56 GB
Peak disk usage 715.91 MB

Stderr output

<core_client_version>7.16.1</core_client_version>
<![CDATA[
<stderr_txt>
07:34:44 (30631): wrapper (7.7.26015): starting
07:34:44 (30631): wrapper: running run_atlas (--nthreads 2)
Di 21. Jan 07:34:44 CET 2020: Arguments: --nthreads 2
Di 21. Jan 07:34:44 CET 2020: Threads: 2
Di 21. Jan 07:34:44 CET 2020: Checking for CVMFS
Di 21. Jan 07:34:44 CET 2020: Probing /cvmfs/atlas.cern.ch... OK
Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/atlas-condb.cern.ch... OK
Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/grid.cern.ch... OK
Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/cernvm-prod.cern.ch... OK
Di 21. Jan 07:34:45 CET 2020: Probing /cvmfs/sft.cern.ch... OK
Di 21. Jan 07:34:46 CET 2020: Probing /cvmfs/alice.cern.ch... OK
Di 21. Jan 07:34:46 CET 2020: VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
Di 21. Jan 07:34:46 CET 2020: 2.7.0.0 3040 9842 81164 59423 0 61 1811145 4194304 1 65024 0 51090 91.1509 900735 3819 http://s1cern-cvmfs.openhtc.io/cvmfs/atlas.cern.ch DIRECT 1
Di 21. Jan 07:34:46 CET 2020: CVMFS is ok
Di 21. Jan 07:34:46 CET 2020: Using singularity image /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img
Di 21. Jan 07:34:46 CET 2020: Checking for singularity binary...
Di 21. Jan 07:34:46 CET 2020: Using singularity found in PATH at /usr/bin/singularity
Di 21. Jan 07:34:46 CET 2020: Running /usr/bin/singularity --version
Di 21. Jan 07:34:46 CET 2020: singularity version 3.5.2-1.1.el7
Di 21. Jan 07:34:46 CET 2020: Checking singularity works with /usr/bin/singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img hostname
Di 21. Jan 07:34:47 CET 2020: ryzcos7
Di 21. Jan 07:34:47 CET 2020: Singularity works
Di 21. Jan 07:34:47 CET 2020: Set ATHENA_PROC_NUMBER=2
Di 21. Jan 07:34:47 CET 2020: Starting ATLAS job with PandaID=4002876565
Di 21. Jan 07:34:47 CET 2020: Running command: /usr/bin/singularity exec --pwd /var/lib/boinc/slots/0 -B /cvmfs,/var /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img sh start_atlas.sh
Di 21. Jan 08:09:17 CET 2020:  *** The last 200 lines of the pilot log: ***
Di 21. Jan 08:09:17 CET 2020:         }, 
Di 21. Jan 08:09:17 CET 2020:         "wallTime": 124
Di 21. Jan 08:09:17 CET 2020:       }
Di 21. Jan 08:09:17 CET 2020:     }, 
Di 21. Jan 08:09:17 CET 2020:     "machine": {
Di 21. Jan 08:09:17 CET 2020:       "cpu_family": "23", 
Di 21. Jan 08:09:17 CET 2020:       "linux_distribution": [
Di 21. Jan 08:09:17 CET 2020:         "CentOS Linux", 
Di 21. Jan 08:09:17 CET 2020:         "7.6.1810", 
Di 21. Jan 08:09:17 CET 2020:         "Core"
Di 21. Jan 08:09:17 CET 2020:       ], 
Di 21. Jan 08:09:17 CET 2020:       "model": "8", 
Di 21. Jan 08:09:17 CET 2020:       "model_name": "AMD Ryzen 7 2700 Eight-Core Processor", 
Di 21. Jan 08:09:17 CET 2020:       "node": "ryzcos7", 
Di 21. Jan 08:09:17 CET 2020:       "platform": "Linux-3.10.0-693.el7.x86_64-x86_64-with-centos-7.6.1810-Core"
Di 21. Jan 08:09:17 CET 2020:     }, 
Di 21. Jan 08:09:17 CET 2020:     "transform": {
Di 21. Jan 08:09:17 CET 2020:       "cpuEfficiency": 0.719, 
Di 21. Jan 08:09:17 CET 2020:       "cpuPWEfficiency": 0.742, 
Di 21. Jan 08:09:17 CET 2020:       "cpuTime": 14, 
Di 21. Jan 08:09:17 CET 2020:       "cpuTimeTotal": 2776, 
Di 21. Jan 08:09:17 CET 2020:       "externalCpuTime": 25, 
Di 21. Jan 08:09:17 CET 2020:       "processedEvents": 10, 
Di 21. Jan 08:09:17 CET 2020:       "trfPredata": null, 
Di 21. Jan 08:09:17 CET 2020:       "wallTime": 1872
Di 21. Jan 08:09:17 CET 2020:     }
Di 21. Jan 08:09:17 CET 2020:   }
Di 21. Jan 08:09:17 CET 2020: }
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG    | queue_monitor       | pilot.util.auxiliary.4002876565  | update_server             | xml:will send fileinfo
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG    | queue_monitor       | pilot.control.job                | get_proper_state          | state=finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG    | queue_monitor       | pilot.control.job                | get_proper_state          | serverstate=running
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | DEBUG    | queue_monitor       | pilot.control.job                | get_proper_state          | serverstate=finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | INFO     | queue_monitor       | pilot.control.job.4002876565     | send_state                | pilot will not update the server (heartbeat message will be written to file)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,664 | INFO     | queue_monitor       | pilot.control.job.4002876565     | send_state                | job 4002876565 has finished - writing final server update
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,665 | DEBUG    | queue_monitor       | pilot.control.job.4002876565     | get_data_structure        | building data structure to be sent to server with heartbeat
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,665 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | get_job_metrics           | will not add max space = -355519053 B to job metrics
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | DEBUG    | queue_monitor       | pilot.api.analytics              | get_fitted_data           | removing tails from data to be fitted
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | INFO     | queue_monitor       | pilot.api.analytics              | get_fitted_data           | fitting pss+swap vs Time
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | INFO     | queue_monitor       | pilot.api.analytics              | get_fitted_data           | current memory leak: 64.71 B/s (using 25 data points, chi2=2713121)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,666 | DEBUG    | queue_monitor       | pilot.util.auxiliary.4002876565  | get_job_metrics           | job metrics="coreCount=2 actualCoreCount=1 nEvents=10 leak=64.71 chi2=2713121"
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,667 | INFO     | queue_monitor       | pilot.control.job.4002876565     | get_data_structure        | total number of processed events: 10 (read)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,667 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_values         | using path: /var/lib/boinc/slots/0/PanDA_Pilot-4002876565/memory_monitor_summary.json (trf name=prmon)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | DEBUG    | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | summary_dictionary={'Max': {'rx_packets': 32767, 'nprocs': 10, 'nthreads': 1, 'rx_bytes': 18194544, 'wtime': 1940, 'rss': 5702712, 'write_bytes': 0, 'vmem': 8753116, 'read_bytes': 0, 'stime': 93, 'tx_bytes': 7597689, 'pss': 2247966, 'wchar': 0, 'rchar': 0, 'tx_packets': 21758, 'swap': 0, 'utime': 2669}, 'Avg': {'write_bytes': 0, 'nprocs': 6, 'nthreads': 0, 'rx_bytes': 9376, 'rx_packets': 16, 'vmem': 6055184, 'read_bytes': 0, 'swap': 0, 'tx_bytes': 3915, 'pss': 1640699, 'wchar': 0, 'rchar': 0, 'tx_packets': 11, 'rss': 3755413}}
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | extracted standard info from prmon json
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | extracted standard memory fields from prmon json
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | ..............................
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,669 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . Timing measurements:
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . get job = 0 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . initial setup = 2 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . payload setup = 0 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . total setup = 2 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . stage-in = 0 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . payload execution = 2015 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . stage-out = 2 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,670 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | ..............................
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,672 | DEBUG    | queue_monitor       | pilot.control.job.4002876565     | send_state                | wrote heartbeat to file /var/lib/boinc/slots/0/heartbeat.json
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,672 | DEBUG    | queue_monitor       | pilot.control.job                | queue_monitor             | job 4002876565 was dequeued from the monitored payloads queue
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:06,672 | DEBUG    | queue_monitor       | pilot.control.job                | queue_monitor             | tmp job object deleted
Di 21. Jan 08:09:17 CET 2020:   File "/var/lib/boinc/slots/0/pilot2/pilot/common/exception.py", line 431, in run
Di 21. Jan 08:09:17 CET 2020:     self._Thread__target(**self._Thread__kwargs)
Di 21. Jan 08:09:17 CET 2020:   File "/var/lib/boinc/slots/0/pilot2/pilot/control/job.py", line 1920, in job_monitor
Di 21. Jan 08:09:17 CET 2020:     update_time = send_heartbeat_if_time(jobs[i], args, update_time)
Di 21. Jan 08:09:17 CET 2020: exception caught by thread run() function: (<type 'exceptions.IndexError'>, IndexError('deque index out of range',), <traceback object at 0x7f71a71144d0>)
Di 21. Jan 08:09:17 CET 2020: Traceback (most recent call last):
Di 21. Jan 08:09:17 CET 2020:   File "/var/lib/boinc/slots/0/pilot2/pilot/common/exception.py", line 431, in run
Di 21. Jan 08:09:17 CET 2020:     self._Thread__target(**self._Thread__kwargs)
Di 21. Jan 08:09:17 CET 2020:   File "/var/lib/boinc/slots/0/pilot2/pilot/control/job.py", line 1920, in job_monitor
Di 21. Jan 08:09:17 CET 2020:     update_time = send_heartbeat_if_time(jobs[i], args, update_time)
Di 21. Jan 08:09:17 CET 2020: IndexError: deque index out of range
Di 21. Jan 08:09:17 CET 2020: 
Di 21. Jan 08:09:17 CET 2020: None
Di 21. Jan 08:09:17 CET 2020: exception has been put in bucket queue belonging to thread 'job_monitor'
Di 21. Jan 08:09:17 CET 2020: setting graceful stop in 10 s since there is no point in continuing
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | 
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | job summary report
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | --------------------------------------------------
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | PanDA job id: 4002876565
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | task id: 000649-198069-24222
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | errors: (none)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,101 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | status: LOG_TRANSFER = DONE 
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | pilot state: finished 
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | transexitcode: 0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exeerrorcode: 0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exeerrordiag: 
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exitcode: 0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exitmsg: OK
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | cpuconsumptiontime: 2784 s
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,102 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | nevents: 10
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | neventsw: 0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | pid: 5480
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | pgrp: 5480
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | corecount: 2
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | event service: False
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | --------------------------------------------------
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | 
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue jobs has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue payloads has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,103 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue data_in has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue data_out has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue current_data_in has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue validated_jobs has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue validated_payloads has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue monitored_payloads has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue finished_jobs has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue finished_payloads has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue finished_data_in has 1 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue finished_data_out has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue failed_jobs has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue failed_payloads has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue failed_data_in has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue failed_data_out has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue completed_jobs has 0 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,104 | INFO     | retrieve            | pilot.util.queuehandling         | queue_report              | queue completed_jobids has 1 job(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,105 | INFO     | retrieve            | pilot.control.job.4002876565     | has_job_completed         | job 4002876565 has completed (purged errors)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,105 | INFO     | retrieve            | pilot.util.processes             | cleanup                   | overall cleanup function is called
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,109 | DEBUG    | retrieve            | pilot.util.processes             | cleanup                   | work directory was removed: /var/lib/boinc/slots/0/PanDA_Pilot-4002876565
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:07,907 | WARNING  | job                 | pilot.control.job                | control                   | thread 'job_monitor' received an exception from bucket: deque index out of range
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:08,115 | INFO     | retrieve            | pilot.info.jobdata               | collect_zombies           | --- collectZombieJob: --- 10, [5480]
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:08,115 | INFO     | retrieve            | pilot.info.jobdata               | collect_zombies           | zombie collector trying to kill pid 5480
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:08,115 | INFO     | retrieve            | pilot.info.jobdata               | collect_zombies           | harmless exception when collecting zombies: [Errno 10] No child processes
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,120 | INFO     | retrieve            | pilot.util.processes             | cleanup                   | collected zombie processes
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,121 | INFO     | retrieve            | pilot.util.processes             | cleanup                   | will now attempt to kill all subprocesses of pid=5480
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,165 | INFO     | retrieve            | pilot.util.processes             | kill_processes            | process IDs to be killed: [5480] (in reverse order)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,201 | WARNING  | retrieve            | pilot.util.processes             | kill_processes            | found no corresponding commands to process id(s)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,202 | INFO     | retrieve            | pilot.util.processes             | kill_orphans              | Do not look for orphan processes in BOINC jobs
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,202 | INFO     | retrieve            | pilot.control.job                | retrieve                  | ready for new job
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,202 | INFO     | retrieve            | root                             | retrieve                  | pilot has finished for previous job - re-establishing logging
Di 21. Jan 08:09:17 CET 2020: mpi4py not found
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | ****************************************
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | ***  PanDA Pilot version 2.3.4 (12)  ***
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | ****************************************
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | 
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,206 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | pilot is running in a VM
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,207 | INFO     | retrieve            | pilot.util.auxiliary             | display_architecture_info | architecture information:
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,264 | INFO     | retrieve            | pilot.util.auxiliary             | display_architecture_info | 
Di 21. Jan 08:09:17 CET 2020: LSB Version:	:core-4.1-amd64:core-4.1-noarch
Di 21. Jan 08:09:17 CET 2020: Distributor ID:	CentOS
Di 21. Jan 08:09:17 CET 2020: Description:	CentOS Linux release 7.6.1810 (Core) 
Di 21. Jan 08:09:17 CET 2020: Release:	7.6.1810
Di 21. Jan 08:09:17 CET 2020: Codename:	Core
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,264 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | ****************************************
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,770 | DEBUG    | retrieve            | pilot.util.monitoring            | check_local_space         | checking local space on /var/lib/boinc/slots/0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,784 | INFO     | retrieve            | pilot.util.monitoring            | check_local_space         | sufficient remaining disk space (8373927936 B)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,784 | WARNING  | retrieve            | pilot.control.job                | proceed_with_getjob       | since timefloor is set to 0, pilot was only allowed to run one job
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,784 | DEBUG    | retrieve            | pilot.control.job                | retrieve                  | [job] retrieve thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,798 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 16 threads
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:09,798 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(job, started 140126076606208)>, <ExcThread(validate_post, started 140125438727936)>, <ExcThread(payload, started 140125958813440)>, <ExcThread(copytool_in, started 140125950420736)>, <ExcThread(queue_monitoring, started 140125975598848)>, <ExcThread(copytool_out, started 140125983991552)>, <ExcThread(failed_post, started 140125421942528)>, <ExcThread(data, started 140125992384256)>, <ExcThread(job_monitor, started 140125430335232)>, <ExcThread(execute_payloads, started 140125413549824)>, <ExcThread(validate, started 140126068213504)>, <ExcThread(monitor, started 140125455513344)>, <ExcThread(queue_monitor, started 140125405157120)>, <ExcThread(validate_pre, started 140125942028032)>, <ExcThread(create_data_payload, started 140125967206144)>]
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,136 | INFO     | execute_payloads    | pilot.control.payload            | execute_payloads          | [payload] execute_payloads thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,272 | INFO     | monitor             | pilot.control.monitor            | control                   | [monitor] control thread has ended
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,294 | DEBUG    | data                | pilot.control.data               | control                   | data control ending since graceful_stop has been set
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,294 | DEBUG    | data                | pilot.control.data               | control                   | [data] control thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,387 | DEBUG    | job                 | pilot.control.job                | control                   | job control ending since graceful_stop has been set
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,387 | DEBUG    | job                 | pilot.control.job                | control                   | [job] control thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,698 | INFO     | validate_pre        | pilot.control.payload            | validate_pre              | [payload] validate_pre thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,830 | DEBUG    | validate            | pilot.control.job                | validate                  | [job] validate thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,840 | WARNING  | copytool_out        | pilot.util.common                | should_abort              | data:copytool_out:received graceful stop - abort after this iteration
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,858 | DEBUG    | payload             | pilot.control.payload            | control                   | payload control ending since graceful_stop has been set
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,858 | DEBUG    | payload             | pilot.control.payload            | control                   | [payload] control thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,887 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 9 threads
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,887 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(validate_post, started 140125438727936)>, <ExcThread(copytool_in, started 140125950420736)>, <ExcThread(queue_monitoring, started 140125975598848)>, <ExcThread(copytool_out, started 140125983991552)>, <ExcThread(failed_post, started 140125421942528)>, <ExcThread(job_monitor, started 140125430335232)>, <ExcThread(queue_monitor, started 140125405157120)>, <ExcThread(create_data_payload, started 140125967206144)>]
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:10,997 | INFO     | failed_post         | pilot.control.payload            | failed_post               | [payload] failed_post thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,076 | INFO     | validate_post       | pilot.control.payload            | validate_post             | [payload] validate_post thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,276 | DEBUG    | copytool_in         | pilot.control.data               | copytool_in               | [data] copytool_in thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,282 | DEBUG    | create_data_payload | pilot.control.job                | create_data_payload       | [job] create_data_payload thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,717 | WARNING  | queue_monitor       | pilot.util.common                | should_abort              | job:queue_monitor:received graceful stop - abort after this iteration
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,717 | DEBUG    | queue_monitor       | pilot.control.job                | queue_monitor             | [job] queue monitor thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,846 | DEBUG    | copytool_out        | pilot.control.data               | copytool_out              | [data] copytool_out thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,913 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 3 threads
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:11,913 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(queue_monitoring, started 140125975598848)>, <ExcThread(job_monitor, started 140125430335232)>]
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:12,407 | WARNING  | queue_monitoring    | pilot.util.common                | should_abort              | data:queue_monitoring:received graceful stop - abort after this iteration
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:15,419 | DEBUG    | queue_monitoring    | pilot.control.data               | queue_monitoring          | [data] queue_monitor thread has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:15,961 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 2 threads
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:15,961 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140126239024960)>, <ExcThread(job_monitor, started 140125430335232)>]
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17,009 | INFO     | MainThread          | pilot.workflow.generic           | run                       | end of generic workflow (traces error code: 0)
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17,009 | INFO     | MainThread          | root                             | wrap_up                   | traces error code: 0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17,009 | INFO     | MainThread          | root                             | wrap_up                   | pilot has finished
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== pilot stdout END ====
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== wrapper stdout RESUME ====
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] Pilot exit status: 0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] STATUSCODE: 0
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] apfmon messages muted
Di 21. Jan 08:09:17 CET 2020: ---- find pandaID.out ----
Di 21. Jan 08:09:17 CET 2020: total 60
Di 21. Jan 08:09:17 CET 2020: -rw-------.  1 boinc boinc 11357 Jul 25 16:38 LICENSE
Di 21. Jan 08:09:17 CET 2020: -rw-------.  1 boinc boinc    20 Sep  9 13:04 MANIFEST.IN
Di 21. Jan 08:09:17 CET 2020: -rw-------.  1 boinc boinc     8 Dec 12 19:00 PILOTVERSION
Di 21. Jan 08:09:17 CET 2020: -rw-------.  1 boinc boinc  2212 Nov 14 11:01 README.md
Di 21. Jan 08:09:17 CET 2020: -rw-------.  1 boinc boinc   221 Jul 25 16:38 TODO.md
Di 21. Jan 08:09:17 CET 2020: -rw-------.  1 boinc boinc    11 Jan 21 07:35 pandaIDs.out
Di 21. Jan 08:09:17 CET 2020: drwx------. 14 boinc boinc   216 Jan 21 07:35 pilot
Di 21. Jan 08:09:17 CET 2020: -rwx------.  1 boinc boinc 21225 Dec 12 19:00 pilot.py
Di 21. Jan 08:09:17 CET 2020: -rw-------.  1 boinc boinc   766 Oct 10 16:01 setup.py
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 11 Jan 21 07:35 /var/lib/boinc/slots/0/pilot2/pandaIDs.out
Di 21. Jan 08:09:17 CET 2020: 4002876565
Di 21. Jan 08:09:17 CET 2020: 
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] Test setup, not cleaning
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== wrapper stdout END ====
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] ==== wrapper stderr END ====
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] wrapper wrapperexiting ec=0, duration=2069
Di 21. Jan 08:09:17 CET 2020: 2020-01-21 07:09:17 UTC [wrapper] apfmon messages muted
Di 21. Jan 08:09:17 CET 2020:  *** Error codes and diagnostics ***
Di 21. Jan 08:09:17 CET 2020:     "exeErrorCode": 0,
Di 21. Jan 08:09:17 CET 2020:     "exeErrorDiag": "",
Di 21. Jan 08:09:17 CET 2020:     "pilotErrorCode": 0,
Di 21. Jan 08:09:17 CET 2020:     "pilotErrorDiag": "",
Di 21. Jan 08:09:17 CET 2020:  *** Listing of results directory ***
Di 21. Jan 08:09:17 CET 2020: insgesamt 379016
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc    267260 20. Jan 16:32 pilot2.tar.gz
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      4492 20. Jan 17:15 queuedata.json
Di 21. Jan 08:09:17 CET 2020: -rwx------. 1 boinc boinc     12641 20. Jan 17:17 runpilot2-wrapper.sh
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc       107 21. Jan 07:34 wrapper_26015_x86_64-pc-linux-gnu
Di 21. Jan 08:09:17 CET 2020: -rwxr-xr-x. 1 boinc boinc      5557 21. Jan 07:34 run_atlas
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc       112 21. Jan 07:34 job.xml
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      5991 21. Jan 07:34 init_data.xml
Di 21. Jan 08:09:17 CET 2020: drwxrwx--x. 2 boinc boinc        86 21. Jan 07:34 shared
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc         0 21. Jan 07:34 boinc_lockfile
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc       815 21. Jan 07:34 RTE.tar.gz
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc    275414 21. Jan 07:34 input.tar.gz
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 365251149 21. Jan 07:34 EVNT.14296418._001447.pool.root.1
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      8509 21. Jan 07:34 start_atlas.sh
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      2948 21. Jan 07:34 pandaJob.out
Di 21. Jan 08:09:17 CET 2020: drwxr-xr-x. 3 boinc boinc        17 21. Jan 07:34 APPS
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc   3699286 21. Jan 07:35 agis_schedconf.cvmfs.json
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc   7792440 21. Jan 07:35 agis_ddmendpoints.json
Di 21. Jan 08:09:17 CET 2020: drwx------. 3 boinc boinc       229 21. Jan 07:35 pilot2
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc       535 21. Jan 08:02 boinc_task_state.xml
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc   9176612 21. Jan 08:08 HITS.000649-198069-24222._078090.pool.root.1
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc        26 21. Jan 08:08 wrapper_checkpoint.txt
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      8192 21. Jan 08:08 boinc_mmap_file
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc       784 21. Jan 08:08 memory_monitor_summary.json
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc    512991 21. Jan 08:09 log.000649-198069-24222._078090.job.log.tgz.1
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc     11640 21. Jan 08:09 heartbeat.json
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc      8556 21. Jan 08:09 pilotlog.txt
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc    217247 21. Jan 08:09 log.000649-198069-24222._078090.job.log.1
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc       490 21. Jan 08:09 dTzMDmZvoDwnShfckohDCDFpABFKDmABFKDmyiALDmNEFKDmsub0Qn.diag
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      7084 21. Jan 08:09 runtime_log.err
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc       496 21. Jan 08:09 output.list
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc       739 21. Jan 08:09 runtime_log
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc    757760 21. Jan 08:09 result.tar.gz
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      2252 21. Jan 08:09 stderr.txt
Di 21. Jan 08:09:17 CET 2020: HITS file was successfully produced:
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc 9176612 21. Jan 08:08 shared/HITS.pool.root.1
Di 21. Jan 08:09:17 CET 2020:  *** Contents of shared directory: ***
Di 21. Jan 08:09:17 CET 2020: insgesamt 366684
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc 365251149 21. Jan 07:34 ATLAS.root_0
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc      8509 21. Jan 07:34 start_atlas.sh
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc       815 21. Jan 07:34 RTE.tar.gz
Di 21. Jan 08:09:17 CET 2020: -rw-r--r--. 1 boinc boinc    275414 21. Jan 07:34 input.tar.gz
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc   9176612 21. Jan 08:08 HITS.pool.root.1
Di 21. Jan 08:09:17 CET 2020: -rw-------. 1 boinc boinc    757760 21. Jan 08:09 result.tar.gz
08:09:18 (30631): run_atlas exited; CPU time 2742.075446
08:09:18 (30631): called boinc_finish(0)

</stderr_txt>
]]>


©2024 CERN