Name PIRKDm1d2TvnShfckohDCDFpABFKDmABFKDmQ6FKDmfFFKDmr2Lzdn_3
Workunit 1938700
Created 1 Oct 2019, 5:30:39 UTC
Sent 7 Oct 2019, 11:12:02 UTC
Report deadline 14 Oct 2019, 11:12:02 UTC
Received 8 Oct 2019, 0:45:50 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 3911
Run time 19 min 35 sec
CPU time 33 min 27 sec
Validate state Valid
Credit 31.67
Device peak FLOPS 11.64 GFLOPS
Application version ATLAS Simulation v0.74 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.79 GB
Peak swap size 2.55 GB
Peak disk usage 731.83 MB

Stderr output

<core_client_version>7.7.0</core_client_version>
<![CDATA[
<stderr_txt>
02:26:03 (4669): wrapper (7.7.26015): starting
02:26:03 (4669): wrapper: running run_atlas (--nthreads 4)
2019-10-08 02:26:03,865: singularity image is /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img
2019-10-08 02:26:03,865: sys.argv = ['run_atlas', '--nthreads', '4']
2019-10-08 02:26:03,866: THREADS=4
2019-10-08 02:26:03,868: Checking for CVMFS
2019-10-08 02:26:06,486: CVMFS is installed
2019-10-08 02:26:06,487: Checking Singularity...
2019-10-08 02:26:07,268: Singularity is installed, version singularity version 3.2.1-1
2019-10-08 02:26:07,268: Testing the function of Singularity...
2019-10-08 02:26:07,268: Checking singularity with cmd:singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img hostname
2019-10-08 02:26:17,272: Singularity Works...
2019-10-08 02:26:17,273: copy /home/grid/slots/0/shared/ATLAS.root_0
2019-10-08 02:26:17,961: copy /home/grid/slots/0/shared/input.tar.gz
2019-10-08 02:26:17,962: copy /home/grid/slots/0/shared/RTE.tar.gz
2019-10-08 02:26:17,962: copy /home/grid/slots/0/shared/start_atlas.sh
2019-10-08 02:26:17,963: export ATHENA_PROC_NUMBER=4;
2019-10-08 02:26:17,995: start atlas job with PandaID=4002876565
2019-10-08 02:26:17,995: cmd = singularity exec --pwd /home/grid/slots/0 -B /cvmfs,/home /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img sh start_atlas.sh > runtime_log 2> runtime_log.err
2019-10-08 02:45:37,434: running cmd return value is 0
2019-10-08 02:45:37,438: Moving ./HITS.000649-403420-13268._078090.pool.root.1 to shared/HITS.pool.root.1
2019-10-08 02:45:37,438: HITS result file:
2019-10-08 02:45:37,464: -rw-------. 1 grid grid 9160611 Oct  8 02:44 shared/HITS.pool.root.1
2019-10-08 02:45:37,465: *****************The last 200 lines of the pilot log******************
2019-10-08 02:45:37,470:           "wallTime": 3
        }, 
        "total": {
          "cpuTime": 58, 
          "wallTime": 128
        }, 
        "validation": {
          "cpuTime": 0, 
          "wallTime": 0
        }, 
        "wallTime": 124
      }
    }, 
    "machine": {
      "cpu_family": "6", 
      "linux_distribution": [
        "CentOS Linux", 
        "7.6.1810", 
        "Core"
      ], 
      "model": "61", 
      "model_name": "Intel Core Processor (Broadwell, IBRS)", 
      "node": "dcameron04.cern.ch", 
      "platform": "Linux-3.10.0-957.el7.x86_64-x86_64-with-centos-7.6.1810-Core"
    }, 
    "transform": {
      "cpuEfficiency": 0.4936, 
      "cpuPWEfficiency": 0.5396, 
      "cpuTime": 5, 
      "cpuTimeTotal": 2054, 
      "externalCpuTime": 7, 
      "processedEvents": 10, 
      "trfPredata": null, 
      "wallTime": 1010
    }
  }
}
2019-10-08 00:44:30,015 | DEBUG    | queue_monitor       | pilot.util.auxiliary.4002876565  | update_server             | xml:will send fileinfo
2019-10-08 00:44:30,015 | INFO     | queue_monitor       | pilot.control.job.4002876565     | send_state                | pilot will not update the server (heartbeat message will be written to file)
2019-10-08 00:44:30,015 | INFO     | queue_monitor       | pilot.control.job.4002876565     | send_state                | job 4002876565 has finished - writing final server update
2019-10-08 00:44:30,015 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | get_job_metrics           | will not add max space = -355547725 B to job metrics
2019-10-08 00:44:30,016 | WARNING  | queue_monitor       | pilot.api.analytics              | get_fitted_data           | wrong length of table data, x=[1570494435.0, 1570494496.0, 1570494557.0, 1570494618.0, 1570494679.0, 1570494740.0, 1570494801.0, 1570494862.0, 1570494923.0, 1570494984.0, 1570495045.0, 1570495106.0, 1570495167.0, 1570495228.0, 1570495289.0, 1570495350.0, 1570495411.0], y=[] (must be same and length>=2)
2019-10-08 00:44:30,016 | DEBUG    | queue_monitor       | pilot.util.auxiliary.4002876565  | get_job_metrics           | job metrics="coreCount=4 actualCoreCount=3 nEvents=10"
2019-10-08 00:44:30,016 | INFO     | queue_monitor       | pilot.control.job.4002876565     | get_data_structure        | total number of processed events: 10 (read)
2019-10-08 00:44:30,017 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_values         | using path: /home/grid/slots/0/PanDA_Pilot-4002876565/memory_monitor_summary.json (trf name=prmon)
2019-10-08 00:44:30,018 | DEBUG    | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | summary_dictionary={'Max': {'rx_packets': 75693, 'nprocs': 10, 'nthreads': 1, 'rx_bytes': 19060732, 'wtime': 989, 'rss': 9423896, 'write_bytes': 16957440, 'vmem': 14014680, 'read_bytes': 3124965376, 'stime': 45, 'tx_bytes': 3595631, 'pss': 2507560, 'wchar': 23940871, 'rchar': 1032225262, 'tx_packets': 24359, 'swap': 0, 'utime': 1975}, 'Avg': {'write_bytes': 17143, 'nprocs': 6, 'nthreads': 0, 'rx_bytes': 19270, 'rx_packets': 76, 'vmem': 7641108, 'read_bytes': 3159338, 'swap': 0, 'tx_bytes': 3635, 'pss': 1690995, 'wchar': 24204, 'rchar': 1043579, 'tx_packets': 24, 'rss': 4856631}}
2019-10-08 00:44:30,018 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | extracted standard info from prmon json
2019-10-08 00:44:30,018 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | extracted standard memory fields from prmon json
2019-10-08 00:44:30,018 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | ..............................
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . Timing measurements:
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . get job = 0 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . initial setup = 1 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . payload setup = 0 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . total setup = 1 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . stage-in = 0 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . payload execution = 1043 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . stage-out = 2 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | ..............................
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | get_log_extracts          | building log extracts (sent to the server as 'pilotLog')
2019-10-08 00:44:30,019 | DEBUG    | queue_monitor       | pilot.util.auxiliary.4002876565  | get_panda_tracer_log      | PanDA tracer log does not exist: /home/grid/slots/0/PanDA_Pilot-4002876565/pandatracerlog.txt (ignoring)
2019-10-08 00:44:30,020 | INFO     | queue_monitor       | pilot.util.container             | execute                   | executing command: tail -n 20 /home/grid/slots/0/PanDA_Pilot-4002876565/pilotlog.txt
2019-10-08 00:44:30,043 | WARNING  | queue_monitor       | pilot.util.auxiliary.4002876565  | get_log_extracts          | detected the following tail of warning/fatal messages in the pilot log:
- Log from pilotlog.txt -2019-10-08 00:44:30,016 | WARNING  | queue_monitor       | pilot.api.analytics              | get_fitted_data           | wrong length of table data, x=[1570494435.0, 1570494496.0, 1570494557.0, 1570494618.0, 1570494679.0, 1570494740.0, 1570494801.0, 1570494862.0, 1570494923.0, 1570494984.0, 1570495045.0, 1570495106.0, 1570495167.0, 1570495228.0, 1570495289.0, 1570495350.0, 1570495411.0], y=[] (must be same and length>=2)
2019-10-08 00:44:30,016 | DEBUG    | queue_monitor       | pilot.util.auxiliary.4002876565  | get_job_metrics           | job metrics="coreCount=4 actualCoreCount=3 nEvents=10"
2019-10-08 00:44:30,016 | INFO     | queue_monitor       | pilot.control.job.4002876565     | get_data_structure        | total number of processed events: 10 (read)
2019-10-08 00:44:30,017 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_values         | using path: /home/grid/slots/0/PanDA_Pilot-4002876565/memory_monitor_summary.json (trf name=prmon)
2019-10-08 00:44:30,018 | DEBUG    | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | summary_dictionary={'Max': {'rx_packets': 75693, 'nprocs': 10, 'nthreads': 1, 'rx_bytes': 19060732, 'wtime': 989, 'rss': 9423896, 'write_bytes': 16957440, 'vmem': 14014680, 'read_bytes': 3124965376, 'stime': 45, 'tx_bytes': 3595631, 'pss': 2507560, 'wchar': 23940871, 'rchar': 1032225262, 'tx_packets': 24359, 'swap': 0, 'utime': 1975}, 'Avg': {'write_bytes': 17143, 'nprocs': 6, 'nthreads': 0, 'rx_bytes': 19270, 'rx_packets': 76, 'vmem': 7641108, 'read_bytes': 3159338, 'swap': 0, 'tx_bytes': 3635, 'pss': 1690995, 'wchar': 24204, 'rchar': 1043579, 'tx_packets': 24, 'rss': 4856631}}
2019-10-08 00:44:30,018 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | extracted standard info from prmon json
2019-10-08 00:44:30,018 | INFO     | queue_monitor       | pilot.user.atlas.utilities       | get_memory_monitor_info   | extracted standard memory fields from prmon json
2019-10-08 00:44:30,018 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | ..............................
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . Timing measurements:
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . get job = 0 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . initial setup = 1 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . payload setup = 0 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . total setup = 1 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . stage-in = 0 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . payload execution = 1043 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | . stage-out = 2 s
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | timing_report             | ..............................
2019-10-08 00:44:30,019 | INFO     | queue_monitor       | pilot.util.auxiliary.4002876565  | get_log_extracts          | building log extracts (sent to the server as 'pilotLog')
2019-10-08 00:44:30,019 | DEBUG    | queue_monitor       | pilot.util.auxiliary.4002876565  | get_panda_tracer_log      | PanDA tracer log does not exist: /home/grid/slots/0/PanDA_Pilot-4002876565/pandatracerlog.txt (ignoring)
2019-10-08 00:44:30,020 | INFO     | queue_monitor       | pilot.util.container             | execute                   | executing command: tail -n 20 /home/grid/slots/0/PanDA_Pilot-4002876565/pilotlog.txt
2019-10-08 00:44:30,043 | DEBUG    | queue_monitor       | pilot.control.job.4002876565     | send_state                | wrote heartbeat to file /home/grid/slots/0/heartbeat.json
2019-10-08 00:44:30,044 | INFO     | queue_monitor       | pilot.control.job                | queue_monitor             | job 4002876565 was dequeued from the monitored payloads queue
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | 
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | job summary report
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | --------------------------------------------------
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | PanDA job id: 4002876565
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | task id: 000649-403420-13268
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | errors: (none)
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | status: LOG_TRANSFER = DONE 
2019-10-08 00:44:30,599 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | pilot state: finished 
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | transexitcode: 0
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exeerrorcode: 0
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exeerrordiag: 
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exitcode: 0
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | exitmsg: OK
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | cpuconsumptiontime: 2032 s
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | nevents: 10
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | neventsw: 0
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | pid: 11840
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | pgrp: 11840
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | corecount: 4
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | event service: False
2019-10-08 00:44:30,600 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | --------------------------------------------------
2019-10-08 00:44:30,601 | INFO     | retrieve            | pilot.util.auxiliary.4002876565  | make_job_report           | 
2019-10-08 00:44:30,601 | INFO     | retrieve            | pilot.control.job.4002876565     | has_job_completed         | job 4002876565 has completed
2019-10-08 00:44:30,601 | INFO     | retrieve            | pilot.util.processes             | cleanup                   | overall cleanup function is called
2019-10-08 00:44:30,610 | DEBUG    | retrieve            | pilot.util.processes             | cleanup                   | work directory was removed: /home/grid/slots/0/PanDA_Pilot-4002876565
2019-10-08 00:44:31,612 | INFO     | retrieve            | pilot.info.jobdata               | collect_zombies           | --- collectZombieJob: --- 10, [11840]
2019-10-08 00:44:31,612 | INFO     | retrieve            | pilot.info.jobdata               | collect_zombies           | zombie collector trying to kill pid 11840
2019-10-08 00:44:31,612 | INFO     | retrieve            | pilot.info.jobdata               | collect_zombies           | harmless exception when collecting zombies: [Errno 10] No child processes
2019-10-08 00:44:32,618 | INFO     | retrieve            | pilot.util.processes             | cleanup                   | collected zombie processes
2019-10-08 00:44:32,618 | INFO     | retrieve            | pilot.util.processes             | cleanup                   | will now attempt to kill all subprocesses of pid=11840
2019-10-08 00:44:32,699 | INFO     | retrieve            | pilot.util.processes             | kill_processes            | process IDs to be killed: [11840] (in reverse order)
2019-10-08 00:44:32,740 | WARNING  | retrieve            | pilot.util.processes             | kill_processes            | found no corresponding commands to process id(s)
2019-10-08 00:44:32,741 | INFO     | retrieve            | pilot.util.processes             | kill_orphans              | Do not look for orphan processes in BOINC jobs
2019-10-08 00:44:32,741 | INFO     | retrieve            | pilot.control.job                | retrieve                  | ready for new job
2019-10-08 00:44:32,741 | INFO     | retrieve            | root                             | retrieve                  | pilot has finished for previous job - re-establishing logging
No handlers could be found for logger "pilot.util.mpi"
2019-10-08 00:44:32,745 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | *****************************************
2019-10-08 00:44:32,745 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | ***  PanDA Pilot version 2.1.25 (11)  ***
2019-10-08 00:44:32,745 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | *****************************************
2019-10-08 00:44:32,745 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | 
2019-10-08 00:44:32,745 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | pilot is running in a VM
2019-10-08 00:44:32,745 | INFO     | retrieve            | pilot.util.auxiliary             | display_architecture_info | architecture information:
2019-10-08 00:44:32,822 | INFO     | retrieve            | pilot.util.auxiliary             | display_architecture_info | 
LSB Version:	:core-4.1-amd64:core-4.1-noarch
Distributor ID:	CentOS
Description:	CentOS Linux release 7.6.1810 (Core) 
Release:	7.6.1810
Codename:	Core
2019-10-08 00:44:32,822 | INFO     | retrieve            | pilot.util.auxiliary             | pilot_version_banner      | *****************************************
2019-10-08 00:44:33,221 | WARNING  | job_monitor         | pilot.control.job                | check_job_monitor_waiting_time | no jobs in monitored_payloads queue (waited for 73 s)
2019-10-08 00:44:33,325 | DEBUG    | retrieve            | pilot.util.monitoring            | check_local_space         | checking local space on /home/grid/slots/0
2019-10-08 00:44:33,349 | INFO     | retrieve            | pilot.util.monitoring            | check_local_space         | sufficient remaining disk space (19167969280 B)
2019-10-08 00:44:33,349 | WARNING  | retrieve            | pilot.control.job                | proceed_with_getjob       | since timefloor is set to 0, pilot was only allowed to run one job
2019-10-08 00:44:33,349 | DEBUG    | retrieve            | pilot.control.job                | retrieve                  | [job] retrieve thread has finished
2019-10-08 00:44:33,410 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 16 threads
2019-10-08 00:44:33,411 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140457539118912)>, <ExcThread(job, started 140457377486592)>, <ExcThread(validate_pre, started 140456813905664)>, <ExcThread(queue_monitoring, started 140456839083776)>, <ExcThread(failed_post, started 140456788727552)>, <ExcThread(monitor, started 140456830691072)>, <ExcThread(validate, started 140457369093888)>, <ExcThread(validate_post, started 140456797120256)>, <ExcThread(queue_monitor, started 140456822298368)>, <ExcThread(copytool_in, started 140456805512960)>, <ExcThread(create_data_payload, started 140457352308480)>, <ExcThread(job_monitor, started 140457318737664)>, <ExcThread(copytool_out, started 140457327130368)>, <ExcThread(execute_payloads, started 140456235104000)>, <ExcThread(payload, started 140457343915776)>, <ExcThread(data, started 140457360701184)>]
2019-10-08 00:44:33,530 | DEBUG    | copytool_in         | pilot.control.data               | copytool_in               | [data] copytool_in thread has finished
2019-10-08 00:44:33,646 | WARNING  | queue_monitoring    | pilot.util.common                | should_abort              | data:queue_monitoring:received graceful stop - abort after this iteration
2019-10-08 00:44:33,724 | WARNING  | job_monitor         | pilot.util.common                | should_abort              | job:job_monitor:received graceful stop - abort after this iteration
2019-10-08 00:44:33,811 | DEBUG    | payload             | pilot.control.payload            | control                   | payload control ending since graceful_stop has been set
2019-10-08 00:44:33,812 | DEBUG    | payload             | pilot.control.payload            | control                   | [payload] control thread has finished
2019-10-08 00:44:33,867 | DEBUG    | validate            | pilot.control.job                | validate                  | [job] validate thread has finished
2019-10-08 00:44:33,913 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 13 threads
2019-10-08 00:44:33,913 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140457539118912)>, <ExcThread(job, started 140457377486592)>, <ExcThread(validate_pre, started 140456813905664)>, <ExcThread(queue_monitoring, started 140456839083776)>, <ExcThread(failed_post, started 140456788727552)>, <ExcThread(monitor, started 140456830691072)>, <ExcThread(validate_post, started 140456797120256)>, <ExcThread(queue_monitor, started 140456822298368)>, <ExcThread(create_data_payload, started 140457352308480)>, <ExcThread(job_monitor, started 140457318737664)>, <ExcThread(copytool_out, started 140457327130368)>, <ExcThread(execute_payloads, started 140456235104000)>, <ExcThread(data, started 140457360701184)>]
2019-10-08 00:44:34,035 | INFO     | monitor             | pilot.control.monitor            | control                   | [monitor] control thread has ended
2019-10-08 00:44:34,058 | DEBUG    | job                 | pilot.control.job                | control                   | job control ending since graceful_stop has been set
2019-10-08 00:44:34,058 | DEBUG    | job                 | pilot.control.job                | control                   | [job] control thread has finished
2019-10-08 00:44:34,067 | INFO     | failed_post         | pilot.control.payload            | failed_post               | [payload] failed_post thread has finished
2019-10-08 00:44:34,068 | INFO     | validate_post       | pilot.control.payload            | validate_post             | [payload] validate_post thread has finished
2019-10-08 00:44:34,112 | DEBUG    | data                | pilot.control.data               | control                   | data control ending since graceful_stop has been set
2019-10-08 00:44:34,112 | DEBUG    | data                | pilot.control.data               | control                   | [data] control thread has finished
2019-10-08 00:44:34,135 | DEBUG    | create_data_payload | pilot.control.job                | create_data_payload       | [job] create_data_payload thread has finished
2019-10-08 00:44:34,135 | INFO     | execute_payloads    | pilot.control.payload            | execute_payloads          | [payload] execute_payloads thread has finished
2019-10-08 00:44:34,136 | INFO     | validate_pre        | pilot.control.payload            | validate_pre              | [payload] validate_pre thread has finished
2019-10-08 00:44:34,142 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 5 threads
2019-10-08 00:44:34,142 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140457539118912)>, <ExcThread(queue_monitoring, started 140456839083776)>, <ExcThread(queue_monitor, started 140456822298368)>, <ExcThread(job_monitor, started 140457318737664)>, <ExcThread(copytool_out, started 140457327130368)>]
2019-10-08 00:44:34,559 | WARNING  | queue_monitor       | pilot.util.common                | should_abort              | job:queue_monitor:received graceful stop - abort after this iteration
2019-10-08 00:44:34,559 | DEBUG    | queue_monitor       | pilot.control.job                | queue_monitor             | [job] queue monitor thread has finished
2019-10-08 00:44:34,569 | WARNING  | copytool_out        | pilot.util.common                | should_abort              | data:copytool_out:received graceful stop - abort after this iteration
2019-10-08 00:44:34,646 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 4 threads
2019-10-08 00:44:34,646 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140457539118912)>, <ExcThread(queue_monitoring, started 140456839083776)>, <ExcThread(job_monitor, started 140457318737664)>, <ExcThread(copytool_out, started 140457327130368)>]
2019-10-08 00:44:35,569 | DEBUG    | copytool_out        | pilot.control.data               | copytool_out              | [data] copytool_out thread has finished
2019-10-08 00:44:35,653 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 3 threads
2019-10-08 00:44:35,653 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140457539118912)>, <ExcThread(queue_monitoring, started 140456839083776)>, <ExcThread(job_monitor, started 140457318737664)>]
2019-10-08 00:44:36,648 | DEBUG    | queue_monitoring    | pilot.control.data               | queue_monitoring          | [data] queue_monitor thread has finished
2019-10-08 00:44:36,660 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | thread count now at 2 threads
2019-10-08 00:44:36,660 | DEBUG    | MainThread          | pilot.workflow.generic           | run                       | enumerate: [<_MainThread(MainThread, started 140457539118912)>, <ExcThread(job_monitor, started 140457318737664)>]
2019-10-08 00:45:33,820 | WARNING  | job_monitor         | pilot.control.job                | check_job_monitor_waiting_time | no jobs in monitored_payloads queue (waited for 133 s)
2019-10-08 00:45:33,820 | DEBUG    | job_monitor         | pilot.control.job                | job_monitor               | [job] job monitor thread has finished
2019-10-08 00:45:33,851 | INFO     | MainThread          | pilot.workflow.generic           | run                       | end of generic workflow (traces error code: 0)
2019-10-08 00:45:33,851 | INFO     | MainThread          | root                             | wrap_up                   | traces error code: 0
2019-10-08 00:45:33,851 | INFO     | MainThread          | root                             | wrap_up                   | pilot has finished
2019-10-08 00:45:34 UTC [wrapper] ==== pilot stdout END ====
2019-10-08 00:45:34 UTC [wrapper] ==== wrapper stdout RESUME ====
2019-10-08 00:45:34 UTC [wrapper] Pilot exit status: 0
2019-10-08 00:45:34 UTC [wrapper] STATUSCODE: 0
2019-10-08 00:45:34 UTC [wrapper] apfmon messages muted
---- find pandaID.out ----
total 56
-rw-------.  1 grid grid 11357 Jul 25 16:38 LICENSE
-rw-------.  1 grid grid    20 Sep  9 13:04 MANIFEST.IN
-rw-------.  1 grid grid    11 Oct  8 02:26 pandaIDs.out
drwx------. 14 grid grid   216 Oct  8 02:26 pilot
-rwx------.  1 grid grid 20136 Sep  9 13:04 pilot.py
-rw-------.  1 grid grid     9 Sep  9 13:04 PILOTVERSION
-rw-------.  1 grid grid  2251 Jul 25 16:38 README.md
-rw-------.  1 grid grid   760 Aug 22 11:01 setup.py
-rw-------.  1 grid grid   221 Jul 25 16:38 TODO.md
-rw-------. 1 grid grid 11 Oct  8 02:26 /home/grid/slots/0/pilot2/pandaIDs.out
4002876565

2019-10-08 00:45:34 UTC [wrapper] Test setup, not cleaning
2019-10-08 00:45:34 UTC [wrapper] ==== wrapper stdout END ====
2019-10-08 00:45:34 UTC [wrapper] ==== wrapper stderr END ====
2019-10-08 00:45:34 UTC [wrapper] wrapper wrapperexiting ec=0, duration=1151
2019-10-08 00:45:34 UTC [wrapper] apfmon messages muted
2019-10-08 02:45:37,475: ***************diag file************
2019-10-08 02:45:37,475: runtimeenvironments=APPS/HEP/ATLAS-SITE;
Processors=1
WallTime=1150.21s
KernelTime=77.26s
UserTime=1995.23s
CPUUsage=180%
MaxResidentMemory=1946740kB
AverageResidentMemory=0kB
AverageTotalMemory=0kB
AverageUnsharedMemory=0kB
AverageUnsharedStack=0kB
AverageSharedMemory=0kB
PageSize=4096B
MajorPageFaults=17284
MinorPageFaults=7630716
Swaps=0
ForcedSwitches=204115
WaitSwitches=2044307
Inputs=6317304
Outputs=87240
SocketReceived=0
SocketSent=0
Signals=0

nodename=David_Cameron@dcameron04.cern.ch
exitcode=0
2019-10-08 02:45:37,482: ******************************WorkDir***********************
2019-10-08 02:45:37,482: total 369564
drwxrwx--x. 7 grid grid      4096 Oct  8 02:45 .
drwxr-x--x. 6 grid grid        42 Sep 26 14:51 ..
-rw-------. 1 grid grid   7595462 Oct  8 02:26 agis_ddmendpoints.json
-rw-------. 1 grid grid   3936744 Oct  8 02:26 agis_schedconf.cvmfs.json
drwx------. 2 grid grid         6 Oct  8 02:26 .alrb
drwxr-xr-x. 3 grid grid        17 Oct  8 02:26 APPS
-rw-------. 1 grid grid       538 Oct  8 02:26 .asetup
-rw-------. 1 grid grid      4506 Oct  8 02:27 .asetup.save
drwx------. 2 grid grid         6 Oct  8 02:27 .asetup-sysbin_11841
-rw-rw-r--. 1 grid grid         0 Oct  8 02:26 boinc_lockfile
-rw-rw-r--. 1 grid grid      8192 Oct  8 02:45 boinc_mmap_file
-rw-rw-r--. 1 grid grid       534 Oct  8 02:38 boinc_task_state.xml
-rw-rw-r--. 1 grid grid 365251149 Oct  8 02:26 EVNT.14296418._001447.pool.root.1
-rw-------. 1 grid grid     15123 Oct  8 02:44 heartbeat.json
-rw-rw-r--. 1 grid grid      5867 Oct  8 02:26 init_data.xml
-rw-rw-r--. 1 grid grid    267443 Oct  8 02:26 input.tar.gz
-rw-rw-r--. 1 grid grid       112 Oct  8 02:26 job.xml
-rw-------. 1 grid grid    735318 Oct  8 02:44 log.000649-403420-13268._078090.job.log.tgz.1
-rw-------. 1 grid grid    181946 Oct  8 02:45 log.14568781._078090.job.log.1
-rw-------. 1 grid grid       837 Oct  8 02:44 memory_monitor_summary.json
-rw-------. 1 grid grid       496 Oct  8 02:45 output.list
-rw-------. 1 grid grid      2948 Sep 16 11:38 pandaJobData.out
drwx------. 3 grid grid       229 Oct  8 02:26 pilot2
-rw-r--r--. 1 grid grid    259319 Sep 16 11:27 pilot2.tar.gz
-rw-------. 1 grid grid     10509 Oct  8 02:45 pilotlog.txt
-rw-------. 1 grid grid       509 Oct  8 02:45 PIRKDm1d2TvnShfckohDCDFpABFKDmABFKDmQ6FKDmfFFKDmr2Lzdn.diag
-rw-r--r--. 1 grid grid      4480 Sep 16 11:35 queuedata.json
-rw-rw-r--. 1 grid grid       815 Oct  8 02:26 RTE.tar.gz
-rwxr-xr-x. 1 grid grid      7950 Oct  8 02:26 run_atlas
-rwx------. 1 grid grid     12641 Sep 16 11:38 runpilot2-wrapper.sh
-rw-rw-r--. 1 grid grid       717 Oct  8 02:45 runtime_log
-rw-rw-r--. 1 grid grid      6947 Oct  8 02:45 runtime_log.err
drwxrwx--x. 2 grid grid       131 Oct  8 02:45 shared
-rw-rw-r--. 1 grid grid      8494 Oct  8 02:26 start_atlas.sh
-rw-rw-r--. 1 grid grid     29851 Oct  8 02:45 stderr.txt
-rw-rw-r--. 1 grid grid       107 Oct  8 02:26 wrapper_26015_x86_64-pc-linux-gnu
-rw-rw-r--. 1 grid grid        26 Oct  8 02:45 wrapper_checkpoint.txt

2019-10-08 02:45:37,483: running start_atlas return value is 0
2019-10-08 02:45:37,483: Parent exit 0
2019-10-08 02:45:37,485: child process exit 0
02:45:38 (4669): run_atlas exited; CPU time 2007.664588
02:45:38 (4669): called boinc_finish(0)

</stderr_txt>
]]>


©2024 CERN