Name | yPuNDm7N3JwnShfckohDCDFpABFKDmABFKDmyiALDmgEFKDmqqLvWn_1 |
Workunit | 1980748 |
Created | 7 Feb 2020, 21:14:18 UTC |
Sent | 9 Feb 2020, 3:48:23 UTC |
Report deadline | 16 Feb 2020, 3:48:23 UTC |
Received | 9 Feb 2020, 4:43:49 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 4064 |
Run time | 53 min 17 sec |
CPU time | 1 hours 10 min 38 sec |
Validate state | Valid |
Credit | 29.28 |
Device peak FLOPS | 3.96 GFLOPS |
Application version | ATLAS Simulation v1.03 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 1.79 GB |
Peak swap size | 2.55 GB |
Peak disk usage | 717.73 MB |
<core_client_version>7.16.1</core_client_version> <![CDATA[ <stderr_txt> 04:49:45 (14090): wrapper (7.7.26015): starting 04:49:45 (14090): wrapper: running run_atlas (--nthreads 2) [2020-02-09 04:49:45] Arguments: --nthreads 2 [2020-02-09 04:49:45] Threads: 2 [2020-02-09 04:49:45] Checking for CVMFS [2020-02-09 04:49:46] Probing /cvmfs/atlas.cern.ch... OK [2020-02-09 04:49:46] Probing /cvmfs/atlas-condb.cern.ch... OK [2020-02-09 04:49:46] Probing /cvmfs/grid.cern.ch... OK [2020-02-09 04:49:46] Probing /cvmfs/cernvm-prod.cern.ch... OK [2020-02-09 04:49:47] Probing /cvmfs/sft.cern.ch... OK [2020-02-09 04:49:47] Probing /cvmfs/alice.cern.ch... OK [2020-02-09 04:49:51] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2020-02-09 04:49:51] 2.7.0.0 3672 278 47948 60204 2 61 3961203 4194304 588 65024 0 126702 99.9345 127670 3298 http://s1cern-cvmfs.openhtc.io/cvmfs/atlas.cern.ch DIRECT 1 [2020-02-09 04:49:51] CVMFS is ok [2020-02-09 04:49:51] Using singularity image /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img [2020-02-09 04:49:51] Checking for singularity binary... [2020-02-09 04:49:51] Using singularity found in PATH at /usr/bin/singularity [2020-02-09 04:49:51] Running /usr/bin/singularity --version [2020-02-09 04:49:51] singularity version 3.5.2-1.1.el7 [2020-02-09 04:49:51] Checking singularity works with /usr/bin/singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img hostname [2020-02-09 04:49:51] FX8S [2020-02-09 04:49:51] Singularity works [2020-02-09 04:49:52] Set ATHENA_PROC_NUMBER=2 [2020-02-09 04:49:52] Starting ATLAS job with PandaID=4002876565 [2020-02-09 04:49:52] Running command: /usr/bin/singularity exec --pwd /var/lib/boinc/slots/1 -B /cvmfs,/var /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img sh start_atlas.sh [2020-02-09 05:42:59] *** The last 200 lines of the pilot log: *** [2020-02-09 05:42:59] }, [2020-02-09 05:42:59] "preExe": { [2020-02-09 05:42:59] "cpuTime": 6, [2020-02-09 05:42:59] "wallTime": 16 [2020-02-09 05:42:59] }, [2020-02-09 05:42:59] "total": { [2020-02-09 05:42:59] "cpuTime": 163, [2020-02-09 05:42:59] "wallTime": 339 [2020-02-09 05:42:59] }, [2020-02-09 05:42:59] "validation": { [2020-02-09 05:42:59] "cpuTime": 0, [2020-02-09 05:42:59] "wallTime": 0 [2020-02-09 05:42:59] }, [2020-02-09 05:42:59] "wallTime": 322 [2020-02-09 05:42:59] } [2020-02-09 05:42:59] }, [2020-02-09 05:42:59] "machine": { [2020-02-09 05:42:59] "cpu_family": "21", [2020-02-09 05:42:59] "linux_distribution": [ [2020-02-09 05:42:59] "CentOS Linux", [2020-02-09 05:42:59] "7.6.1810", [2020-02-09 05:42:59] "Core" [2020-02-09 05:42:59] ], [2020-02-09 05:42:59] "model": "2", [2020-02-09 05:42:59] "model_name": "AMD FX-8370E Eight-Core Processor", [2020-02-09 05:42:59] "node": "FX8S", [2020-02-09 05:42:59] "platform": "Linux-3.10.0-1062.el7.x86_64-x86_64-with-centos-7.6.1810-Core" [2020-02-09 05:42:59] }, [2020-02-09 05:42:59] "transform": { [2020-02-09 05:42:59] "cpuEfficiency": 0.7123, [2020-02-09 05:42:59] "cpuPWEfficiency": 0.7419, [2020-02-09 05:42:59] "cpuTime": 19, [2020-02-09 05:42:59] "cpuTimeTotal": 4404, [2020-02-09 05:42:59] "externalCpuTime": 34, [2020-02-09 05:42:59] "processedEvents": 10, [2020-02-09 05:42:59] "trfPredata": null, [2020-02-09 05:42:59] "wallTime": 2968 [2020-02-09 05:42:59] } [2020-02-09 05:42:59] } [2020-02-09 05:42:59] } [2020-02-09 05:42:59] 2020-02-09 04:42:06,039 | DEBUG | queue_monitor | pilot.util.auxiliary.4002876565 | update_server | xml:will send fileinfo [2020-02-09 05:42:59] 2020-02-09 04:42:06,039 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | state=finished [2020-02-09 05:42:59] 2020-02-09 04:42:06,039 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=running [2020-02-09 05:42:59] 2020-02-09 04:42:06,039 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=finished [2020-02-09 05:42:59] 2020-02-09 04:42:06,040 | INFO | queue_monitor | pilot.control.job.4002876565 | send_state | pilot will not update the server (heartbeat message will be written to file) [2020-02-09 05:42:59] 2020-02-09 04:42:06,040 | INFO | queue_monitor | pilot.control.job.4002876565 | send_state | job 4002876565 has finished - writing final server update [2020-02-09 05:42:59] 2020-02-09 04:42:06,040 | DEBUG | queue_monitor | pilot.control.job.4002876565 | get_data_structure | building data structure to be sent to server with heartbeat [2020-02-09 05:42:59] 2020-02-09 04:42:06,040 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | get_job_metrics | will not add max space = -353233485 B to job metrics [2020-02-09 05:42:59] 2020-02-09 04:42:06,042 | DEBUG | queue_monitor | pilot.api.analytics | get_fitted_data | removing tails from data to be fitted [2020-02-09 05:42:59] 2020-02-09 04:42:06,042 | INFO | queue_monitor | pilot.api.analytics | get_fitted_data | fitting pss+swap vs Time [2020-02-09 05:42:59] 2020-02-09 04:42:06,043 | INFO | queue_monitor | pilot.api.analytics | get_fitted_data | current memory leak: -210.21 B/s (using 43 data points, chi2=7723527) [2020-02-09 05:42:59] 2020-02-09 04:42:06,043 | DEBUG | queue_monitor | pilot.util.auxiliary.4002876565 | get_job_metrics | job metrics="coreCount=2 actualCoreCount=1 nEvents=10 leak=-210.21 chi2=7723527" [2020-02-09 05:42:59] 2020-02-09 04:42:06,043 | INFO | queue_monitor | pilot.control.job.4002876565 | get_data_structure | total number of processed events: 10 (read) [2020-02-09 05:42:59] 2020-02-09 04:42:06,045 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_values | using path: /var/lib/boinc/slots/1/PanDA_Pilot-4002876565/memory_monitor_summary.json (trf name=prmon) [2020-02-09 05:42:59] 2020-02-09 04:42:06,047 | DEBUG | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | summary_dictionary={'Max': {'rx_packets': 118966, 'nprocs': 10, 'nthreads': 1, 'rx_bytes': 79435887, 'wtime': 3034, 'rss': 5681040, 'write_bytes': 0, 'vmem': 8678420, 'read_bytes': 0, 'stime': 138, 'tx_bytes': 151093508, 'pss': 2221625, 'wchar': 0, 'rchar': 0, 'tx_packets': 117354, 'swap': 0, 'utime': 4180}, 'Avg': {'write_bytes': 0, 'nprocs': 6, 'nthreads': 0, 'rx_bytes': 26173, 'rx_packets': 39, 'vmem': 6206800, 'read_bytes': 0, 'swap': 0, 'tx_bytes': 49784, 'pss': 1684223, 'wchar': 0, 'rchar': 0, 'tx_packets': 38, 'rss': 3826828}} [2020-02-09 05:42:59] 2020-02-09 04:42:06,047 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard info from prmon json [2020-02-09 05:42:59] 2020-02-09 04:42:06,047 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard memory fields from prmon json [2020-02-09 05:42:59] 2020-02-09 04:42:06,048 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | .............................. [2020-02-09 05:42:59] 2020-02-09 04:42:06,048 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . Timing measurements: [2020-02-09 05:42:59] 2020-02-09 04:42:06,048 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . get job = 0 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,048 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . initial setup = 2 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,048 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . payload setup = 0 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,048 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . total setup = 2 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,048 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . stage-in = 0 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,049 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . payload execution = 3087 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,049 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | . stage-out = 4 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,049 | INFO | queue_monitor | pilot.util.auxiliary.4002876565 | timing_report | .............................. [2020-02-09 05:42:59] 2020-02-09 04:42:06,055 | DEBUG | queue_monitor | pilot.control.job.4002876565 | send_state | wrote heartbeat to file /var/lib/boinc/slots/1/heartbeat.json [2020-02-09 05:42:59] 2020-02-09 04:42:06,055 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | job 4002876565 was dequeued from the monitored payloads queue [2020-02-09 05:42:59] 2020-02-09 04:42:06,055 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | tmp job object deleted [2020-02-09 05:42:59] 2020-02-09 04:42:06,187 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | [2020-02-09 05:42:59] 2020-02-09 04:42:06,188 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | job summary report [2020-02-09 05:42:59] 2020-02-09 04:42:06,188 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | -------------------------------------------------- [2020-02-09 05:42:59] 2020-02-09 04:42:06,188 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | PanDA job id: 4002876565 [2020-02-09 05:42:59] 2020-02-09 04:42:06,188 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | task id: 000649-1749162-26069 [2020-02-09 05:42:59] 2020-02-09 04:42:06,188 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | errors: (none) [2020-02-09 05:42:59] 2020-02-09 04:42:06,188 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | status: LOG_TRANSFER = DONE [2020-02-09 05:42:59] 2020-02-09 04:42:06,189 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | pilot state: finished [2020-02-09 05:42:59] 2020-02-09 04:42:06,189 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | transexitcode: 0 [2020-02-09 05:42:59] 2020-02-09 04:42:06,189 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exeerrorcode: 0 [2020-02-09 05:42:59] 2020-02-09 04:42:06,189 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exeerrordiag: [2020-02-09 05:42:59] 2020-02-09 04:42:06,189 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exitcode: 0 [2020-02-09 05:42:59] 2020-02-09 04:42:06,189 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | exitmsg: OK [2020-02-09 05:42:59] 2020-02-09 04:42:06,189 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | cpuconsumptiontime: 4346 s [2020-02-09 05:42:59] 2020-02-09 04:42:06,190 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | nevents: 10 [2020-02-09 05:42:59] 2020-02-09 04:42:06,190 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | neventsw: 0 [2020-02-09 05:42:59] 2020-02-09 04:42:06,190 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | pid: 20718 [2020-02-09 05:42:59] 2020-02-09 04:42:06,190 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | pgrp: 20718 [2020-02-09 05:42:59] 2020-02-09 04:42:06,190 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | corecount: 2 [2020-02-09 05:42:59] 2020-02-09 04:42:06,190 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | event service: False [2020-02-09 05:42:59] 2020-02-09 04:42:06,191 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | -------------------------------------------------- [2020-02-09 05:42:59] 2020-02-09 04:42:06,191 | INFO | retrieve | pilot.util.auxiliary.4002876565 | make_job_report | [2020-02-09 05:42:59] 2020-02-09 04:42:06,191 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue jobs has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,191 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue payloads has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,191 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_in has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,191 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_out has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,191 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue current_data_in has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,192 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_jobs has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,192 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_payloads has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,192 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue monitored_payloads has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,192 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_jobs has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,192 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_payloads has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,192 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_in has 1 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,192 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_out has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,193 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_jobs has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,193 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_payloads has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,193 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_in has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,193 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_out has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,193 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobs has 0 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,193 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobids has 1 job(s) [2020-02-09 05:42:59] 2020-02-09 04:42:06,193 | INFO | retrieve | pilot.control.job.4002876565 | has_job_completed | job 4002876565 has completed (purged errors) [2020-02-09 05:42:59] 2020-02-09 04:42:06,194 | INFO | retrieve | pilot.util.processes | cleanup | overall cleanup function is called [2020-02-09 05:42:59] 2020-02-09 04:42:06,201 | DEBUG | retrieve | pilot.util.processes | cleanup | work directory was removed: /var/lib/boinc/slots/1/PanDA_Pilot-4002876565 [2020-02-09 05:42:59] 2020-02-09 04:42:07,209 | INFO | retrieve | pilot.info.jobdata | collect_zombies | --- collectZombieJob: --- 10, [20718] [2020-02-09 05:42:59] 2020-02-09 04:42:07,209 | INFO | retrieve | pilot.info.jobdata | collect_zombies | zombie collector trying to kill pid 20718 [2020-02-09 05:42:59] 2020-02-09 04:42:07,209 | INFO | retrieve | pilot.info.jobdata | collect_zombies | harmless exception when collecting zombies: [Errno 10] No child processes [2020-02-09 05:42:59] 2020-02-09 04:42:08,225 | INFO | retrieve | pilot.util.processes | cleanup | collected zombie processes [2020-02-09 05:42:59] 2020-02-09 04:42:08,225 | INFO | retrieve | pilot.util.processes | cleanup | will now attempt to kill all subprocesses of pid=20718 [2020-02-09 05:42:59] 2020-02-09 04:42:08,367 | INFO | retrieve | pilot.util.processes | kill_processes | process IDs to be killed: [20718] (in reverse order) [2020-02-09 05:42:59] 2020-02-09 04:42:08,464 | WARNING | retrieve | pilot.util.processes | kill_processes | found no corresponding commands to process id(s) [2020-02-09 05:42:59] 2020-02-09 04:42:08,464 | INFO | retrieve | pilot.util.processes | kill_orphans | Do not look for orphan processes in BOINC jobs [2020-02-09 05:42:59] 2020-02-09 04:42:08,465 | INFO | retrieve | pilot.control.job | retrieve | ready for new job [2020-02-09 05:42:59] 2020-02-09 04:42:08,465 | INFO | retrieve | root | retrieve | pilot has finished for previous job - re-establishing logging [2020-02-09 05:42:59] mpi4py not found [2020-02-09 05:42:59] 2020-02-09 04:42:08,482 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2020-02-09 05:42:59] 2020-02-09 04:42:08,482 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | *** PanDA Pilot version 2.3.4 (12) *** [2020-02-09 05:42:59] 2020-02-09 04:42:08,482 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2020-02-09 05:42:59] 2020-02-09 04:42:08,483 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | [2020-02-09 05:42:59] 2020-02-09 04:42:08,483 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | pilot is running in a VM [2020-02-09 05:42:59] 2020-02-09 04:42:08,483 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | architecture information: [2020-02-09 05:42:59] 2020-02-09 04:42:08,770 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | [2020-02-09 05:42:59] LSB Version: :core-4.1-amd64:core-4.1-noarch [2020-02-09 05:42:59] Distributor ID: CentOS [2020-02-09 05:42:59] Description: CentOS Linux release 7.6.1810 (Core) [2020-02-09 05:42:59] Release: 7.6.1810 [2020-02-09 05:42:59] Codename: Core [2020-02-09 05:42:59] 2020-02-09 04:42:08,770 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2020-02-09 05:42:59] 2020-02-09 04:42:09,273 | DEBUG | retrieve | pilot.util.monitoring | check_local_space | checking local space on /var/lib/boinc/slots/1 [2020-02-09 05:42:59] 2020-02-09 04:42:09,320 | INFO | retrieve | pilot.util.monitoring | check_local_space | sufficient remaining disk space (8638169088 B) [2020-02-09 05:42:59] 2020-02-09 04:42:09,320 | WARNING | retrieve | pilot.control.job | proceed_with_getjob | since timefloor is set to 0, pilot was only allowed to run one job [2020-02-09 05:42:59] 2020-02-09 04:42:09,320 | DEBUG | retrieve | pilot.control.job | retrieve | [job] retrieve thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:09,328 | DEBUG | data | pilot.control.data | control | data control ending since graceful_stop has been set [2020-02-09 05:42:59] 2020-02-09 04:42:09,328 | DEBUG | data | pilot.control.data | control | [data] control thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:09,352 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 15 threads [2020-02-09 05:42:59] 2020-02-09 04:42:09,353 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 139748097488704)>, <ExcThread(job, started 139747933988608)>, <ExcThread(queue_monitoring, started 139747338995456)>, <ExcThread(validate_pre, started 139747800364800)>, <ExcThread(validate_post, started 139747783579392)>, <ExcThread(job_monitor, started 139747364173568)>, <ExcThread(payload, started 139747817150208)>, <ExcThread(execute_payloads, started 139747347388160)>, <ExcThread(copytool_in, started 139747330602752)>, <ExcThread(failed_post, started 139747355780864)>, <ExcThread(copytool_out, started 139747825542912)>, <ExcThread(queue_monitor, started 139747322210048)>, <ExcThread(create_data_payload, started 139747833935616)>, <ExcThread(monitor, started 139747791972096)>, <ExcThread(validate, started 139747925595904)>] [2020-02-09 05:42:59] 2020-02-09 04:42:09,385 | INFO | monitor | pilot.control.monitor | control | [monitor] control thread has ended [2020-02-09 05:42:59] 2020-02-09 04:42:09,532 | INFO | validate_post | pilot.control.payload | validate_post | [payload] validate_post thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:09,536 | DEBUG | validate | pilot.control.job | validate | [job] validate thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:09,749 | DEBUG | payload | pilot.control.payload | control | payload control ending since graceful_stop has been set [2020-02-09 05:42:59] 2020-02-09 04:42:09,750 | DEBUG | payload | pilot.control.payload | control | [payload] control thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:10,167 | DEBUG | job | pilot.control.job | control | job control ending since graceful_stop has been set [2020-02-09 05:42:59] 2020-02-09 04:42:10,168 | DEBUG | job | pilot.control.job | control | [job] control thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:10,215 | DEBUG | copytool_in | pilot.control.data | copytool_in | [data] copytool_in thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:10,354 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 9 threads [2020-02-09 05:42:59] 2020-02-09 04:42:10,354 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 139748097488704)>, <ExcThread(queue_monitoring, started 139747338995456)>, <ExcThread(validate_pre, started 139747800364800)>, <ExcThread(job_monitor, started 139747364173568)>, <ExcThread(execute_payloads, started 139747347388160)>, <ExcThread(failed_post, started 139747355780864)>, <ExcThread(copytool_out, started 139747825542912)>, <ExcThread(queue_monitor, started 139747322210048)>, <ExcThread(create_data_payload, started 139747833935616)>] [2020-02-09 05:42:59] 2020-02-09 04:42:10,377 | INFO | execute_payloads | pilot.control.payload | execute_payloads | [payload] execute_payloads thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:10,387 | INFO | validate_pre | pilot.control.payload | validate_pre | [payload] validate_pre thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:10,390 | DEBUG | create_data_payload | pilot.control.job | create_data_payload | [job] create_data_payload thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:10,623 | INFO | failed_post | pilot.control.payload | failed_post | [payload] failed_post thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:10,649 | WARNING | copytool_out | pilot.util.common | should_abort | data:copytool_out:received graceful stop - abort after this iteration [2020-02-09 05:42:59] 2020-02-09 04:42:11,075 | WARNING | queue_monitor | pilot.util.common | should_abort | job:queue_monitor:received graceful stop - abort after this iteration [2020-02-09 05:42:59] 2020-02-09 04:42:11,076 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | [job] queue monitor thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:11,167 | WARNING | queue_monitoring | pilot.util.common | should_abort | data:queue_monitoring:received graceful stop - abort after this iteration [2020-02-09 05:42:59] 2020-02-09 04:42:11,363 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 4 threads [2020-02-09 05:42:59] 2020-02-09 04:42:11,363 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 139748097488704)>, <ExcThread(queue_monitoring, started 139747338995456)>, <ExcThread(job_monitor, started 139747364173568)>, <ExcThread(copytool_out, started 139747825542912)>] [2020-02-09 05:42:59] 2020-02-09 04:42:11,650 | DEBUG | copytool_out | pilot.control.data | copytool_out | [data] copytool_out thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:12,369 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 3 threads [2020-02-09 05:42:59] 2020-02-09 04:42:12,369 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 139748097488704)>, <ExcThread(queue_monitoring, started 139747338995456)>, <ExcThread(job_monitor, started 139747364173568)>] [2020-02-09 05:42:59] 2020-02-09 04:42:14,176 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | [data] queue_monitor thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:14,378 | DEBUG | MainThread | pilot.workflow.generic | run | thread count now at 2 threads [2020-02-09 05:42:59] 2020-02-09 04:42:14,379 | DEBUG | MainThread | pilot.workflow.generic | run | enumerate: [<_MainThread(MainThread, started 139748097488704)>, <ExcThread(job_monitor, started 139747364173568)>] [2020-02-09 05:42:59] 2020-02-09 04:42:58,476 | WARNING | job_monitor | pilot.control.job | check_job_monitor_waiting_time | no jobs in monitored_payloads queue (waited for 72 s) [2020-02-09 05:42:59] 2020-02-09 04:42:58,476 | DEBUG | job_monitor | pilot.control.job | job_monitor | [job] job monitor thread has finished [2020-02-09 05:42:59] 2020-02-09 04:42:59,146 | INFO | MainThread | pilot.workflow.generic | run | end of generic workflow (traces error code: 0) [2020-02-09 05:42:59] 2020-02-09 04:42:59,147 | INFO | MainThread | root | wrap_up | traces error code: 0 [2020-02-09 05:42:59] 2020-02-09 04:42:59,147 | INFO | MainThread | root | wrap_up | pilot has finished [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] ==== pilot stdout END ==== [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] ==== wrapper stdout RESUME ==== [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] Pilot exit status: 0 [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] STATUSCODE: 0 [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] apfmon messages muted [2020-02-09 05:42:59] ---- find pandaID.out ---- [2020-02-09 05:42:59] total 60 [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 11357 Jul 25 2019 LICENSE [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 20 Sep 9 13:04 MANIFEST.IN [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 8 Dec 12 19:00 PILOTVERSION [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 2212 Nov 14 11:01 README.md [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 221 Jul 25 2019 TODO.md [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 11 Feb 9 04:50 pandaIDs.out [2020-02-09 05:42:59] drwx------. 14 boinc boinc 216 Feb 9 04:50 pilot [2020-02-09 05:42:59] -rwx------. 1 boinc boinc 21225 Dec 12 19:00 pilot.py [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 766 Oct 10 16:01 setup.py [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 11 Feb 9 04:50 /var/lib/boinc/slots/1/pilot2/pandaIDs.out [2020-02-09 05:42:59] 4002876565 [2020-02-09 05:42:59] [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] Test setup, not cleaning [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] ==== wrapper stdout END ==== [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] ==== wrapper stderr END ==== [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] wrapper wrapperexiting ec=0, duration=3187 [2020-02-09 05:42:59] 2020-02-09 04:42:59 UTC [wrapper] apfmon messages muted [2020-02-09 05:42:59] *** Error codes and diagnostics *** [2020-02-09 05:42:59] "exeErrorCode": 0, [2020-02-09 05:42:59] "exeErrorDiag": "", [2020-02-09 05:42:59] "pilotErrorCode": 0, [2020-02-09 05:42:59] "pilotErrorDiag": "", [2020-02-09 05:42:59] *** Listing of results directory *** [2020-02-09 05:42:59] insgesamt 379024 [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 267260 6. Feb 21:30 pilot2.tar.gz [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 4492 6. Feb 21:55 queuedata.json [2020-02-09 05:42:59] -rwx------. 1 boinc boinc 12641 6. Feb 21:57 runpilot2-wrapper.sh [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 107 9. Feb 04:49 wrapper_26015_x86_64-pc-linux-gnu [2020-02-09 05:42:59] -rwxr-xr-x. 1 boinc boinc 5573 9. Feb 04:49 run_atlas [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 112 9. Feb 04:49 job.xml [2020-02-09 05:42:59] drwxrwx--x. 2 boinc boinc 86 9. Feb 04:49 shared [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 5902 9. Feb 04:49 init_data.xml [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 0 9. Feb 04:49 boinc_lockfile [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 365251149 9. Feb 04:49 EVNT.14296418._001447.pool.root.1 [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 8513 9. Feb 04:49 start_atlas.sh [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 926 9. Feb 04:49 RTE.tar.gz [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 275418 9. Feb 04:49 input.tar.gz [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 2958 9. Feb 04:49 pandaJob.out [2020-02-09 05:42:59] drwxr-xr-x. 3 boinc boinc 17 9. Feb 04:49 APPS [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 3641907 9. Feb 04:50 agis_schedconf.cvmfs.json [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 7859698 9. Feb 04:50 agis_ddmendpoints.json [2020-02-09 05:42:59] drwx------. 3 boinc boinc 229 9. Feb 04:50 pilot2 [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 535 9. Feb 05:28 boinc_task_state.xml [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 9055738 9. Feb 05:41 HITS.000649-1749162-26069._078090.pool.root.1 [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 791 9. Feb 05:41 memory_monitor_summary.json [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 516643 9. Feb 05:42 log.000649-1749162-26069._078090.job.log.tgz.1 [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 12292 9. Feb 05:42 heartbeat.json [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 8192 9. Feb 05:42 boinc_mmap_file [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 26 9. Feb 05:42 wrapper_checkpoint.txt [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 9354 9. Feb 05:42 pilotlog.txt [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 262444 9. Feb 05:42 log.000649-1749162-26069._078090.job.log.1 [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 488 9. Feb 05:42 yPuNDm7N3JwnShfckohDCDFpABFKDmABFKDmyiALDmgEFKDmqqLvWn.diag [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 499 9. Feb 05:42 output.list [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 7241 9. Feb 05:42 runtime_log.err [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 742 9. Feb 05:42 runtime_log [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 798720 9. Feb 05:42 result.tar.gz [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 31639 9. Feb 05:42 stderr.txt [2020-02-09 05:42:59] HITS file was successfully produced: [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 9055738 9. Feb 05:41 shared/HITS.pool.root.1 [2020-02-09 05:42:59] *** Contents of shared directory: *** [2020-02-09 05:42:59] insgesamt 366604 [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 8513 9. Feb 04:49 start_atlas.sh [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 926 9. Feb 04:49 RTE.tar.gz [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 275418 9. Feb 04:49 input.tar.gz [2020-02-09 05:42:59] -rw-r--r--. 1 boinc boinc 365251149 9. Feb 04:49 ATLAS.root_0 [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 9055738 9. Feb 05:41 HITS.pool.root.1 [2020-02-09 05:42:59] -rw-------. 1 boinc boinc 798720 9. Feb 05:42 result.tar.gz 05:43:01 (14090): run_atlas exited; CPU time 4238.647809 05:43:01 (14090): called boinc_finish(0) </stderr_txt> ]]>
©2024 CERN