Name P7XKDmEsGu4n7Olcko1bjSoqABFKDmABFKDmyeZQDmIoKKDmSZQSBo_2
Workunit 2388708
Created 14 Feb 2024, 3:36:16 UTC
Sent 14 Feb 2024, 3:53:56 UTC
Report deadline 21 Feb 2024, 3:53:56 UTC
Received 14 Feb 2024, 4:43:46 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 4911
Run time 28 min 48 sec
CPU time 10 min 25 sec
Validate state Valid
Credit 33.32
Device peak FLOPS 8.00 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.51 GB
Peak swap size 2.12 GB
Peak disk usage 82.33 MB

Stderr output

<core_client_version>7.16.6</core_client_version>
<![CDATA[
<stderr_txt>
22:54:35 (1332519): wrapper (7.7.26015): starting
22:54:35 (1332519): wrapper: running run_atlas (--nthreads 1)
[2024-02-13 22:54:35] Arguments: --nthreads 1
[2024-02-13 22:54:35] Threads: 1
[2024-02-13 22:54:35] Checking for CVMFS
[2024-02-13 22:54:35] Probing /cvmfs/atlas.cern.ch... OK
[2024-02-13 22:54:35] Probing /cvmfs/atlas-condb.cern.ch... OK
[2024-02-13 22:54:35] Running cvmfs_config stat atlas.cern.ch
[2024-02-13 22:54:36] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2024-02-13 22:54:36] 2.9.2.0 1282470 27 47824 129367 0 86 2552741 4194305 89 130560 0 144020 99.991 3053 764 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch DIRECT 1
[2024-02-13 22:54:36] CVMFS is ok
[2024-02-13 22:54:36] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2024-02-13 22:54:36] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2024-02-13 22:54:36] Small home clusters do not require a local http proxy but it is suggested if
[2024-02-13 22:54:36] more than 10 cores throughout the same LAN segment are regularly running ATLAS like tasks.
[2024-02-13 22:54:36] Further information can be found at the LHC@home message board.
[2024-02-13 22:54:36] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2024-02-13 22:54:36] Checking for apptainer binary...
[2024-02-13 22:54:36] apptainer is not installed, using version from CVMFS
[2024-02-13 22:54:36] Checking apptainer works with /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2024-02-13 22:54:36] WC5950
[2024-02-13 22:54:36] apptainer works
[2024-02-13 22:54:36] Starting ATLAS job with PandaID=6108682108
[2024-02-13 22:54:36] Running command: /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs,/var/lib/boinc-client/slots/4 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2024-02-13 23:23:21]  *** The last 200 lines of the pilot log: ***
[2024-02-13 23:23:21] 2024-02-14 04:22:17,964 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:20,467 | INFO     | monitor loop #224: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:20,467 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:22,968 | INFO     | monitor loop #225: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:22,968 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:25,476 | INFO     | monitor loop #226: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:25,476 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:27,982 | INFO     | monitor loop #227: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:27,983 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:30,486 | INFO     | monitor loop #228: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:30,486 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:32,992 | INFO     | monitor loop #229: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:32,992 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:35,460 | INFO     | 1665s have passed since pilot start
[2024-02-13 23:23:21] 2024-02-14 04:22:35,498 | INFO     | monitor loop #230: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:35,498 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:38,000 | INFO     | monitor loop #231: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:38,000 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:40,503 | INFO     | monitor loop #232: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:40,503 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:43,010 | INFO     | monitor loop #233: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:43,010 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:45,512 | INFO     | monitor loop #234: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:45,512 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:48,024 | INFO     | monitor loop #235: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:48,024 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:50,532 | INFO     | monitor loop #236: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:50,532 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:52,284 | INFO     | [attempt=3/3] loading data from url=https://atlas-cric.cern.ch/cache/ddmendpoints.json
[2024-02-13 23:23:21] 2024-02-14 04:22:53,035 | INFO     | monitor loop #237: job 0:6108682108 is in state 'stageout'
[2024-02-13 23:23:21] 2024-02-14 04:22:53,035 | WARNING  | aborting job monitor tasks since payload process 1343628 is not running
[2024-02-13 23:23:21] 2024-02-14 04:22:53,449 | WARNING  | failed to load data from url=https://atlas-cric.cern.ch/cache/ddmendpoints.json, error: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: 
[2024-02-13 23:23:21] 2024-02-14 04:22:53,449 | WARNING  | cache file=/var/lib/boinc-client/slots/4/agis_ddmendpoints.agis.ALL.json is not available: [Errno 2] No such file or directory: '/var/lib/boinc-client/slots/4/agis
[2024-02-13 23:23:21] 2024-02-14 04:22:53,494 | INFO     | transferring file 6461c328-266a-45f9-aa90-9062f28df712_64799.1.job.log.tgz from /var/lib/boinc-client/slots/4/PanDA_Pilot-6108682108/6461c328-266a-45f9-aa90-9062f2
[2024-02-13 23:23:21] 2024-02-14 04:22:53,494 | INFO     | executing command: /usr/bin/env mv /var/lib/boinc-client/slots/4/PanDA_Pilot-6108682108/6461c328-266a-45f9-aa90-9062f28df712_64799.1.job.log.tgz /var/lib/boinc-cli
[2024-02-13 23:23:21] 2024-02-14 04:22:53,515 | INFO     | Adding to output.list: 6461c328-266a-45f9-aa90-9062f28df712_64799.1.job.log.tgz davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/hc_test/8a/aa/6461c328-266a-
[2024-02-13 23:23:21] 2024-02-14 04:22:53,516 | INFO     | summary of transferred files:
[2024-02-13 23:23:21] 2024-02-14 04:22:53,516 | INFO     |  -- lfn=6461c328-266a-45f9-aa90-9062f28df712_64799.1.job.log.tgz, status_code=0, status=transferred
[2024-02-13 23:23:21] 2024-02-14 04:22:53,516 | INFO     | stage-out finished correctly
[2024-02-13 23:23:21] 2024-02-14 04:22:55,546 | INFO     | monitor loop #238: job 0:6108682108 is in state 'finished'
[2024-02-13 23:23:21] 2024-02-14 04:22:55,547 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-02-13 23:23:21] 2024-02-14 04:22:55,639 | INFO     | finished stage-out for finished payload, adding job to finished_jobs queue
[2024-02-13 23:23:21] 2024-02-14 04:22:55,702 | INFO     | job 6108682108 has state=finished
[2024-02-13 23:23:21] 2024-02-14 04:22:55,703 | INFO     | preparing for final server update for job 6108682108 in state='finished'
[2024-02-13 23:23:21] 2024-02-14 04:22:55,703 | INFO     | this job has now completed (state=finished)
[2024-02-13 23:23:21] 2024-02-14 04:22:55,703 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2024-02-13 23:23:21] 2024-02-14 04:22:55,703 | INFO     | job 6108682108 has finished - writing final server update
[2024-02-13 23:23:21] 2024-02-14 04:22:55,703 | INFO     | total number of processed events: 2 (read)
[2024-02-13 23:23:21] 2024-02-14 04:22:55,724 | INFO     | executing command: lscpu
[2024-02-13 23:23:21] 2024-02-14 04:22:55,749 | INFO     | found 16 cores (16 cores per socket, 1 sockets)
[2024-02-13 23:23:21] 2024-02-14 04:22:55,749 | INFO     | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo
[2024-02-13 23:23:21] 2024-02-14 04:22:55,792 | INFO     | executing command: export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase;source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh --quiet;lsetup
[2024-02-13 23:23:21] 2024-02-14 04:22:58,048 | INFO     | monitor loop #239: job 0:6108682108 is in state 'finished'
[2024-02-13 23:23:21] 2024-02-14 04:22:58,052 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-02-13 23:23:21] 2024-02-14 04:22:59,576 | INFO     | CPU arch script returned: x86-64-v3
[2024-02-13 23:23:21] 2024-02-14 04:22:59,576 | INFO     | using path: /var/lib/boinc-client/slots/4/PanDA_Pilot-6108682108/memory_monitor_summary.json (trf name=prmon)
[2024-02-13 23:23:21] 2024-02-14 04:22:59,577 | INFO     | extracted standard info from prmon json
[2024-02-13 23:23:21] 2024-02-14 04:22:59,577 | INFO     | extracted standard memory fields from prmon json
[2024-02-13 23:23:21] 2024-02-14 04:22:59,577 | INFO     | fitting pss+swap vs Time
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | current memory leak: 1542.71 B/s (using 11 data points, chi2=0.08)
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | ..............................
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . Timing measurements:
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . get job = 63 s
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . initial setup = 69 s
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . payload setup = 7 s
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . stage-in = 62 s
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . payload execution = 1098 s
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . stage-out = 362 s
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | . log creation = 0 s
[2024-02-13 23:23:21] 2024-02-14 04:22:59,578 | INFO     | ..............................
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | 
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | job summary report
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | --------------------------------------------------
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | PanDA job id: 6108682108
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | task id: NULL
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | errors: (none)
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | status: LOG_TRANSFER = DONE 
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | pilot state: finished 
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | transexitcode: 0
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | exeerrorcode: 0
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | exeerrordiag: 
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | exitcode: 0
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | exitmsg: OK
[2024-02-13 23:23:21] 2024-02-14 04:23:00,060 | INFO     | cpuconsumptiontime: 629 s
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | nevents: 2
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | neventsw: 0
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | pid: 1343628
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | pgrp: 1343628
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | corecount: 1
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | event service: False
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | sizes: {0: 2407163, 11: 2407309, 22: 2407337, 32: 2407365, 43: 2407393, 53: 2407421, 64: 2407577, 67: 2407693, 75: 2407721, 1109: 2429182, 1472: 2438246, 1474: 243
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | --------------------------------------------------
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | 
[2024-02-13 23:23:21] 2024-02-14 04:23:00,061 | INFO     | executing command: ls -lF /var/lib/boinc-client/slots/4
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue jobs had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue payloads had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue data_in had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue data_out had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue current_data_in had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,081 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | queue completed_jobids has 1 job(s)
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | queue messages had 0 job(s) [purged]
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | job 6108682108 has completed (purged errors)
[2024-02-13 23:23:21] 2024-02-14 04:23:00,082 | INFO     | overall cleanup function is called
[2024-02-13 23:23:21] 2024-02-14 04:23:01,088 | INFO     | --- collectZombieJob: --- 10, [1343628]
[2024-02-13 23:23:21] 2024-02-14 04:23:01,088 | INFO     | zombie collector waiting for pid 1343628
[2024-02-13 23:23:21] 2024-02-14 04:23:01,088 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2024-02-13 23:23:21] 2024-02-14 04:23:02,092 | INFO     | collected zombie processes
[2024-02-13 23:23:21] 2024-02-14 04:23:02,092 | INFO     | will now attempt to kill all subprocesses of pid=1343628
[2024-02-13 23:23:21] 2024-02-14 04:23:02,155 | INFO     | process IDs to be killed: [1343628] (in reverse order)
[2024-02-13 23:23:21] 2024-02-14 04:23:02,207 | WARNING  | found no corresponding commands to process id(s)
[2024-02-13 23:23:21] 2024-02-14 04:23:02,208 | INFO     | Do not look for orphan processes in BOINC jobs
[2024-02-13 23:23:21] 2024-02-14 04:23:02,210 | INFO     | did not find any defunct processes belonging to 1343628
[2024-02-13 23:23:21] 2024-02-14 04:23:02,213 | INFO     | did not find any defunct processes belonging to 1343628
[2024-02-13 23:23:21] 2024-02-14 04:23:02,213 | INFO     | ready for new job
[2024-02-13 23:23:21] 2024-02-14 04:23:02,213 | INFO     | pilot has finished with previous job - re-establishing logging
[2024-02-13 23:23:21] 2024-02-14 04:23:02,213 | INFO     | **************************************
[2024-02-13 23:23:21] 2024-02-14 04:23:02,213 | INFO     | ***  PanDA Pilot version 3.7.0.36  ***
[2024-02-13 23:23:21] 2024-02-14 04:23:02,213 | INFO     | **************************************
[2024-02-13 23:23:21] 2024-02-14 04:23:02,213 | INFO     | 
[2024-02-13 23:23:21] 2024-02-14 04:23:02,237 | INFO     | architecture information:
[2024-02-13 23:23:21] 2024-02-14 04:23:02,237 | INFO     | executing command: cat /etc/os-release
[2024-02-13 23:23:21] 2024-02-14 04:23:02,252 | INFO     | cat /etc/os-release:
[2024-02-13 23:23:21] NAME="CentOS Linux"
[2024-02-13 23:23:21] VERSION="7 (Core)"
[2024-02-13 23:23:21] ID="centos"
[2024-02-13 23:23:21] ID_LIKE="rhel fedora"
[2024-02-13 23:23:21] VERSION_ID="7"
[2024-02-13 23:23:21] PRETTY_NAME="CentOS Linux 7 (Core)"
[2024-02-13 23:23:21] ANSI_COLOR="0;31"
[2024-02-13 23:23:21] CPE_NAME="cpe:/o:centos:centos:7"
[2024-02-13 23:23:21] HOME_URL="https://www.centos.org/"
[2024-02-13 23:23:21] BUG_REPORT_URL="https://bugs.centos.org/"
[2024-02-13 23:23:21] 
[2024-02-13 23:23:21] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2024-02-13 23:23:21] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2024-02-13 23:23:21] REDHAT_SUPPORT_PRODUCT="centos"
[2024-02-13 23:23:21] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2024-02-13 23:23:21] 
[2024-02-13 23:23:21] 2024-02-14 04:23:02,252 | INFO     | **************************************
[2024-02-13 23:23:21] 2024-02-14 04:23:02,755 | INFO     | executing command: df -mP /var/lib/boinc-client/slots/4
[2024-02-13 23:23:21] 2024-02-14 04:23:02,782 | INFO     | sufficient remaining disk space (368745381888 B)
[2024-02-13 23:23:21] 2024-02-14 04:23:02,782 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2024-02-13 23:23:21] 2024-02-14 04:23:02,782 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2024-02-13 23:23:21] 2024-02-14 04:23:02,782 | WARNING  | aborting monitor loop since graceful_stop has been set (timing out remaining threads)
[2024-02-13 23:23:21] 2024-02-14 04:23:02,782 | INFO     | found 0 job(s) in 20 queues
[2024-02-13 23:23:21] 2024-02-14 04:23:02,782 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2024-02-13 23:23:21] 2024-02-14 04:23:02,782 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2024-02-13 23:23:21] 2024-02-14 04:23:03,058 | WARNING  | job monitor detected an abort_job request (signal=args.signal)
[2024-02-13 23:23:21] 2024-02-14 04:23:03,059 | WARNING  | cannot recover job monitoring - aborting pilot
[2024-02-13 23:23:21] 2024-02-14 04:23:03,059 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2024-02-13 23:23:21] 2024-02-14 04:23:03,059 | INFO     | will abort loop
[2024-02-13 23:23:21] 2024-02-14 04:23:03,784 | INFO     | [job] retrieve thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,020 | INFO     | [job] create_data_payload thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,060 | INFO     | [job] job monitor thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,112 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2024-02-13 23:23:21] 2024-02-14 04:23:04,228 | INFO     | [payload] validate_post thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,348 | INFO     | [data] copytool_in thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,380 | INFO     | [job] control thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,392 | INFO     | [payload] control thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,408 | INFO     | [payload] validate_pre thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,540 | INFO     | [payload] failed_post thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,648 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2024-02-13 23:23:21] 2024-02-14 04:23:04,756 | INFO     | [payload] execute_payloads thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:04,832 | INFO     | [data] control thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:05,172 | INFO     | [job] validate thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:05,652 | INFO     | [job] queue monitor thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:06,120 | INFO     | [data] copytool_out thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:06,215 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2024-02-13 23:23:21] 2024-02-14 04:23:10,228 | INFO     | [data] queue_monitor thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:13,396 | INFO     | job.realtimelogging is not enabled
[2024-02-13 23:23:21] 2024-02-14 04:23:14,404 | INFO     | [payload] run_realtimelog thread has finished
[2024-02-13 23:23:21] 2024-02-14 04:23:16,180 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 139939421607744)>', '<ExcThread(monitor, started 139938406323968)>']
[2024-02-13 23:23:21] 2024-02-14 04:23:16,856 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2024-02-13 23:23:21] 2024-02-14 04:23:16,856 | INFO     | [monitor] control thread has ended
[2024-02-13 23:23:21] 2024-02-14 04:23:21,204 | INFO     | end of generic workflow (traces error code: 0)
[2024-02-13 23:23:21] 2024-02-14 04:23:21,204 | INFO     | traces error code: 0
[2024-02-13 23:23:21] 2024-02-14 04:23:21,204 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2024-02-13 23:23:21] 2024-02-14 04:23:21,262 [wrapper] ==== pilot stdout END ====
[2024-02-13 23:23:21] 2024-02-14 04:23:21,273 [wrapper] ==== wrapper stdout RESUME ====
[2024-02-13 23:23:21] 2024-02-14 04:23:21,278 [wrapper] pilotpid: 1336273
[2024-02-13 23:23:21] 2024-02-14 04:23:21,284 [wrapper] Pilot exit status: 0
[2024-02-13 23:23:21] 2024-02-14 04:23:21,309 [wrapper] pandaids: 6108682108
[2024-02-13 23:23:21] 2024-02-14 04:23:21,369 [wrapper] cleanup: SIGTERM to supervisor_pilot 1351243 1336274
[2024-02-13 23:23:21] 2024-02-14 04:23:21,385 [wrapper] Test setup, not cleaning
[2024-02-13 23:23:21] 2024-02-14 04:23:21,401 [wrapper] ==== wrapper stdout END ====
[2024-02-13 23:23:21] 2024-02-14 04:23:21,409 [wrapper] ==== wrapper stderr END ====
[2024-02-13 23:23:21] 2024-02-14 04:23:21,436 [wrapper] apfmon messages muted
[2024-02-13 23:23:21]  *** Error codes and diagnostics ***
[2024-02-13 23:23:21]     "exeErrorCode": 0,
[2024-02-13 23:23:21]     "exeErrorDiag": "",
[2024-02-13 23:23:21]     "pilotErrorCode": 0,
[2024-02-13 23:23:21]     "pilotErrorDiag": "",
[2024-02-13 23:23:21]  *** Listing of results directory ***
[2024-02-13 23:23:21] total 46484
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc   441289 Feb 13 17:38 pilot3.tar.gz
[2024-02-13 23:23:21] -rwx------ 1 boinc boinc    31345 Feb 13 17:56 runpilot2-wrapper.sh
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc     4388 Feb 13 17:56 queuedata.json
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc      107 Feb 13 22:54 wrapper_26015_x86_64-pc-linux-gnu
[2024-02-13 23:23:21] -rwxr-xr-x 1 boinc boinc     7986 Feb 13 22:54 run_atlas
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc      112 Feb 13 22:54 job.xml
[2024-02-13 23:23:21] -rw-r--r-- 2 boinc boinc    17618 Feb 13 22:54 start_atlas.sh
[2024-02-13 23:23:21] drwxrwx--x 2 boinc boinc     4096 Feb 13 22:54 shared
[2024-02-13 23:23:21] -rw-r--r-- 2 boinc boinc   453816 Feb 13 22:54 input.tar.gz
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc     9512 Feb 13 22:54 init_data.xml
[2024-02-13 23:23:21] -rw-r--r-- 2 boinc boinc 37595586 Feb 13 22:54 EVNT.04972714._000034.pool.root.1
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc        0 Feb 13 22:54 boinc_lockfile
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc     2751 Feb 13 22:54 pandaJob.out
[2024-02-13 23:23:21] -rw------- 1 boinc boinc      424 Feb 13 22:54 setup.sh.local
[2024-02-13 23:23:21] -rw------- 1 boinc boinc  1341875 Feb 13 22:55 cric_ddmendpoints.json
[2024-02-13 23:23:21] -rw------- 1 boinc boinc   960914 Feb 13 22:56 agis_schedconf.cvmfs.json
[2024-02-13 23:23:21] drwx------ 4 boinc boinc     4096 Feb 13 22:57 pilot3
[2024-02-13 23:23:21] -rw------- 1 boinc boinc  2930643 Feb 13 23:16 output.1.6461c328-266a-45f9-aa90-9062f28df712_64799.pool.root
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc      533 Feb 13 23:16 boinc_task_state.xml
[2024-02-13 23:23:21] -rw------- 1 boinc boinc     1005 Feb 13 23:16 memory_monitor_summary.json
[2024-02-13 23:23:21] -rw------- 1 boinc boinc   159710 Feb 13 23:19 6461c328-266a-45f9-aa90-9062f28df712_64799.1.job.log.tgz
[2024-02-13 23:23:21] -rw------- 1 boinc boinc     7725 Feb 13 23:22 heartbeat.json
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc       25 Feb 13 23:23 wrapper_checkpoint.txt
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc     8192 Feb 13 23:23 boinc_mmap_file
[2024-02-13 23:23:21] -rw------- 1 boinc boinc     4290 Feb 13 23:23 pilotlog.txt
[2024-02-13 23:23:21] -rw------- 1 boinc boinc   200575 Feb 13 23:23 6461c328-266a-45f9-aa90-9062f28df712_64799.1.job.log
[2024-02-13 23:23:21] -rw------- 1 boinc boinc      464 Feb 13 23:23 output.list
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc      748 Feb 13 23:23 runtime_log
[2024-02-13 23:23:21] -rw------- 1 boinc boinc  3307520 Feb 13 23:23 result.tar.gz
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc    11951 Feb 13 23:23 runtime_log.err
[2024-02-13 23:23:21] -rw------- 1 boinc boinc      625 Feb 13 23:23 P7XKDmEsGu4n7Olcko1bjSoqABFKDmABFKDmyeZQDmIoKKDmSZQSBo.diag
[2024-02-13 23:23:21] -rw-r--r-- 1 boinc boinc    23268 Feb 13 23:23 stderr.txt
[2024-02-13 23:23:21] HITS file was successfully produced:
[2024-02-13 23:23:21] -rw------- 1 boinc boinc 2930643 Feb 13 23:16 shared/HITS.pool.root.1
[2024-02-13 23:23:21]  *** Contents of shared directory: ***
[2024-02-13 23:23:21] total 43276
[2024-02-13 23:23:21] -rw-r--r-- 2 boinc boinc    17618 Feb 13 22:54 start_atlas.sh
[2024-02-13 23:23:21] -rw-r--r-- 2 boinc boinc   453816 Feb 13 22:54 input.tar.gz
[2024-02-13 23:23:21] -rw-r--r-- 2 boinc boinc 37595586 Feb 13 22:54 ATLAS.root_0
[2024-02-13 23:23:21] -rw------- 1 boinc boinc  2930643 Feb 13 23:16 HITS.pool.root.1
[2024-02-13 23:23:21] -rw------- 1 boinc boinc  3307520 Feb 13 23:23 result.tar.gz
23:23:23 (1332519): run_atlas exited; CPU time 625.965310
23:23:23 (1332519): called boinc_finish(0)

</stderr_txt>
]]>


©2025 CERN