Name fDJMDmrgam4n7Olcko1bjSoqABFKDmABFKDmyeZQDm3uJKDmhO3jfn_2
Workunit 2378887
Created 23 Jan 2024, 20:11:06 UTC
Sent 23 Jan 2024, 20:16:10 UTC
Report deadline 30 Jan 2024, 20:16:10 UTC
Received 23 Jan 2024, 20:54:20 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 4889
Run time 27 min 22 sec
CPU time 7 min 12 sec
Validate state Valid
Credit 32.01
Device peak FLOPS 31.99 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.56 GB
Peak swap size 2.12 GB
Peak disk usage 80.02 MB

Stderr output

<core_client_version>7.18.1</core_client_version>
<![CDATA[
<stderr_txt>
15:16:39 (3116442): wrapper (7.7.26015): starting
15:16:39 (3116442): wrapper: running run_atlas (--nthreads 4)
[2024-01-23 15:16:39] Arguments: --nthreads 4
[2024-01-23 15:16:39] Threads: 4
[2024-01-23 15:16:39] Checking for CVMFS
[2024-01-23 15:16:41] Probing /cvmfs/atlas.cern.ch... OK
[2024-01-23 15:16:43] Probing /cvmfs/atlas-condb.cern.ch... OK
[2024-01-23 15:16:43] Running cvmfs_config stat atlas.cern.ch
[2024-01-23 15:16:45] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2024-01-23 15:16:45] 2.11.1.0 81175 5318 63968 128526 3 1 3098346 4194305 336 130560 0 1872965 98.457 3183508 1174 http://s1fnal-cvmfs.openhtc.io:8080/cvmfs/atlas.cern.ch DIRECT 1
[2024-01-23 15:16:45] CVMFS is ok
[2024-01-23 15:16:45] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2024-01-23 15:16:45] Small home clusters do not require a local http proxy but it is suggested if
[2024-01-23 15:16:45] more than 10 cores throughout the same LAN segment are regularly running ATLAS like tasks.
[2024-01-23 15:16:45] Further information can be found at the LHC@home message board.
[2024-01-23 15:16:45] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2024-01-23 15:16:45] Checking for apptainer binary...
[2024-01-23 15:16:45] apptainer is not installed, using version from CVMFS
[2024-01-23 15:16:45] Checking apptainer works with /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2024-01-23 15:16:48] 3070Ti
[2024-01-23 15:16:48] apptainer works
[2024-01-23 15:16:48] Set ATHENA_PROC_NUMBER=4
[2024-01-23 15:16:48] Set ATHENA_CORE_NUMBER=4
[2024-01-23 15:16:48] Starting ATLAS job with PandaID=6086221707
[2024-01-23 15:16:48] Running command: /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs,/var/lib/boinc-client/slots/5 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2024-01-23 15:43:58]  *** The last 200 lines of the pilot log: ***
[2024-01-23 15:43:58] 2024-01-23 20:42:55,418 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:42:57,921 | INFO     | monitor loop #249: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:42:57,922 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:00,430 | INFO     | monitor loop #250: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:00,430 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:02,939 | INFO     | monitor loop #251: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:02,939 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:05,444 | INFO     | monitor loop #252: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:05,444 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:07,949 | INFO     | monitor loop #253: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:07,950 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:10,457 | INFO     | monitor loop #254: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:10,458 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:12,966 | INFO     | monitor loop #255: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:12,966 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:15,476 | INFO     | monitor loop #256: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:15,476 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:17,981 | INFO     | monitor loop #257: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:17,982 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:20,485 | INFO     | monitor loop #258: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:20,486 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:22,996 | INFO     | monitor loop #259: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:22,996 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:25,030 | INFO     | [attempt=3/3] loading data from url=https://atlas-cric.cern.ch/cache/ddmendpoints.json
[2024-01-23 15:43:58] 2024-01-23 20:43:25,507 | INFO     | monitor loop #260: job 0:6086221707 is in state 'stageout'
[2024-01-23 15:43:58] 2024-01-23 20:43:25,507 | WARNING  | aborting job monitor tasks since payload process 3129893 is not running
[2024-01-23 15:43:58] 2024-01-23 20:43:25,614 | WARNING  | failed to load data from url=https://atlas-cric.cern.ch/cache/ddmendpoints.json, error: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: 
[2024-01-23 15:43:58] 2024-01-23 20:43:25,614 | WARNING  | cache file=/var/lib/boinc-client/slots/5/agis_ddmendpoints.agis.ALL.json is not available: [Errno 2] No such file or directory: '/var/lib/boinc-client/slots/5/agis
[2024-01-23 15:43:58] 2024-01-23 20:43:25,659 | INFO     | transferring file 116b7ff2-b2d2-49de-9c18-5f3c2fff74a7_37030.job.log.tgz from /var/lib/boinc-client/slots/5/PanDA_Pilot-6086221707/116b7ff2-b2d2-49de-9c18-5f3c2fff
[2024-01-23 15:43:58] 2024-01-23 20:43:25,660 | INFO     | executing command: /usr/bin/env mv /var/lib/boinc-client/slots/5/PanDA_Pilot-6086221707/116b7ff2-b2d2-49de-9c18-5f3c2fff74a7_37030.job.log.tgz /var/lib/boinc-clien
[2024-01-23 15:43:58] 2024-01-23 20:43:25,680 | INFO     | Adding to output.list: 116b7ff2-b2d2-49de-9c18-5f3c2fff74a7_37030.job.log.tgz davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/hc_test/47/fd/116b7ff2-b2d2-49
[2024-01-23 15:43:58] 2024-01-23 20:43:25,681 | INFO     | summary of transferred files:
[2024-01-23 15:43:58] 2024-01-23 20:43:25,681 | INFO     |  -- lfn=116b7ff2-b2d2-49de-9c18-5f3c2fff74a7_37030.job.log.tgz, status_code=0, status=transferred
[2024-01-23 15:43:58] 2024-01-23 20:43:25,681 | INFO     | stage-out finished correctly
[2024-01-23 15:43:58] 2024-01-23 20:43:27,125 | INFO     | finished stage-out for finished payload, adding job to finished_jobs queue
[2024-01-23 15:43:58] 2024-01-23 20:43:28,021 | INFO     | monitor loop #261: job 0:6086221707 is in state 'finished'
[2024-01-23 15:43:58] 2024-01-23 20:43:28,021 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-01-23 15:43:58] 2024-01-23 20:43:29,866 | INFO     | job 6086221707 has state=finished
[2024-01-23 15:43:58] 2024-01-23 20:43:29,866 | INFO     | preparing for final server update for job 6086221707 in state='finished'
[2024-01-23 15:43:58] 2024-01-23 20:43:29,866 | INFO     | this job has now completed (state=finished)
[2024-01-23 15:43:58] 2024-01-23 20:43:29,866 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2024-01-23 15:43:58] 2024-01-23 20:43:29,866 | INFO     | job 6086221707 has finished - writing final server update
[2024-01-23 15:43:58] 2024-01-23 20:43:29,867 | INFO     | total number of processed events: 2 (read)
[2024-01-23 15:43:58] 2024-01-23 20:43:29,885 | INFO     | executing command: lscpu
[2024-01-23 15:43:58] 2024-01-23 20:43:29,924 | INFO     | found 16 cores (16 cores per socket, 1 sockets)
[2024-01-23 15:43:58] 2024-01-23 20:43:29,924 | INFO     | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo
[2024-01-23 15:43:58] 2024-01-23 20:43:29,960 | INFO     | executing command: export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase;source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh --quiet;lsetup
[2024-01-23 15:43:58] 2024-01-23 20:43:30,526 | INFO     | monitor loop #262: job 0:6086221707 is in state 'finished'
[2024-01-23 15:43:58] 2024-01-23 20:43:30,526 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-01-23 15:43:58] 2024-01-23 20:43:33,032 | INFO     | monitor loop #263: job 0:6086221707 is in state 'finished'
[2024-01-23 15:43:58] 2024-01-23 20:43:33,032 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-01-23 15:43:58] 2024-01-23 20:43:35,535 | INFO     | monitor loop #264: job 0:6086221707 is in state 'finished'
[2024-01-23 15:43:58] 2024-01-23 20:43:35,535 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-01-23 15:43:58] 2024-01-23 20:43:37,550 | INFO     | CPU arch script returned: x86-64-v3
[2024-01-23 15:43:58] 2024-01-23 20:43:37,551 | INFO     | using path: /var/lib/boinc-client/slots/5/PanDA_Pilot-6086221707/memory_monitor_summary.json (trf name=prmon)
[2024-01-23 15:43:58] 2024-01-23 20:43:37,551 | INFO     | extracted standard info from prmon json
[2024-01-23 15:43:58] 2024-01-23 20:43:37,552 | INFO     | extracted standard memory fields from prmon json
[2024-01-23 15:43:58] 2024-01-23 20:43:37,552 | WARNING  | wrong length of table data, x=[1706041596.0, 1706041657.0, 1706041718.0, 1706041779.0, 1706041840.0, 1706041901.0, 1706041962.0], y=[990501.0, 1089255.0, 1190894.0
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | ..............................
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . Timing measurements:
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . get job = 73 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . initial setup = 61 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . payload setup = 11 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . stage-in = 82 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . payload execution = 854 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . stage-out = 463 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | . log creation = 0 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,553 | INFO     | ..............................
[2024-01-23 15:43:58] 2024-01-23 20:43:37,635 | INFO     | 
[2024-01-23 15:43:58] 2024-01-23 20:43:37,635 | INFO     | job summary report
[2024-01-23 15:43:58] 2024-01-23 20:43:37,635 | INFO     | --------------------------------------------------
[2024-01-23 15:43:58] 2024-01-23 20:43:37,635 | INFO     | PanDA job id: 6086221707
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | task id: NULL
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | errors: (none)
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | status: LOG_TRANSFER = DONE 
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | pilot state: finished 
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | transexitcode: 0
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | exeerrorcode: 0
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | exeerrordiag: 
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | exitcode: 0
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | exitmsg: OK
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | cpuconsumptiontime: 489 s
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | nevents: 2
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | neventsw: 0
[2024-01-23 15:43:58] 2024-01-23 20:43:37,636 | INFO     | pid: 3129893
[2024-01-23 15:43:58] 2024-01-23 20:43:37,637 | INFO     | pgrp: 3129893
[2024-01-23 15:43:58] 2024-01-23 20:43:37,637 | INFO     | corecount: 4
[2024-01-23 15:43:58] 2024-01-23 20:43:37,637 | INFO     | event service: False
[2024-01-23 15:43:58] 2024-01-23 20:43:37,637 | INFO     | sizes: {0: 2399238, 11: 2399384, 21: 2399412, 32: 2399440, 43: 2399468, 53: 2399496, 64: 2399652, 75: 2399708, 85: 2399736, 88: 2399824, 96: 2399852, 869: 2421154,
[2024-01-23 15:43:58] 2024-01-23 20:43:37,637 | INFO     | --------------------------------------------------
[2024-01-23 15:43:58] 2024-01-23 20:43:37,637 | INFO     | 
[2024-01-23 15:43:58] 2024-01-23 20:43:37,637 | INFO     | executing command: ls -lF /var/lib/boinc-client/slots/5
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue jobs had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue payloads had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue data_in had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue data_out had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue current_data_in had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,670 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue completed_jobids has 1 job(s)
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,671 | INFO     | queue messages had 0 job(s) [purged]
[2024-01-23 15:43:58] 2024-01-23 20:43:37,672 | INFO     | job 6086221707 has completed (purged errors)
[2024-01-23 15:43:58] 2024-01-23 20:43:37,672 | INFO     | overall cleanup function is called
[2024-01-23 15:43:58] 2024-01-23 20:43:38,681 | INFO     | --- collectZombieJob: --- 10, [3129893]
[2024-01-23 15:43:58] 2024-01-23 20:43:38,682 | INFO     | zombie collector waiting for pid 3129893
[2024-01-23 15:43:58] 2024-01-23 20:43:38,682 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2024-01-23 15:43:58] 2024-01-23 20:43:39,689 | INFO     | collected zombie processes
[2024-01-23 15:43:58] 2024-01-23 20:43:39,690 | INFO     | will now attempt to kill all subprocesses of pid=3129893
[2024-01-23 15:43:58] 2024-01-23 20:43:39,801 | INFO     | process IDs to be killed: [3129893] (in reverse order)
[2024-01-23 15:43:58] 2024-01-23 20:43:39,911 | WARNING  | found no corresponding commands to process id(s)
[2024-01-23 15:43:58] 2024-01-23 20:43:39,911 | INFO     | Do not look for orphan processes in BOINC jobs
[2024-01-23 15:43:58] 2024-01-23 20:43:39,917 | INFO     | did not find any defunct processes belonging to 3129893
[2024-01-23 15:43:58] 2024-01-23 20:43:39,922 | INFO     | did not find any defunct processes belonging to 3129893
[2024-01-23 15:43:58] 2024-01-23 20:43:39,922 | INFO     | ready for new job
[2024-01-23 15:43:58] 2024-01-23 20:43:39,923 | INFO     | pilot has finished with previous job - re-establishing logging
[2024-01-23 15:43:58] 2024-01-23 20:43:39,924 | INFO     | **************************************
[2024-01-23 15:43:58] 2024-01-23 20:43:39,924 | INFO     | ***  PanDA Pilot version 3.7.0.36  ***
[2024-01-23 15:43:58] 2024-01-23 20:43:39,924 | INFO     | **************************************
[2024-01-23 15:43:58] 2024-01-23 20:43:39,924 | INFO     | 
[2024-01-23 15:43:58] 2024-01-23 20:43:39,926 | INFO     | architecture information:
[2024-01-23 15:43:58] 2024-01-23 20:43:39,926 | INFO     | executing command: cat /etc/os-release
[2024-01-23 15:43:58] 2024-01-23 20:43:39,942 | INFO     | cat /etc/os-release:
[2024-01-23 15:43:58] NAME="CentOS Linux"
[2024-01-23 15:43:58] VERSION="7 (Core)"
[2024-01-23 15:43:58] ID="centos"
[2024-01-23 15:43:58] ID_LIKE="rhel fedora"
[2024-01-23 15:43:58] VERSION_ID="7"
[2024-01-23 15:43:58] PRETTY_NAME="CentOS Linux 7 (Core)"
[2024-01-23 15:43:58] ANSI_COLOR="0;31"
[2024-01-23 15:43:58] CPE_NAME="cpe:/o:centos:centos:7"
[2024-01-23 15:43:58] HOME_URL="https://www.centos.org/"
[2024-01-23 15:43:58] BUG_REPORT_URL="https://bugs.centos.org/"
[2024-01-23 15:43:58] 
[2024-01-23 15:43:58] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2024-01-23 15:43:58] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2024-01-23 15:43:58] REDHAT_SUPPORT_PRODUCT="centos"
[2024-01-23 15:43:58] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2024-01-23 15:43:58] 
[2024-01-23 15:43:58] 2024-01-23 20:43:39,942 | INFO     | **************************************
[2024-01-23 15:43:58] 2024-01-23 20:43:40,449 | INFO     | executing command: df -mP /var/lib/boinc-client/slots/5
[2024-01-23 15:43:58] 2024-01-23 20:43:40,475 | INFO     | sufficient remaining disk space (1816493293568 B)
[2024-01-23 15:43:58] 2024-01-23 20:43:40,475 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2024-01-23 15:43:58] 2024-01-23 20:43:40,475 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2024-01-23 15:43:58] 2024-01-23 20:43:40,549 | WARNING  | job monitor detected an abort_job request (signal=args.signal)
[2024-01-23 15:43:58] 2024-01-23 20:43:40,550 | WARNING  | cannot recover job monitoring - aborting pilot
[2024-01-23 15:43:58] 2024-01-23 20:43:40,550 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2024-01-23 15:43:58] 2024-01-23 20:43:40,550 | INFO     | will abort loop
[2024-01-23 15:43:58] 2024-01-23 20:43:41,166 | INFO     | found 0 job(s) in 20 queues
[2024-01-23 15:43:58] 2024-01-23 20:43:41,166 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2024-01-23 15:43:58] 2024-01-23 20:43:41,166 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2024-01-23 15:43:58] 2024-01-23 20:43:41,275 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2024-01-23 15:43:58] 2024-01-23 20:43:41,482 | INFO     | [job] retrieve thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:41,554 | INFO     | [job] job monitor thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:41,738 | INFO     | [job] create_data_payload thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:41,854 | INFO     | [payload] execute_payloads thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:41,964 | INFO     | [payload] validate_pre thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:42,202 | INFO     | [data] copytool_in thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:42,205 | INFO     | [payload] validate_post thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:42,237 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2024-01-23 15:43:58] 2024-01-23 20:43:42,282 | INFO     | [payload] control thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:42,477 | INFO     | [data] control thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:42,648 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2024-01-23 15:43:58] 2024-01-23 20:43:42,662 | INFO     | [job] control thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:42,679 | INFO     | [payload] failed_post thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:42,986 | INFO     | [job] validate thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:43,279 | INFO     | [data] copytool_out thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:43,654 | INFO     | [job] queue monitor thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:46,249 | INFO     | [data] queue_monitor thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:50,861 | INFO     | job.realtimelogging is not enabled
[2024-01-23 15:43:58] 2024-01-23 20:43:51,865 | INFO     | [payload] run_realtimelog thread has finished
[2024-01-23 15:43:58] 2024-01-23 20:43:53,541 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 139767588448064)>', '<ExcThread(monitor, started 139766490199808)>']
[2024-01-23 15:43:58] 2024-01-23 20:43:54,233 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2024-01-23 15:43:58] 2024-01-23 20:43:54,233 | INFO     | [monitor] control thread has ended
[2024-01-23 15:43:58] 2024-01-23 20:43:58,567 | INFO     | end of generic workflow (traces error code: 0)
[2024-01-23 15:43:58] 2024-01-23 20:43:58,568 | INFO     | traces error code: 0
[2024-01-23 15:43:58] 2024-01-23 20:43:58,568 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2024-01-23 15:43:58] 2024-01-23 20:43:58,639 [wrapper] ==== pilot stdout END ====
[2024-01-23 15:43:58] 2024-01-23 20:43:58,641 [wrapper] ==== wrapper stdout RESUME ====
[2024-01-23 15:43:58] 2024-01-23 20:43:58,644 [wrapper] pilotpid: 3120781
[2024-01-23 15:43:58] 2024-01-23 20:43:58,647 [wrapper] Pilot exit status: 0
[2024-01-23 15:43:58] 2024-01-23 20:43:58,658 [wrapper] pandaids: 6086221707
[2024-01-23 15:43:58] 2024-01-23 20:43:58,662 [wrapper] Sending SIGTERM to SUPERVISOR_PID=3120782
[2024-01-23 15:43:58] 2024-01-23 20:43:58,664 [wrapper] Sending SIGTERM to SUPERVISOR_PID=3120782
[2024-01-23 15:43:58] 2024-01-23 20:43:58,673 [wrapper] apfmon messages muted
[2024-01-23 15:43:58] 2024-01-23 20:43:58,675 [wrapper] Test setup, not cleaning
[2024-01-23 15:43:58] 2024-01-23 20:43:58,678 [wrapper] ==== wrapper stdout END ====
[2024-01-23 15:43:58] 2024-01-23 20:43:58,681 [wrapper] ==== wrapper stderr END ====
[2024-01-23 15:43:58] 2024-01-23 20:43:58,686 [wrapper] wrapperexiting ec=0, duration=1627
[2024-01-23 15:43:58] 2024-01-23 20:43:58,689 [wrapper] apfmon messages muted
[2024-01-23 15:43:58]  *** Error codes and diagnostics ***
[2024-01-23 15:43:58]     "exeErrorCode": 0,
[2024-01-23 15:43:58]     "exeErrorDiag": "",
[2024-01-23 15:43:58]     "pilotErrorCode": 0,
[2024-01-23 15:43:58]     "pilotErrorDiag": "",
[2024-01-23 15:43:58]  *** Listing of results directory ***
[2024-01-23 15:43:58] total 41804
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc   441289 Jan 23 13:25 pilot3.tar.gz
[2024-01-23 15:43:58] -rwx------ 1 boinc boinc    30164 Jan 23 13:28 runpilot2-wrapper.sh
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc     4388 Jan 23 13:28 queuedata.json
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc      107 Jan 23 15:16 wrapper_26015_x86_64-pc-linux-gnu
[2024-01-23 15:43:58] -rwxr-xr-x 1 boinc boinc     7986 Jan 23 15:16 run_atlas
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc      112 Jan 23 15:16 job.xml
[2024-01-23 15:43:58] -rw-r--r-- 2 boinc boinc    17630 Jan 23 15:16 start_atlas.sh
[2024-01-23 15:43:58] drwxrwx--x 2 boinc boinc     4096 Jan 23 15:16 shared
[2024-01-23 15:43:58] -rw-r--r-- 2 boinc boinc   453568 Jan 23 15:16 input.tar.gz
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc     9503 Jan 23 15:16 init_data.xml
[2024-01-23 15:43:58] -rw-r--r-- 2 boinc boinc 37438704 Jan 23 15:16 EVNT.04972714._000033.pool.root.1
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc        0 Jan 23 15:16 boinc_lockfile
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc     2796 Jan 23 15:16 pandaJob.out
[2024-01-23 15:43:58] -rw------- 1 boinc boinc      424 Jan 23 15:16 setup.sh.local
[2024-01-23 15:43:58] -rw------- 1 boinc boinc  1338676 Jan 23 15:18 cric_ddmendpoints.json
[2024-01-23 15:43:58] -rw------- 1 boinc boinc   949395 Jan 23 15:18 agis_schedconf.cvmfs.json
[2024-01-23 15:43:58] drwx------ 4 boinc boinc     4096 Jan 23 15:19 pilot3
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc      533 Jan 23 15:35 boinc_task_state.xml
[2024-01-23 15:43:58] -rw------- 1 boinc boinc  1260460 Jan 23 15:35 Hits.hc_20284782.BOINC-TEST.116b7ff2-b2d2-49de-9c18-5f3c2fff74a7_37030.HITS.pool.root
[2024-01-23 15:43:58] -rw------- 1 boinc boinc     1001 Jan 23 15:35 memory_monitor_summary.json
[2024-01-23 15:43:58] -rw------- 1 boinc boinc   157600 Jan 23 15:39 116b7ff2-b2d2-49de-9c18-5f3c2fff74a7_37030.job.log.tgz
[2024-01-23 15:43:58] -rw------- 1 boinc boinc     7390 Jan 23 15:43 heartbeat.json
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc     8192 Jan 23 15:43 boinc_mmap_file
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc       25 Jan 23 15:43 wrapper_checkpoint.txt
[2024-01-23 15:43:58] -rw------- 1 boinc boinc     4168 Jan 23 15:43 pilotlog.txt
[2024-01-23 15:43:58] -rw------- 1 boinc boinc   188353 Jan 23 15:43 116b7ff2-b2d2-49de-9c18-5f3c2fff74a7_37030.job.log
[2024-01-23 15:43:58] -rw------- 1 boinc boinc      510 Jan 23 15:43 output.list
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc      680 Jan 23 15:43 runtime_log
[2024-01-23 15:43:58] -rw------- 1 boinc boinc   368640 Jan 23 15:43 result.tar.gz
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc    11762 Jan 23 15:43 runtime_log.err
[2024-01-23 15:43:58] -rw------- 1 boinc boinc      623 Jan 23 15:43 fDJMDmrgam4n7Olcko1bjSoqABFKDmABFKDmyeZQDm3uJKDmhO3jfn.diag
[2024-01-23 15:43:58] -rw-r--r-- 1 boinc boinc    23294 Jan 23 15:43 stderr.txt
[2024-01-23 15:43:58] HITS file was successfully produced:
[2024-01-23 15:43:58] -rw------- 1 boinc boinc 1260460 Jan 23 15:35 shared/HITS.pool.root.1
[2024-01-23 15:43:58]  *** Contents of shared directory: ***
[2024-01-23 15:43:58] total 38624
[2024-01-23 15:43:58] -rw-r--r-- 2 boinc boinc    17630 Jan 23 15:16 start_atlas.sh
[2024-01-23 15:43:58] -rw-r--r-- 2 boinc boinc   453568 Jan 23 15:16 input.tar.gz
[2024-01-23 15:43:58] -rw-r--r-- 2 boinc boinc 37438704 Jan 23 15:16 ATLAS.root_0
[2024-01-23 15:43:58] -rw------- 1 boinc boinc  1260460 Jan 23 15:35 HITS.pool.root.1
[2024-01-23 15:43:58] -rw------- 1 boinc boinc   368640 Jan 23 15:43 result.tar.gz
15:44:00 (3116442): run_atlas exited; CPU time 477.047455
15:44:00 (3116442): called boinc_finish(0)

</stderr_txt>
]]>


©2025 CERN