Name 0lxNDm8nuI3n7Olcko1bjSoqABFKDmABFKDm7AsVDmPXJKDmQpbGhn_0
Workunit 2306651
Created 15 May 2023, 21:21:14 UTC
Sent 16 May 2023, 0:41:55 UTC
Report deadline 23 May 2023, 0:41:55 UTC
Received 16 May 2023, 5:49:41 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 1498
Run time 30 min 56 sec
CPU time 11 min 13 sec
Validate state Valid
Credit 21.79
Device peak FLOPS 12.99 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.51 GB
Peak swap size 2.12 GB
Peak disk usage 80.46 MB

Stderr output

<core_client_version>7.4.25</core_client_version>
<![CDATA[
<stderr_txt>
06:17:50 (4503): wrapper (7.7.26015): starting
06:17:50 (4503): wrapper: running run_atlas (--nthreads 4)
[2023-05-16 06:17:50] Arguments: --nthreads 4
[2023-05-16 06:17:50] Threads: 4
[2023-05-16 06:17:50] Checking for CVMFS
[2023-05-16 06:17:54] Probing /cvmfs/atlas.cern.ch... OK
[2023-05-16 06:17:55] Probing /cvmfs/atlas-condb.cern.ch... OK
[2023-05-16 06:17:55] Running cvmfs_config stat atlas.cern.ch
[2023-05-16 06:17:55] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2023-05-16 06:17:55] 2.9.0.0 4728 0 24756 119047 3 1 3481651 4194305 0 130560 0 0 0.000 824 714 http://s1ral-cvmfs.openhtc.io/cvmfs/atlas.cern.ch http://192.168.100.152:3128 1
[2023-05-16 06:17:55] CVMFS is ok
[2023-05-16 06:17:55] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2023-05-16 06:17:55] Checking for apptainer binary...
[2023-05-16 06:17:55] Using apptainer found in PATH at /usr/bin/apptainer
[2023-05-16 06:17:55] Running /usr/bin/apptainer --version
[2023-05-16 06:17:56] apptainer version 1.0.3
[2023-05-16 06:17:56] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2023-05-16 06:17:57] TeeC16
[2023-05-16 06:17:57] apptainer works
[2023-05-16 06:17:57] Set ATHENA_PROC_NUMBER=4
[2023-05-16 06:17:57] Set ATHENA_CORE_NUMBER=4
[2023-05-16 06:17:57] Starting ATLAS job with PandaID=5847613469
[2023-05-16 06:17:57] Running command: /usr/bin/apptainer exec -B /cvmfs,/home/m/BOINC/slots/1 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2023-05-16 06:48:43]  *** The last 200 lines of the pilot log: ***
[2023-05-16 06:48:43] 2023-05-16 05:43:24,933 | INFO     | pilot.control.job                | make_job_report           | errors: (none)
[2023-05-16 06:48:43] 2023-05-16 05:43:24,933 | INFO     | pilot.control.job                | make_job_report           | status: LOG_TRANSFER = DONE 
[2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO     | pilot.control.job                | make_job_report           | pilot state: finished 
[2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO     | pilot.control.job                | make_job_report           | transexitcode: 0
[2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO     | pilot.control.job                | make_job_report           | exeerrorcode: 0
[2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO     | pilot.control.job                | make_job_report           | exeerrordiag: 
[2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO     | pilot.control.job                | make_job_report           | exitcode: 0
[2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO     | pilot.control.job                | make_job_report           | exitmsg: OK
[2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO     | pilot.control.job                | make_job_report           | cpuconsumptiontime: 687 s
[2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO     | pilot.control.job                | make_job_report           | nevents: 2
[2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO     | pilot.control.job                | make_job_report           | neventsw: 0
[2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO     | pilot.control.job                | make_job_report           | pid: 15499
[2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO     | pilot.control.job                | make_job_report           | pgrp: 15499
[2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO     | pilot.control.job                | make_job_report           | corecount: 4
[2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO     | pilot.control.job                | make_job_report           | event service: False
[2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO     | pilot.control.job                | make_job_report           | sizes: {0: 2545978, 1: 2546177, 11: 2546323, 22: 2546351, 33: 2546379, 43: 2546407, 54: 2546563, 64:
[2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | INFO     | pilot.control.job                | make_job_report           | --------------------------------------------------
[2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | INFO     | pilot.control.job                | make_job_report           | 
[2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | DEBUG    | pilot.control.job                | has_job_completed         | ls -lF /home/m/BOINC/slots/1:
[2023-05-16 06:48:43] 
[2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | INFO     | pilot.util.container             | print_executable          | executing command: ls -lF /home/m/BOINC/slots/1
[2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | DEBUG    | pilot.control.job                | has_job_completed         | total 43408
[2023-05-16 06:48:43] -rw------- 1 m m      129 May 16 06:17 0lxNDm8nuI3n7Olcko1bjSoqABFKDmABFKDm7AsVDmPXJKDmQpbGhn.diag
[2023-05-16 06:48:43] -rw-r--r-- 2 m m 37620382 May 16 06:17 EVNT.04972714._000038.pool.root.1
[2023-05-16 06:48:43] drwxrwx--- 2 m m     4096 May 16 06:43 PanDA_Pilot-5847613469/
[2023-05-16 06:48:43] -rw------- 1 m m  1016777 May 16 06:19 agis_schedconf.cvmfs.json
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m        0 May 16 06:17 boinc_lockfile
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     8192 May 16 06:43 boinc_mmap_file
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      533 May 16 06:37 boinc_task_state.xml
[2023-05-16 06:48:43] -rw------- 1 m m  1425006 May 16 06:19 cric_ddmendpoints.json
[2023-05-16 06:48:43] -rw------- 1 m m   669062 May 16 06:43 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log
[2023-05-16 06:48:43] -rw------- 1 m m   186696 May 16 06:40 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log.tgz
[2023-05-16 06:48:43] -rw------- 1 m m     7383 May 16 06:43 heartbeat.json
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     5708 May 16 06:17 init_data.xml
[2023-05-16 06:48:43] -rw-r--r-- 2 m m   425322 May 16 06:17 input.tar.gz
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      112 May 16 06:17 job.xml
[2023-05-16 06:48:43] -rw------- 1 m m     1018 May 16 06:37 memory_monitor_summary.json
[2023-05-16 06:48:43] -rw------- 1 m m  1859045 May 16 06:37 output.1.fda160fa-9103-40db-86b5-09da938b92c9_6527.pool.root
[2023-05-16 06:48:43] -rw------- 1 m m      460 May 16 06:43 output.list
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     2658 May 16 06:17 pandaJob.out
[2023-05-16 06:48:43] drwx------ 4 m m     4096 May 16 06:20 pilot3/
[2023-05-16 06:48:43] -rw------- 1 m m   413616 May 15 22:21 pilot3.tar.gz
[2023-05-16 06:48:43] -rw------- 1 m m   655549 May 16 06:43 pilotlog.txt
[2023-05-16 06:48:43] -rw-r--r-- 1 m m     4388 May 15 22:20 queuedata.json
[2023-05-16 06:48:43] -rwxr-xr-x 1 m m     7986 May 16 06:17 run_atlas*
[2023-05-16 06:48:43] -rwx------ 1 m m    27540 May 15 22:21 runpilot2-wrapper.sh*
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      407 May 16 06:17 runtime_log
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     7697 May 16 06:17 runtime_log.err
[2023-05-16 06:48:43] -rw------- 1 m m      424 May 16 06:17 setup.sh.local
[2023-05-16 06:48:43] drwxrwx--x 2 m m     4096 May 16 06:17 shared/
[2023-05-16 06:48:43] -rw-r--r-- 2 m m    17628 May 16 06:17 start_atlas.sh
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     1715 May 16 06:17 stderr.txt
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      107 May 16 06:17 wrapper_26015_x86_64-pc-linux-gnu
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m       25 May 16 06:43 wrapper_checkpoint.txt
[2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO     | pilot.util.queuehandling         | queue_report              | queue jobs had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO     | pilot.util.queuehandling         | queue_report              | queue payloads had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO     | pilot.util.queuehandling         | queue_report              | queue data_in had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO     | pilot.util.queuehandling         | queue_report              | queue data_out had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue current_data_in had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue validated_jobs had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue validated_payloads had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue monitored_payloads had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue finished_jobs had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue finished_payloads had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue finished_data_in had 1 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue finished_data_out had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO     | pilot.util.queuehandling         | queue_report              | queue failed_jobs had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.util.queuehandling         | queue_report              | queue failed_payloads had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.util.queuehandling         | queue_report              | queue failed_data_in had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.util.queuehandling         | queue_report              | queue failed_data_out had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.util.queuehandling         | queue_report              | queue completed_jobs had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.util.queuehandling         | queue_report              | queue completed_jobids has 1 job(s)
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.util.queuehandling         | queue_report              | queue realtimelog_payloads had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.util.queuehandling         | queue_report              | queue messages had 0 job(s) [purged]
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO     | pilot.control.job                | has_job_completed         | job 5847613469 has completed (purged errors)
[2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | DEBUG    | pilot.util.realtimelogger        | cleanup                   | attempting real-time logger cleanup
[2023-05-16 06:48:43] 2023-05-16 05:43:24,962 | INFO     | pilot.util.processes             | cleanup                   | overall cleanup function is called
[2023-05-16 06:48:43] 2023-05-16 05:43:24,963 | DEBUG    | pilot.util.processes             | cleanup                   | work directory was removed: /home/m/BOINC/slots/1/PanDA_Pilot-5847613469
[2023-05-16 06:48:43] 2023-05-16 05:43:25,969 | INFO     | pilot.info.jobdata               | collect_zombies           | --- collectZombieJob: --- 10, [15499]
[2023-05-16 06:48:43] 2023-05-16 05:43:25,969 | INFO     | pilot.info.jobdata               | collect_zombies           | zombie collector trying to kill pid 15499
[2023-05-16 06:48:43] 2023-05-16 05:43:25,969 | INFO     | pilot.info.jobdata               | collect_zombies           | harmless exception when collecting zombies: [Errno 10] No child processes
[2023-05-16 06:48:43] 2023-05-16 05:43:26,974 | INFO     | pilot.util.processes             | cleanup                   | collected zombie processes
[2023-05-16 06:48:43] 2023-05-16 05:43:26,974 | INFO     | pilot.util.processes             | cleanup                   | will now attempt to kill all subprocesses of pid=15499
[2023-05-16 06:48:43] 2023-05-16 05:43:27,018 | INFO     | pilot.util.processes             | kill_processes            | process IDs to be killed: [15499] (in reverse order)
[2023-05-16 06:48:43] 2023-05-16 05:43:27,047 | WARNING  | pilot.util.processes             | kill_processes            | found no corresponding commands to process id(s)
[2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | INFO     | pilot.util.processes             | kill_orphans              | Do not look for orphan processes in BOINC jobs
[2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | DEBUG    | pilot.util.queuehandling         | purge_queue               | queue purged
[2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | INFO     | pilot.control.job                | retrieve                  | ready for new job
[2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | INFO     | root                             | retrieve                  | pilot has finished with previous job - re-establishing logging
[2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO     | pilot.util.auxiliary             | pilot_version_banner      | *****************************************
[2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO     | pilot.util.auxiliary             | pilot_version_banner      | ***  PanDA Pilot version 3.6.0 (100)  ***
[2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO     | pilot.util.auxiliary             | pilot_version_banner      | *****************************************
[2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO     | pilot.util.auxiliary             | pilot_version_banner      | 
[2023-05-16 06:48:43] 2023-05-16 05:43:27,073 | INFO     | pilot.util.auxiliary             | display_architecture_info | architecture information:
[2023-05-16 06:48:43] 2023-05-16 05:43:27,074 | INFO     | pilot.util.container             | print_executable          | executing command: cat /etc/os-release
[2023-05-16 06:48:43] 2023-05-16 05:43:27,091 | INFO     | pilot.util.filehandling          | dump                      | cat /etc/os-release:
[2023-05-16 06:48:43] NAME="CentOS Linux"
[2023-05-16 06:48:43] VERSION="7 (Core)"
[2023-05-16 06:48:43] ID="centos"
[2023-05-16 06:48:43] ID_LIKE="rhel fedora"
[2023-05-16 06:48:43] VERSION_ID="7"
[2023-05-16 06:48:43] PRETTY_NAME="CentOS Linux 7 (Core)"
[2023-05-16 06:48:43] ANSI_COLOR="0;31"
[2023-05-16 06:48:43] CPE_NAME="cpe:/o:centos:centos:7"
[2023-05-16 06:48:43] HOME_URL="https://www.centos.org/"
[2023-05-16 06:48:43] BUG_REPORT_URL="https://bugs.centos.org/"
[2023-05-16 06:48:43] 
[2023-05-16 06:48:43] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2023-05-16 06:48:43] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2023-05-16 06:48:43] REDHAT_SUPPORT_PRODUCT="centos"
[2023-05-16 06:48:43] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2023-05-16 06:48:43] 
[2023-05-16 06:48:43] 2023-05-16 05:43:27,091 | INFO     | pilot.util.auxiliary             | pilot_version_banner      | *****************************************
[2023-05-16 06:48:43] 2023-05-16 05:43:27,594 | DEBUG    | pilot.util.monitoring            | check_local_space         | checking local space on /home/m/BOINC/slots/1
[2023-05-16 06:48:43] 2023-05-16 05:43:27,595 | INFO     | pilot.util.container             | print_executable          | executing command: df -mP /home/m/BOINC/slots/1
[2023-05-16 06:48:43] 2023-05-16 05:43:27,613 | DEBUG    | pilot.util.workernode            | get_local_disk_space      | stdout=Filesystem     1048576-blocks  Used Available Capacity Mounted on
[2023-05-16 06:48:43] /dev/sda1              455417 28552    403709       7% /home/m/BOINC/slots/1
[2023-05-16 06:48:43] 2023-05-16 05:43:27,613 | DEBUG    | pilot.util.workernode            | get_local_disk_space      | stderr=
[2023-05-16 06:48:43] 2023-05-16 05:43:27,613 | INFO     | pilot.util.monitoring            | check_local_space         | sufficient remaining disk space (423319568384 B)
[2023-05-16 06:48:43] 2023-05-16 05:43:27,614 | WARNING  | pilot.control.job                | proceed_with_getjob       | since timefloor is set to 0, pilot was only allowed to run one job
[2023-05-16 06:48:43] 2023-05-16 05:43:27,614 | WARNING  | pilot.control.job                | retrieve                  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2023-05-16 06:48:43] 2023-05-16 05:43:27,614 | INFO     | pilot.control.job                | retrieve                  | [job] retrieve thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:27,736 | INFO     | pilot.util.queuehandling         | abort_jobs_in_queues      | found 0 job(s) in 20 queues
[2023-05-16 06:48:43] 2023-05-16 05:43:27,736 | WARNING  | pilot.control.monitor            | run_checks                | pilot monitor received instruction that args.graceful_stop has been set
[2023-05-16 06:48:43] 2023-05-16 05:43:27,736 | WARNING  | pilot.control.monitor            | run_checks                | will wait for a maximum of 300 s for threads to finish
[2023-05-16 06:48:43] 2023-05-16 05:43:27,752 | DEBUG    | pilot.control.job                | control                   | job control ending since graceful_stop has been set
[2023-05-16 06:48:43] 2023-05-16 05:43:27,752 | INFO     | pilot.control.job                | control                   | [job] control thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:27,892 | INFO     | pilot.control.payload            | failed_post               | [payload] failed_post thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,017 | INFO     | pilot.control.job                | validate                  | [job] validate thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,245 | INFO     | pilot.control.job                | create_data_payload       | [job] create_data_payload thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,254 | DEBUG    | pilot.control.data               | control                   | data control ending since graceful_stop has been set
[2023-05-16 06:48:43] 2023-05-16 05:43:28,254 | INFO     | pilot.control.data               | control                   | [data] control thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,350 | INFO     | pilot.control.payload            | validate_pre              | [payload] validate_pre thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,382 | WARNING  | pilot.util.common                | should_abort              | data:copytool_out:received graceful stop - abort after this iteration
[2023-05-16 06:48:43] 2023-05-16 05:43:28,424 | INFO     | pilot.control.payload            | validate_post             | [payload] validate_post thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,657 | WARNING  | pilot.control.job                | check_for_abort_job       | job monitor detected an abort_job request (signal=None)
[2023-05-16 06:48:43] 2023-05-16 05:43:28,657 | WARNING  | pilot.util.common                | should_abort              | job:job_monitor:received graceful stop - abort after this iteration
[2023-05-16 06:48:43] 2023-05-16 05:43:28,657 | INFO     | pilot.control.job                | job_monitor               | [job] job monitor thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,755 | DEBUG    | pilot.control.payload            | control                   | payload control ending since graceful_stop has been set
[2023-05-16 06:48:43] 2023-05-16 05:43:28,756 | INFO     | pilot.control.payload            | control                   | [payload] control thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:28,861 | INFO     | pilot.control.payload            | execute_payloads          | [payload] execute_payloads thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:29,057 | INFO     | pilot.control.data               | copytool_in               | [data] copytool_in thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:29,383 | INFO     | pilot.control.data               | copytool_out              | [data] copytool_out thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:29,474 | WARNING  | pilot.util.common                | should_abort              | job:queue_monitor:received graceful stop - abort after this iteration
[2023-05-16 06:48:43] 2023-05-16 05:43:29,474 | INFO     | pilot.control.job                | queue_monitor             | [job] queue monitor thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:29,834 | WARNING  | pilot.util.common                | should_abort              | data:queue_monitoring:received graceful stop - abort after this iteration
[2023-05-16 06:48:43] 2023-05-16 05:43:32,834 | INFO     | pilot.control.data               | queue_monitoring          | [data] queue_monitor thread has finished
[2023-05-16 06:48:43] 2023-05-16 05:43:38,600 | INFO     | pilot.control.payload            | get_logging_info          | job.realtimelogging is not enabled
[2023-05-16 06:48:43] 2023-05-16 05:43:38,601 | DEBUG    | pilot.control.payload            | run_realtimelog           | real-time logging not needed at this point
[2023-05-16 06:48:43] 2023-05-16 05:43:38,601 | DEBUG    | pilot.control.payload            | run_realtimelog           | realtime logger was not found, waiting ..
[2023-05-16 06:48:43] 2023-05-16 05:43:38,601 | INFO     | pilot.control.payload            | run_realtimelog           | [payload] run_realtimelog thread has finished
[2023-05-16 06:48:43]   File "/home/m/BOINC/slots/1/pilot3/pilot/common/exception.py", line 424, in run
[2023-05-16 06:48:43]     self._target(**self._kwargs)
[2023-05-16 06:48:43]   File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 142, in control
[2023-05-16 06:48:43]     raise PilotException(error)
[2023-05-16 06:48:43] received exception from bucket queue in generic workflow: error code: 1301, message: An unknown pilot exception has occurred
[2023-05-16 06:48:43] details: error code: 1317, message: Exceeded maximum waiting time
[2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception)
[2023-05-16 06:48:43] monitor: exception caught: error code: 1317, message: Exceeded maximum waiting time
[2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception)
[2023-05-16 06:48:43] exception caught by thread run() function: (<class 'pilot.common.exception.PilotException'>, PilotException(ExceededMaxWaitTime(('reached maximum waiting time - threads should have finished (ignore ex
[2023-05-16 06:48:43] Traceback (most recent call last):
[2023-05-16 06:48:43]   File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 126, in control
[2023-05-16 06:48:43]     run_checks(queues, args)
[2023-05-16 06:48:43]   File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 319, in run_checks
[2023-05-16 06:48:43]     raise ExceededMaxWaitTime(diagnostics)
[2023-05-16 06:48:43] pilot.common.exception.ExceededMaxWaitTime: error code: 1317, message: Exceeded maximum waiting time
[2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception)
[2023-05-16 06:48:43] 
[2023-05-16 06:48:43] During handling of the above exception, another exception occurred:
[2023-05-16 06:48:43] 
[2023-05-16 06:48:43] Traceback (most recent call last):
[2023-05-16 06:48:43]   File "/home/m/BOINC/slots/1/pilot3/pilot/common/exception.py", line 424, in run
[2023-05-16 06:48:43]     self._target(**self._kwargs)
[2023-05-16 06:48:43]   File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 142, in control
[2023-05-16 06:48:43]     raise PilotException(error)
[2023-05-16 06:48:43] pilot.common.exception.PilotException: error code: 1301, message: An unknown pilot exception has occurred
[2023-05-16 06:48:43] details: error code: 1317, message: Exceeded maximum waiting time
[2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception)
[2023-05-16 06:48:43] 
[2023-05-16 06:48:43] None
[2023-05-16 06:48:43] exception has been put in bucket queue belonging to thread 'monitor'
[2023-05-16 06:48:43] setting graceful stop in 10 s since there is no point in continuing
[2023-05-16 06:48:43] 2023-05-16 05:48:38,533 | INFO     | pilot.util.processes             | threads_aborted           | caller=run is remaining thread - safe to abort (names=['<_MainThread(MainThread, started 13981182045
[2023-05-16 06:48:43] 2023-05-16 05:48:38,533 | DEBUG    | pilot.workflow.generic           | run                       | will proceed to set job_aborted
[2023-05-16 06:48:43] 2023-05-16 05:48:43,558 | DEBUG    | pilot.workflow.generic           | run                       | all relevant threads have aborted (thread count=1)
[2023-05-16 06:48:43] 2023-05-16 05:48:43,559 | INFO     | pilot.workflow.generic           | run                       | end of generic workflow (traces error code: 0)
[2023-05-16 06:48:43] 2023-05-16 05:48:43,559 | INFO     | root                             | wrap_up                   | traces error code: 0
[2023-05-16 06:48:43] 2023-05-16 05:48:43,559 | INFO     | root                             | wrap_up                   | pilot has finished (exit code=0, shell exit code=0)
[2023-05-16 06:48:43] 2023-05-16 05:48:43,612 [wrapper] ==== pilot stdout END ====
[2023-05-16 06:48:43] 2023-05-16 05:48:43,616 [wrapper] ==== wrapper stdout RESUME ====
[2023-05-16 06:48:43] 2023-05-16 05:48:43,621 [wrapper] pilotpid: 8272
[2023-05-16 06:48:43] 2023-05-16 05:48:43,626 [wrapper] Pilot exit status: 0
[2023-05-16 06:48:43] 2023-05-16 05:48:43,638 [wrapper] pandaids: 5847613469
[2023-05-16 06:48:43] 2023-05-16 05:48:43,645 [wrapper] apfmon messages muted
[2023-05-16 06:48:43] 2023-05-16 05:48:43,649 [wrapper] Test setup, not cleaning
[2023-05-16 06:48:43] 2023-05-16 05:48:43,652 [wrapper] ==== wrapper stdout END ====
[2023-05-16 06:48:43] 2023-05-16 05:48:43,656 [wrapper] ==== wrapper stderr END ====
[2023-05-16 06:48:43] 2023-05-16 05:48:43,665 [wrapper] wrapperexiting ec=0, duration=1845
[2023-05-16 06:48:43] 2023-05-16 05:48:43,670 [wrapper] apfmon messages muted
[2023-05-16 06:48:43]  *** Error codes and diagnostics ***
[2023-05-16 06:48:43]     "exeErrorCode": 0,
[2023-05-16 06:48:43]     "exeErrorDiag": "",
[2023-05-16 06:48:43]     "pilotErrorCode": 0,
[2023-05-16 06:48:43]     "pilotErrorDiag": "",
[2023-05-16 06:48:43]  *** Listing of results directory ***
[2023-05-16 06:48:43] total 45500
[2023-05-16 06:48:43] -rw-r--r-- 1 m m     4388 May 15 22:20 queuedata.json
[2023-05-16 06:48:43] -rwx------ 1 m m    27540 May 15 22:21 runpilot2-wrapper.sh
[2023-05-16 06:48:43] -rw------- 1 m m   413616 May 15 22:21 pilot3.tar.gz
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      107 May 16 06:17 wrapper_26015_x86_64-pc-linux-gnu
[2023-05-16 06:48:43] -rwxr-xr-x 1 m m     7986 May 16 06:17 run_atlas
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      112 May 16 06:17 job.xml
[2023-05-16 06:48:43] -rw-r--r-- 2 m m    17628 May 16 06:17 start_atlas.sh
[2023-05-16 06:48:43] drwxrwx--x 2 m m     4096 May 16 06:17 shared
[2023-05-16 06:48:43] -rw-r--r-- 2 m m   425322 May 16 06:17 input.tar.gz
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     5708 May 16 06:17 init_data.xml
[2023-05-16 06:48:43] -rw-r--r-- 2 m m 37620382 May 16 06:17 EVNT.04972714._000038.pool.root.1
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m        0 May 16 06:17 boinc_lockfile
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     2658 May 16 06:17 pandaJob.out
[2023-05-16 06:48:43] -rw------- 1 m m      424 May 16 06:17 setup.sh.local
[2023-05-16 06:48:43] -rw------- 1 m m  1425006 May 16 06:19 cric_ddmendpoints.json
[2023-05-16 06:48:43] -rw------- 1 m m  1016777 May 16 06:19 agis_schedconf.cvmfs.json
[2023-05-16 06:48:43] drwx------ 4 m m     4096 May 16 06:20 pilot3
[2023-05-16 06:48:43] -rw------- 1 m m  1859045 May 16 06:37 output.1.fda160fa-9103-40db-86b5-09da938b92c9_6527.pool.root
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      533 May 16 06:37 boinc_task_state.xml
[2023-05-16 06:48:43] -rw------- 1 m m     1018 May 16 06:37 memory_monitor_summary.json
[2023-05-16 06:48:43] -rw------- 1 m m   186696 May 16 06:40 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log.tgz
[2023-05-16 06:48:43] -rw------- 1 m m     7383 May 16 06:43 heartbeat.json
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m     8192 May 16 06:48 boinc_mmap_file
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m       25 May 16 06:48 wrapper_checkpoint.txt
[2023-05-16 06:48:43] -rw------- 1 m m     7928 May 16 06:48 pilotlog.txt
[2023-05-16 06:48:43] -rw------- 1 m m   686866 May 16 06:48 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log
[2023-05-16 06:48:43] -rw------- 1 m m      460 May 16 06:48 output.list
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m      744 May 16 06:48 runtime_log
[2023-05-16 06:48:43] -rw------- 1 m m  2754560 May 16 06:48 result.tar.gz
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m    11879 May 16 06:48 runtime_log.err
[2023-05-16 06:48:43] -rw------- 1 m m      626 May 16 06:48 0lxNDm8nuI3n7Olcko1bjSoqABFKDmABFKDm7AsVDmPXJKDmQpbGhn.diag
[2023-05-16 06:48:43] -rw-rw-r-- 1 m m    26704 May 16 06:48 stderr.txt
[2023-05-16 06:48:43] HITS file was successfully produced:
[2023-05-16 06:48:43] -rw------- 1 m m 1859045 May 16 06:37 shared/HITS.pool.root.1
[2023-05-16 06:48:43]  *** Contents of shared directory: ***
[2023-05-16 06:48:43] total 41684
[2023-05-16 06:48:43] -rw-r--r-- 2 m m    17628 May 16 06:17 start_atlas.sh
[2023-05-16 06:48:43] -rw-r--r-- 2 m m   425322 May 16 06:17 input.tar.gz
[2023-05-16 06:48:43] -rw-r--r-- 2 m m 37620382 May 16 06:17 ATLAS.root_0
[2023-05-16 06:48:43] -rw------- 1 m m  1859045 May 16 06:37 HITS.pool.root.1
[2023-05-16 06:48:43] -rw------- 1 m m  2754560 May 16 06:48 result.tar.gz
06:48:45 (4503): run_atlas exited; CPU time 673.365423
06:48:45 (4503): called boinc_finish(0)

</stderr_txt>
]]>


©2024 CERN