Name | 0lxNDm8nuI3n7Olcko1bjSoqABFKDmABFKDm7AsVDmPXJKDmQpbGhn_0 |
Workunit | 2306651 |
Created | 15 May 2023, 21:21:14 UTC |
Sent | 16 May 2023, 0:41:55 UTC |
Report deadline | 23 May 2023, 0:41:55 UTC |
Received | 16 May 2023, 5:49:41 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1498 |
Run time | 30 min 56 sec |
CPU time | 11 min 13 sec |
Validate state | Valid |
Credit | 21.79 |
Device peak FLOPS | 12.99 GFLOPS |
Application version | ATLAS Simulation v3.01 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 1.51 GB |
Peak swap size | 2.12 GB |
Peak disk usage | 80.46 MB |
<core_client_version>7.4.25</core_client_version> <![CDATA[ <stderr_txt> 06:17:50 (4503): wrapper (7.7.26015): starting 06:17:50 (4503): wrapper: running run_atlas (--nthreads 4) [2023-05-16 06:17:50] Arguments: --nthreads 4 [2023-05-16 06:17:50] Threads: 4 [2023-05-16 06:17:50] Checking for CVMFS [2023-05-16 06:17:54] Probing /cvmfs/atlas.cern.ch... OK [2023-05-16 06:17:55] Probing /cvmfs/atlas-condb.cern.ch... OK [2023-05-16 06:17:55] Running cvmfs_config stat atlas.cern.ch [2023-05-16 06:17:55] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2023-05-16 06:17:55] 2.9.0.0 4728 0 24756 119047 3 1 3481651 4194305 0 130560 0 0 0.000 824 714 http://s1ral-cvmfs.openhtc.io/cvmfs/atlas.cern.ch http://192.168.100.152:3128 1 [2023-05-16 06:17:55] CVMFS is ok [2023-05-16 06:17:55] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 [2023-05-16 06:17:55] Checking for apptainer binary... [2023-05-16 06:17:55] Using apptainer found in PATH at /usr/bin/apptainer [2023-05-16 06:17:55] Running /usr/bin/apptainer --version [2023-05-16 06:17:56] apptainer version 1.0.3 [2023-05-16 06:17:56] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname [2023-05-16 06:17:57] TeeC16 [2023-05-16 06:17:57] apptainer works [2023-05-16 06:17:57] Set ATHENA_PROC_NUMBER=4 [2023-05-16 06:17:57] Set ATHENA_CORE_NUMBER=4 [2023-05-16 06:17:57] Starting ATLAS job with PandaID=5847613469 [2023-05-16 06:17:57] Running command: /usr/bin/apptainer exec -B /cvmfs,/home/m/BOINC/slots/1 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh [2023-05-16 06:48:43] *** The last 200 lines of the pilot log: *** [2023-05-16 06:48:43] 2023-05-16 05:43:24,933 | INFO | pilot.control.job | make_job_report | errors: (none) [2023-05-16 06:48:43] 2023-05-16 05:43:24,933 | INFO | pilot.control.job | make_job_report | status: LOG_TRANSFER = DONE [2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO | pilot.control.job | make_job_report | pilot state: finished [2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO | pilot.control.job | make_job_report | transexitcode: 0 [2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO | pilot.control.job | make_job_report | exeerrorcode: 0 [2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO | pilot.control.job | make_job_report | exeerrordiag: [2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO | pilot.control.job | make_job_report | exitcode: 0 [2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO | pilot.control.job | make_job_report | exitmsg: OK [2023-05-16 06:48:43] 2023-05-16 05:43:24,934 | INFO | pilot.control.job | make_job_report | cpuconsumptiontime: 687 s [2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO | pilot.control.job | make_job_report | nevents: 2 [2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO | pilot.control.job | make_job_report | neventsw: 0 [2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO | pilot.control.job | make_job_report | pid: 15499 [2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO | pilot.control.job | make_job_report | pgrp: 15499 [2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO | pilot.control.job | make_job_report | corecount: 4 [2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO | pilot.control.job | make_job_report | event service: False [2023-05-16 06:48:43] 2023-05-16 05:43:24,935 | INFO | pilot.control.job | make_job_report | sizes: {0: 2545978, 1: 2546177, 11: 2546323, 22: 2546351, 33: 2546379, 43: 2546407, 54: 2546563, 64: [2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | INFO | pilot.control.job | make_job_report | -------------------------------------------------- [2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | INFO | pilot.control.job | make_job_report | [2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | DEBUG | pilot.control.job | has_job_completed | ls -lF /home/m/BOINC/slots/1: [2023-05-16 06:48:43] [2023-05-16 06:48:43] 2023-05-16 05:43:24,936 | INFO | pilot.util.container | print_executable | executing command: ls -lF /home/m/BOINC/slots/1 [2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | DEBUG | pilot.control.job | has_job_completed | total 43408 [2023-05-16 06:48:43] -rw------- 1 m m 129 May 16 06:17 0lxNDm8nuI3n7Olcko1bjSoqABFKDmABFKDm7AsVDmPXJKDmQpbGhn.diag [2023-05-16 06:48:43] -rw-r--r-- 2 m m 37620382 May 16 06:17 EVNT.04972714._000038.pool.root.1 [2023-05-16 06:48:43] drwxrwx--- 2 m m 4096 May 16 06:43 PanDA_Pilot-5847613469/ [2023-05-16 06:48:43] -rw------- 1 m m 1016777 May 16 06:19 agis_schedconf.cvmfs.json [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 0 May 16 06:17 boinc_lockfile [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 8192 May 16 06:43 boinc_mmap_file [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 533 May 16 06:37 boinc_task_state.xml [2023-05-16 06:48:43] -rw------- 1 m m 1425006 May 16 06:19 cric_ddmendpoints.json [2023-05-16 06:48:43] -rw------- 1 m m 669062 May 16 06:43 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log [2023-05-16 06:48:43] -rw------- 1 m m 186696 May 16 06:40 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log.tgz [2023-05-16 06:48:43] -rw------- 1 m m 7383 May 16 06:43 heartbeat.json [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 5708 May 16 06:17 init_data.xml [2023-05-16 06:48:43] -rw-r--r-- 2 m m 425322 May 16 06:17 input.tar.gz [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 112 May 16 06:17 job.xml [2023-05-16 06:48:43] -rw------- 1 m m 1018 May 16 06:37 memory_monitor_summary.json [2023-05-16 06:48:43] -rw------- 1 m m 1859045 May 16 06:37 output.1.fda160fa-9103-40db-86b5-09da938b92c9_6527.pool.root [2023-05-16 06:48:43] -rw------- 1 m m 460 May 16 06:43 output.list [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 2658 May 16 06:17 pandaJob.out [2023-05-16 06:48:43] drwx------ 4 m m 4096 May 16 06:20 pilot3/ [2023-05-16 06:48:43] -rw------- 1 m m 413616 May 15 22:21 pilot3.tar.gz [2023-05-16 06:48:43] -rw------- 1 m m 655549 May 16 06:43 pilotlog.txt [2023-05-16 06:48:43] -rw-r--r-- 1 m m 4388 May 15 22:20 queuedata.json [2023-05-16 06:48:43] -rwxr-xr-x 1 m m 7986 May 16 06:17 run_atlas* [2023-05-16 06:48:43] -rwx------ 1 m m 27540 May 15 22:21 runpilot2-wrapper.sh* [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 407 May 16 06:17 runtime_log [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 7697 May 16 06:17 runtime_log.err [2023-05-16 06:48:43] -rw------- 1 m m 424 May 16 06:17 setup.sh.local [2023-05-16 06:48:43] drwxrwx--x 2 m m 4096 May 16 06:17 shared/ [2023-05-16 06:48:43] -rw-r--r-- 2 m m 17628 May 16 06:17 start_atlas.sh [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 1715 May 16 06:17 stderr.txt [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 107 May 16 06:17 wrapper_26015_x86_64-pc-linux-gnu [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 25 May 16 06:43 wrapper_checkpoint.txt [2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO | pilot.util.queuehandling | queue_report | queue jobs had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO | pilot.util.queuehandling | queue_report | queue payloads had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO | pilot.util.queuehandling | queue_report | queue data_in had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,959 | INFO | pilot.util.queuehandling | queue_report | queue data_out had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue current_data_in had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue validated_jobs had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue validated_payloads had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue monitored_payloads had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue finished_jobs had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue finished_payloads had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue finished_data_in had 1 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue finished_data_out had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,960 | INFO | pilot.util.queuehandling | queue_report | queue failed_jobs had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.util.queuehandling | queue_report | queue failed_payloads had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.util.queuehandling | queue_report | queue failed_data_in had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.util.queuehandling | queue_report | queue failed_data_out had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.util.queuehandling | queue_report | queue completed_jobs had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.util.queuehandling | queue_report | queue completed_jobids has 1 job(s) [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.util.queuehandling | queue_report | queue realtimelog_payloads had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.util.queuehandling | queue_report | queue messages had 0 job(s) [purged] [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | INFO | pilot.control.job | has_job_completed | job 5847613469 has completed (purged errors) [2023-05-16 06:48:43] 2023-05-16 05:43:24,961 | DEBUG | pilot.util.realtimelogger | cleanup | attempting real-time logger cleanup [2023-05-16 06:48:43] 2023-05-16 05:43:24,962 | INFO | pilot.util.processes | cleanup | overall cleanup function is called [2023-05-16 06:48:43] 2023-05-16 05:43:24,963 | DEBUG | pilot.util.processes | cleanup | work directory was removed: /home/m/BOINC/slots/1/PanDA_Pilot-5847613469 [2023-05-16 06:48:43] 2023-05-16 05:43:25,969 | INFO | pilot.info.jobdata | collect_zombies | --- collectZombieJob: --- 10, [15499] [2023-05-16 06:48:43] 2023-05-16 05:43:25,969 | INFO | pilot.info.jobdata | collect_zombies | zombie collector trying to kill pid 15499 [2023-05-16 06:48:43] 2023-05-16 05:43:25,969 | INFO | pilot.info.jobdata | collect_zombies | harmless exception when collecting zombies: [Errno 10] No child processes [2023-05-16 06:48:43] 2023-05-16 05:43:26,974 | INFO | pilot.util.processes | cleanup | collected zombie processes [2023-05-16 06:48:43] 2023-05-16 05:43:26,974 | INFO | pilot.util.processes | cleanup | will now attempt to kill all subprocesses of pid=15499 [2023-05-16 06:48:43] 2023-05-16 05:43:27,018 | INFO | pilot.util.processes | kill_processes | process IDs to be killed: [15499] (in reverse order) [2023-05-16 06:48:43] 2023-05-16 05:43:27,047 | WARNING | pilot.util.processes | kill_processes | found no corresponding commands to process id(s) [2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | INFO | pilot.util.processes | kill_orphans | Do not look for orphan processes in BOINC jobs [2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | DEBUG | pilot.util.queuehandling | purge_queue | queue purged [2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | INFO | pilot.control.job | retrieve | ready for new job [2023-05-16 06:48:43] 2023-05-16 05:43:27,048 | INFO | root | retrieve | pilot has finished with previous job - re-establishing logging [2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO | pilot.util.auxiliary | pilot_version_banner | ***************************************** [2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO | pilot.util.auxiliary | pilot_version_banner | *** PanDA Pilot version 3.6.0 (100) *** [2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO | pilot.util.auxiliary | pilot_version_banner | ***************************************** [2023-05-16 06:48:43] 2023-05-16 05:43:27,049 | INFO | pilot.util.auxiliary | pilot_version_banner | [2023-05-16 06:48:43] 2023-05-16 05:43:27,073 | INFO | pilot.util.auxiliary | display_architecture_info | architecture information: [2023-05-16 06:48:43] 2023-05-16 05:43:27,074 | INFO | pilot.util.container | print_executable | executing command: cat /etc/os-release [2023-05-16 06:48:43] 2023-05-16 05:43:27,091 | INFO | pilot.util.filehandling | dump | cat /etc/os-release: [2023-05-16 06:48:43] NAME="CentOS Linux" [2023-05-16 06:48:43] VERSION="7 (Core)" [2023-05-16 06:48:43] ID="centos" [2023-05-16 06:48:43] ID_LIKE="rhel fedora" [2023-05-16 06:48:43] VERSION_ID="7" [2023-05-16 06:48:43] PRETTY_NAME="CentOS Linux 7 (Core)" [2023-05-16 06:48:43] ANSI_COLOR="0;31" [2023-05-16 06:48:43] CPE_NAME="cpe:/o:centos:centos:7" [2023-05-16 06:48:43] HOME_URL="https://www.centos.org/" [2023-05-16 06:48:43] BUG_REPORT_URL="https://bugs.centos.org/" [2023-05-16 06:48:43] [2023-05-16 06:48:43] CENTOS_MANTISBT_PROJECT="CentOS-7" [2023-05-16 06:48:43] CENTOS_MANTISBT_PROJECT_VERSION="7" [2023-05-16 06:48:43] REDHAT_SUPPORT_PRODUCT="centos" [2023-05-16 06:48:43] REDHAT_SUPPORT_PRODUCT_VERSION="7" [2023-05-16 06:48:43] [2023-05-16 06:48:43] 2023-05-16 05:43:27,091 | INFO | pilot.util.auxiliary | pilot_version_banner | ***************************************** [2023-05-16 06:48:43] 2023-05-16 05:43:27,594 | DEBUG | pilot.util.monitoring | check_local_space | checking local space on /home/m/BOINC/slots/1 [2023-05-16 06:48:43] 2023-05-16 05:43:27,595 | INFO | pilot.util.container | print_executable | executing command: df -mP /home/m/BOINC/slots/1 [2023-05-16 06:48:43] 2023-05-16 05:43:27,613 | DEBUG | pilot.util.workernode | get_local_disk_space | stdout=Filesystem 1048576-blocks Used Available Capacity Mounted on [2023-05-16 06:48:43] /dev/sda1 455417 28552 403709 7% /home/m/BOINC/slots/1 [2023-05-16 06:48:43] 2023-05-16 05:43:27,613 | DEBUG | pilot.util.workernode | get_local_disk_space | stderr= [2023-05-16 06:48:43] 2023-05-16 05:43:27,613 | INFO | pilot.util.monitoring | check_local_space | sufficient remaining disk space (423319568384 B) [2023-05-16 06:48:43] 2023-05-16 05:43:27,614 | WARNING | pilot.control.job | proceed_with_getjob | since timefloor is set to 0, pilot was only allowed to run one job [2023-05-16 06:48:43] 2023-05-16 05:43:27,614 | WARNING | pilot.control.job | retrieve | setting graceful_stop since proceed_with_getjob() returned False (pilot will end) [2023-05-16 06:48:43] 2023-05-16 05:43:27,614 | INFO | pilot.control.job | retrieve | [job] retrieve thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:27,736 | INFO | pilot.util.queuehandling | abort_jobs_in_queues | found 0 job(s) in 20 queues [2023-05-16 06:48:43] 2023-05-16 05:43:27,736 | WARNING | pilot.control.monitor | run_checks | pilot monitor received instruction that args.graceful_stop has been set [2023-05-16 06:48:43] 2023-05-16 05:43:27,736 | WARNING | pilot.control.monitor | run_checks | will wait for a maximum of 300 s for threads to finish [2023-05-16 06:48:43] 2023-05-16 05:43:27,752 | DEBUG | pilot.control.job | control | job control ending since graceful_stop has been set [2023-05-16 06:48:43] 2023-05-16 05:43:27,752 | INFO | pilot.control.job | control | [job] control thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:27,892 | INFO | pilot.control.payload | failed_post | [payload] failed_post thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,017 | INFO | pilot.control.job | validate | [job] validate thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,245 | INFO | pilot.control.job | create_data_payload | [job] create_data_payload thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,254 | DEBUG | pilot.control.data | control | data control ending since graceful_stop has been set [2023-05-16 06:48:43] 2023-05-16 05:43:28,254 | INFO | pilot.control.data | control | [data] control thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,350 | INFO | pilot.control.payload | validate_pre | [payload] validate_pre thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,382 | WARNING | pilot.util.common | should_abort | data:copytool_out:received graceful stop - abort after this iteration [2023-05-16 06:48:43] 2023-05-16 05:43:28,424 | INFO | pilot.control.payload | validate_post | [payload] validate_post thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,657 | WARNING | pilot.control.job | check_for_abort_job | job monitor detected an abort_job request (signal=None) [2023-05-16 06:48:43] 2023-05-16 05:43:28,657 | WARNING | pilot.util.common | should_abort | job:job_monitor:received graceful stop - abort after this iteration [2023-05-16 06:48:43] 2023-05-16 05:43:28,657 | INFO | pilot.control.job | job_monitor | [job] job monitor thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,755 | DEBUG | pilot.control.payload | control | payload control ending since graceful_stop has been set [2023-05-16 06:48:43] 2023-05-16 05:43:28,756 | INFO | pilot.control.payload | control | [payload] control thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:28,861 | INFO | pilot.control.payload | execute_payloads | [payload] execute_payloads thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:29,057 | INFO | pilot.control.data | copytool_in | [data] copytool_in thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:29,383 | INFO | pilot.control.data | copytool_out | [data] copytool_out thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:29,474 | WARNING | pilot.util.common | should_abort | job:queue_monitor:received graceful stop - abort after this iteration [2023-05-16 06:48:43] 2023-05-16 05:43:29,474 | INFO | pilot.control.job | queue_monitor | [job] queue monitor thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:29,834 | WARNING | pilot.util.common | should_abort | data:queue_monitoring:received graceful stop - abort after this iteration [2023-05-16 06:48:43] 2023-05-16 05:43:32,834 | INFO | pilot.control.data | queue_monitoring | [data] queue_monitor thread has finished [2023-05-16 06:48:43] 2023-05-16 05:43:38,600 | INFO | pilot.control.payload | get_logging_info | job.realtimelogging is not enabled [2023-05-16 06:48:43] 2023-05-16 05:43:38,601 | DEBUG | pilot.control.payload | run_realtimelog | real-time logging not needed at this point [2023-05-16 06:48:43] 2023-05-16 05:43:38,601 | DEBUG | pilot.control.payload | run_realtimelog | realtime logger was not found, waiting .. [2023-05-16 06:48:43] 2023-05-16 05:43:38,601 | INFO | pilot.control.payload | run_realtimelog | [payload] run_realtimelog thread has finished [2023-05-16 06:48:43] File "/home/m/BOINC/slots/1/pilot3/pilot/common/exception.py", line 424, in run [2023-05-16 06:48:43] self._target(**self._kwargs) [2023-05-16 06:48:43] File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 142, in control [2023-05-16 06:48:43] raise PilotException(error) [2023-05-16 06:48:43] received exception from bucket queue in generic workflow: error code: 1301, message: An unknown pilot exception has occurred [2023-05-16 06:48:43] details: error code: 1317, message: Exceeded maximum waiting time [2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception) [2023-05-16 06:48:43] monitor: exception caught: error code: 1317, message: Exceeded maximum waiting time [2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception) [2023-05-16 06:48:43] exception caught by thread run() function: (<class 'pilot.common.exception.PilotException'>, PilotException(ExceededMaxWaitTime(('reached maximum waiting time - threads should have finished (ignore ex [2023-05-16 06:48:43] Traceback (most recent call last): [2023-05-16 06:48:43] File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 126, in control [2023-05-16 06:48:43] run_checks(queues, args) [2023-05-16 06:48:43] File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 319, in run_checks [2023-05-16 06:48:43] raise ExceededMaxWaitTime(diagnostics) [2023-05-16 06:48:43] pilot.common.exception.ExceededMaxWaitTime: error code: 1317, message: Exceeded maximum waiting time [2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception) [2023-05-16 06:48:43] [2023-05-16 06:48:43] During handling of the above exception, another exception occurred: [2023-05-16 06:48:43] [2023-05-16 06:48:43] Traceback (most recent call last): [2023-05-16 06:48:43] File "/home/m/BOINC/slots/1/pilot3/pilot/common/exception.py", line 424, in run [2023-05-16 06:48:43] self._target(**self._kwargs) [2023-05-16 06:48:43] File "/home/m/BOINC/slots/1/pilot3/pilot/control/monitor.py", line 142, in control [2023-05-16 06:48:43] raise PilotException(error) [2023-05-16 06:48:43] pilot.common.exception.PilotException: error code: 1301, message: An unknown pilot exception has occurred [2023-05-16 06:48:43] details: error code: 1317, message: Exceeded maximum waiting time [2023-05-16 06:48:43] details: reached maximum waiting time - threads should have finished (ignore exception) [2023-05-16 06:48:43] [2023-05-16 06:48:43] None [2023-05-16 06:48:43] exception has been put in bucket queue belonging to thread 'monitor' [2023-05-16 06:48:43] setting graceful stop in 10 s since there is no point in continuing [2023-05-16 06:48:43] 2023-05-16 05:48:38,533 | INFO | pilot.util.processes | threads_aborted | caller=run is remaining thread - safe to abort (names=['<_MainThread(MainThread, started 13981182045 [2023-05-16 06:48:43] 2023-05-16 05:48:38,533 | DEBUG | pilot.workflow.generic | run | will proceed to set job_aborted [2023-05-16 06:48:43] 2023-05-16 05:48:43,558 | DEBUG | pilot.workflow.generic | run | all relevant threads have aborted (thread count=1) [2023-05-16 06:48:43] 2023-05-16 05:48:43,559 | INFO | pilot.workflow.generic | run | end of generic workflow (traces error code: 0) [2023-05-16 06:48:43] 2023-05-16 05:48:43,559 | INFO | root | wrap_up | traces error code: 0 [2023-05-16 06:48:43] 2023-05-16 05:48:43,559 | INFO | root | wrap_up | pilot has finished (exit code=0, shell exit code=0) [2023-05-16 06:48:43] 2023-05-16 05:48:43,612 [wrapper] ==== pilot stdout END ==== [2023-05-16 06:48:43] 2023-05-16 05:48:43,616 [wrapper] ==== wrapper stdout RESUME ==== [2023-05-16 06:48:43] 2023-05-16 05:48:43,621 [wrapper] pilotpid: 8272 [2023-05-16 06:48:43] 2023-05-16 05:48:43,626 [wrapper] Pilot exit status: 0 [2023-05-16 06:48:43] 2023-05-16 05:48:43,638 [wrapper] pandaids: 5847613469 [2023-05-16 06:48:43] 2023-05-16 05:48:43,645 [wrapper] apfmon messages muted [2023-05-16 06:48:43] 2023-05-16 05:48:43,649 [wrapper] Test setup, not cleaning [2023-05-16 06:48:43] 2023-05-16 05:48:43,652 [wrapper] ==== wrapper stdout END ==== [2023-05-16 06:48:43] 2023-05-16 05:48:43,656 [wrapper] ==== wrapper stderr END ==== [2023-05-16 06:48:43] 2023-05-16 05:48:43,665 [wrapper] wrapperexiting ec=0, duration=1845 [2023-05-16 06:48:43] 2023-05-16 05:48:43,670 [wrapper] apfmon messages muted [2023-05-16 06:48:43] *** Error codes and diagnostics *** [2023-05-16 06:48:43] "exeErrorCode": 0, [2023-05-16 06:48:43] "exeErrorDiag": "", [2023-05-16 06:48:43] "pilotErrorCode": 0, [2023-05-16 06:48:43] "pilotErrorDiag": "", [2023-05-16 06:48:43] *** Listing of results directory *** [2023-05-16 06:48:43] total 45500 [2023-05-16 06:48:43] -rw-r--r-- 1 m m 4388 May 15 22:20 queuedata.json [2023-05-16 06:48:43] -rwx------ 1 m m 27540 May 15 22:21 runpilot2-wrapper.sh [2023-05-16 06:48:43] -rw------- 1 m m 413616 May 15 22:21 pilot3.tar.gz [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 107 May 16 06:17 wrapper_26015_x86_64-pc-linux-gnu [2023-05-16 06:48:43] -rwxr-xr-x 1 m m 7986 May 16 06:17 run_atlas [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 112 May 16 06:17 job.xml [2023-05-16 06:48:43] -rw-r--r-- 2 m m 17628 May 16 06:17 start_atlas.sh [2023-05-16 06:48:43] drwxrwx--x 2 m m 4096 May 16 06:17 shared [2023-05-16 06:48:43] -rw-r--r-- 2 m m 425322 May 16 06:17 input.tar.gz [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 5708 May 16 06:17 init_data.xml [2023-05-16 06:48:43] -rw-r--r-- 2 m m 37620382 May 16 06:17 EVNT.04972714._000038.pool.root.1 [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 0 May 16 06:17 boinc_lockfile [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 2658 May 16 06:17 pandaJob.out [2023-05-16 06:48:43] -rw------- 1 m m 424 May 16 06:17 setup.sh.local [2023-05-16 06:48:43] -rw------- 1 m m 1425006 May 16 06:19 cric_ddmendpoints.json [2023-05-16 06:48:43] -rw------- 1 m m 1016777 May 16 06:19 agis_schedconf.cvmfs.json [2023-05-16 06:48:43] drwx------ 4 m m 4096 May 16 06:20 pilot3 [2023-05-16 06:48:43] -rw------- 1 m m 1859045 May 16 06:37 output.1.fda160fa-9103-40db-86b5-09da938b92c9_6527.pool.root [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 533 May 16 06:37 boinc_task_state.xml [2023-05-16 06:48:43] -rw------- 1 m m 1018 May 16 06:37 memory_monitor_summary.json [2023-05-16 06:48:43] -rw------- 1 m m 186696 May 16 06:40 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log.tgz [2023-05-16 06:48:43] -rw------- 1 m m 7383 May 16 06:43 heartbeat.json [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 8192 May 16 06:48 boinc_mmap_file [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 25 May 16 06:48 wrapper_checkpoint.txt [2023-05-16 06:48:43] -rw------- 1 m m 7928 May 16 06:48 pilotlog.txt [2023-05-16 06:48:43] -rw------- 1 m m 686866 May 16 06:48 fda160fa-9103-40db-86b5-09da938b92c9_6527.1.job.log [2023-05-16 06:48:43] -rw------- 1 m m 460 May 16 06:48 output.list [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 744 May 16 06:48 runtime_log [2023-05-16 06:48:43] -rw------- 1 m m 2754560 May 16 06:48 result.tar.gz [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 11879 May 16 06:48 runtime_log.err [2023-05-16 06:48:43] -rw------- 1 m m 626 May 16 06:48 0lxNDm8nuI3n7Olcko1bjSoqABFKDmABFKDm7AsVDmPXJKDmQpbGhn.diag [2023-05-16 06:48:43] -rw-rw-r-- 1 m m 26704 May 16 06:48 stderr.txt [2023-05-16 06:48:43] HITS file was successfully produced: [2023-05-16 06:48:43] -rw------- 1 m m 1859045 May 16 06:37 shared/HITS.pool.root.1 [2023-05-16 06:48:43] *** Contents of shared directory: *** [2023-05-16 06:48:43] total 41684 [2023-05-16 06:48:43] -rw-r--r-- 2 m m 17628 May 16 06:17 start_atlas.sh [2023-05-16 06:48:43] -rw-r--r-- 2 m m 425322 May 16 06:17 input.tar.gz [2023-05-16 06:48:43] -rw-r--r-- 2 m m 37620382 May 16 06:17 ATLAS.root_0 [2023-05-16 06:48:43] -rw------- 1 m m 1859045 May 16 06:37 HITS.pool.root.1 [2023-05-16 06:48:43] -rw------- 1 m m 2754560 May 16 06:48 result.tar.gz 06:48:45 (4503): run_atlas exited; CPU time 673.365423 06:48:45 (4503): called boinc_finish(0) </stderr_txt> ]]>
©2024 CERN