Name | kqaMDmmcLRzn7Olcko1bjSoqABFKDmABFKDmumdXDmPIGKDmm7B2pn_0 |
Workunit | 2109670 |
Created | 24 Jul 2021, 1:27:16 UTC |
Sent | 24 Jul 2021, 1:39:41 UTC |
Report deadline | 31 Jul 2021, 1:39:41 UTC |
Received | 24 Jul 2021, 1:56:36 UTC |
Server state | Over |
Outcome | Validate error |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 4357 |
Run time | 15 min 59 sec |
CPU time | 1 min 22 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 3.04 GFLOPS |
Application version | ATLAS Simulation v1.04 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 745.01 MB |
Peak swap size | 30.47 GB |
Peak disk usage | 79.42 MB |
<core_client_version>7.9.3</core_client_version> <![CDATA[ <stderr_txt> 21:40:26 (12404): wrapper (7.7.26015): starting 21:40:26 (12404): wrapper: running run_atlas (--nthreads 1) [2021-07-23 21:40:26] Arguments: --nthreads 1 [2021-07-23 21:40:26] Threads: 1 [2021-07-23 21:40:26] Checking for CVMFS [2021-07-23 21:40:26] Probing /cvmfs/atlas.cern.ch... OK [2021-07-23 21:40:27] Probing /cvmfs/atlas-condb.cern.ch... OK [2021-07-23 21:40:27] Running cvmfs_config stat atlas.cern.ch [2021-07-23 21:40:28] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2021-07-23 21:40:28] 2.9.0.0 12719 221 59536 88914 2 12 3261333 4194305 737 130560 0 518095 99.729 160596 1856 http://cvmfs-s1bnl.opensciencegrid.org:8000/cvmfs/atlas.cern.ch DIRECT 0 [2021-07-23 21:40:28] CVMFS is ok [2021-07-23 21:40:28] Efficiency of ATLAS tasks can be improved by the following measure(s): [2021-07-23 21:40:28] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io. [2021-07-23 21:40:28] Small home clusters do not require a local http proxy but it is suggested if [2021-07-23 21:40:28] more than 10 cores throughout the same LAN segment are regularly running ATLAS like tasks. [2021-07-23 21:40:28] Further information can be found at the LHC@home message board. [2021-07-23 21:40:28] Using singularity image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 [2021-07-23 21:40:28] Checking for singularity binary... [2021-07-23 21:40:28] Using singularity found in PATH at /usr/local/bin/singularity [2021-07-23 21:40:28] Running /usr/local/bin/singularity --version [2021-07-23 21:40:28] singularity version 3.7.2+10-ga969f0f8c [2021-07-23 21:40:28] Checking singularity works with /usr/local/bin/singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname [2021-07-23 21:40:30] E5-2670 [2021-07-23 21:40:30] Singularity works [2021-07-23 21:40:30] Starting ATLAS job with PandaID=5129440837 [2021-07-23 21:40:30] Running command: /usr/local/bin/singularity exec --pwd /var/lib/boinc-client/slots/9 -B /cvmfs,/var /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh [2021-07-23 21:56:22] *** The last 200 lines of the pilot log: *** [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | .............................. [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . Timing measurements: [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . get job = 0 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . initial setup = 1 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . payload setup = 43 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . total setup = 44 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . stage-in = 0 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . payload execution = 665 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . stage-out = 0 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | INFO | pilot.util.timing | timing_report | .............................. [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | INFO | pilot.user.atlas.diagnose | get_log_extracts | building log extracts (sent to the server as 'pilotLog') [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | DEBUG | pilot.user.atlas.diagnose | get_panda_tracer_log | PanDA tracer log does not exist: /var/lib/boinc-client/slots/9/PanDA_Pilot-5129440837/pandatracerlog [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | INFO | pilot.util.container | execute | executing command: tail -n 20 /var/lib/boinc-client/slots/9/PanDA_Pilot-5129440837/pilotlog.txt [2021-07-23 21:56:22] 2021-07-24 01:56:10,231 | WARNING | pilot.control.job | add_timing_and_extracts | [2021-07-23 21:56:22] XXXXXXXXXXXXXXXXXXXXX[begin log extracts] [2021-07-23 21:56:22] - Log from pilotlog.txt - [2021-07-23 21:56:22] 2021-07-24 01:56:10,056 | INFO | pilot.control.job | get_data_structure | mean actualcorecount: 4.333333 [2021-07-23 21:56:22] 2021-07-24 01:56:10,056 | INFO | pilot.control.job | get_data_structure | payload/TRF did not report the number of read events [2021-07-23 21:56:22] 2021-07-24 01:56:10,081 | INFO | pilot.util.container | execute | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo [2021-07-23 21:56:22] 2021-07-24 01:56:10,191 | INFO | pilot.user.atlas.utilities | get_memory_values | using path: /var/lib/boinc-client/slots/9/PanDA_Pilot-5129440837/memory_monitor_summary.json (trf na [2021-07-23 21:56:22] 2021-07-24 01:56:10,193 | DEBUG | pilot.user.atlas.utilities | get_memory_monitor_info | summary_dictionary={'Avg': {'nprocs': 5.181, 'nthreads': 7.545, 'pss': 677889.0, 'rchar': 145075.0, [2021-07-23 21:56:22] 2021-07-24 01:56:10,193 | INFO | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard info from prmon json [2021-07-23 21:56:22] 2021-07-24 01:56:10,193 | INFO | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard memory fields from prmon json [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | .............................. [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . Timing measurements: [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . get job = 0 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . initial setup = 1 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . payload setup = 43 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . total setup = 44 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . stage-in = 0 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . payload execution = 665 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,194 | INFO | pilot.util.timing | timing_report | . stage-out = 0 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | INFO | pilot.util.timing | timing_report | .............................. [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | INFO | pilot.user.atlas.diagnose | get_log_extracts | building log extracts (sent to the server as 'pilotLog') [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | DEBUG | pilot.user.atlas.diagnose | get_panda_tracer_log | PanDA tracer log does not exist: /var/lib/boinc-client/slots/9/PanDA_Pilot-5129440837/pandatracerlog [2021-07-23 21:56:22] 2021-07-24 01:56:10,195 | INFO | pilot.util.container | execute | executing command: tail -n 20 /var/lib/boinc-client/slots/9/PanDA_Pilot-5129440837/pilotlog.txt [2021-07-23 21:56:22] XXXXXXXXXXXXXXXXXXXXX[end log extracts] [2021-07-23 21:56:22] 2021-07-24 01:56:10,231 | WARNING | pilot.control.job | add_error_codes | pilotErrorCodes = [1305] (will report primary/first error code) [2021-07-23 21:56:22] 2021-07-24 01:56:10,231 | WARNING | pilot.control.job | add_error_codes | pilotErrorDiags = ['Failed to execute payload'] (will report primary/first error diag) [2021-07-23 21:56:22] 2021-07-24 01:56:10,231 | DEBUG | pilot.control.job | send_state | is_harvester_mode(args) : False [2021-07-23 21:56:22] 2021-07-24 01:56:10,232 | DEBUG | pilot.control.job | write_heartbeat_to_file | heartbeat dictionary: {'jobId': '5129440837', 'state': 'failed', 'timestamp': '2021-07-23T21:56:10-0 [2021-07-23 21:56:22] 2021-07-24 01:56:10,233 | DEBUG | pilot.control.job | write_heartbeat_to_file | wrote heartbeat to file /var/lib/boinc-client/slots/9/heartbeat.json [2021-07-23 21:56:22] 2021-07-24 01:56:10,233 | DEBUG | pilot.control.job | queue_monitor | job 5129440837 was dequeued from the monitored payloads queue [2021-07-23 21:56:22] 2021-07-24 01:56:10,385 | DEBUG | pilot.control.job | queue_monitor | tmp job object deleted [2021-07-23 21:56:22] 2021-07-24 01:56:10,385 | INFO | pilot.control.job | make_job_report | [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | job summary report [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | -------------------------------------------------- [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | PanDA job id: 5129440837 [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | task id: NULL [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | error 1/1: 1305: Failed to execute payload [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | status: LOG_TRANSFER = DONE [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | pilot state: failed [2021-07-23 21:56:22] 2021-07-24 01:56:10,386 | INFO | pilot.control.job | make_job_report | transexitcode: 65 [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | exeerrorcode: 65 [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | exeerrordiag: Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: "GeoModelS [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | exitcode: 65 [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | exitmsg: Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: "GeoModelSvc [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | cpuconsumptiontime: 90 s [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | nevents: 0 [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | neventsw: 0 [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | pid: 32891 [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | pgrp: 32891 [2021-07-23 21:56:22] 2021-07-24 01:56:10,387 | INFO | pilot.control.job | make_job_report | corecount: 1 [2021-07-23 21:56:22] 2021-07-24 01:56:10,388 | INFO | pilot.control.job | make_job_report | event service: False [2021-07-23 21:56:22] 2021-07-24 01:56:10,388 | INFO | pilot.control.job | make_job_report | sizes: {24824028: 3855540, 24824029: 3855652, 24824030: 3855817, 24824742: 3879594, 24824743: 388392 [2021-07-23 21:56:22] 2021-07-24 01:56:10,388 | INFO | pilot.control.job | make_job_report | -------------------------------------------------- [2021-07-23 21:56:22] 2021-07-24 01:56:10,388 | INFO | pilot.control.job | make_job_report | [2021-07-23 21:56:22] 2021-07-24 01:56:10,388 | DEBUG | pilot.control.job | has_job_completed | ls -lF /var/lib/boinc-client/slots/9: [2021-07-23 21:56:22] [2021-07-23 21:56:22] 2021-07-24 01:56:10,388 | INFO | pilot.util.container | execute | executing command: ls -lF /var/lib/boinc-client/slots/9 [2021-07-23 21:56:22] 2021-07-24 01:56:10,445 | DEBUG | pilot.control.job | has_job_completed | total 41500 [2021-07-23 21:56:22] -rw------- 1 boinc boinc 164282 Jul 23 21:56 838411c0-5e09-4ad0-ba04-8a8f216bf2a0_88559.1.job.log [2021-07-23 21:56:22] -rw------- 1 boinc boinc 75751 Jul 23 21:54 838411c0-5e09-4ad0-ba04-8a8f216bf2a0_88559.1.job.log.tgz [2021-07-23 21:56:22] -rw------- 1 boinc boinc 1068585 Jul 23 21:42 agis_schedconf.cvmfs.json [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 0 Jul 23 21:40 boinc_lockfile [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 8192 Jul 23 21:56 boinc_mmap_file [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 531 Jul 23 21:53 boinc_task_state.xml [2021-07-23 21:56:22] -rw------- 1 boinc boinc 1961526 Jul 23 21:42 cric_ddmendpoints.json [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 38217911 Jul 23 21:40 EVNT.04972714._000030.pool.root.1 [2021-07-23 21:56:22] -rw------- 1 boinc boinc 8540 Jul 23 21:56 heartbeat.json [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 6187 Jul 23 21:40 init_data.xml [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 350495 Jul 23 21:40 input.tar.gz [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 112 Jul 23 21:40 job.xml [2021-07-23 21:56:22] -rw------- 1 boinc boinc 130 Jul 23 21:40 kqaMDmmcLRzn7Olcko1bjSoqABFKDmABFKDmumdXDmPIGKDmm7B2pn.diag [2021-07-23 21:56:22] -rw------- 1 boinc boinc 1010 Jul 23 21:54 memory_monitor_summary.json [2021-07-23 21:56:22] -rw------- 1 boinc boinc 263 Jul 23 21:54 output.list [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 2613 Jul 23 21:40 pandaJob.out [2021-07-23 21:56:22] drwxrwx--- 2 boinc boinc 4096 Jul 23 21:54 PanDA_Pilot-5129440837/ [2021-07-23 21:56:22] drwx------ 4 boinc boinc 4096 Jul 23 21:42 pilot2/ [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 340725 Jul 23 20:30 pilot2.tar.gz [2021-07-23 21:56:22] -rw------- 1 boinc boinc 145323 Jul 23 21:56 pilotlog.txt [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 4534 Jul 23 21:27 queuedata.json [2021-07-23 21:56:22] -rwxr-xr-x 1 boinc boinc 6966 Jul 23 21:40 run_atlas* [2021-07-23 21:56:22] -rwx------ 1 boinc boinc 20591 Jul 23 21:27 runpilot2-wrapper.sh* [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 407 Jul 23 21:40 runtime_log [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 5560 Jul 23 21:40 runtime_log.err [2021-07-23 21:56:22] drwxrwx--x 2 boinc boinc 4096 Jul 23 21:40 shared/ [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 16551 Jul 23 21:40 start_atlas.sh [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 2209 Jul 23 21:40 stderr.txt [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 107 Jul 23 21:40 wrapper_26015_x86_64-pc-linux-gnu [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 23 Jul 23 21:56 wrapper_checkpoint.txt [2021-07-23 21:56:22] 2021-07-24 01:56:10,446 | INFO | pilot.util.queuehandling | queue_report | queue jobs has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,446 | INFO | pilot.util.queuehandling | queue_report | queue payloads has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,446 | INFO | pilot.util.queuehandling | queue_report | queue data_in has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,446 | INFO | pilot.util.queuehandling | queue_report | queue data_out has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,446 | INFO | pilot.util.queuehandling | queue_report | queue current_data_in has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,446 | INFO | pilot.util.queuehandling | queue_report | queue validated_jobs has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue validated_payloads has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue monitored_payloads has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue finished_jobs has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue finished_payloads has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue finished_data_in has 1 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue finished_data_out has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue failed_jobs has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue failed_payloads has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,447 | INFO | pilot.util.queuehandling | queue_report | queue failed_data_in has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,448 | INFO | pilot.util.queuehandling | queue_report | queue failed_data_out has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,448 | INFO | pilot.util.queuehandling | queue_report | queue completed_jobs has 0 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,448 | INFO | pilot.util.queuehandling | queue_report | queue completed_jobids has 1 job(s) [2021-07-23 21:56:22] 2021-07-24 01:56:10,448 | INFO | pilot.control.job | has_job_completed | job 5129440837 has completed (purged errors) [2021-07-23 21:56:22] 2021-07-24 01:56:10,448 | INFO | pilot.util.processes | cleanup | overall cleanup function is called [2021-07-23 21:56:22] 2021-07-24 01:56:10,451 | DEBUG | pilot.util.processes | cleanup | work directory was removed: /var/lib/boinc-client/slots/9/PanDA_Pilot-5129440837 [2021-07-23 21:56:22] 2021-07-24 01:56:11,456 | INFO | pilot.info.jobdata | collect_zombies | --- collectZombieJob: --- 10, [32891] [2021-07-23 21:56:22] 2021-07-24 01:56:11,456 | INFO | pilot.info.jobdata | collect_zombies | zombie collector trying to kill pid 32891 [2021-07-23 21:56:22] 2021-07-24 01:56:11,457 | INFO | pilot.info.jobdata | collect_zombies | harmless exception when collecting zombies: [Errno 10] No child processes [2021-07-23 21:56:22] 2021-07-24 01:56:12,460 | INFO | pilot.util.processes | cleanup | collected zombie processes [2021-07-23 21:56:22] 2021-07-24 01:56:12,460 | INFO | pilot.util.processes | cleanup | will now attempt to kill all subprocesses of pid=32891 [2021-07-23 21:56:22] 2021-07-24 01:56:12,641 | INFO | pilot.util.processes | kill_processes | process IDs to be killed: [32891] (in reverse order) [2021-07-23 21:56:22] 2021-07-24 01:56:12,745 | WARNING | pilot.util.processes | kill_processes | found no corresponding commands to process id(s) [2021-07-23 21:56:22] 2021-07-24 01:56:12,745 | INFO | pilot.util.processes | kill_orphans | Do not look for orphan processes in BOINC jobs [2021-07-23 21:56:22] 2021-07-24 01:56:12,746 | DEBUG | pilot.util.queuehandling | purge_queue | queue purged [2021-07-23 21:56:22] 2021-07-24 01:56:12,746 | INFO | pilot.control.job | retrieve | ready for new job [2021-07-23 21:56:22] 2021-07-24 01:56:12,746 | INFO | root | retrieve | pilot has finished for previous job - re-establishing logging [2021-07-23 21:56:22] 2021-07-24 01:56:12,747 | INFO | pilot.util.auxiliary | pilot_version_banner | ***************************************** [2021-07-23 21:56:22] 2021-07-24 01:56:12,747 | INFO | pilot.util.auxiliary | pilot_version_banner | *** PanDA Pilot version 2.12.3 (13) *** [2021-07-23 21:56:22] 2021-07-24 01:56:12,748 | INFO | pilot.util.auxiliary | pilot_version_banner | ***************************************** [2021-07-23 21:56:22] 2021-07-24 01:56:12,748 | INFO | pilot.util.auxiliary | pilot_version_banner | [2021-07-23 21:56:22] 2021-07-24 01:56:12,770 | INFO | pilot.util.auxiliary | display_architecture_info | architecture information: [2021-07-23 21:56:22] 2021-07-24 01:56:13,279 | INFO | pilot.util.auxiliary | display_architecture_info | [2021-07-23 21:56:22] LSB Version: :core-4.1-amd64:core-4.1-noarch [2021-07-23 21:56:22] Distributor ID: CentOS [2021-07-23 21:56:22] Description: CentOS Linux release 7.8.2003 (Core) [2021-07-23 21:56:22] Release: 7.8.2003 [2021-07-23 21:56:22] Codename: Core [2021-07-23 21:56:22] 2021-07-24 01:56:13,279 | INFO | pilot.util.auxiliary | pilot_version_banner | ***************************************** [2021-07-23 21:56:22] 2021-07-24 01:56:13,780 | DEBUG | pilot.util.monitoring | check_local_space | checking local space on /var/lib/boinc-client/slots/9 [2021-07-23 21:56:22] 2021-07-24 01:56:13,805 | INFO | pilot.util.monitoring | check_local_space | sufficient remaining disk space (29501685760 B) [2021-07-23 21:56:22] 2021-07-24 01:56:13,805 | WARNING | pilot.control.job | proceed_with_getjob | since timefloor is set to 0, pilot was only allowed to run one job [2021-07-23 21:56:22] 2021-07-24 01:56:13,806 | DEBUG | pilot.control.job | retrieve | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:13,806 | DEBUG | pilot.control.job | retrieve | [job] retrieve thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:13,967 | DEBUG | pilot.control.job | control | job control ending since graceful_stop has been set [2021-07-23 21:56:22] 2021-07-24 01:56:13,967 | DEBUG | pilot.control.job | control | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:13,967 | DEBUG | pilot.control.job | control | [job] control thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:14,224 | DEBUG | pilot.control.payload | control | payload control ending since graceful_stop has been set [2021-07-23 21:56:22] 2021-07-24 01:56:14,224 | DEBUG | pilot.control.payload | control | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:14,225 | DEBUG | pilot.control.payload | control | [payload] control thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:14,323 | DEBUG | pilot.control.payload | validate_pre | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:14,324 | INFO | pilot.control.payload | validate_pre | [payload] validate_pre thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:14,332 | DEBUG | pilot.control.data | control | data control ending since graceful_stop has been set [2021-07-23 21:56:22] 2021-07-24 01:56:14,332 | DEBUG | pilot.control.data | control | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:14,333 | DEBUG | pilot.control.data | control | [data] control thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:14,412 | DEBUG | pilot.control.payload | execute_payloads | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:14,412 | INFO | pilot.control.payload | execute_payloads | [payload] execute_payloads thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:14,492 | DEBUG | pilot.control.data | copytool_in | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:14,493 | DEBUG | pilot.control.data | copytool_in | [data] copytool_in thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:14,572 | INFO | pilot.control.monitor | control | [monitor] control thread has ended [2021-07-23 21:56:22] 2021-07-24 01:56:14,942 | WARNING | pilot.util.common | should_abort | data:copytool_out:received graceful stop - abort after this iteration [2021-07-23 21:56:22] 2021-07-24 01:56:14,997 | DEBUG | pilot.control.payload | failed_post | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:14,997 | INFO | pilot.control.payload | failed_post | [payload] failed_post thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:15,128 | DEBUG | pilot.control.job | validate | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:15,128 | DEBUG | pilot.control.job | validate | [job] validate thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:15,225 | DEBUG | pilot.control.job | create_data_payload | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:15,225 | DEBUG | pilot.control.job | create_data_payload | [job] create_data_payload thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:15,273 | DEBUG | pilot.control.payload | validate_post | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:15,273 | INFO | pilot.control.payload | validate_post | [payload] validate_post thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:15,392 | WARNING | pilot.util.common | should_abort | job:queue_monitor:received graceful stop - abort after this iteration [2021-07-23 21:56:22] 2021-07-24 01:56:15,392 | DEBUG | pilot.control.job | queue_monitor | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:15,393 | DEBUG | pilot.control.job | queue_monitor | [job] queue monitor thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:15,942 | DEBUG | pilot.control.data | copytool_out | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:15,943 | DEBUG | pilot.control.data | copytool_out | [data] copytool_out thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:15,963 | WARNING | pilot.util.common | should_abort | data:queue_monitoring:received graceful stop - abort after this iteration [2021-07-23 21:56:22] 2021-07-24 01:56:18,964 | DEBUG | pilot.control.data | queue_monitoring | will not set job_aborted yet [2021-07-23 21:56:22] 2021-07-24 01:56:18,965 | DEBUG | pilot.control.data | queue_monitoring | [data] queue_monitor thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:20,516 | WARNING | pilot.control.job | check_job_monitor_waiting_time | no jobs in monitored_payloads queue (waited for 62 s) [2021-07-23 21:56:22] 2021-07-24 01:56:20,517 | DEBUG | pilot.util.processes | threads_aborted | aborting since the last relevant thread is about to finish [2021-07-23 21:56:22] 2021-07-24 01:56:20,517 | DEBUG | pilot.control.job | job_monitor | will proceed to set job_aborted [2021-07-23 21:56:22] 2021-07-24 01:56:20,517 | DEBUG | pilot.control.job | job_monitor | [job] job monitor thread has finished [2021-07-23 21:56:22] 2021-07-24 01:56:21,188 | INFO | pilot.workflow.generic | run | end of generic workflow (traces error code: 0) [2021-07-23 21:56:22] 2021-07-24 01:56:21,189 | INFO | root | wrap_up | traces error code: 0 [2021-07-23 21:56:22] 2021-07-24 01:56:21,189 | INFO | root | wrap_up | pilot has finished [2021-07-23 21:56:22] 2021-07-24 01:56:21,299 [wrapper] ==== pilot stdout END ==== [2021-07-23 21:56:22] 2021-07-24 01:56:21,322 [wrapper] ==== wrapper stdout RESUME ==== [2021-07-23 21:56:22] 2021-07-24 01:56:21,346 [wrapper] Pilot exit status: 0 [2021-07-23 21:56:22] 2021-07-24 01:56:21,493 [wrapper] pandaids: 5129440837 [2021-07-23 21:56:22] 2021-07-24 01:56:21,542 [wrapper] apfmon messages muted [2021-07-23 21:56:22] 2021-07-24 01:56:21,566 [wrapper] Test setup, not cleaning [2021-07-23 21:56:22] 2021-07-24 01:56:21,590 [wrapper] ==== wrapper stdout END ==== [2021-07-23 21:56:22] 2021-07-24 01:56:21,614 [wrapper] ==== wrapper stderr END ==== [2021-07-23 21:56:22] 2021-07-24 01:56:21,666 [wrapper] wrapperexiting ec=0, duration=949 [2021-07-23 21:56:22] 2021-07-24 01:56:21,689 [wrapper] apfmon messages muted [2021-07-23 21:56:22] *** Error codes and diagnostics *** [2021-07-23 21:56:22] "exeErrorCode": 65, [2021-07-23 21:56:22] "exeErrorDiag": "Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: \"GeoModelSvc FATAL in sysInitialize(): standard std::exception is caught\"", [2021-07-23 21:56:22] "pilotErrorCode": 1305, [2021-07-23 21:56:22] "pilotErrorDiag": "Failed to execute payload", [2021-07-23 21:56:22] *** Listing of results directory *** [2021-07-23 21:56:22] total 41672 [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 340725 Jul 23 20:30 pilot2.tar.gz [2021-07-23 21:56:22] -rwx------ 1 boinc boinc 20591 Jul 23 21:27 runpilot2-wrapper.sh [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 4534 Jul 23 21:27 queuedata.json [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 107 Jul 23 21:40 wrapper_26015_x86_64-pc-linux-gnu [2021-07-23 21:56:22] -rwxr-xr-x 1 boinc boinc 6966 Jul 23 21:40 run_atlas [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 112 Jul 23 21:40 job.xml [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 6187 Jul 23 21:40 init_data.xml [2021-07-23 21:56:22] drwxrwx--x 2 boinc boinc 4096 Jul 23 21:40 shared [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 0 Jul 23 21:40 boinc_lockfile [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 16551 Jul 23 21:40 start_atlas.sh [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 350495 Jul 23 21:40 input.tar.gz [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 38217911 Jul 23 21:40 EVNT.04972714._000030.pool.root.1 [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 2613 Jul 23 21:40 pandaJob.out [2021-07-23 21:56:22] -rw------- 1 boinc boinc 1068585 Jul 23 21:42 agis_schedconf.cvmfs.json [2021-07-23 21:56:22] -rw------- 1 boinc boinc 1961526 Jul 23 21:42 cric_ddmendpoints.json [2021-07-23 21:56:22] drwx------ 4 boinc boinc 4096 Jul 23 21:42 pilot2 [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 531 Jul 23 21:53 boinc_task_state.xml [2021-07-23 21:56:22] -rw------- 1 boinc boinc 1010 Jul 23 21:54 memory_monitor_summary.json [2021-07-23 21:56:22] -rw------- 1 boinc boinc 75751 Jul 23 21:54 838411c0-5e09-4ad0-ba04-8a8f216bf2a0_88559.1.job.log.tgz [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 8192 Jul 23 21:56 boinc_mmap_file [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 23 Jul 23 21:56 wrapper_checkpoint.txt [2021-07-23 21:56:22] -rw------- 1 boinc boinc 8540 Jul 23 21:56 heartbeat.json [2021-07-23 21:56:22] -rw------- 1 boinc boinc 7366 Jul 23 21:56 pilotlog.txt [2021-07-23 21:56:22] -rw------- 1 boinc boinc 178845 Jul 23 21:56 838411c0-5e09-4ad0-ba04-8a8f216bf2a0_88559.1.job.log [2021-07-23 21:56:22] -rw------- 1 boinc boinc 263 Jul 23 21:56 output.list [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 8833 Jul 23 21:56 runtime_log.err [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 686 Jul 23 21:56 runtime_log [2021-07-23 21:56:22] -rw------- 1 boinc boinc 276480 Jul 23 21:56 result.tar.gz [2021-07-23 21:56:22] -rw------- 1 boinc boinc 558 Jul 23 21:56 kqaMDmmcLRzn7Olcko1bjSoqABFKDmABFKDmumdXDmPIGKDmm7B2pn.diag [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 31093 Jul 23 21:56 stderr.txt [2021-07-23 21:56:22] No HITS result produced [2021-07-23 21:56:22] *** Contents of shared directory: *** [2021-07-23 21:56:22] total 37960 [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 38217911 Jul 23 21:40 ATLAS.root_0 [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 16551 Jul 23 21:40 start_atlas.sh [2021-07-23 21:56:22] -rw-r--r-- 1 boinc boinc 350495 Jul 23 21:40 input.tar.gz [2021-07-23 21:56:22] -rw------- 1 boinc boinc 276480 Jul 23 21:56 result.tar.gz 21:56:23 (12404): run_atlas exited; CPU time 82.147304 21:56:23 (12404): called boinc_finish(0) </stderr_txt> ]]>
©2024 CERN