Name | hGENDmjR5B3n7Olcko1bjSoqABFKDmABFKDm7AsVDmKKIKDmqepgRn_0 |
Workunit | 2302746 |
Created | 27 Apr 2023, 1:17:51 UTC |
Sent | 27 Apr 2023, 1:30:17 UTC |
Report deadline | 4 May 2023, 1:30:17 UTC |
Received | 27 Apr 2023, 1:55:16 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1498 |
Run time | 23 min 0 sec |
CPU time | 1 min 8 sec |
Validate state | Valid |
Credit | 11.91 |
Device peak FLOPS | 9.74 GFLOPS |
Application version | ATLAS Simulation v3.01 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 950.36 MB |
Peak swap size | 1.57 GB |
Peak disk usage | 81.03 MB |
<core_client_version>7.4.25</core_client_version> <![CDATA[ <stderr_txt> 02:31:43 (27300): wrapper (7.7.26015): starting 02:31:43 (27300): wrapper: running run_atlas (--nthreads 3) [2023-04-27 02:31:43] Arguments: --nthreads 3 [2023-04-27 02:31:43] Threads: 3 [2023-04-27 02:31:43] Checking for CVMFS [2023-04-27 02:31:43] Probing /cvmfs/atlas.cern.ch... OK [2023-04-27 02:31:43] Probing /cvmfs/atlas-condb.cern.ch... OK [2023-04-27 02:31:43] Running cvmfs_config stat atlas.cern.ch [2023-04-27 02:31:43] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2023-04-27 02:31:43] 2.9.0.0 2872 63 49416 118322 0 230 3988830 4194304 831 130560 0 89038 97.570 238812 179 http://s1ral-cvmfs.openhtc.io/cvmfs/atlas.cern.ch http://192.168.100.152:3128 1 [2023-04-27 02:31:43] CVMFS is ok [2023-04-27 02:31:43] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 [2023-04-27 02:31:43] Checking for apptainer binary... [2023-04-27 02:31:43] Using apptainer found in PATH at /usr/bin/apptainer [2023-04-27 02:31:43] Running /usr/bin/apptainer --version [2023-04-27 02:31:43] apptainer version 1.0.3 [2023-04-27 02:31:43] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname [2023-04-27 02:31:43] TeeC16 [2023-04-27 02:31:43] apptainer works [2023-04-27 02:31:43] Set ATHENA_PROC_NUMBER=3 [2023-04-27 02:31:43] Set ATHENA_CORE_NUMBER=3 [2023-04-27 02:31:43] Starting ATLAS job with PandaID=5832219782 [2023-04-27 02:31:43] Running command: /usr/bin/apptainer exec -B /cvmfs,/home/m/BOINC/slots/1 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh [2023-04-27 02:54:40] *** The last 200 lines of the pilot log: *** [2023-04-27 02:54:40] 2023-04-27 01:52:22,615 | INFO | -- lfn=e7049c97-d9e0-4b82-8e92-1590d32ec8b5_70670.1.job.log.tgz, status_code=0, status=transferred [2023-04-27 02:54:40] 2023-04-27 01:52:22,615 | INFO | stage-out finished correctly [2023-04-27 02:54:40] 2023-04-27 01:52:24,101 | INFO | finished stage-out (of log) for failed payload [2023-04-27 02:54:40] 2023-04-27 01:52:24,774 | INFO | job 5832219782 has state=failed [2023-04-27 02:54:40] 2023-04-27 01:52:24,775 | INFO | preparing for final server update for job 5832219782 in state='failed' [2023-04-27 02:54:40] 2023-04-27 01:52:28,694 | INFO | monitor loop #14: job 0:5832219782 is in state 'failed' [2023-04-27 02:54:40] 2023-04-27 01:52:28,695 | INFO | will abort job monitoring soon since job state=failed (job is still in queue) [2023-04-27 02:54:40] 2023-04-27 01:52:50,041 | INFO | collecting machine features [2023-04-27 02:54:40] 2023-04-27 01:52:50,041 | INFO | machine features path does not exist (path="") [2023-04-27 02:54:40] 2023-04-27 01:53:20,110 | INFO | 1289s have passed since pilot start [2023-04-27 02:54:40] 2023-04-27 01:53:30,294 | INFO | monitor loop #15: job 0:5832219782 is in state 'failed' [2023-04-27 02:54:40] 2023-04-27 01:53:30,295 | INFO | will abort job monitoring soon since job state=failed (job is still in queue) [2023-04-27 02:54:40] 2023-04-27 01:53:50,169 | INFO | collecting machine features [2023-04-27 02:54:40] 2023-04-27 01:53:50,169 | INFO | machine features path does not exist (path="") [2023-04-27 02:54:40] 2023-04-27 01:54:20,191 | INFO | proceeding with final server update [2023-04-27 02:54:40] 2023-04-27 01:54:20,192 | INFO | pilot will not update the server (heartbeat message will be written to file) [2023-04-27 02:54:40] 2023-04-27 01:54:20,192 | INFO | job 5832219782 has failed - writing final server update [2023-04-27 02:54:40] 2023-04-27 01:54:20,192 | WARNING | making sure that job.state is set to failed since a pilot error code is set [2023-04-27 02:54:40] 2023-04-27 01:54:20,192 | WARNING | format EVNTtoHITS has no such key: dbData [2023-04-27 02:54:40] 2023-04-27 01:54:20,192 | WARNING | format EVNTtoHITS has no such key: dbTime [2023-04-27 02:54:40] 2023-04-27 01:54:20,193 | INFO | collecting machine features [2023-04-27 02:54:40] 2023-04-27 01:54:20,193 | INFO | machine features path does not exist (path="") [2023-04-27 02:54:40] 2023-04-27 01:54:20,193 | INFO | collecting job features [2023-04-27 02:54:40] 2023-04-27 01:54:20,193 | INFO | job features path does not exist (path="") [2023-04-27 02:54:40] 2023-04-27 01:54:20,194 | INFO | fitting pss+swap vs Time [2023-04-27 02:54:40] 2023-04-27 01:54:20,194 | INFO | current memory leak: 484.14 B/s (using 8 data points, chi2=0.01) [2023-04-27 02:54:40] 2023-04-27 01:54:20,195 | INFO | payload/TRF did not report the number of read events [2023-04-27 02:54:40] 2023-04-27 01:54:20,215 | INFO | executing command: lscpu [2023-04-27 02:54:40] 2023-04-27 01:54:20,235 | INFO | found 2 cores (2 cores per socket, 1 sockets) [2023-04-27 02:54:40] 2023-04-27 01:54:20,235 | INFO | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo [2023-04-27 02:54:40] 2023-04-27 01:54:20,266 | INFO | using path: /home/m/BOINC/slots/1/PanDA_Pilot-5832219782/memory_monitor_summary.json (trf name=prmon) [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | extracted standard info from prmon json [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | extracted standard memory fields from prmon json [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | .............................. [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | . Timing measurements: [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | . get job = 73 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . initial setup = 56 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . payload setup = 10 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . stage-in = 59 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . payload execution = 873 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . stage-out = 150 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | .............................. [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | building log extracts (sent to the server as 'pilotLog') [2023-04-27 02:54:40] 2023-04-27 01:54:20,270 | INFO | executing command: tail -n 20 /home/m/BOINC/slots/1/PanDA_Pilot-5832219782/pilotlog.txt [2023-04-27 02:54:40] 2023-04-27 01:54:20,295 | WARNING | detected the following tail of warning/fatal messages in the pilot log: [2023-04-27 02:54:40] - Log from pilotlog.txt - [2023-04-27 02:54:40] 2023-04-27 01:54:20,194 | INFO | fitting pss+swap vs Time [2023-04-27 02:54:40] 2023-04-27 01:54:20,194 | INFO | current memory leak: 484.14 B/s (using 8 data points, chi2=0.01) [2023-04-27 02:54:40] 2023-04-27 01:54:20,195 | INFO | payload/TRF did not report the number of read events [2023-04-27 02:54:40] 2023-04-27 01:54:20,215 | INFO | executing command: lscpu [2023-04-27 02:54:40] 2023-04-27 01:54:20,235 | INFO | found 2 cores (2 cores per socket, 1 sockets) [2023-04-27 02:54:40] 2023-04-27 01:54:20,235 | INFO | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo [2023-04-27 02:54:40] 2023-04-27 01:54:20,266 | INFO | using path: /home/m/BOINC/slots/1/PanDA_Pilot-5832219782/memory_monitor_summary.json (trf name=prmon) [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | extracted standard info from prmon json [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | extracted standard memory fields from prmon json [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | .............................. [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | . Timing measurements: [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | . get job = 73 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . initial setup = 56 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . payload setup = 10 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . stage-in = 59 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . payload execution = 873 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . stage-out = 150 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | .............................. [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | building log extracts (sent to the server as 'pilotLog') [2023-04-27 02:54:40] 2023-04-27 01:54:20,270 | INFO | executing command: tail -n 20 /home/m/BOINC/slots/1/PanDA_Pilot-5832219782/pilotlog.txt [2023-04-27 02:54:40] 2023-04-27 01:54:20,295 | WARNING | [2023-04-27 02:54:40] [begin log extracts] [2023-04-27 02:54:40] - Log from pilotlog.txt - [2023-04-27 02:54:40] 2023-04-27 01:54:20,194 | INFO | fitting pss+swap vs Time [2023-04-27 02:54:40] 2023-04-27 01:54:20,194 | INFO | current memory leak: 484.14 B/s (using 8 data points, chi2=0.01) [2023-04-27 02:54:40] 2023-04-27 01:54:20,195 | INFO | payload/TRF did not report the number of read events [2023-04-27 02:54:40] 2023-04-27 01:54:20,215 | INFO | executing command: lscpu [2023-04-27 02:54:40] 2023-04-27 01:54:20,235 | INFO | found 2 cores (2 cores per socket, 1 sockets) [2023-04-27 02:54:40] 2023-04-27 01:54:20,235 | INFO | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo [2023-04-27 02:54:40] 2023-04-27 01:54:20,266 | INFO | using path: /home/m/BOINC/slots/1/PanDA_Pilot-5832219782/memory_monitor_summary.json (trf name=prmon) [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | extracted standard info from prmon json [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | extracted standard memory fields from prmon json [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | .............................. [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | . Timing measurements: [2023-04-27 02:54:40] 2023-04-27 01:54:20,268 | INFO | . get job = 73 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . initial setup = 56 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . payload setup = 10 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . stage-in = 59 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . payload execution = 873 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | . stage-out = 150 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | .............................. [2023-04-27 02:54:40] 2023-04-27 01:54:20,269 | INFO | building log extracts (sent to the server as 'pilotLog') [2023-04-27 02:54:40] 2023-04-27 01:54:20,270 | INFO | executing command: tail -n 20 /home/m/BOINC/slots/1/PanDA_Pilot-5832219782/pilotlog.txt [2023-04-27 02:54:40] [end log extracts] [2023-04-27 02:54:40] 2023-04-27 01:54:20,295 | WARNING | pilotErrorCodes = [1305] (will report primary/first error code) [2023-04-27 02:54:40] 2023-04-27 01:54:20,295 | WARNING | pilotErrorDiags = ['Failed to execute payload:PyJobTransforms.transform.execute 2023-04-27 02:49:44,965 CRITICAL Transform executor raised TransformValidationExcep [2023-04-27 02:54:40] 2023-04-27 01:54:20,907 | INFO | [2023-04-27 02:54:40] 2023-04-27 01:54:20,907 | INFO | job summary report [2023-04-27 02:54:40] 2023-04-27 01:54:20,907 | INFO | -------------------------------------------------- [2023-04-27 02:54:40] 2023-04-27 01:54:20,907 | INFO | PanDA job id: 5832219782 [2023-04-27 02:54:40] 2023-04-27 01:54:20,907 | INFO | task id: NULL [2023-04-27 02:54:40] 2023-04-27 01:54:20,907 | INFO | error 1/1: 1305: Failed to execute payload:PyJobTransforms.transform.execute 2023-04-27 02:49:44,965 CRITICAL Transform executor raised TransformValidationExceptio [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | status: LOG_TRANSFER = DONE [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | pilot state: failed [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | transexitcode: 65 [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | exeerrorcode: 65 [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | exeerrordiag: Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: "Segmentation fault: Event counter: 0; Run: unknown; Evt: unknown; Curren [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | exitcode: 65 [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | exitmsg: Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: "Segmentation fault: Event counter: 0; Run: unknown; Evt: unknown; Current alg [2023-04-27 02:54:40] 2023-04-27 01:54:20,908 | INFO | cpuconsumptiontime: 75 s [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | nevents: 0 [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | neventsw: 0 [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | pid: 1241 [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | pgrp: 1241 [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | corecount: 3 [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | event service: False [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | sizes: {0: 2584891, 11: 2585037, 21: 2585065, 32: 2585093, 43: 2585121, 53: 2585149, 59: 2585365, 64: 2585393, 887: 2609113, 1037: 2613376, 1039: 2613432, 1155: 26 [2023-04-27 02:54:40] 2023-04-27 01:54:20,909 | INFO | -------------------------------------------------- [2023-04-27 02:54:40] 2023-04-27 01:54:20,910 | INFO | [2023-04-27 02:54:40] 2023-04-27 01:54:20,910 | INFO | executing command: ls -lF /home/m/BOINC/slots/1 [2023-04-27 02:54:40] 2023-04-27 01:54:20,930 | INFO | queue jobs had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,931 | INFO | queue payloads had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,931 | INFO | queue data_in had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,931 | INFO | queue data_out had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,931 | INFO | queue current_data_in had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,931 | INFO | queue validated_jobs had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,931 | INFO | queue validated_payloads had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,931 | INFO | queue monitored_payloads had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue finished_jobs had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue finished_payloads had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue finished_data_in had 1 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue finished_data_out had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue failed_jobs had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue failed_payloads had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue failed_data_in had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,932 | INFO | queue failed_data_out had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,933 | INFO | queue completed_jobs had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,933 | INFO | queue completed_jobids has 1 job(s) [2023-04-27 02:54:40] 2023-04-27 01:54:20,933 | INFO | queue realtimelog_payloads had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,933 | INFO | queue messages had 0 job(s) [purged] [2023-04-27 02:54:40] 2023-04-27 01:54:20,933 | INFO | job 5832219782 has completed (purged errors) [2023-04-27 02:54:40] 2023-04-27 01:54:20,933 | INFO | overall cleanup function is called [2023-04-27 02:54:40] 2023-04-27 01:54:21,939 | INFO | --- collectZombieJob: --- 10, [1241] [2023-04-27 02:54:40] 2023-04-27 01:54:21,940 | INFO | zombie collector trying to kill pid 1241 [2023-04-27 02:54:40] 2023-04-27 01:54:21,940 | INFO | harmless exception when collecting zombies: [Errno 10] No child processes [2023-04-27 02:54:40] 2023-04-27 01:54:22,945 | INFO | collected zombie processes [2023-04-27 02:54:40] 2023-04-27 01:54:22,945 | INFO | will now attempt to kill all subprocesses of pid=1241 [2023-04-27 02:54:40] 2023-04-27 01:54:23,007 | INFO | process IDs to be killed: [1241] (in reverse order) [2023-04-27 02:54:40] 2023-04-27 01:54:23,056 | WARNING | found no corresponding commands to process id(s) [2023-04-27 02:54:40] 2023-04-27 01:54:23,056 | INFO | Do not look for orphan processes in BOINC jobs [2023-04-27 02:54:40] 2023-04-27 01:54:23,056 | INFO | ready for new job [2023-04-27 02:54:40] 2023-04-27 01:54:23,056 | INFO | pilot has finished with previous job - re-establishing logging [2023-04-27 02:54:40] 2023-04-27 01:54:23,057 | INFO | **************************************** [2023-04-27 02:54:40] 2023-04-27 01:54:23,058 | INFO | *** PanDA Pilot version 3.5.1 (17) *** [2023-04-27 02:54:40] 2023-04-27 01:54:23,058 | INFO | **************************************** [2023-04-27 02:54:40] 2023-04-27 01:54:23,058 | INFO | [2023-04-27 02:54:40] 2023-04-27 01:54:23,074 | INFO | architecture information: [2023-04-27 02:54:40] 2023-04-27 01:54:23,213 | INFO | [2023-04-27 02:54:40] LSB Version: :core-4.1-amd64:core-4.1-noarch [2023-04-27 02:54:40] Distributor ID: CentOS [2023-04-27 02:54:40] Description: CentOS Linux release 7.9.2009 (Core) [2023-04-27 02:54:40] Release: 7.9.2009 [2023-04-27 02:54:40] Codename: Core [2023-04-27 02:54:40] 2023-04-27 01:54:23,213 | INFO | **************************************** [2023-04-27 02:54:40] 2023-04-27 01:54:23,716 | INFO | executing command: df -mP /home/m/BOINC/slots/1 [2023-04-27 02:54:40] 2023-04-27 01:54:23,733 | INFO | sufficient remaining disk space (422915866624 B) [2023-04-27 02:54:40] 2023-04-27 01:54:23,733 | WARNING | since timefloor is set to 0, pilot was only allowed to run one job [2023-04-27 02:54:40] 2023-04-27 01:54:23,733 | INFO | [job] retrieve thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:23,734 | WARNING | data:copytool_out:received graceful stop - abort after this iteration [2023-04-27 02:54:40] 2023-04-27 01:54:23,734 | WARNING | aborting monitor loop since graceful_stop has been set [2023-04-27 02:54:40] 2023-04-27 01:54:23,734 | INFO | [monitor] control thread has ended [2023-04-27 02:54:40] 2023-04-27 01:54:23,734 | WARNING | data:queue_monitoring:received graceful stop - abort after this iteration [2023-04-27 02:54:40] 2023-04-27 01:54:23,839 | INFO | [data] control thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:23,918 | INFO | [payload] execute_payloads thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:24,158 | INFO | [payload] control thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:24,182 | INFO | [job] create_data_payload thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:24,293 | INFO | [payload] validate_pre thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:24,374 | INFO | [job] validate thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:24,582 | INFO | [payload] failed_post thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:24,734 | INFO | [data] copytool_out thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:24,738 | INFO | [payload] validate_post thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:25,047 | INFO | [data] copytool_in thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:25,137 | INFO | [job] control thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:25,447 | WARNING | job:queue_monitor:received graceful stop - abort after this iteration [2023-04-27 02:54:40] 2023-04-27 01:54:25,447 | INFO | [job] queue monitor thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:26,735 | INFO | [data] queue_monitor thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:31,894 | WARNING | no jobs in monitored_payloads queue (waited for 61 s) [2023-04-27 02:54:40] 2023-04-27 01:54:31,894 | INFO | [job] job monitor thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:39,396 | INFO | job.realtimelogging is not enabled [2023-04-27 02:54:40] 2023-04-27 01:54:39,396 | INFO | [payload] run_realtimelog thread has finished [2023-04-27 02:54:40] 2023-04-27 01:54:40,395 | INFO | end of generic workflow (traces error code: 0) [2023-04-27 02:54:40] 2023-04-27 01:54:40,395 | INFO | traces error code: 0 [2023-04-27 02:54:40] 2023-04-27 01:54:40,395 | INFO | pilot has finished [2023-04-27 02:54:40] 2023-04-27 01:54:40,468 [wrapper] ==== pilot stdout END ==== [2023-04-27 02:54:40] 2023-04-27 01:54:40,472 [wrapper] ==== wrapper stdout RESUME ==== [2023-04-27 02:54:40] 2023-04-27 01:54:40,476 [wrapper] pilotpid: 30438 [2023-04-27 02:54:40] 2023-04-27 01:54:40,480 [wrapper] Pilot exit status: 0 [2023-04-27 02:54:40] 2023-04-27 01:54:40,506 [wrapper] pandaids: 5832219782 [2023-04-27 02:54:40] 2023-04-27 01:54:40,525 [wrapper] apfmon messages muted [2023-04-27 02:54:40] 2023-04-27 01:54:40,530 [wrapper] Test setup, not cleaning [2023-04-27 02:54:40] 2023-04-27 01:54:40,534 [wrapper] ==== wrapper stdout END ==== [2023-04-27 02:54:40] 2023-04-27 01:54:40,549 [wrapper] ==== wrapper stderr END ==== [2023-04-27 02:54:40] 2023-04-27 01:54:40,558 [wrapper] wrapperexiting ec=0, duration=1377 [2023-04-27 02:54:40] 2023-04-27 01:54:40,566 [wrapper] apfmon messages muted [2023-04-27 02:54:40] *** Error codes and diagnostics *** [2023-04-27 02:54:40] "exeErrorCode": 65, [2023-04-27 02:54:40] "exeErrorDiag": "Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: \"Segmentation fault: Event counter: 0; Run: unknown; Evt: unknown; Current algorithm: PyG4AtlasAlg; Current Function: unknown\"", [2023-04-27 02:54:40] "pilotErrorCode": 1305, [2023-04-27 02:54:40] "pilotErrorDiag": "Failed to execute payload:PyJobTransforms.transform.execute 2023-04-27 02:49:44,965 CRITICAL Transform executor raised TransformValidationException: Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: \"Segmentation fault: Event counter: 0; Run: unknown; Evt: unknown; Current algorith", [2023-04-27 02:54:40] *** Listing of results directory *** [2023-04-27 02:54:40] total 41840 [2023-04-27 02:54:40] -rw-r--r-- 1 m m 393247 Apr 27 01:31 pilot3.tar.gz [2023-04-27 02:54:40] -rwx------ 1 m m 27542 Apr 27 02:17 runpilot2-wrapper.sh [2023-04-27 02:54:40] -rw-r--r-- 1 m m 4388 Apr 27 02:17 queuedata.json [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 107 Apr 27 02:31 wrapper_26015_x86_64-pc-linux-gnu [2023-04-27 02:54:40] -rwxr-xr-x 1 m m 7986 Apr 27 02:31 run_atlas [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 112 Apr 27 02:31 job.xml [2023-04-27 02:54:40] -rw-r--r-- 2 m m 17604 Apr 27 02:31 start_atlas.sh [2023-04-27 02:54:40] drwxrwx--x 2 m m 4096 Apr 27 02:31 shared [2023-04-27 02:54:40] -rw-r--r-- 2 m m 403672 Apr 27 02:31 input.tar.gz [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 5708 Apr 27 02:31 init_data.xml [2023-04-27 02:54:40] -rw-r--r-- 2 m m 38971704 Apr 27 02:31 EVNT.04972714._000035.pool.root.1 [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 0 Apr 27 02:31 boinc_lockfile [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 2756 Apr 27 02:31 pandaJob.out [2023-04-27 02:54:40] -rw------- 1 m m 424 Apr 27 02:31 setup.sh.local [2023-04-27 02:54:40] -rw------- 1 m m 1012562 Apr 27 02:31 agis_schedconf.cvmfs.json [2023-04-27 02:54:40] -rw------- 1 m m 1450330 Apr 27 02:32 cric_ddmendpoints.json [2023-04-27 02:54:40] drwx------ 4 m m 4096 Apr 27 02:34 pilot3 [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 531 Apr 27 02:49 boinc_task_state.xml [2023-04-27 02:54:40] -rw------- 1 m m 1015 Apr 27 02:49 memory_monitor_summary.json [2023-04-27 02:54:40] -rw------- 1 m m 84595 Apr 27 02:49 e7049c97-d9e0-4b82-8e92-1590d32ec8b5_70670.1.job.log.tgz [2023-04-27 02:54:40] -rw------- 1 m m 9381 Apr 27 02:54 heartbeat.json [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 24 Apr 27 02:54 wrapper_checkpoint.txt [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 8192 Apr 27 02:54 boinc_mmap_file [2023-04-27 02:54:40] -rw------- 1 m m 2962 Apr 27 02:54 pilotlog.txt [2023-04-27 02:54:40] -rw------- 1 m m 106292 Apr 27 02:54 e7049c97-d9e0-4b82-8e92-1590d32ec8b5_70670.1.job.log [2023-04-27 02:54:40] -rw------- 1 m m 227 Apr 27 02:54 output.list [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 686 Apr 27 02:54 runtime_log [2023-04-27 02:54:40] -rw------- 1 m m 215040 Apr 27 02:54 result.tar.gz [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 11324 Apr 27 02:54 runtime_log.err [2023-04-27 02:54:40] -rw------- 1 m m 621 Apr 27 02:54 hGENDmjR5B3n7Olcko1bjSoqABFKDmABFKDm7AsVDmKKIKDmqepgRn.diag [2023-04-27 02:54:40] -rw-rw-r-- 1 m m 22492 Apr 27 02:54 stderr.txt [2023-04-27 02:54:40] No HITS result produced [2023-04-27 02:54:40] *** Contents of shared directory: *** [2023-04-27 02:54:40] total 38688 [2023-04-27 02:54:40] -rw-r--r-- 2 m m 17604 Apr 27 02:31 start_atlas.sh [2023-04-27 02:54:40] -rw-r--r-- 2 m m 403672 Apr 27 02:31 input.tar.gz [2023-04-27 02:54:40] -rw-r--r-- 2 m m 38971704 Apr 27 02:31 ATLAS.root_0 [2023-04-27 02:54:40] -rw------- 1 m m 215040 Apr 27 02:54 result.tar.gz 02:54:42 (27300): run_atlas exited; CPU time 68.189445 02:54:42 (27300): called boinc_finish(0) </stderr_txt> ]]>
©2024 CERN