Name q5oNDmSpgh3n7Olcko1bjSoqABFKDmABFKDm7AsVDmSqNKDmoaRxKo_0
Workunit 2320217
Created 23 Jul 2023, 5:29:00 UTC
Sent 23 Jul 2023, 5:37:53 UTC
Report deadline 30 Jul 2023, 5:37:53 UTC
Received 23 Jul 2023, 5:50:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 4721
Run time 6 min 48 sec
CPU time 10 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 5.90 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 63.23 MB
Peak swap size 2.89 GB
Peak disk usage 35.69 MB

Stderr output

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<stderr_txt>
07:38:02 (3328868): wrapper (7.7.26015): starting
07:38:02 (3328868): wrapper: running run_atlas (--nthreads 1)
[2023-07-23 07:38:02] Arguments: --nthreads 1
[2023-07-23 07:38:02] Threads: 1
[2023-07-23 07:38:02] Checking for CVMFS
[2023-07-23 07:38:05] Probing /cvmfs/atlas.cern.ch... OK
[2023-07-23 07:38:05] Probing /cvmfs/atlas-condb.cern.ch... OK
[2023-07-23 07:38:05] Running cvmfs_config stat atlas.cern.ch
[2023-07-23 07:38:05] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2023-07-23 07:38:05] 2.10.1.0 3075980 193 45028 121609 2 73 3238750 4096001 0 130560 0 956132 99.994 27516 1450 http://s1cern-cvmfs.openhtc.io/cvmfs/atlas.cern.ch http://10.116.178.201:3128 1
[2023-07-23 07:38:05] CVMFS is ok
[2023-07-23 07:38:05] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2023-07-23 07:38:05] Checking for apptainer binary...
[2023-07-23 07:38:05] Using apptainer found in PATH at /usr/bin/apptainer
[2023-07-23 07:38:05] Running /usr/bin/apptainer --version
[2023-07-23 07:38:05] apptainer version 1.1.9-1.el9
[2023-07-23 07:38:05] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2023-07-23 07:38:05] P620-CentOS9
[2023-07-23 07:38:05] apptainer works
[2023-07-23 07:38:05] Starting ATLAS job with PandaID=5911382216
[2023-07-23 07:38:05] Running command: /usr/bin/apptainer exec -B /cvmfs,/var/lib/boinc/slots/4 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2023-07-23 07:44:47]  *** The last 200 lines of the pilot log: ***
[2023-07-23 07:44:47] 2023-07-23 05:42:05,574 | WARNING  | main payload execution returned non-zero exit code: 1
[2023-07-23 07:44:47] 2023-07-23 05:42:05,575 | INFO     | scanning dmesg message for subprocess=3341780 for memory errors
[2023-07-23 07:44:47] 2023-07-23 05:42:05,575 | INFO     | executing command: dmesg|grep 3341780
[2023-07-23 07:44:47] 2023-07-23 05:42:07,471 | INFO     | monitor loop #6: job 0:5911382216 is in state 'failed'
[2023-07-23 07:44:47] 2023-07-23 05:42:07,471 | INFO     | will abort job monitoring soon since job state=failed (job is still in queue)
[2023-07-23 07:44:47] 2023-07-23 05:42:08,343 | CRITICAL | execute payloads caught an exception (cannot recover): [Errno 107] Transport endpoint is not connected: '/bin/bash', Traceback (most recent call last):
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/payload.py", line 257, in execute_payloads
[2023-07-23 07:44:47]     perform_initial_payload_error_analysis(job, exit_code)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/payload.py", line 577, in perform_initial_payload_error_analysis
[2023-07-23 07:44:47]     msg = scan_for_memory_errors(job.subprocesses)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/payload.py", line 651, in scan_for_memory_errors
[2023-07-23 07:44:47]     _, out, _ = execute(cmd)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/util/container.py", line 64, in execute
[2023-07-23 07:44:47]     process = subprocess.Popen(exe,
[2023-07-23 07:44:47]   File "/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.14-x86_64-centos7/lib/python3.9/subprocess.py", line 951, in __init__
[2023-07-23 07:44:47]     self._execute_child(args, executable, preexec_fn, close_fds,
[2023-07-23 07:44:47]   File "/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.14-x86_64-centos7/lib/python3.9/subprocess.py", line 1821, in _execute_child
[2023-07-23 07:44:47]     raise child_exception_type(errno_num, err_msg, err_filename)
[2023-07-23 07:44:47] OSError: [Errno 107] Transport endpoint is not connected: '/bin/bash'
[2023-07-23 07:44:47] 
[2023-07-23 07:44:47] 2023-07-23 05:42:09,974 | INFO     | monitor loop #7: job 0:5911382216 is in state 'failed'
[2023-07-23 07:44:47] 2023-07-23 05:42:09,974 | INFO     | will abort job monitoring soon since job state=failed (job is still in queue)
[2023-07-23 07:44:47] 2023-07-23 05:42:12,479 | INFO     | monitor loop #8: job 0:5911382216 is in state 'failed'
[2023-07-23 07:44:47] 2023-07-23 05:42:12,479 | INFO     | will abort job monitoring soon since job state=failed (job is still in queue)
[2023-07-23 07:44:47] 2023-07-23 05:42:14,363 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2023-07-23 07:44:47] 2023-07-23 05:42:14,363 | INFO     | aborting loop
[2023-07-23 07:44:47] 2023-07-23 05:42:14,363 | WARNING  | aborting monitor loop since graceful_stop has been set (timing out remaining threads)
[2023-07-23 07:44:47] 2023-07-23 05:42:14,363 | INFO     | found 1 job(s) in 20 queues
[2023-07-23 07:44:47] 2023-07-23 05:42:14,363 | INFO     | aborting job 5911382216
[2023-07-23 07:44:47] 2023-07-23 05:42:14,406 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2023-07-23 07:44:47] 2023-07-23 05:42:14,406 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2023-07-23 07:44:47] 2023-07-23 05:42:14,812 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2023-07-23 07:44:47] 2023-07-23 05:42:14,813 | WARNING  | since job:queue_monitor is responsible for sending job updates, we sleep for 20 s
[2023-07-23 07:44:47] 2023-07-23 05:42:15,368 | INFO     | [job] job monitor thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:15,812 | INFO     | [job] validate thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:15,832 | INFO     | [job] retrieve thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:15,845 | INFO     | [payload] failed_post thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:15,986 | INFO     | [job] create_data_payload thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:16,094 | INFO     | [payload] validate_pre thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:16,124 | INFO     | [job] control thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:16,215 | INFO     | [data] control thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:16,274 | INFO     | [data] copytool_in thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:16,297 | INFO     | [payload] validate_post thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:16,676 | INFO     | [payload] control thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:17,316 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2023-07-23 07:44:47] 2023-07-23 05:42:19,435 | INFO     | [payload] execute_payloads thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:21,319 | INFO     | [data] queue_monitor thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:28,317 | INFO     | job.realtimelogging is not enabled
[2023-07-23 07:44:47] 2023-07-23 05:42:29,318 | INFO     | [payload] run_realtimelog thread has finished
[2023-07-23 07:44:47] 2023-07-23 05:42:35,909 | INFO     | job 5911382216 has state=failed
[2023-07-23 07:44:47] 2023-07-23 05:42:35,909 | INFO     | preparing for final server update for job 5911382216 in state='failed'
[2023-07-23 07:44:47] 2023-07-23 05:42:36,499 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2023-07-23 07:44:47] 2023-07-23 05:42:36,499 | INFO     | [monitor] control thread has ended
[2023-07-23 07:44:47] 2023-07-23 05:42:38,256 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:40,263 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:42,270 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:44,279 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:46,285 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:48,297 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:50,305 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:52,314 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:54,325 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:56,337 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:42:58,343 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:00,351 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:02,359 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:04,367 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:06,374 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:08,382 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:10,390 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:12,399 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:14,408 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:16,417 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:18,426 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:20,435 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:22,441 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:24,450 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:26,458 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:28,468 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:30,476 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:32,484 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:34,494 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:36,504 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:38,512 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:40,522 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:42,530 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:44,539 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:46,548 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:48,554 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:50,563 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:52,572 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:54,582 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:56,590 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:43:58,596 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:00,603 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:02,611 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:04,619 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:06,628 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:08,638 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:10,648 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:12,658 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:14,665 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:16,675 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:18,682 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:20,691 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:22,701 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:24,709 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:26,719 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:28,726 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:30,733 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:30,984 | INFO     | proceeding with final server update
[2023-07-23 07:44:47] 2023-07-23 05:44:30,985 | INFO     | this job has now completed (state=failed)
[2023-07-23 07:44:47] 2023-07-23 05:44:30,985 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2023-07-23 07:44:47] 2023-07-23 05:44:30,985 | INFO     | job 5911382216 has failed - writing final server update
[2023-07-23 07:44:47] 2023-07-23 05:44:30,985 | WARNING  | making sure that job.state is set to failed since a pilot error code is set
[2023-07-23 07:44:47] 2023-07-23 05:44:30,985 | WARNING  | wrong length of table data, x=[1690090872.0], y=[1741.0] (must be same and length>=4)
[2023-07-23 07:44:47] 2023-07-23 05:44:30,985 | INFO     | payload/TRF did not report the number of read events
[2023-07-23 07:44:47] 2023-07-23 05:44:30,989 | WARNING  | command={cmd} does not exist - cannot check number of available cores
[2023-07-23 07:44:47] 2023-07-23 05:44:30,989 | INFO     | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/common/exception.py", line 424, in run
[2023-07-23 07:44:47]     self._target(**self._kwargs)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 2432, in queue_monitor
[2023-07-23 07:44:47]     update_server(job, args)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 2483, in update_server
[2023-07-23 07:44:47]     send_state(job, args, job.state, metadata=metadata)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 329, in send_state
[2023-07-23 07:44:47]     data = get_data_structure(job, state, args, xml=xml, metadata=metadata, final=final)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 653, in get_data_structure
[2023-07-23 07:44:47]     instruction_sets = has_instruction_sets(['AVX2'])
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/util/auxiliary.py", line 492, in has_instruction_sets
[2023-07-23 07:44:47]     exit_code, stdout, stderr = execute(cmd)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/util/container.py", line 64, in execute
[2023-07-23 07:44:47]     process = subprocess.Popen(exe,
[2023-07-23 07:44:47]   File "/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.14-x86_64-centos7/lib/python3.9/subprocess.py", line 951, in __init__
[2023-07-23 07:44:47]     self._execute_child(args, executable, preexec_fn, close_fds,
[2023-07-23 07:44:47]   File "/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.14-x86_64-centos7/lib/python3.9/subprocess.py", line 1821, in _execute_child
[2023-07-23 07:44:47]     raise child_exception_type(errno_num, err_msg, err_filename)
[2023-07-23 07:44:47] exception caught by thread run() function: (<class 'OSError'>, OSError(107, 'Transport endpoint is not connected'), <traceback object at 0x7f46ffd1e900>)
[2023-07-23 07:44:47] Traceback (most recent call last):
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/common/exception.py", line 424, in run
[2023-07-23 07:44:47]     self._target(**self._kwargs)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 2432, in queue_monitor
[2023-07-23 07:44:47]     update_server(job, args)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 2483, in update_server
[2023-07-23 07:44:47]     send_state(job, args, job.state, metadata=metadata)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 329, in send_state
[2023-07-23 07:44:47]     data = get_data_structure(job, state, args, xml=xml, metadata=metadata, final=final)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/control/job.py", line 653, in get_data_structure
[2023-07-23 07:44:47]     instruction_sets = has_instruction_sets(['AVX2'])
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/util/auxiliary.py", line 492, in has_instruction_sets
[2023-07-23 07:44:47]     exit_code, stdout, stderr = execute(cmd)
[2023-07-23 07:44:47]   File "/var/lib/boinc/slots/4/pilot3/pilot/util/container.py", line 64, in execute
[2023-07-23 07:44:47]     process = subprocess.Popen(exe,
[2023-07-23 07:44:47]   File "/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.14-x86_64-centos7/lib/python3.9/subprocess.py", line 951, in __init__
[2023-07-23 07:44:47]     self._execute_child(args, executable, preexec_fn, close_fds,
[2023-07-23 07:44:47]   File "/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.14-x86_64-centos7/lib/python3.9/subprocess.py", line 1821, in _execute_child
[2023-07-23 07:44:47]     raise child_exception_type(errno_num, err_msg, err_filename)
[2023-07-23 07:44:47] OSError: [Errno 107] Transport endpoint is not connected: '/bin/bash'
[2023-07-23 07:44:47] 
[2023-07-23 07:44:47] None
[2023-07-23 07:44:47] exception has been put in bucket queue belonging to thread 'queue_monitor'
[2023-07-23 07:44:47] setting graceful stop in 10 s since there is no point in continuing
[2023-07-23 07:44:47] 2023-07-23 05:44:32,744 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:34,752 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:36,760 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:38,769 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:40,776 | INFO     | waiting for thread to finish: ['<_MainThread(MainThread, started 139942965872448)>', '<ExcThread(queue_monitor, started 139942281852672)>']
[2023-07-23 07:44:47] 2023-07-23 05:44:42,784 | INFO     | caller=run is remaining thread - safe to abort (names=['<_MainThread(MainThread, started 139942965872448)>'])
[2023-07-23 07:44:47] 2023-07-23 05:44:47,809 | INFO     | end of generic workflow (traces error code: 1354)
[2023-07-23 07:44:47] 2023-07-23 05:44:47,809 | INFO     | traces error code: 1354
[2023-07-23 07:44:47] 2023-07-23 05:44:47,809 | INFO     | an exit code was already set: 1354 (will be converted to a standard shell code)
[2023-07-23 07:44:47] no translation to shell exit code for error code 1354
[2023-07-23 07:44:47] 2023-07-23 05:44:47,809 | INFO     | pilot has finished (exit code=1354, shell exit code=1)
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  ==== pilot stdout END ====
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  ==== wrapper stdout RESUME ====
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  pilotpid: 3332021
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  Pilot exit status: 1
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 852: cut: command not found
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 852: xargs: command not found
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 852: /usr/bin/cat: Transport endpoint is not connected
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  pandaids: 
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 860: date: command not found
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  apfmon messages muted
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  Test setup, not cleaning
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  ==== wrapper stdout END ====
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 10: date: command not found
[2023-07-23 07:44:47]  ==== wrapper stderr END ====
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 474: date: command not found
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  wrapperexiting ec=0, duration=-1690090685
[2023-07-23 07:44:47] ./runpilot2-wrapper.sh: line 15: date: command not found
[2023-07-23 07:44:47]  apfmon messages muted
[2023-07-23 07:44:47]  *** Error codes and diagnostics ***
[2023-07-23 07:44:47]  *** Listing of results directory ***
[2023-07-23 07:44:47] insgesamt 39540
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc   418016 23. Jul 06:24 pilot3.tar.gz
[2023-07-23 07:44:47] -rwx------. 1 boinc boinc    27277 23. Jul 07:28 runpilot2-wrapper.sh
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc     4388 23. Jul 07:28 queuedata.json
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc      107 23. Jul 07:38 wrapper_26015_x86_64-pc-linux-gnu
[2023-07-23 07:44:47] -rwxr-xr-x. 1 boinc boinc     7986 23. Jul 07:38 run_atlas
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc      112 23. Jul 07:38 job.xml
[2023-07-23 07:44:47] -rw-r--r--. 2 boinc boinc    17604 23. Jul 07:38 start_atlas.sh
[2023-07-23 07:44:47] drwxrwx--x. 2 boinc boinc       68 23. Jul 07:38 shared
[2023-07-23 07:44:47] -rw-r--r--. 2 boinc boinc   428867 23. Jul 07:38 input.tar.gz
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc     6174 23. Jul 07:38 init_data.xml
[2023-07-23 07:44:47] -rw-r--r--. 2 boinc boinc 36949600 23. Jul 07:38 EVNT.04972714._000039.pool.root.1
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc        0 23. Jul 07:38 boinc_lockfile
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc     2755 23. Jul 07:38 pandaJob.out
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc      424 23. Jul 07:38 setup.sh.local
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc  1370854 23. Jul 07:39 cric_ddmendpoints.json
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc  1015272 23. Jul 07:39 agis_schedconf.cvmfs.json
[2023-07-23 07:44:47] drwx------. 4 boinc boinc     4096 23. Jul 07:39 pilot3
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc      515 23. Jul 07:41 heartbeat.json
[2023-07-23 07:44:47] drwxrwx---. 2 boinc boinc     4096 23. Jul 07:42 PanDA_Pilot-5911382216
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc      959 23. Jul 07:42 memory_monitor_summary.json
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc      528 23. Jul 07:42 boinc_task_state.xml
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc     8192 23. Jul 07:44 boinc_mmap_file
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc       22 23. Jul 07:44 wrapper_checkpoint.txt
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc    52535 23. Jul 07:44 pilotlog.txt
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc    11134 23. Jul 07:44 runtime_log.err
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc      428 23. Jul 07:44 runtime_log
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc      599 23. Jul 07:44 q5oNDmSpgh3n7Olcko1bjSoqABFKDmABFKDm7AsVDmSqNKDmoaRxKo.diag
[2023-07-23 07:44:47] -rw-------. 1 boinc boinc    71986 23. Jul 07:44 f054ae60-35b0-4c9c-aa4b-bb694fe3da88_36830.1.job.log
[2023-07-23 07:44:47] -rw-r--r--. 1 boinc boinc    27646 23. Jul 07:44 stderr.txt
[2023-07-23 07:44:48] No HITS result produced
[2023-07-23 07:44:48]  *** Contents of shared directory: ***
[2023-07-23 07:44:48] insgesamt 36524
[2023-07-23 07:44:48] -rw-r--r--. 2 boinc boinc    17604 23. Jul 07:38 start_atlas.sh
[2023-07-23 07:44:48] -rw-r--r--. 2 boinc boinc   428867 23. Jul 07:38 input.tar.gz
[2023-07-23 07:44:48] -rw-r--r--. 2 boinc boinc 36949600 23. Jul 07:38 ATLAS.root_0
07:44:49 (3328868): run_atlas exited; CPU time 10.842553
07:44:49 (3328868): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>q5oNDmSpgh3n7Olcko1bjSoqABFKDmABFKDm7AsVDmSqNKDmoaRxKo_0_r910086635_ATLAS_result</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
</message>
]]>


©2024 CERN