Name | gwCNDmMK1gynfZGDcpSWOuwoABFKDmABFKDm2IFNDmdCFKDmccSXTn_0 |
Workunit | 2063705 |
Created | 18 Mar 2021, 6:28:47 UTC |
Sent | 18 Mar 2021, 6:29:11 UTC |
Report deadline | 25 Mar 2021, 6:29:11 UTC |
Received | 18 Mar 2021, 7:30:31 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 4389 |
Run time | 10 min 58 sec |
CPU time | 8 min 11 sec |
Validate state | Valid |
Credit | 1.53 |
Device peak FLOPS | 1.00 GFLOPS |
Application version | ATLAS long simulation v1.00 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 1.43 GB |
Peak swap size | 2.13 GB |
Peak disk usage | 88.07 MB |
<core_client_version>7.16.11</core_client_version> <![CDATA[ <stderr_txt> 09:19:24 (14415): wrapper (7.7.26015): starting 09:19:24 (14415): wrapper: running run_atlas (--nthreads 1) [2021-03-18 09:19:24] Arguments: --nthreads 1 [2021-03-18 09:19:24] Threads: 1 [2021-03-18 09:19:24] Checking for CVMFS [2021-03-18 09:19:24] Probing /cvmfs/atlas.cern.ch... OK [2021-03-18 09:19:24] Probing /cvmfs/atlas-condb.cern.ch... OK [2021-03-18 09:19:25] Probing /cvmfs/grid.cern.ch... OK [2021-03-18 09:19:25] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2021-03-18 09:19:25] 2.8.0.0 1755 140 62484 81002 3 11 3937499 4096000 0 130560 0 171260 99.9591 3566 385 http://cvmfs-stratum-one.cern.ch:8000/cvmfs/atlas.cern.ch DIRECT 1 [2021-03-18 09:19:25] CVMFS is ok [2021-03-18 09:19:25] Using singularity image /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img [2021-03-18 09:19:25] Checking for singularity binary... [2021-03-18 09:19:25] Using singularity found in PATH at /usr/bin/singularity [2021-03-18 09:19:25] Running /usr/bin/singularity --version [2021-03-18 09:19:25] singularity version 3.7.1-1.el7 [2021-03-18 09:19:25] Checking singularity works with /usr/bin/singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img hostname [2021-03-18 09:19:25] CentOS7 [2021-03-18 09:19:25] Singularity works [2021-03-18 09:19:25] Starting ATLAS job with PandaID=5002789745 [2021-03-18 09:19:25] Running command: /usr/bin/singularity exec --pwd /var/lib/BASE/BOINC01/slots/1 -B /cvmfs,/var /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img sh start_atlas.sh [2021-03-18 09:30:19] *** The last 200 lines of the pilot log: *** [2021-03-18 09:30:19] "cpuTimeTotal": 479, [2021-03-18 09:30:19] "externalCpuTime": 2, [2021-03-18 09:30:19] "processedEvents": 2, [2021-03-18 09:30:19] "trfPredata": null, [2021-03-18 09:30:19] "wallTime": 591 [2021-03-18 09:30:19] } [2021-03-18 09:30:19] } [2021-03-18 09:30:19] } [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | DEBUG | queue_monitor | pilot.user.atlas.common | update_server | no need to update logstash for this job [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | state=finished [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=running [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=finished [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | INFO | queue_monitor | pilot.control.job | send_state | pilot will not update the server (heartbeat message will be written to file) [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | INFO | queue_monitor | pilot.control.job | send_state | job 5002789745 has finished - writing final server update [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | DEBUG | queue_monitor | pilot.control.job | get_data_structure | building data structure to be sent to server with heartbeat [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | DEBUG | queue_monitor | pilot.user.atlas.jobmetrics | get_job_metrics_string | job definition core count: 1 [2021-03-18 09:30:19] 2021-03-18 07:30:09,728 | INFO | queue_monitor | pilot.user.atlas.jobmetrics | get_job_metrics_string | will not add max space = -37139005 B to job metrics [2021-03-18 09:30:19] 2021-03-18 07:30:09,729 | DEBUG | queue_monitor | pilot.api.analytics | get_fitted_data | removing tails from data to be fitted [2021-03-18 09:30:19] 2021-03-18 07:30:09,729 | WARNING | queue_monitor | pilot.api.analytics | get_fitted_data | wrong length of table data, x=[1616052306.0, 1616052367.0, 1616052428.0], y=[1565184.0, 1576068.0, 1578332.0] (must be same and length>=4) [2021-03-18 09:30:19] 2021-03-18 07:30:09,729 | DEBUG | queue_monitor | pilot.user.atlas.jobmetrics | get_job_metrics | job metrics="actualCoreCount=4 nEvents=2 dbTime=28.97 dbData=4274626" [2021-03-18 09:30:19] 2021-03-18 07:30:09,729 | INFO | queue_monitor | pilot.control.job | get_data_structure | mean actualcorecount: 3.625000 [2021-03-18 09:30:19] 2021-03-18 07:30:09,729 | INFO | queue_monitor | pilot.control.job | get_data_structure | total number of processed events: 2 (read) [2021-03-18 09:30:19] 2021-03-18 07:30:09,730 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_values | using path: /var/lib/BASE/BOINC01/slots/1/PanDA_Pilot-5002789745/memory_monitor_summary.json (trf name=prmon) [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | DEBUG | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | summary_dictionary={'Max': {'rx_packets': 64361, 'nprocs': 5, 'nthreads': 14, 'rx_bytes': 90045477, 'wtime': 552, 'rss': 1596536, 'write_bytes': 2138112, 'vmem': 3342460, 'read_bytes': 1629094912, 'stime': 12, 'tx_bytes': 4969332, 'pss': 1589268, 'wchar': 5450159, 'rchar': 573404965, 'tx_packets': 18657, 'swap': 0, 'utime': 435}, 'Avg': {'write_bytes': 3867.0, 'nprocs': 4.799, 'nthreads': 6.7, 'rx_bytes': 162863.0, 'rx_packets': 116.408, 'vmem': 2708037.0, 'read_bytes': 2946508.0, 'swap': 0.0, 'tx_bytes': 8987.0, 'pss': 1196631.0, 'wchar': 9857.0, 'rchar': 1037104.0, 'tx_packets': 33.744, 'rss': 1203407.0}, 'HW': {'mem': {'MemTotal': 4654084}, 'cpu': {'CoresPerSocket': 4, 'ModelName': 'Intel(R) Core(TM) i5-4440 CPU @ 3.10GHz', 'ThreadsPerCore': 1, 'CPUs': 4, 'Sockets': 1}}, 'prmon': {'Version': '2.2.0'}} [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard info from prmon json [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard memory fields from prmon json [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | .............................. [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . Timing measurements: [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . get job = 0 s [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . initial setup = 0 s [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . payload setup = 9 s [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . total setup = 9 s [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . stage-in = 0 s [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . payload execution = 603 s [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | . stage-out = 0 s [2021-03-18 09:30:19] 2021-03-18 07:30:09,731 | INFO | queue_monitor | pilot.util.timing | timing_report | .............................. [2021-03-18 09:30:19] 2021-03-18 07:30:09,732 | DEBUG | queue_monitor | pilot.control.job | send_state | is_harvester_mode(args) : False [2021-03-18 09:30:19] 2021-03-18 07:30:09,732 | DEBUG | queue_monitor | pilot.control.job | write_heartbeat_to_file | heartbeat dictionary: {'pilotErrorCode': 0, 'rateWBYTES': 3867.0, 'pilotID': 'http://aipanda403.cern.ch/data/jobs/2021-03-18/BOINC-TEST/5002789745.out|PR|2.9.6 (20)', 'meanCoreCount': 3.625, 'totRBYTES': 1629094912, 'siteName': 'BOINC-TEST', 'avgVMEM': 2708037.0, 'coreCount': 1, 'totWCHAR': 5450159, 'rateRCHAR': 1037104.0, 'jobId': '5002789745', 'totRCHAR': 573404965, 'exeErrorCode': 0, 'rateWCHAR': 9857.0, 'metaData': '{\n "cmdLine": "\'/cvmfs/atlas.cern.ch/repo/sw/software/21.0/AtlasOffline/21.0.16/InstallArea/x86_64-slc6-gcc49-opt/share/Sim_tf.py\' \'--maxEvents=2\' \'--skipEvents=0\' \'--firstEvent=118001\' \'--randomSeed=119\' \'--DBRelease=all:current\' \'--geometryVersion=default:ATLAS-R2-2016-01-00-01_VALIDATION\' \'--conditionsTag\' \'default:OFLCOND-MC16-SDR-14\' \'--physicsList=FTFP_BERT_ATL_VALIDATION\' \'--preExec\' \'EVNTtoHITS:simFlags.SimBarcodeOffset.set_Value_and_Lock(200000)\' \'EVNTtoHITS:simFlags.TRTRangeCut=30.0;simFlags.TightMuonStepping=True\' \'--postInclude\' \'default:RecJobTransforms/UseFrontier.py\' \'--simulator=FullG4\' \'--truthStrategy=MC15aPlus\' \'--DataRunNumber=361106\' \'--outputHitsFile\' \'output.1.fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.pool.root\' \'--inputEvgenFile\' \'EVNT.04972714._000023.pool.root.1\'", \n "created": "2021-03-18T09:30:00", \n "executor": [\n {\n "asetup": null, \n "errMsg": "", \n "exeConfig": {\n "inputs": [\n "EVNT"\n ], \n "outputs": [\n "HITS"\n ], \n "script": "athena.py", \n "substep": "sim"\n }, \n "logfileReport": {\n "countSummary": {\n "CATASTROPHE": 0, \n "CRITICAL": 0, \n "DEBUG": 0, \n "ERROR": 0, \n "FATAL": 0, \n "IGNORED": 0, \n "INFO": 2772, \n "UNKNOWN": 6337, \n "VERBOSE": 0, \n "WARNING": 31\n }, \n "details": {}\n }, \n "metaData": {}, \n "name": "EVNTtoHITS", \n "rc": 0, \n "statusOK": true, \n "validation": true\n }\n ], \n "exitAcronym": "OK", \n "exitCode": 0, \n "exitMsg": "OK", \n "exitMsgExtra": "", \n "files": {\n "input": [\n {\n "dataset": null, \n "nentries": 1000, \n "subFiles": [\n {\n "file_guid": "527922C2-75F2-064D-A171-672D7A39A6CB", \n "name": "EVNT.04972714._000023.pool.root.1"\n }\n ], \n "type": "EVNT"\n }\n ], \n "output": [\n {\n "argName": "outputHITSFile", \n "dataset": null, \n "subFiles": [\n {\n "file_guid": "F39FAC8B-8AC2-0B4D-A4C2-523B18F3610F", \n "file_size": 1769754, \n "name": "output.1.fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.pool.root", \n "nentries": 2\n }\n ], \n "type": "HITS"\n }\n ]\n }, \n "name": "Sim_tf", \n "reportVersion": "2.0.7", \n "resource": {\n "dbDataTotal": 8549252, \n "dbTimeTotal": 57.94, \n "executor": {\n "EVNTtoHITS": {\n "cpuTime": 477, \n "dbData": 4274626, \n "dbTime": 28.97, \n "memory": {\n "Avg": {\n "avgPSS": 1162010, \n "avgRSS": 1165205, \n "avgSwap": 0, \n "avgVMEM": 1866442, \n "rateRBYTES": 2446382, \n "rateRCHAR": 946116, \n "rateWBYTES": 2047, \n "rateWCHAR": 6264\n }, \n "Max": {\n "maxPSS": 1493596, \n "maxRSS": 1496828, \n "maxSwap": 0, \n "maxVMEM": 2247228, \n "totRBYTES": 1394438144, \n "totRCHAR": 539286250, \n "totWBYTES": 1167360, \n "totWCHAR": 3570860\n }\n }, \n "nevents": 2, \n "postExe": {\n "cpuTime": 1, \n "wallTime": 1\n }, \n "preExe": {\n "cpuTime": 0, \n "wallTime": 1\n }, \n "total": {\n "cpuTime": 479, \n "wallTime": 581\n }, \n "validation": {\n "cpuTime": 0, \n "wallTime": 1\n }, \n "wallTime": 578\n }\n }, \n "machine": {\n "cpu_family": "6", \n "linux_distribution": [\n "CentOS Linux", \n "7.8.2003", \n "Core"\n ], \n "model": "60", \n "model_name": "Intel(R) Core(TM) i5-4440 CPU @ 3.10GHz", \n "node": "CentOS7", \n "platform": "Linux-3.10.0-1160.15.2.el7.x86_64-x86_64-with-centos-7.8.2003-Core"\n }, \n "transform": {\n "cpuEfficiency": 0.8139, \n "cpuPWEfficiency": 0.8139, \n "cpuTime": 4, \n "cpuTimeTotal": 479, \n "externalCpuTime": 2, \n "processedEvents": 2, \n "trfPredata": null, \n "wallTime": 591\n }\n }\n}', 'xml': '{"fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.1.job.log.tgz": {"adler32": "d39bf94b", "surl": "root://eosatlas.cern.ch:1094//eos/atlas/atlasdatadisk/rucio/hc_test/1b/f6/fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.1.job.log.tgz", "guid": "bb4742a7-f2a0-4082-ae02-39090eb71398", "fsize": 159953}, "output.1.fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.pool.root": {"adler32": "ed145c9e", "surl": "root://eosatlas.cern.ch:1094//eos/atlas/atlasdatadisk/rucio/hc_test/11/5f/output.1.fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.pool.root", "guid": "F39FAC8B-8AC2-0B4D-A4C2-523B18F3610F", "fsize": 1769754}}', 'maxVMEM': 3342460, 'cpuConversionFactor': 1.0, 'avgSWAP': 0.0, 'state': 'finished', 'transExitCode': 0, 'pilotErrorDiag': '', 'node': 'CentOS7', 'avgRSS': 1203407.0, 'avgPSS': 1196631.0, 'timestamp': '2021-03-18T09:30:09+02:00', 'pilotTiming': '0|0|603|0|9', 'attemptNr': 0, 'totWBYTES': 2138112, 'nEvents': 2, 'rateRBYTES': 2946508.0, 'pilotLog': '', 'cpuConsumptionTime': 496, 'startTime': 1616051975.474575, 'cpuConsumptionUnit': 's+Intel(R) Core(TM) i5-4440 CPU @ 3.10GHz 6144 KB', 'exeErrorDiag': '', 'maxSWAP': 0, 'jobMetrics': 'actualCoreCount=4 nEvents=2 dbTime=28.97 dbData=4274626', 'maxRSS': 1596536, 'schedulerID': 'harvester-CERN_central_ACTA', 'endTime': 1616052609.731988, 'maxPSS': 1589268} [2021-03-18 09:30:19] 2021-03-18 07:30:09,732 | DEBUG | queue_monitor | pilot.control.job | write_heartbeat_to_file | wrote heartbeat to file /var/lib/BASE/BOINC01/slots/1/heartbeat.json [2021-03-18 09:30:19] 2021-03-18 07:30:09,732 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | job 5002789745 was dequeued from the monitored payloads queue [2021-03-18 09:30:19] 2021-03-18 07:30:09,863 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | tmp job object deleted [2021-03-18 09:30:19] 2021-03-18 07:30:10,111 | INFO | retrieve | pilot.control.job | make_job_report | [2021-03-18 09:30:19] 2021-03-18 07:30:10,111 | INFO | retrieve | pilot.control.job | make_job_report | job summary report [2021-03-18 09:30:19] 2021-03-18 07:30:10,111 | INFO | retrieve | pilot.control.job | make_job_report | -------------------------------------------------- [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | PanDA job id: 5002789745 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | task id: NULL [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | errors: (none) [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | status: LOG_TRANSFER = DONE [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | pilot state: finished [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | transexitcode: 0 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | exeerrorcode: 0 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | exeerrordiag: [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | exitcode: 0 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | exitmsg: OK [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | cpuconsumptiontime: 496 s [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | nevents: 2 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | neventsw: 0 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | pid: 21249 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | pgrp: 21249 [2021-03-18 09:30:19] 2021-03-18 07:30:10,112 | INFO | retrieve | pilot.control.job | make_job_report | corecount: 1 [2021-03-18 09:30:19] 2021-03-18 07:30:10,113 | INFO | retrieve | pilot.control.job | make_job_report | event service: False [2021-03-18 09:30:19] 2021-03-18 07:30:10,113 | INFO | retrieve | pilot.control.job | make_job_report | sizes: {0: 8514585, 1: 8515021, 2: 8515045, 3: 8515244, 616: 8548022, 617: 8548439, 618: 8557592, 13: 8515268, 622: 8557782, 621: 8557640} [2021-03-18 09:30:19] 2021-03-18 07:30:10,113 | INFO | retrieve | pilot.control.job | make_job_report | -------------------------------------------------- [2021-03-18 09:30:19] 2021-03-18 07:30:10,113 | INFO | retrieve | pilot.control.job | make_job_report | [2021-03-18 09:30:19] 2021-03-18 07:30:10,113 | DEBUG | retrieve | pilot.control.job | has_job_completed | ls -lF /var/lib/BASE/BOINC01/slots/1: [2021-03-18 09:30:19] [2021-03-18 09:30:19] 2021-03-18 07:30:10,113 | INFO | retrieve | pilot.util.container | execute | executing command: ls -lF /var/lib/BASE/BOINC01/slots/1 [2021-03-18 09:30:19] 2021-03-18 07:30:10,130 | DEBUG | retrieve | pilot.control.job | has_job_completed | total 44700 [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1174577 Mar 18 09:19 agis_schedconf.cvmfs.json [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 0 Mar 18 09:19 boinc_lockfile [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 8192 Mar 18 09:29 boinc_mmap_file [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 532 Mar 18 09:29 boinc_task_state.xml [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1948795 Mar 18 09:19 cric_ddmendpoints.json [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 38019645 Mar 18 09:19 EVNT.04972714._000023.pool.root.1 [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 176041 Mar 18 09:30 fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.1.job.log [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 159953 Mar 18 09:30 fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.1.job.log.tgz [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 130 Mar 18 09:19 gwCNDmMK1gynfZGDcpSWOuwoABFKDmABFKDm2IFNDmdCFKDmccSXTn.diag [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 6563 Mar 18 09:30 heartbeat.json [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 6096 Mar 18 09:19 init_data.xml [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 1048546 Mar 18 09:19 input.tar.gz [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 112 Mar 18 09:19 job.xml [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1007 Mar 18 09:30 memory_monitor_summary.json [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1769754 Mar 18 09:29 output.1.fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.pool.root [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 546 Mar 18 09:30 output.list [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 2618 Mar 18 09:19 pandaJob.out [2021-03-18 09:30:19] drwxrwx---. 2 boinc boinc 4096 Mar 18 09:30 PanDA_Pilot-5002789745/ [2021-03-18 09:30:19] drwx------. 5 boinc boinc 275 Mar 18 09:19 pilot2/ [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 1042975 Mar 18 08:24 pilot2.tar.gz [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 154534 Mar 18 09:30 pilotlog.txt [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 4974 Mar 18 08:28 queuedata.json [2021-03-18 09:30:19] -rwxr-xr-x. 1 boinc boinc 5573 Mar 18 09:19 run_atlas* [2021-03-18 09:30:19] -rwx------. 1 boinc boinc 20043 Mar 18 08:28 runpilot2-wrapper.sh* [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 407 Mar 18 09:19 runtime_log [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 5550 Mar 18 09:19 runtime_log.err [2021-03-18 09:30:19] drwxrwx--x. 2 boinc boinc 68 Mar 18 09:19 shared/ [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 16507 Mar 18 09:19 start_atlas.sh [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 1684 Mar 18 09:19 stderr.txt [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 107 Mar 18 09:19 wrapper_26015_x86_64-pc-linux-gnu [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 24 Mar 18 09:29 wrapper_checkpoint.txt [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue jobs has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue payloads has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_in has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_out has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue current_data_in has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_jobs has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_payloads has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue monitored_payloads has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_jobs has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_payloads has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_in has 1 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_out has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,131 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_jobs has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,132 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_payloads has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,132 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_in has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,132 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_out has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,132 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobs has 0 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,132 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobids has 1 job(s) [2021-03-18 09:30:19] 2021-03-18 07:30:10,132 | INFO | retrieve | pilot.control.job | has_job_completed | job 5002789745 has completed (purged errors) [2021-03-18 09:30:19] 2021-03-18 07:30:10,132 | INFO | retrieve | pilot.util.processes | cleanup | overall cleanup function is called [2021-03-18 09:30:19] 2021-03-18 07:30:10,133 | DEBUG | retrieve | pilot.util.processes | cleanup | work directory was removed: /var/lib/BASE/BOINC01/slots/1/PanDA_Pilot-5002789745 [2021-03-18 09:30:19] 2021-03-18 07:30:11,138 | INFO | retrieve | pilot.info.jobdata | collect_zombies | --- collectZombieJob: --- 10, [21249] [2021-03-18 09:30:19] 2021-03-18 07:30:11,138 | INFO | retrieve | pilot.info.jobdata | collect_zombies | zombie collector trying to kill pid 21249 [2021-03-18 09:30:19] 2021-03-18 07:30:11,138 | INFO | retrieve | pilot.info.jobdata | collect_zombies | harmless exception when collecting zombies: [Errno 10] No child processes [2021-03-18 09:30:19] 2021-03-18 07:30:12,144 | INFO | retrieve | pilot.util.processes | cleanup | collected zombie processes [2021-03-18 09:30:19] 2021-03-18 07:30:12,144 | INFO | retrieve | pilot.util.processes | cleanup | will now attempt to kill all subprocesses of pid=21249 [2021-03-18 09:30:19] 2021-03-18 07:30:12,175 | INFO | retrieve | pilot.util.processes | kill_processes | process IDs to be killed: [21249] (in reverse order) [2021-03-18 09:30:19] 2021-03-18 07:30:12,202 | WARNING | retrieve | pilot.util.processes | kill_processes | found no corresponding commands to process id(s) [2021-03-18 09:30:19] 2021-03-18 07:30:12,202 | INFO | retrieve | pilot.util.processes | kill_orphans | Do not look for orphan processes in BOINC jobs [2021-03-18 09:30:19] 2021-03-18 07:30:12,202 | DEBUG | retrieve | pilot.util.queuehandling | purge_queue | queue purged [2021-03-18 09:30:19] 2021-03-18 07:30:12,203 | INFO | retrieve | pilot.control.job | retrieve | ready for new job [2021-03-18 09:30:19] 2021-03-18 07:30:12,203 | INFO | retrieve | root | retrieve | pilot has finished for previous job - re-establishing logging [2021-03-18 09:30:19] 2021-03-18 07:30:12,203 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2021-03-18 09:30:19] 2021-03-18 07:30:12,203 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | *** PanDA Pilot version 2.9.6 (20) *** [2021-03-18 09:30:19] 2021-03-18 07:30:12,203 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2021-03-18 09:30:19] 2021-03-18 07:30:12,203 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | [2021-03-18 09:30:19] 2021-03-18 07:30:12,204 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | pilot is running in a VM [2021-03-18 09:30:19] 2021-03-18 07:30:12,204 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | architecture information: [2021-03-18 09:30:19] 2021-03-18 07:30:12,259 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | [2021-03-18 09:30:19] LSB Version: :core-4.1-amd64:core-4.1-noarch [2021-03-18 09:30:19] Distributor ID: CentOS [2021-03-18 09:30:19] Description: CentOS Linux release 7.8.2003 (Core) [2021-03-18 09:30:19] Release: 7.8.2003 [2021-03-18 09:30:19] Codename: Core [2021-03-18 09:30:19] 2021-03-18 07:30:12,260 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2021-03-18 09:30:19] 2021-03-18 07:30:12,761 | DEBUG | retrieve | pilot.util.monitoring | check_local_space | checking local space on /var/lib/BASE/BOINC01/slots/1 [2021-03-18 09:30:19] 2021-03-18 07:30:12,773 | INFO | retrieve | pilot.util.monitoring | check_local_space | sufficient remaining disk space (13964935168 B) [2021-03-18 09:30:19] 2021-03-18 07:30:12,773 | WARNING | retrieve | pilot.control.job | proceed_with_getjob | since timefloor is set to 0, pilot was only allowed to run one job [2021-03-18 09:30:19] 2021-03-18 07:30:12,773 | DEBUG | retrieve | pilot.control.job | retrieve | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:12,773 | DEBUG | retrieve | pilot.control.job | retrieve | [job] retrieve thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:12,797 | WARNING | monitor | pilot.control.monitor | control | aborting monitor loop since graceful_stop has been set [2021-03-18 09:30:19] 2021-03-18 07:30:12,797 | INFO | monitor | pilot.control.monitor | control | [monitor] control thread has ended [2021-03-18 09:30:19] 2021-03-18 07:30:12,802 | WARNING | copytool_out | pilot.util.common | should_abort | data:copytool_out:received graceful stop - abort after this iteration [2021-03-18 09:30:19] 2021-03-18 07:30:12,838 | DEBUG | validate_post | pilot.control.payload | validate_post | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:12,838 | INFO | validate_post | pilot.control.payload | validate_post | [payload] validate_post thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:12,985 | DEBUG | payload | pilot.control.payload | control | payload control ending since graceful_stop has been set [2021-03-18 09:30:19] 2021-03-18 07:30:12,985 | DEBUG | payload | pilot.control.payload | control | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:12,985 | DEBUG | payload | pilot.control.payload | control | [payload] control thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,119 | DEBUG | validate_pre | pilot.control.payload | validate_pre | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,120 | INFO | validate_pre | pilot.control.payload | validate_pre | [payload] validate_pre thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,224 | DEBUG | failed_post | pilot.control.payload | failed_post | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,224 | INFO | failed_post | pilot.control.payload | failed_post | [payload] failed_post thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,224 | DEBUG | execute_payloads | pilot.control.payload | execute_payloads | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,225 | INFO | execute_payloads | pilot.control.payload | execute_payloads | [payload] execute_payloads thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,457 | DEBUG | data | pilot.control.data | control | data control ending since graceful_stop has been set [2021-03-18 09:30:19] 2021-03-18 07:30:13,457 | DEBUG | data | pilot.control.data | control | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,457 | DEBUG | data | pilot.control.data | control | [data] control thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,541 | DEBUG | copytool_in | pilot.control.data | copytool_in | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,542 | DEBUG | copytool_in | pilot.control.data | copytool_in | [data] copytool_in thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,568 | DEBUG | job | pilot.control.job | control | job control ending since graceful_stop has been set [2021-03-18 09:30:19] 2021-03-18 07:30:13,569 | DEBUG | job | pilot.control.job | control | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,569 | DEBUG | job | pilot.control.job | control | [job] control thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,630 | DEBUG | create_data_payload | pilot.control.job | create_data_payload | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,631 | DEBUG | create_data_payload | pilot.control.job | create_data_payload | [job] create_data_payload thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:13,806 | DEBUG | copytool_out | pilot.control.data | copytool_out | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:13,806 | DEBUG | copytool_out | pilot.control.data | copytool_out | [data] copytool_out thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:14,055 | DEBUG | validate | pilot.control.job | validate | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:14,055 | DEBUG | validate | pilot.control.job | validate | [job] validate thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:14,834 | WARNING | job_monitor | pilot.control.job | check_job_monitor_waiting_time | no jobs in monitored_payloads queue (waited for 72 s) [2021-03-18 09:30:19] 2021-03-18 07:30:14,834 | DEBUG | job_monitor | pilot.control.job | job_monitor | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:14,834 | DEBUG | job_monitor | pilot.control.job | job_monitor | [job] job monitor thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:14,889 | WARNING | queue_monitor | pilot.util.common | should_abort | job:queue_monitor:received graceful stop - abort after this iteration [2021-03-18 09:30:19] 2021-03-18 07:30:14,889 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | will not set job_aborted yet [2021-03-18 09:30:19] 2021-03-18 07:30:14,889 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | [job] queue monitor thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:15,724 | WARNING | queue_monitoring | pilot.util.common | should_abort | data:queue_monitoring:received graceful stop - abort after this iteration [2021-03-18 09:30:19] 2021-03-18 07:30:18,726 | DEBUG | queue_monitoring | pilot.util.processes | threads_aborted | aborting since the last relevant thread is about to finish [2021-03-18 09:30:19] 2021-03-18 07:30:18,726 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | will proceed to set job_aborted [2021-03-18 09:30:19] 2021-03-18 07:30:18,726 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | [data] queue_monitor thread has finished [2021-03-18 09:30:19] 2021-03-18 07:30:19,315 | INFO | MainThread | pilot.workflow.generic | run | end of generic workflow (traces error code: 0) [2021-03-18 09:30:19] 2021-03-18 07:30:19,315 | INFO | MainThread | root | wrap_up | traces error code: 0 [2021-03-18 09:30:19] 2021-03-18 07:30:19,315 | INFO | MainThread | root | wrap_up | pilot has finished [2021-03-18 09:30:19] 2021-03-18 07:30:19,363 [wrapper] ==== pilot stdout END ==== [2021-03-18 09:30:19] 2021-03-18 07:30:19,366 [wrapper] ==== wrapper stdout RESUME ==== [2021-03-18 09:30:19] 2021-03-18 07:30:19,368 [wrapper] Pilot exit status: 0 [2021-03-18 09:30:19] 2021-03-18 07:30:19,380 [wrapper] pandaids: 5002789745 [2021-03-18 09:30:19] 2021-03-18 07:30:19,385 [wrapper] apfmon messages muted [2021-03-18 09:30:19] 2021-03-18 07:30:19,387 [wrapper] Test setup, not cleaning [2021-03-18 09:30:19] 2021-03-18 07:30:19,390 [wrapper] ==== wrapper stdout END ==== [2021-03-18 09:30:19] 2021-03-18 07:30:19,392 [wrapper] ==== wrapper stderr END ==== [2021-03-18 09:30:19] 2021-03-18 07:30:19,397 [wrapper] wrapperexiting ec=0, duration=653 [2021-03-18 09:30:19] 2021-03-18 07:30:19,400 [wrapper] apfmon messages muted [2021-03-18 09:30:19] *** Error codes and diagnostics *** [2021-03-18 09:30:19] "exeErrorCode": 0, [2021-03-18 09:30:19] "exeErrorDiag": "", [2021-03-18 09:30:19] "pilotErrorCode": 0, [2021-03-18 09:30:19] "pilotErrorDiag": "", [2021-03-18 09:30:19] *** Listing of results directory *** [2021-03-18 09:30:19] total 46584 [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 1042975 Mar 18 08:24 pilot2.tar.gz [2021-03-18 09:30:19] -rwx------. 1 boinc boinc 20043 Mar 18 08:28 runpilot2-wrapper.sh [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 4974 Mar 18 08:28 queuedata.json [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 107 Mar 18 09:19 wrapper_26015_x86_64-pc-linux-gnu [2021-03-18 09:30:19] -rwxr-xr-x. 1 boinc boinc 5573 Mar 18 09:19 run_atlas [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 112 Mar 18 09:19 job.xml [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 6096 Mar 18 09:19 init_data.xml [2021-03-18 09:30:19] drwxrwx--x. 2 boinc boinc 68 Mar 18 09:19 shared [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 0 Mar 18 09:19 boinc_lockfile [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 38019645 Mar 18 09:19 EVNT.04972714._000023.pool.root.1 [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 16507 Mar 18 09:19 start_atlas.sh [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 1048546 Mar 18 09:19 input.tar.gz [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 2618 Mar 18 09:19 pandaJob.out [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1174577 Mar 18 09:19 agis_schedconf.cvmfs.json [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1948795 Mar 18 09:19 cric_ddmendpoints.json [2021-03-18 09:30:19] drwx------. 5 boinc boinc 275 Mar 18 09:19 pilot2 [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 24 Mar 18 09:29 wrapper_checkpoint.txt [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 8192 Mar 18 09:29 boinc_mmap_file [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 532 Mar 18 09:29 boinc_task_state.xml [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1769754 Mar 18 09:29 output.1.fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.pool.root [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1007 Mar 18 09:30 memory_monitor_summary.json [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 159953 Mar 18 09:30 fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.1.job.log.tgz [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 6563 Mar 18 09:30 heartbeat.json [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 8830 Mar 18 09:30 pilotlog.txt [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 192934 Mar 18 09:30 fee03ebf-9e28-42cc-9f74-f58a7546aa05_96343.1.job.log [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 546 Mar 18 09:30 output.list [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 9441 Mar 18 09:30 runtime_log.err [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 748 Mar 18 09:30 runtime_log [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 2140160 Mar 18 09:30 result.tar.gz [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 565 Mar 18 09:30 gwCNDmMK1gynfZGDcpSWOuwoABFKDmABFKDm2IFNDmdCFKDmccSXTn.diag [2021-03-18 09:30:19] -rw-rw-r--. 1 boinc boinc 39492 Mar 18 09:30 stderr.txt [2021-03-18 09:30:19] HITS file was successfully produced: [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1769754 Mar 18 09:29 shared/HITS.pool.root.1 [2021-03-18 09:30:19] *** Contents of shared directory: *** [2021-03-18 09:30:19] total 42000 [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 38019645 Mar 18 09:19 ATLAS.root_0 [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 16507 Mar 18 09:19 start_atlas.sh [2021-03-18 09:30:19] -rw-r--r--. 1 boinc boinc 1048546 Mar 18 09:19 input.tar.gz [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 1769754 Mar 18 09:29 HITS.pool.root.1 [2021-03-18 09:30:19] -rw-------. 1 boinc boinc 2140160 Mar 18 09:30 result.tar.gz 09:30:21 (14415): run_atlas exited; CPU time 491.857426 09:30:21 (14415): called boinc_finish(0) </stderr_txt> ]]>
©2024 CERN