Name | 2xoKDmpa2gynfZGDcpSWOuwoABFKDmABFKDm2IFNDmlCFKDm9DMZ8m_0 |
Workunit | 2063713 |
Created | 18 Mar 2021, 7:49:49 UTC |
Sent | 18 Mar 2021, 7:55:23 UTC |
Report deadline | 25 Mar 2021, 7:55:23 UTC |
Received | 18 Mar 2021, 8:21:30 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 4389 |
Run time | 10 min 59 sec |
CPU time | 8 min 16 sec |
Validate state | Valid |
Credit | 10.91 |
Device peak FLOPS | 3.56 GFLOPS |
Application version | ATLAS long simulation v1.00 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 1.37 GB |
Peak swap size | 2.23 GB |
Peak disk usage | 88.05 MB |
<core_client_version>7.16.11</core_client_version> <![CDATA[ <stderr_txt> 10:10:21 (8569): wrapper (7.7.26015): starting 10:10:21 (8569): wrapper: running run_atlas (--nthreads 1) [2021-03-18 10:10:21] Arguments: --nthreads 1 [2021-03-18 10:10:21] Threads: 1 [2021-03-18 10:10:21] Checking for CVMFS [2021-03-18 10:10:22] Probing /cvmfs/atlas.cern.ch... OK [2021-03-18 10:10:22] Probing /cvmfs/atlas-condb.cern.ch... OK [2021-03-18 10:10:23] Probing /cvmfs/grid.cern.ch... OK [2021-03-18 10:10:24] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2021-03-18 10:10:24] 2.8.0.0 15600 24 55004 81003 3 11 3947899 4096000 0 130560 0 32466 99.9969 536 309 http://cvmfs-stratum-one.cern.ch:8000/cvmfs/atlas.cern.ch DIRECT 1 [2021-03-18 10:10:24] CVMFS is ok [2021-03-18 10:10:24] Using singularity image /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img [2021-03-18 10:10:24] Checking for singularity binary... [2021-03-18 10:10:24] Using singularity found in PATH at /usr/bin/singularity [2021-03-18 10:10:24] Running /usr/bin/singularity --version [2021-03-18 10:10:24] singularity version 3.7.1-1.el7 [2021-03-18 10:10:24] Checking singularity works with /usr/bin/singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img hostname [2021-03-18 10:10:24] CentOS7 [2021-03-18 10:10:24] Singularity works [2021-03-18 10:10:24] Starting ATLAS job with PandaID=5002834296 [2021-03-18 10:10:24] Running command: /usr/bin/singularity exec --pwd /var/lib/boinc/slots/1 -B /cvmfs,/var /cvmfs/atlas.cern.ch/repo/containers/images/singularity/x86_64-centos7.img sh start_atlas.sh [2021-03-18 10:21:18] *** The last 200 lines of the pilot log: *** [2021-03-18 10:21:18] "externalCpuTime": 2, [2021-03-18 10:21:18] "processedEvents": 2, [2021-03-18 10:21:18] "trfPredata": null, [2021-03-18 10:21:18] "wallTime": 592 [2021-03-18 10:21:18] } [2021-03-18 10:21:18] } [2021-03-18 10:21:18] } [2021-03-18 10:21:18] 2021-03-18 08:21:11,694 | DEBUG | queue_monitor | pilot.user.atlas.common | update_server | no need to update logstash for this job [2021-03-18 10:21:18] 2021-03-18 08:21:11,694 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | state=finished [2021-03-18 10:21:18] 2021-03-18 08:21:11,694 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=running [2021-03-18 10:21:18] 2021-03-18 08:21:11,694 | DEBUG | queue_monitor | pilot.control.job | get_proper_state | serverstate=finished [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | INFO | queue_monitor | pilot.control.job | send_state | pilot will not update the server (heartbeat message will be written to file) [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | INFO | queue_monitor | pilot.control.job | send_state | job 5002834296 has finished - writing final server update [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | DEBUG | queue_monitor | pilot.control.job | get_data_structure | building data structure to be sent to server with heartbeat [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | DEBUG | queue_monitor | pilot.user.atlas.jobmetrics | get_job_metrics_string | job definition core count: 1 [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | INFO | queue_monitor | pilot.user.atlas.jobmetrics | get_job_metrics_string | will not add max space = -36651581 B to job metrics [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | DEBUG | queue_monitor | pilot.api.analytics | get_fitted_data | removing tails from data to be fitted [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | WARNING | queue_monitor | pilot.api.analytics | get_fitted_data | wrong length of table data, x=[1616055365.0, 1616055426.0, 1616055487.0], y=[1508620.0, 1520140.0, 1522384.0] (must be same and length>=4) [2021-03-18 10:21:18] 2021-03-18 08:21:11,695 | DEBUG | queue_monitor | pilot.user.atlas.jobmetrics | get_job_metrics | job metrics="actualCoreCount=3 nEvents=2 dbTime=29.41 dbData=4274626" [2021-03-18 10:21:18] 2021-03-18 08:21:11,696 | INFO | queue_monitor | pilot.control.job | get_data_structure | mean actualcorecount: 2.875000 [2021-03-18 10:21:18] 2021-03-18 08:21:11,696 | INFO | queue_monitor | pilot.control.job | get_data_structure | total number of processed events: 2 (read) [2021-03-18 10:21:18] 2021-03-18 08:21:11,696 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_values | using path: /var/lib/boinc/slots/1/PanDA_Pilot-5002834296/memory_monitor_summary.json (trf name=prmon) [2021-03-18 10:21:18] 2021-03-18 08:21:11,697 | DEBUG | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | summary_dictionary={'Max': {'rx_packets': 11194, 'nprocs': 5, 'nthreads': 6, 'rx_bytes': 9467399, 'wtime': 552, 'rss': 1540492, 'write_bytes': 2134016, 'vmem': 2916808, 'read_bytes': 1774447616, 'stime': 11, 'tx_bytes': 3712883, 'pss': 1533288, 'wchar': 5446590, 'rchar': 573739822, 'tx_packets': 7269, 'swap': 0, 'utime': 438}, 'Avg': {'write_bytes': 3863.0, 'nprocs': 4.7, 'nthreads': 5.599, 'rx_bytes': 17138.0, 'rx_packets': 20.264, 'vmem': 2392546.0, 'read_bytes': 3212251.0, 'swap': 0.0, 'tx_bytes': 6721.0, 'pss': 1161498.0, 'wchar': 9859.0, 'rchar': 1038631.0, 'tx_packets': 13.158, 'rss': 1168171.0}, 'HW': {'mem': {'MemTotal': 4654084}, 'cpu': {'CoresPerSocket': 4, 'ModelName': 'Intel(R) Core(TM) i5-4440 CPU @ 3.10GHz', 'ThreadsPerCore': 1, 'CPUs': 4, 'Sockets': 1}}, 'prmon': {'Version': '2.2.0'}} [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard info from prmon json [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.user.atlas.utilities | get_memory_monitor_info | extracted standard memory fields from prmon json [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | .............................. [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . Timing measurements: [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . get job = 0 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . initial setup = 0 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . payload setup = 8 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . total setup = 8 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . stage-in = 0 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . payload execution = 604 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | . stage-out = 0 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | INFO | queue_monitor | pilot.util.timing | timing_report | .............................. [2021-03-18 10:21:18] 2021-03-18 08:21:11,698 | DEBUG | queue_monitor | pilot.control.job | send_state | is_harvester_mode(args) : False [2021-03-18 10:21:18] 2021-03-18 08:21:11,699 | DEBUG | queue_monitor | pilot.control.job | write_heartbeat_to_file | heartbeat dictionary: {'pilotErrorCode': 0, 'rateWBYTES': 3863.0, 'pilotID': 'http://aipanda404.cern.ch/data/jobs/2021-03-18/BOINC-TEST/5002834296.out|PR|2.9.6 (20)', 'meanCoreCount': 2.875, 'totRBYTES': 1774447616, 'siteName': 'BOINC-TEST', 'avgVMEM': 2392546.0, 'coreCount': 1, 'totWCHAR': 5446590, 'rateRCHAR': 1038631.0, 'jobId': '5002834296', 'totRCHAR': 573739822, 'exeErrorCode': 0, 'rateWCHAR': 9859.0, 'metaData': '{\n "cmdLine": "\'/cvmfs/atlas.cern.ch/repo/sw/software/21.0/AtlasOffline/21.0.16/InstallArea/x86_64-slc6-gcc49-opt/share/Sim_tf.py\' \'--maxEvents=2\' \'--skipEvents=0\' \'--firstEvent=118001\' \'--randomSeed=119\' \'--DBRelease=all:current\' \'--geometryVersion=default:ATLAS-R2-2016-01-00-01_VALIDATION\' \'--conditionsTag\' \'default:OFLCOND-MC16-SDR-14\' \'--physicsList=FTFP_BERT_ATL_VALIDATION\' \'--preExec\' \'EVNTtoHITS:simFlags.SimBarcodeOffset.set_Value_and_Lock(200000)\' \'EVNTtoHITS:simFlags.TRTRangeCut=30.0;simFlags.TightMuonStepping=True\' \'--postInclude\' \'default:RecJobTransforms/UseFrontier.py\' \'--simulator=FullG4\' \'--truthStrategy=MC15aPlus\' \'--DataRunNumber=361106\' \'--outputHitsFile\' \'output.1.628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.pool.root\' \'--inputEvgenFile\' \'EVNT.04972714._000023.pool.root.1\'", \n "created": "2021-03-18T10:21:00", \n "executor": [\n {\n "asetup": null, \n "errMsg": "", \n "exeConfig": {\n "inputs": [\n "EVNT"\n ], \n "outputs": [\n "HITS"\n ], \n "script": "athena.py", \n "substep": "sim"\n }, \n "logfileReport": {\n "countSummary": {\n "CATASTROPHE": 0, \n "CRITICAL": 0, \n "DEBUG": 0, \n "ERROR": 0, \n "FATAL": 0, \n "IGNORED": 0, \n "INFO": 2772, \n "UNKNOWN": 6336, \n "VERBOSE": 0, \n "WARNING": 31\n }, \n "details": {}\n }, \n "metaData": {}, \n "name": "EVNTtoHITS", \n "rc": 0, \n "statusOK": true, \n "validation": true\n }\n ], \n "exitAcronym": "OK", \n "exitCode": 0, \n "exitMsg": "OK", \n "exitMsgExtra": "", \n "files": {\n "input": [\n {\n "dataset": null, \n "nentries": 1000, \n "subFiles": [\n {\n "file_guid": "527922C2-75F2-064D-A171-672D7A39A6CB", \n "name": "EVNT.04972714._000023.pool.root.1"\n }\n ], \n "type": "EVNT"\n }\n ], \n "output": [\n {\n "argName": "outputHITSFile", \n "dataset": null, \n "subFiles": [\n {\n "file_guid": "832CA19A-8589-1A4C-8F6C-5AB49B0B34B3", \n "file_size": 1769748, \n "name": "output.1.628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.pool.root", \n "nentries": 2\n }\n ], \n "type": "HITS"\n }\n ]\n }, \n "name": "Sim_tf", \n "reportVersion": "2.0.7", \n "resource": {\n "dbDataTotal": 8549252, \n "dbTimeTotal": 58.82, \n "executor": {\n "EVNTtoHITS": {\n "cpuTime": 482, \n "dbData": 4274626, \n "dbTime": 29.41, \n "memory": {\n "Avg": {\n "avgPSS": 1123874, \n "avgRSS": 1130944, \n "avgSwap": 0, \n "avgVMEM": 1904492, \n "rateRBYTES": 2698466, \n "rateRCHAR": 946105, \n "rateWBYTES": 2033, \n "rateWCHAR": 6259\n }, \n "Max": {\n "maxPSS": 1437712, \n "maxRSS": 1440912, \n "maxSwap": 0, \n "maxVMEM": 2247724, \n "totRBYTES": 1538125824, \n "totRCHAR": 539279917, \n "totWBYTES": 1159168, \n "totWCHAR": 3568185\n }\n }, \n "nevents": 2, \n "postExe": {\n "cpuTime": 1, \n "wallTime": 1\n }, \n "preExe": {\n "cpuTime": 0, \n "wallTime": 1\n }, \n "total": {\n "cpuTime": 483, \n "wallTime": 585\n }, \n "validation": {\n "cpuTime": 0, \n "wallTime": 1\n }, \n "wallTime": 583\n }\n }, \n "machine": {\n "cpu_family": "6", \n "linux_distribution": [\n "CentOS Linux", \n "7.8.2003", \n "Core"\n ], \n "model": "60", \n "model_name": "Intel(R) Core(TM) i5-4440 CPU @ 3.10GHz", \n "node": "CentOS7", \n "platform": "Linux-3.10.0-1160.15.2.el7.x86_64-x86_64-with-centos-7.8.2003-Core"\n }, \n "transform": {\n "cpuEfficiency": 0.8209, \n "cpuPWEfficiency": 0.8209, \n "cpuTime": 4, \n "cpuTimeTotal": 483, \n "externalCpuTime": 2, \n "processedEvents": 2, \n "trfPredata": null, \n "wallTime": 592\n }\n }\n}', 'xml': '{"output.1.628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.pool.root": {"adler32": "36e453d6", "surl": "root://eosatlas.cern.ch:1094//eos/atlas/atlasdatadisk/rucio/hc_test/3d/ce/output.1.628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.pool.root", "guid": "832CA19A-8589-1A4C-8F6C-5AB49B0B34B3", "fsize": 1769748}, "628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.1.job.log.tgz": {"adler32": "e3adc229", "surl": "root://eosatlas.cern.ch:1094//eos/atlas/atlasdatadisk/rucio/hc_test/84/be/628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.1.job.log.tgz", "guid": "903076d4-581f-4934-9ce0-543149291101", "fsize": 159187}}', 'maxVMEM': 2916808, 'cpuConversionFactor': 1.0, 'avgSWAP': 0.0, 'state': 'finished', 'transExitCode': 0, 'pilotErrorDiag': '', 'node': 'CentOS7', 'avgRSS': 1168171.0, 'avgPSS': 1161498.0, 'timestamp': '2021-03-18T10:21:11+02:00', 'pilotTiming': '0|0|604|0|8', 'attemptNr': 0, 'totWBYTES': 2134016, 'nEvents': 2, 'rateRBYTES': 3212251.0, 'pilotLog': '', 'cpuConsumptionTime': 500, 'startTime': 1616055034.701567, 'cpuConsumptionUnit': 's+Intel(R) Core(TM) i5-4440 CPU @ 3.10GHz 6144 KB', 'exeErrorDiag': '', 'maxSWAP': 0, 'jobMetrics': 'actualCoreCount=3 nEvents=2 dbTime=29.41 dbData=4274626', 'maxRSS': 1540492, 'schedulerID': 'harvester-CERN_central_ACTA', 'endTime': 1616055671.698799, 'maxPSS': 1533288} [2021-03-18 10:21:18] 2021-03-18 08:21:11,699 | DEBUG | queue_monitor | pilot.control.job | write_heartbeat_to_file | wrote heartbeat to file /var/lib/boinc/slots/1/heartbeat.json [2021-03-18 10:21:18] 2021-03-18 08:21:11,699 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | job 5002834296 was dequeued from the monitored payloads queue [2021-03-18 10:21:18] 2021-03-18 08:21:11,825 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | tmp job object deleted [2021-03-18 10:21:18] 2021-03-18 08:21:11,860 | INFO | retrieve | pilot.control.job | make_job_report | [2021-03-18 10:21:18] 2021-03-18 08:21:11,860 | INFO | retrieve | pilot.control.job | make_job_report | job summary report [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | -------------------------------------------------- [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | PanDA job id: 5002834296 [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | task id: NULL [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | errors: (none) [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | status: LOG_TRANSFER = DONE [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | pilot state: finished [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | transexitcode: 0 [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | exeerrorcode: 0 [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | exeerrordiag: [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | exitcode: 0 [2021-03-18 10:21:18] 2021-03-18 08:21:11,861 | INFO | retrieve | pilot.control.job | make_job_report | exitmsg: OK [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | cpuconsumptiontime: 500 s [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | nevents: 2 [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | neventsw: 0 [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | pid: 15480 [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | pgrp: 15480 [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | corecount: 1 [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | event service: False [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | sizes: {0: 8514990, 1: 8515213, 2: 8515213, 616: 8547984, 617: 8547633, 618: 8557540, 12: 8515237, 621: 8557588, 624: 8557730} [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | -------------------------------------------------- [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.control.job | make_job_report | [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | DEBUG | retrieve | pilot.control.job | has_job_completed | ls -lF /var/lib/boinc/slots/1: [2021-03-18 10:21:18] [2021-03-18 10:21:18] 2021-03-18 08:21:11,862 | INFO | retrieve | pilot.util.container | execute | executing command: ls -lF /var/lib/boinc/slots/1 [2021-03-18 10:21:18] 2021-03-18 08:21:11,878 | DEBUG | retrieve | pilot.control.job | has_job_completed | total 44700 [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 130 Mar 18 10:10 2xoKDmpa2gynfZGDcpSWOuwoABFKDmABFKDm2IFNDmlCFKDm9DMZ8m.diag [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 171380 Mar 18 10:21 628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.1.job.log [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 159187 Mar 18 10:21 628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.1.job.log.tgz [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1174577 Mar 18 10:10 agis_schedconf.cvmfs.json [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 0 Mar 18 10:10 boinc_lockfile [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 8192 Mar 18 10:21 boinc_mmap_file [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 532 Mar 18 10:21 boinc_task_state.xml [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1948795 Mar 18 10:10 cric_ddmendpoints.json [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 38019645 Mar 18 10:10 EVNT.04972714._000023.pool.root.1 [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 6563 Mar 18 10:21 heartbeat.json [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 6082 Mar 18 10:10 init_data.xml [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 1048537 Mar 18 10:10 input.tar.gz [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 112 Mar 18 10:10 job.xml [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1002 Mar 18 10:21 memory_monitor_summary.json [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1769748 Mar 18 10:20 output.1.628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.pool.root [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 546 Mar 18 10:21 output.list [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 2618 Mar 18 10:10 pandaJob.out [2021-03-18 10:21:18] drwxrwx---. 2 boinc boinc 4096 Mar 18 10:21 PanDA_Pilot-5002834296/ [2021-03-18 10:21:18] drwx------. 5 boinc boinc 4096 Mar 18 10:10 pilot2/ [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 1042975 Mar 18 09:24 pilot2.tar.gz [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 153624 Mar 18 10:21 pilotlog.txt [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 4974 Mar 18 09:49 queuedata.json [2021-03-18 10:21:18] -rwxr-xr-x. 1 boinc boinc 5573 Mar 18 10:10 run_atlas* [2021-03-18 10:21:18] -rwx------. 1 boinc boinc 20043 Mar 18 09:49 runpilot2-wrapper.sh* [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 407 Mar 18 10:10 runtime_log [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 5452 Mar 18 10:10 runtime_log.err [2021-03-18 10:21:18] drwxrwx--x. 2 boinc boinc 68 Mar 18 10:10 shared/ [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 16507 Mar 18 10:10 start_atlas.sh [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 1673 Mar 18 10:10 stderr.txt [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 107 Mar 18 10:10 wrapper_26015_x86_64-pc-linux-gnu [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 24 Mar 18 10:21 wrapper_checkpoint.txt [2021-03-18 10:21:18] 2021-03-18 08:21:11,878 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue jobs has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,878 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue payloads has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,878 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_in has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,878 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue data_out has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue current_data_in has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_jobs has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue validated_payloads has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue monitored_payloads has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_jobs has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_payloads has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_in has 1 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue finished_data_out has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_jobs has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_payloads has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_in has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue failed_data_out has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobs has 0 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.queuehandling | queue_report | queue completed_jobids has 1 job(s) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.control.job | has_job_completed | job 5002834296 has completed (purged errors) [2021-03-18 10:21:18] 2021-03-18 08:21:11,879 | INFO | retrieve | pilot.util.processes | cleanup | overall cleanup function is called [2021-03-18 10:21:18] 2021-03-18 08:21:11,881 | DEBUG | retrieve | pilot.util.processes | cleanup | work directory was removed: /var/lib/boinc/slots/1/PanDA_Pilot-5002834296 [2021-03-18 10:21:18] 2021-03-18 08:21:12,886 | INFO | retrieve | pilot.info.jobdata | collect_zombies | --- collectZombieJob: --- 10, [15480] [2021-03-18 10:21:18] 2021-03-18 08:21:12,886 | INFO | retrieve | pilot.info.jobdata | collect_zombies | zombie collector trying to kill pid 15480 [2021-03-18 10:21:18] 2021-03-18 08:21:12,886 | INFO | retrieve | pilot.info.jobdata | collect_zombies | harmless exception when collecting zombies: [Errno 10] No child processes [2021-03-18 10:21:18] 2021-03-18 08:21:13,814 | WARNING | job_monitor | pilot.control.job | check_job_monitor_waiting_time | no jobs in monitored_payloads queue (waited for 72 s) [2021-03-18 10:21:18] 2021-03-18 08:21:13,907 | INFO | retrieve | pilot.util.processes | cleanup | collected zombie processes [2021-03-18 10:21:18] 2021-03-18 08:21:13,907 | INFO | retrieve | pilot.util.processes | cleanup | will now attempt to kill all subprocesses of pid=15480 [2021-03-18 10:21:18] 2021-03-18 08:21:13,937 | INFO | retrieve | pilot.util.processes | kill_processes | process IDs to be killed: [15480] (in reverse order) [2021-03-18 10:21:18] 2021-03-18 08:21:13,964 | WARNING | retrieve | pilot.util.processes | kill_processes | found no corresponding commands to process id(s) [2021-03-18 10:21:18] 2021-03-18 08:21:13,964 | INFO | retrieve | pilot.util.processes | kill_orphans | Do not look for orphan processes in BOINC jobs [2021-03-18 10:21:18] 2021-03-18 08:21:13,964 | DEBUG | retrieve | pilot.util.queuehandling | purge_queue | queue purged [2021-03-18 10:21:18] 2021-03-18 08:21:13,965 | INFO | retrieve | pilot.control.job | retrieve | ready for new job [2021-03-18 10:21:18] 2021-03-18 08:21:13,965 | INFO | retrieve | root | retrieve | pilot has finished for previous job - re-establishing logging [2021-03-18 10:21:18] 2021-03-18 08:21:13,965 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2021-03-18 10:21:18] 2021-03-18 08:21:13,965 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | *** PanDA Pilot version 2.9.6 (20) *** [2021-03-18 10:21:18] 2021-03-18 08:21:13,965 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2021-03-18 10:21:18] 2021-03-18 08:21:13,965 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | [2021-03-18 10:21:18] 2021-03-18 08:21:13,966 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | pilot is running in a VM [2021-03-18 10:21:18] 2021-03-18 08:21:13,966 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | architecture information: [2021-03-18 10:21:18] 2021-03-18 08:21:14,024 | INFO | retrieve | pilot.util.auxiliary | display_architecture_info | [2021-03-18 10:21:18] LSB Version: :core-4.1-amd64:core-4.1-noarch [2021-03-18 10:21:18] Distributor ID: CentOS [2021-03-18 10:21:18] Description: CentOS Linux release 7.8.2003 (Core) [2021-03-18 10:21:18] Release: 7.8.2003 [2021-03-18 10:21:18] Codename: Core [2021-03-18 10:21:18] 2021-03-18 08:21:14,025 | INFO | retrieve | pilot.util.auxiliary | pilot_version_banner | **************************************** [2021-03-18 10:21:18] 2021-03-18 08:21:14,528 | DEBUG | retrieve | pilot.util.monitoring | check_local_space | checking local space on /var/lib/boinc/slots/1 [2021-03-18 10:21:18] 2021-03-18 08:21:14,539 | INFO | retrieve | pilot.util.monitoring | check_local_space | sufficient remaining disk space (13954449408 B) [2021-03-18 10:21:18] 2021-03-18 08:21:14,539 | WARNING | retrieve | pilot.control.job | proceed_with_getjob | since timefloor is set to 0, pilot was only allowed to run one job [2021-03-18 10:21:18] 2021-03-18 08:21:14,539 | DEBUG | retrieve | pilot.control.job | retrieve | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:14,539 | DEBUG | retrieve | pilot.control.job | retrieve | [job] retrieve thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:14,542 | WARNING | copytool_out | pilot.util.common | should_abort | data:copytool_out:received graceful stop - abort after this iteration [2021-03-18 10:21:18] 2021-03-18 08:21:14,542 | WARNING | monitor | pilot.control.monitor | control | aborting monitor loop since graceful_stop has been set [2021-03-18 10:21:18] 2021-03-18 08:21:14,543 | INFO | monitor | pilot.control.monitor | control | [monitor] control thread has ended [2021-03-18 10:21:18] 2021-03-18 08:21:14,579 | WARNING | job_monitor | pilot.util.common | should_abort | job:job_monitor:received graceful stop - abort after this iteration [2021-03-18 10:21:18] 2021-03-18 08:21:14,649 | DEBUG | validate_post | pilot.control.payload | validate_post | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:14,649 | INFO | validate_post | pilot.control.payload | validate_post | [payload] validate_post thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:14,774 | WARNING | queue_monitoring | pilot.util.common | should_abort | data:queue_monitoring:received graceful stop - abort after this iteration [2021-03-18 10:21:18] 2021-03-18 08:21:14,864 | DEBUG | execute_payloads | pilot.control.payload | execute_payloads | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:14,864 | INFO | execute_payloads | pilot.control.payload | execute_payloads | [payload] execute_payloads thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:14,907 | DEBUG | validate | pilot.control.job | validate | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:14,907 | DEBUG | validate | pilot.control.job | validate | [job] validate thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:14,910 | DEBUG | validate_pre | pilot.control.payload | validate_pre | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:14,910 | INFO | validate_pre | pilot.control.payload | validate_pre | [payload] validate_pre thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,437 | DEBUG | create_data_payload | pilot.control.job | create_data_payload | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,437 | DEBUG | create_data_payload | pilot.control.job | create_data_payload | [job] create_data_payload thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,437 | DEBUG | copytool_in | pilot.control.data | copytool_in | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,438 | DEBUG | copytool_in | pilot.control.data | copytool_in | [data] copytool_in thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,569 | DEBUG | copytool_out | pilot.control.data | copytool_out | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,570 | DEBUG | copytool_out | pilot.control.data | copytool_out | [data] copytool_out thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,585 | DEBUG | job_monitor | pilot.control.job | job_monitor | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,585 | DEBUG | job_monitor | pilot.control.job | job_monitor | [job] job monitor thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,604 | DEBUG | data | pilot.control.data | control | data control ending since graceful_stop has been set [2021-03-18 10:21:18] 2021-03-18 08:21:15,604 | DEBUG | data | pilot.control.data | control | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,604 | DEBUG | data | pilot.control.data | control | [data] control thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,617 | DEBUG | job | pilot.control.job | control | job control ending since graceful_stop has been set [2021-03-18 10:21:18] 2021-03-18 08:21:15,617 | DEBUG | job | pilot.control.job | control | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,617 | DEBUG | job | pilot.control.job | control | [job] control thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,642 | DEBUG | payload | pilot.control.payload | control | payload control ending since graceful_stop has been set [2021-03-18 10:21:18] 2021-03-18 08:21:15,642 | DEBUG | payload | pilot.control.payload | control | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,642 | DEBUG | payload | pilot.control.payload | control | [payload] control thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:15,822 | DEBUG | failed_post | pilot.control.payload | failed_post | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:15,822 | INFO | failed_post | pilot.control.payload | failed_post | [payload] failed_post thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:16,880 | WARNING | queue_monitor | pilot.util.common | should_abort | job:queue_monitor:received graceful stop - abort after this iteration [2021-03-18 10:21:18] 2021-03-18 08:21:16,880 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | will not set job_aborted yet [2021-03-18 10:21:18] 2021-03-18 08:21:16,880 | DEBUG | queue_monitor | pilot.control.job | queue_monitor | [job] queue monitor thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:17,778 | DEBUG | queue_monitoring | pilot.util.processes | threads_aborted | aborting since the last relevant thread is about to finish [2021-03-18 10:21:18] 2021-03-18 08:21:17,778 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | will proceed to set job_aborted [2021-03-18 10:21:18] 2021-03-18 08:21:17,778 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | [data] queue_monitor thread has finished [2021-03-18 10:21:18] 2021-03-18 08:21:18,781 | INFO | MainThread | pilot.workflow.generic | run | end of generic workflow (traces error code: 0) [2021-03-18 10:21:18] 2021-03-18 08:21:18,781 | INFO | MainThread | root | wrap_up | traces error code: 0 [2021-03-18 10:21:18] 2021-03-18 08:21:18,781 | INFO | MainThread | root | wrap_up | pilot has finished [2021-03-18 10:21:18] 2021-03-18 08:21:18,829 [wrapper] ==== pilot stdout END ==== [2021-03-18 10:21:18] 2021-03-18 08:21:18,832 [wrapper] ==== wrapper stdout RESUME ==== [2021-03-18 10:21:18] 2021-03-18 08:21:18,835 [wrapper] Pilot exit status: 0 [2021-03-18 10:21:18] 2021-03-18 08:21:18,843 [wrapper] pandaids: 5002834296 [2021-03-18 10:21:18] 2021-03-18 08:21:18,848 [wrapper] apfmon messages muted [2021-03-18 10:21:18] 2021-03-18 08:21:18,851 [wrapper] Test setup, not cleaning [2021-03-18 10:21:18] 2021-03-18 08:21:18,854 [wrapper] ==== wrapper stdout END ==== [2021-03-18 10:21:18] 2021-03-18 08:21:18,856 [wrapper] ==== wrapper stderr END ==== [2021-03-18 10:21:18] 2021-03-18 08:21:18,861 [wrapper] wrapperexiting ec=0, duration=653 [2021-03-18 10:21:18] 2021-03-18 08:21:18,863 [wrapper] apfmon messages muted [2021-03-18 10:21:18] *** Error codes and diagnostics *** [2021-03-18 10:21:18] "exeErrorCode": 0, [2021-03-18 10:21:18] "exeErrorDiag": "", [2021-03-18 10:21:18] "pilotErrorCode": 0, [2021-03-18 10:21:18] "pilotErrorDiag": "", [2021-03-18 10:21:18] *** Listing of results directory *** [2021-03-18 10:21:18] total 46580 [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 1042975 Mar 18 09:24 pilot2.tar.gz [2021-03-18 10:21:18] -rwx------. 1 boinc boinc 20043 Mar 18 09:49 runpilot2-wrapper.sh [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 4974 Mar 18 09:49 queuedata.json [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 107 Mar 18 10:10 wrapper_26015_x86_64-pc-linux-gnu [2021-03-18 10:21:18] -rwxr-xr-x. 1 boinc boinc 5573 Mar 18 10:10 run_atlas [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 112 Mar 18 10:10 job.xml [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 6082 Mar 18 10:10 init_data.xml [2021-03-18 10:21:18] drwxrwx--x. 2 boinc boinc 68 Mar 18 10:10 shared [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 0 Mar 18 10:10 boinc_lockfile [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 38019645 Mar 18 10:10 EVNT.04972714._000023.pool.root.1 [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 16507 Mar 18 10:10 start_atlas.sh [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 1048537 Mar 18 10:10 input.tar.gz [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 2618 Mar 18 10:10 pandaJob.out [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1174577 Mar 18 10:10 agis_schedconf.cvmfs.json [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1948795 Mar 18 10:10 cric_ddmendpoints.json [2021-03-18 10:21:18] drwx------. 5 boinc boinc 4096 Mar 18 10:10 pilot2 [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1769748 Mar 18 10:20 output.1.628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.pool.root [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1002 Mar 18 10:21 memory_monitor_summary.json [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 8192 Mar 18 10:21 boinc_mmap_file [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 24 Mar 18 10:21 wrapper_checkpoint.txt [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 159187 Mar 18 10:21 628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.1.job.log.tgz [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 532 Mar 18 10:21 boinc_task_state.xml [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 6563 Mar 18 10:21 heartbeat.json [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 8832 Mar 18 10:21 pilotlog.txt [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 188449 Mar 18 10:21 628fdc1e-9592-478c-a65f-b4b4c5fc13ec_87131.1.job.log [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 546 Mar 18 10:21 output.list [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 748 Mar 18 10:21 runtime_log [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 2140160 Mar 18 10:21 result.tar.gz [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 9336 Mar 18 10:21 runtime_log.err [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 566 Mar 18 10:21 2xoKDmpa2gynfZGDcpSWOuwoABFKDmABFKDm2IFNDmlCFKDm9DMZ8m.diag [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 39584 Mar 18 10:21 stderr.txt [2021-03-18 10:21:18] HITS file was successfully produced: [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1769748 Mar 18 10:20 shared/HITS.pool.root.1 [2021-03-18 10:21:18] *** Contents of shared directory: *** [2021-03-18 10:21:18] total 42000 [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 38019645 Mar 18 10:10 ATLAS.root_0 [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 16507 Mar 18 10:10 start_atlas.sh [2021-03-18 10:21:18] -rw-r--r--. 1 boinc boinc 1048537 Mar 18 10:10 input.tar.gz [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 1769748 Mar 18 10:20 HITS.pool.root.1 [2021-03-18 10:21:18] -rw-------. 1 boinc boinc 2140160 Mar 18 10:21 result.tar.gz 10:21:20 (8569): run_atlas exited; CPU time 496.453814 10:21:20 (8569): called boinc_finish(0) </stderr_txt> ]]>
©2024 CERN