Message boards : Theory Application : New version 5.00
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1021
Credit: 274,753
RAC: 0
Message 6733 - Posted: 1 Oct 2019, 13:45:48 UTC - in response to Message 6732.  

I have just defined the variable. Let's see how far that gets us.
ID: 6733 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1010
Credit: 591,548
RAC: 2
Message 6734 - Posted: 1 Oct 2019, 14:09:37 UTC - in response to Message 6729.  
Last modified: 1 Oct 2019, 14:14:18 UTC

A new version (v5.15) is available that hopefully fixes all outstanding issues.
This job info (example from a task, where we still had several jobs in 1 task until 12 hours elapsed time was past):
2019-07-02 16:33:11 (372): Guest Log: [INFO] New Job Starting in slot1
2019-07-02 16:33:11 (372): Guest Log: [INFO] Condor JobID:  502264.4 in slot1
2019-07-02 16:33:16 (372): Guest Log: [INFO] MCPlots JobID: 50563426 in slot1
2019-07-02 16:33:22 (372): Guest Log: [INFO] ===> [runRivet] Tue Jul  2 16:33:08 CEST 2019 [boinc ee zhad 206 - - pythia6 6.427 358 100000 76]
2019-07-02 16:44:39 (372): Guest Log: [INFO] Job finished in slot1 with 0.
did not make it in the code so far (or it must be in v5.16)
The line ===> [runRivet] etc etc would suffice, when added to stderr.txt directly after runc has started.
ID: 6734 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Avatar

Send message
Joined: 28 Jul 16
Posts: 263
Credit: 232,222
RAC: 0
Message 6735 - Posted: 1 Oct 2019, 15:23:30 UTC - in response to Message 6733.  

I have just defined the variable. Let's see how far that gets us.

The VM now writes the local wpad file and the file contains the local proxy.
Nonetheless the settings are ignored, hence I checked the VM's /persistent/etc/cvmfs/ where the basic settings should be.

site.conf is present (should be deleted):
CVMFS_PAC_URLS="http://grid-wpad/wpad.dat;http://wpad/wpad.dat;http://wlcg-wpad.cern.ch/wpad.dat;http://wlcg-wpad.fnal.gov/wpad.dat"
CVMFS_HTTP_PROXY="auto;DIRECT"
CVMFS_PAC_URLS="http://grid-wpad/wpad.dat;http://wpad/wpad.dat;http://wlcg-wpad.fnal.gov/wpad.dat;http://wlcg-wpad.cern.ch/wpad.dat"
CVMFS_HTTP_PROXY="auto;DIRECT"
CVMFS_PAC_URLS="http://grid-wpad/wpad.dat;http://wpad/wpad.dat;http://wlcg-wpad.fnal.gov/wpad.dat;http://wlcg-wpad.cern.ch/wpad.dat"
CVMFS_HTTP_PROXY="auto;DIRECT"

This contains a proxy configuration that will only work for client IPs inside WLCG.
IPs from outside WLCG get an HTTP 400 instead of a wpad file.


default.local is missing.
Should be created with at least the following lines:
CVMFS_HTTP_PROXY="auto;DIRECT"
CVMFS_PAC_URLS="http://localhost/wpad.dat;http://lhchomeproxy.cern.ch/wpad.dat;http://lhchomeproxy.fnal.gov/wpad.dat"
CVMFS_SEND_INFO_HEADER=yes



All of that corresponds to the entries in my proxy log.
ID: 6735 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1021
Credit: 274,753
RAC: 0
Message 6736 - Posted: 2 Oct 2019, 7:15:11 UTC - in response to Message 6734.  

A new version (v5.15) is available that hopefully fixes all outstanding issues.
This job info (example from a task, where we still had several jobs in 1 task until 12 hours elapsed time was past):
2019-07-02 16:33:11 (372): Guest Log: [INFO] New Job Starting in slot1
2019-07-02 16:33:11 (372): Guest Log: [INFO] Condor JobID:  502264.4 in slot1
2019-07-02 16:33:16 (372): Guest Log: [INFO] MCPlots JobID: 50563426 in slot1
2019-07-02 16:33:22 (372): Guest Log: [INFO] ===> [runRivet] Tue Jul  2 16:33:08 CEST 2019 [boinc ee zhad 206 - - pythia6 6.427 358 100000 76]
2019-07-02 16:44:39 (372): Guest Log: [INFO] Job finished in slot1 with 0.
did not make it in the code so far (or it must be in v5.16)
The line ===> [runRivet] etc etc would suffice, when added to stderr.txt directly after runc has started.

Fixed in v5.18.
ID: 6736 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1021
Credit: 274,753
RAC: 0
Message 6737 - Posted: 2 Oct 2019, 7:15:35 UTC - in response to Message 6735.  

The changes are included in v5.18.
ID: 6737 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1010
Credit: 591,548
RAC: 2
Message 6738 - Posted: 2 Oct 2019, 7:24:06 UTC - in response to Message 6736.  

The line ===> [runRivet] etc etc would suffice, when added to stderr.txt directly after runc has started.
Fixed in v5.18.
It does. Thanks!
2019-10-02 09:15:36 (2032): Guest Log: 09:15:33 CEST +02:00 2019-10-02: cranky: [INFO] ===> [runRivet] Wed Oct  2 07:15:31 UTC 2019 [boinc pp z1j 7000 250 - herwig++ 2.5.2 default 100000 132]
ID: 6738 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Avatar

Send message
Joined: 28 Jul 16
Posts: 263
Credit: 232,222
RAC: 0
Message 6739 - Posted: 2 Oct 2019, 8:01:33 UTC - in response to Message 6737.  

The changes are included in v5.18.

CVMFS and proxy configuration looks fine in v5.18.
ID: 6739 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 8
Credit: 13,798
RAC: 0
Message 6786 - Posted: 25 Oct 2019, 7:03:43 UTC

Some errors after 25 minutes
2832554
2832551
2832550
etc
ID: 6786 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Luigi R.

Send message
Joined: 29 Sep 15
Posts: 5
Credit: 35,723
RAC: 0
Message 6787 - Posted: 25 Oct 2019, 9:22:49 UTC

ID: 6787 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1021
Credit: 274,753
RAC: 0
Message 6788 - Posted: 28 Oct 2019, 14:48:46 UTC - in response to Message 6737.  

I am planning to put v5.18 on the production server tomorrow. Any objections?
ID: 6788 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1010
Credit: 591,548
RAC: 2
Message 6789 - Posted: 28 Oct 2019, 18:02:21 UTC - in response to Message 6788.  

No objections.

I suppose the tasks will always run single core except when the user have setup in app_config.xml to run multi-core.
No influence from the Max # of CPUs set in the preferences.
ID: 6789 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 536
Credit: 7,500,539
RAC: 4,901
Message 6790 - Posted: 29 Oct 2019, 0:29:29 UTC

As long as they are working and you give them the basics (such as cores and ram and maybe running time)
Then many more will be run over there.
ID: 6790 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1021
Credit: 274,753
RAC: 0
Message 6791 - Posted: 29 Oct 2019, 13:00:13 UTC - in response to Message 6788.  

I am planning to put v5.18 on the production server tomorrow. Any objections?

I will have to do this later as the 32bit app also needs to be updated.
ID: 6791 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 8
Credit: 13,798
RAC: 0
Message 6792 - Posted: 29 Oct 2019, 14:59:05 UTC - in response to Message 6788.  

I am planning to put v5.18 on the production server tomorrow. Any objections?

No.
What's new? Bugfix??
ID: 6792 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
m
Volunteer tester

Send message
Joined: 20 Mar 15
Posts: 242
Credit: 855,269
RAC: 19
Message 6806 - Posted: 9 Nov 2019, 2:17:15 UTC - in response to Message 6788.  

I am planning to put v5.18 on the production server tomorrow. Any objections?

A quick test produced:-
v5.18 failing on Linux (with CVMFS installed) and on Windows.

This is from Linux:-

2019-11-08 00:45:16 (3522): Guest Log: 00:45:14 GMT +00:00 2019-11-08: cranky: [INFO] Checking CVMFS.
2019-11-08 00:45:34 (3522): Guest Log: 00:45:34 GMT +00:00 2019-11-08: cranky: [WARNING] 'cvmfs_config probe sft.cern.ch' failed.
2019-11-08 00:45:34 (3522): Guest Log: 00:45:34 GMT +00:00 2019-11-08: cranky: [INFO] Creating local CVMFS repository.
2019-11-08 00:45:34 (3522): Guest Log: sed: can't read cvmfs-mini-0.1-amd64.tgz: No such file or directory
2019-11-08 00:45:34 (3522): Guest Log: tar: option requires an argument -- 'f'
2019-11-08 00:45:34 (3522): Guest Log: Try `tar --help' or `tar --usage' for more information.
2019-11-08 00:45:34 (3522): Guest Log: /home/boinc/cranky: line 62: ./cvmfs-mini-0.1-amd64/mount_cvmfs.sh: No such file or directory
2019-11-08 00:45:34 (3522): Guest Log: 00:45:34 GMT +00:00 2019-11-08: cranky: [WARNING] 'cvmfs_config probe sft.cern.ch' failed.

..and from Windows

2019-11-09 01:09:16 (288): Guest Log: 01:09:13 GMT +00:00 2019-11-09: cranky: [INFO] Checking CVMFS.
2019-11-09 01:09:38 (288): Guest Log: 01:09:34 GMT +00:00 2019-11-09: cranky: [WARNING] 'cvmfs_config probe sft.cern.ch' failed.
2019-11-09 01:09:38 (288): Guest Log: 01:09:34 GMT +00:00 2019-11-09: cranky: [INFO] Creating local CVMFS repository.
2019-11-09 01:09:38 (288): Guest Log: sed: can't read cvmfs-mini-0.1-amd64.tgz: No such file or directory
2019-11-09 01:09:38 (288): Guest Log: tar: option requires an argument -- 'f'
2019-11-09 01:09:38 (288): Guest Log: Try `tar --help' or `tar --usage' for more information.
2019-11-09 01:09:38 (288): Guest Log: /home/boinc/cranky: line 62: ./cvmfs-mini-0.1-amd64/mount_cvmfs.sh: No such file or directory
2019-11-09 01:09:38 (288): Guest Log: 01:09:34 GMT +00:00 2019-11-09: cranky: [WARNING] 'cvmfs_config probe sft.cern.ch' failed.

The startup messages are as CPs post here

This is via the local proxy
and this isn't so it doesn't seem to be a cache problem.
ID: 6806 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 8
Credit: 13,798
RAC: 0
Message 6814 - Posted: 12 Nov 2019, 14:53:15 UTC

My wus are very variables (from 700 to 36000 seconds), but now i'm crunching a wu at 11% after 26hs.
Is it normal?
ID: 6814 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 8
Credit: 13,798
RAC: 0
Message 6815 - Posted: 12 Nov 2019, 17:51:32 UTC - in response to Message 6814.  

My wus are very variables (from 700 to 36000 seconds), but now i'm crunching a wu at 11% after 26hs.
Is it normal?


Finished!!!
2836069
ID: 6815 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 8
Credit: 13,798
RAC: 0
Message 6862 - Posted: 29 Nov 2019, 8:06:23 UTC

A lot of errors
2840748
2840750
2840779
etc
ID: 6862 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1010
Credit: 591,548
RAC: 2
Message 6863 - Posted: 29 Nov 2019, 9:47:06 UTC - in response to Message 6862.  

A lot of errors
2840748
2840750
2840779
etc
All the errors are caused by: VM Heartbeat file specified, but missing file system status. (errno = '2')
That means the VirtualBox COM (VBoxSVC.exe) can't communicate fast enough with the wrapper, mostly caused by a too busy system.
ID: 6863 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boboviz

Send message
Joined: 24 Oct 19
Posts: 8
Credit: 13,798
RAC: 0
Message 6864 - Posted: 29 Nov 2019, 10:35:25 UTC - in response to Message 6863.  

That means the VirtualBox COM (VBoxSVC.exe) can't communicate fast enough with the wrapper, mostly caused by a too busy system.

That's strange. I crunch with my pc during the night, when i'm not working....
ID: 6864 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Theory Application : New version 5.00


©2020 CERN