21) Message boards : CMS Application : New version v48.30 (Message 5811)
Posted 8 Feb 2019 by m
Post:
Has anyone tried one of these CMS tasks lately?.

Not deliberately but they crept in by accident.

https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2751387
22) Message boards : Theory Application : New Native App - Linux Only (Message 5809)
Posted 8 Feb 2019 by m
Post:
From this:-
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2752133
came this:-
cranky-0.0.13 INFO: Running Container 'runc'.
nsenter: failed to unshare user namespace: Invalid argument
container_linux.go:336: starting container process caused "process_linux.go:279: running exec setns process for init caused \"exit status 39\""
cranky-0.0.13 ERROR: Container 'runc' failed.

For those more knowledgeable than I, there may be some ideas here:-
https://coderwall.com/p/s_ydlq/using-user-namespaces-on-docker
I've followed the instructions to enable namespaces on the kernel, but now there's no work to try it out... and it's Friday.
23) Message boards : Theory Application : New Native App - Linux Only (Message 5808)
Posted 8 Feb 2019 by m
Post:
I've no idea what happened here.
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2752133
This is Centos7 which (apparently) must have Python 2.7. I've installed Python 3.6 (as well) which fixed this:-
14:34:52 (4883): wrapper: running ../../projects/lhcathomedev.cern.ch_lhcathome-dev/cranky-0.0.13 ()
/usr/bin/env: python3: No such file or directory

but it still doesn't work...
24) Message boards : CMS Application : New version v48.30 (Message 5774)
Posted 20 Jan 2019 by m
Post:
These:-
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2749184
and
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2749215
look as though they might have worked if there had been any...

Didn't see any errors during the (long and complicated) setup
25) Message boards : CMS Application : New version v48.30 (Message 5756)
Posted 21 Dec 2018 by m
Post:
There's been one of these running here for an hour or so but it isn't actually doing any work. The set up takes quite a while. It seems to install singularity (among much else) in the VM. A quick look at the proxy log shows about 255M download for two tasks, one started the other waiting.. The setup appears to complete successfully but, although "cmsrun" appears at intervals in the "top" console and takes ca 50% CPU, no "running job", "wrapper" nor "error" outputs appear and it hasn't timed out.
The host is shut down at the moment but will start itself up again later and run until 0700 GMT. I'll leave it to see what, if anything, happens.
26) Message boards : News : Dev server updated (Message 5664)
Posted 17 Nov 2018 by m
Post:
Feedback on the new opt-in and auto-attach features is welcome.

After a read through, these points spring to mind. I'm sorry if they're somewhat ill-organised but if I
don't do it now, I never will...
Opt in

" In order to by fully-compliant with GDPR, only allow users to create accounts through the Web site. "

This seems to set the scene for frustrations and difficulties due to the large number of different browsers, versions,
add ons, functions turned off etc. We get to the state of needing a later browser version which needs a later OS which
needs new hardware... I know this can happen with other organisations but don't want LHC@home to be one of them.
********
There needs to be a proper way of drawing user's attention to any changes to the Ts and Cs..
*******
Perhaps a reminder similar to that shown on CERN's login page would also be a good idea. Something like this, maybe:-

Remember: you have agreed to follow <this project's> Code of Conduct, and you have read our Privacy Notice.

Not sure on which page this should appear... somewhere on the message boards index pages.

There should also be links to both the Code and the Notice on each page. They're not easy to find at the moment.
The page footer might well not be visible.

Auto attach.

Will prospective users be able to use different versions of software? For example client and manager from different versions,
or no manager at all, or compile their own, or use third party versions or...

Also bear in mind that we have all been fiddling around with BOINC and these web sites for some time. New users will find
problems that we don't. There needs to be an easy to find "How to get going" or "If you're having trouble" page which
allows people to ask for (and get) help before they sign up or agree to anything.

Data

"0 = Users are not given the option to delete their account (Default value)"

If users are not allowed to delete (or amend) their own data, surely there must be a clear explanation of how they can get
the project operators to do it, and make sure it's done in a timely manner.


" 1 = User data is anonymized. This means that user records and host records are left in the database but personal
information is replaced with nonsense data. Other user related records not required for processing are deleted.
2 = All user data is deleted. This means that all user related records are deleted from the database.
3 = Project defined implementation. Projects can implement a function in project.inc: project_delete_account
($user) and this function will then be used when a user delete's their account.
"

If one of these options is used, the user needs to be clearly told what will happen.
27) Message boards : Theory Application : New version v3.10 (Message 5534)
Posted 22 Sep 2018 by m
Post:
... how can you verify that the VM really uses OpenHTC.io for CernVM? ...

... you may check your network traffic.

From the local proxy log:-
a host running dev theory
22/09/2018 01:15:27      2 192.168.100.122 TCP_MEM_HIT/200 1080 GET http://s1ral-cvmfs.openhtc.io/cvmfs/grid.cern.ch/.cvmfspublished - NONE/- application/x-cvmfs
22/09/2018 01:15:45      2 192.168.100.122 TCP_MEM_HIT/200 1087 GET http://s1cern-cvmfs.openhtc.io/cvmfs/cvmfs-config.cern.ch/.cvmfspublished - NONE/- application/x-cvmfs
22/09/2018 01:16:28     70 192.168.100.122 TCP_MISS/200 1219 GET http://s1cern-cvmfs.openhtc.io/cvmfs/cvmfs-config.cern.ch/.cvmfspublished - DIRECT/104.18.55.33 application/x-cvmfs
22/09/2018 01:16:32     26 192.168.100.122 TCP_MISS/200 1214 GET http://s1cern-cvmfs.openhtc.io/cvmfs/grid.cern.ch/.cvmfspublished - DIRECT/104.18.55.33 application/x-cvmfs

and from one running production theory
22/09/2018 01:16:38     60 192.168.100.123 TCP_MISS/200 618 GET http://s1cern-cvmfs.openhtc.io/cvmfs/sft.cern.ch/api/v1.0/geo/192.168.100.137/s1bnl-cvmfs.openhtc.io,s1cern-cvmfs.openhtc.io,s1fnal-cvmfs.openhtc.io,s1ral-cvmfs.openhtc.io - DIRECT/104.18.55.33 text/plain
22/09/2018 01:16:53     18 192.168.100.123 TCP_MEM_HIT/200 98178 GET http://s1ral-cvmfs.openhtc.io/cvmfs/sft.cern.ch/data/d1/9d8bd664ae174d4eab80c023f7efb48ef9f495C - NONE/- text/plain
22/09/2018 01:16:53     12 192.168.100.123 TCP_MEM_HIT/200 61402 GET http://s1ral-cvmfs.openhtc.io/cvmfs/sft.cern.ch/data/71/948c137be8baa03f6a6fb4b3bf8042ab8d161bC - NONE/- text/plain
22/09/2018 01:16:55    389 192.168.100.123 TCP_HIT/200 1754447 GET http://s1ral-cvmfs.openhtc.io/cvmfs/sft.cern.ch/data/69/9c932053df383da6b67605d31b84c417915baa - NONE/- text/plain
22/09/2018 01:16:55      3 192.168.100.123 TCP_MEM_HIT/200 6521 GET http://s1ral-cvmfs.openhtc.io/cvmfs/sft.cern.ch/data/75/a4e5370a790c6e4668a765f522e3d1387ff35a - NONE/- text/plain

so it really does work, unfortunately. From what I see here it has removed most of the advantage of the local proxy.
28) Message boards : Sixtrack Application : The Sixtrack Application (Message 5353)
Posted 10 Feb 2018 by m
Post:
Task appeared to complete OK but upload failed.

lhcathome-dev 10/02/2018 2:06:06 am
Temporarily failed upload of condor#vccondorce02.cern.ch#sixtrack#1518169336#22240.0_2_r650826464_0: transient HTTP error
29) Message boards : Number crunching : Server problem again (Message 5348)
Posted 31 Jan 2018 by m
Post:
I see this, too:-

From CMS

and from LHCb
30) Message boards : News : New native Linux ATLAS application (Message 4777)
Posted 4 Mar 2017 by m
Post:
Just run a couple of jobs. Both failed. The second still downloaded >200MB.

Looking at one of the jobs, the stderr that's included with the task result isn't all of the stderr.txt from the slot. In the bit that's missing, there is:-

<title>400 Bad Request</title>
</head><body>
<h1>Bad Request</h1>
<p>Your browser sent a request that this server could not understand.<br />
Reason: You're speaking plain HTTP to an SSL-enabled server port.<br />
Instead use the HTTPS scheme to access this URL, please.<br />
<blockquote>Hint: <a href=\"https://pandaserver.cern.ch:25443/\"><b>https://pandaserver.cern.ch:25443/</b></a></blockquote></p>
</body></html>

Are we still having ssl problems? Have I put the symlinks in the right place? Different distributions put libs in different places.
31) Message boards : News : New native Linux ATLAS application (Message 4775)
Posted 3 Mar 2017 by m
Post:
David,

For this you need to apply the workaround mentioned earlier in this thread

Where does your script expect to find these libs? Mine are in
/lib/x86_64-linux-gnu which is where I put the symlinks. There are (32bit?) ones in /lib/i386-linux-gnu and there apparently are apps that expect them to be in /usr/lib or maybe /usr/lib/x86_64-linux-gnu .
32) Message boards : News : New native Linux ATLAS application (Message 4764)
Posted 3 Mar 2017 by m
Post:
Jobs fail after a few minutes as described below.
Practically no CPU use apart from short bursts by python at intervals.

At the start of the pilotlog.txt file is this:-

2017-03-03 00:06:36|11945|SiteInformat| !!WARNING!!2999!! $X509_CERT_DIR is not set and default location /etc/grid-security/certificates does not exist

which looks suspicious. I also grabbed a copy of runtime_log_err.txt as the filename looked promising, but it appears to be the same as pilotlog.txt.
33) Message boards : News : New native Linux ATLAS application (Message 4754)
Posted 2 Mar 2017 by m
Post:
I picked up this task last night
by accident. Not using he BOINC manager. It only ran for a
few seconds and, on ending, took down the default VNC server
(Vino) which I was using to control the host. Had to restart
everything to regain control so it looks as though it's
killing more than it should. Don't know why it failed to
run - no HITS file. This is Ubuntu 12.04.
34) Message boards : News : New native Linux ATLAS application (Message 4737)
Posted 27 Feb 2017 by m
Post:
The problem here is incompatible SSL libraries.
The python that we are using from CVMFS expects to use the SSL library libssl.so.10 but I guess you have a different version on openSUSE. This is one of the disadvantages of CVMFS - packaging all the required software in there means it restricts which operating systems it can run on.

There must be quite a few versions in use in the different disributions. The system here - which is supported by CVMFS, they say - has libssl.so.0.9.8 and 1.0.0 (unless they're hidden away somewhere) - a long way from 10. Does this mean it's back to VBox? or can we simply add the required version? If so, where should it be placed so that your Python can find it.

also...
bash is normally the default shell for users. But many scripts use /bin/sh which on Ubuntu is redirected to dash shell. You can see what your system uses with "ls -l /bin/sh".

You're right

~$ ls -l /bin/sh
lrwxrwxrwx 1 root root 4 Jun 21 2015 /bin/sh -> dash

~$
35) Message boards : News : New native Linux ATLAS application (Message 4735)
Posted 27 Feb 2017 by m
Post:

Sometimes with success (why?), sometimes with validation error.
.

I got the idea that validation was based on the run time,
i.e. if the task didn't crash within (say) 12 mins, it was good to go, as it were
with further validation checks done out of sight of BOINC.
Doesn't sound right to me but the idea came from somewhere... and it does seem to behave rather like that.
36) Message boards : News : New native Linux ATLAS application (Message 4731)
Posted 27 Feb 2017 by m
Post:
I found that the WU were not running properly on Ubuntu machines due to one of our scripts assuming bash shell (from what I see dash is the default on Ubuntu). I've fixed the script and will send some new WU now.

Task ran for ca 12min and failed to validate. This is Ubuntu 12.04
and bash is installed... never heard of dash.

$ bash --version
GNU bash, version 4.2.25(1)-release (x86_64-pc-linux-gnu)
Copyright (C) 2011 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>


I'll try another, but after that I must wait until tonight.

EDIT. Failed, although it validated; I think I still got the old version but that will have to do for now.
37) Message boards : News : New native Linux ATLAS application (Message 4721)
Posted 25 Feb 2017 by m
Post:
A couple of hours later it was OK, so whilst it may have been a timeout
(didn't think about that, thanks for the reminder).
I suspect a couple of other hosts running Atlas with the 200M downloads
may have something to do with it (they wait for the free data time, too).
That's why I no longer run Atlas regularly.

~$ sudo cvmfs_config probe
Probing /cvmfs/atlas.cern.ch... OK
Probing /cvmfs/atlas-condb.cern.ch... OK
Probing /cvmfs/grid.cern.ch... OK
~$ sudo cvmfs_config chksetup
OK
~$


Hope there's work tonight...
38) Message boards : News : New native Linux ATLAS application (Message 4718)
Posted 25 Feb 2017 by m
Post:
After waiting for my unmetered data period to start, installed cvmfs only to find that there isn't any work... so can't try it out... serves me right.
However, running chksetup produces this:-

$ cvmfs_config chksetup
Warning: failed to access http://cvmfs-stratum-one.cern.ch/cvmfs/atlas.cern.ch/.cvmfspublished through proxy DIRECT
Warning: failed to use Geo-API with cvmfs-stratum-one.cern.ch


Is this a problem?

Probe works OK:-

$ cvmfs_config probe
Probing /cvmfs/atlas.cern.ch... OK
Probing /cvmfs/atlas-condb.cern.ch... OK
Probing /cvmfs/grid.cern.ch... OK


Running $ sudo cvmfs_talk host.probe
gives:-
atlas.cern.ch:
unknown command
atlas-condb.cern.ch:
unknown command
grid.cern.ch:
unknown command


and

$ sudo cvmfs_talk host.probe.geo
gives:-
atlas.cern.ch:
Seems like CernVM-FS is not running in /var/lib/cvmfs/shared (not found: /var/lib/cvmfs/shared/cvmfs_io.atlas.cern.ch)

atlas-condb.cern.ch:
Seems like CernVM-FS is not running in /var/lib/cvmfs/shared (not found: /var/lib/cvmfs/shared/cvmfs_io.atlas-condb.cern.ch)

grid.cern.ch:
Seems like CernVM-FS is not running in /var/lib/cvmfs/shared (not found: /var/lib/cvmfs/shared/cvmfs_io.grid.cern.ch)


The three files are there but 0 bytes.
39) Message boards : Theory Application : A new 32bit image is available (Message 4589)
Posted 29 Dec 2016 by m
Post:
I've just tried (again) to get it to run on a 32 bit host (WXP) VB5.0.12
Still fails like this:-

2016-12-29 00:37:35 (3276): Guest Log: [INFO] Requesting an X509 credential from vLHC@home
2016-12-29 00:37:45 (3276): Guest Log: [INFO] Requesting an X509 credential from vLHC@home-dev
2016-12-29 00:38:15 (3276): Guest Log: [ERROR] Could not get an x509 credential
2016-12-29 00:38:15 (3276): Guest Log: [ERROR] The x509 proxy creation failed.
2016-12-29 00:38:15 (3276): Guest Log: [INFO] Shutting Down.
2016-12-29 00:38:15 (3276): VM Completion File Detected.
2016-12-29 00:38:15 (3276): VM Completion Message: The x509 proxy creation failed.


.
40) Message boards : Number crunching : Disable Blank Console Screensaver (Message 4461)
Posted 6 Dec 2016 by m
Post:
+1, and please do the same to LHC@home, too.


Previous 20 · Next 20


©2024 CERN