1) Message boards : ATLAS Application : Testing CentOS 7 vbox image (Message 6742)
Posted 4 Oct 2019 by rbpeake
Post:
It’s great!
In my case, the ALT-F2 takes a couple of tries to make it work.
2) Message boards : ATLAS Application : Testing CentOS 7 vbox image (Message 6629)
Posted 13 Sep 2019 by rbpeake
Post:
I keep getting this failure message:

9/13/2019 10:51:20 AM | lhcathome-dev | Task 8ukMDmqecSvnShfckohDCDFpABFKDmABFKDm9l7ZDmABFKDmVXE4Wo_1 postponed for 86400 seconds: VM Hypervisor failed to enter an online state in a timely fashion.
3) Message boards : ATLAS Application : Tasks testing new pilot version (Message 6392)
Posted 29 May 2019 by rbpeake
Post:
...We are putting some tasks here which use a brand new version of the ATLAS "pilot": this is the tool which controls the execution of the task from start to finish....

What practical effect will this have? Will it increase processing efficiency for the user?
Thanks!
4) Message boards : CMS Application : New version 49.00 (Message 6258)
Posted 26 Mar 2019 by rbpeake
Post:
Kindly ignore.
5) Message boards : Number crunching : VBox issues (Message 3547)
Posted 8 Jun 2016 by rbpeake
Post:
Rom just released vboxwrapper 26186 at the weekend to support VirtualBox 5.1 on Windows. The Theory app was updated so that the release on Monday would go with that version but the CMS app wasn't updated. It has been done now so please try again.


Does this apply as well to the production work at vLHC?
6) Message boards : CMS Application : Error rate going up (Message 3325)
Posted 12 May 2016 by rbpeake
Post:
Just curious if this app will require this much hand-holding in the future, or will it be much more resilient and reliable when it gets out of beta? Just seems it has been very sensitive to running off the rails for a long time.
7) Message boards : Theory Application : Suspend/Resume Theory (Message 3288)
Posted 6 May 2016 by rbpeake
Post:
For CMS, there are no new jobs following an exit and resume.
8) Message boards : Theory Application : Suspend/Resume Theory (Message 3268)
Posted 5 May 2016 by rbpeake
Post:
Disconnecting the internet for 10min causes:

After resume the job finishes.
The running.log and the last finished x.log are the same.

No new Job is started.(for at least 1 hour)

No "Job finished" entry in stderr.txt

05/05/16 15:01:38 condor_write(): Socket closed when trying to write 365 bytes to <188.184.187.167:9618>, fd is 11
05/05/16 15:01:38 Buf::write(): condor_write() failed
05/05/16 15:01:38 Failed to send job exit status to shadow
05/05/16 15:01:38 JobExit() failed, waiting for job lease to expire or for a reconnect attempt

No new job started for CMS as well.
9) Message boards : Theory Application : New version with app_config.xml (Message 3083)
Posted 28 Apr 2016 by rbpeake
Post:

Recap;

...-Default app_config downloads with Theory if none already present (however, it might not take effect until Host or Boinc are restarted, or manual "Read config files"?)


Yes, I restarted BOINC, and then received this notification from BOINC Manager:

    vLHCathome-dev: Notice from BOINC
    Your app_config.xml file refers to an unknown application 'ALICE'. Known applications: 'LHCb', 'CMS', 'Theory', 'ATLAS'
    4/28/2016 4:12:17 PM

10) Message boards : Theory Application : New version with app_config.xml (Message 3072)
Posted 28 Apr 2016 by rbpeake
Post:
Sure. There should be a prominent entry in the FAQs.

http://lhcathome.web.cern.ch/faq

In terms of priority, it is more important that we don't lock up new volunteer's machines by swamping them with tasks than trying to maximize then number of tasks we can run by default.

We hope that those volunteers with powerful machines and who would like to donate more can follow the FAQ.

If any of you can think of a better way to do this please let us know. Later tonight I will create a new version of the CMS app that also has this file to see if it breaks anything.


Sounds like a good plan. Agree to not swamping new volunteer's machines! People will probably also ask in the forum and then can be guided to the FAQ.

Thanks!
11) Message boards : Theory Application : New version with app_config.xml (Message 3070)
Posted 28 Apr 2016 by rbpeake
Post:
As this gets sorted out it should be made easily clear on a post for the final version the name of the file to be modified, its location in the BOINC file directory, and directions for what part of the file to change. Users who may not be very technically adept may nonetheless know how many units they can run on their machine based on the amount of RAM they have.

Thanks!
12) Message boards : LHCb Application : v0.05 task doing something (Message 3038)
Posted 26 Apr 2016 by rbpeake
Post:
Is there a way to get this app working more efficiently? It uses a lot of non-CPU time, which seems inefficient.

Thanks!
13) Message boards : LHCb Application : No jobs? (Message 3037)
Posted 26 Apr 2016 by rbpeake
Post:
Wrong location, sorry.
14) Message boards : CMS Application : No Tasks (Message 2777)
Posted 15 Apr 2016 by rbpeake
Post:
No Boinc-cms tasks on the server status page availabe.

Will the server default to the other applications that have work available, to maximize donor efficiency?

Thanks.
15) Message boards : LHCb Application : Debugging LHCb failed jobs (Message 2759)
Posted 14 Apr 2016 by rbpeake
Post:
Hi Cinzia,

... I am getting a Permission denied error when it is trying to 'touch' the shutdown file in the shared area so the jobs keep running longer than they should.


I am getting the same message.
16) Message boards : News : New App Version For Linux and Windows (Message 1947)
Posted 10 Feb 2016 by rbpeake
Post:
Using "Show VM Console" starts OK, but after some seconds and switching between screens leads to fault and then crash of the app and Computation Error.
17) Message boards : News : Graceful Shutdown Now Implemented (Message 1900)
Posted 6 Feb 2016 by rbpeake
Post:
How are you managing to get new jobs? I recently get the message that I need a huge amount of memory.
18) Message boards : Number crunching : Current issues (Message 1891)
Posted 5 Feb 2016 by rbpeake
Post:
There is CMS work now at vLHC, and it is working properly.
19) Message boards : News : Graceful Shutdown Now Implemented (Message 1820)
Posted 2 Feb 2016 by rbpeake
Post:
F5 works again with the second run. Definitely a glitch! 😉
20) Message boards : News : Graceful Shutdown Now Implemented (Message 1815)
Posted 2 Feb 2016 by rbpeake
Post:
Can you look at the logs in boincmgr with the (misleading) "Show graphics" buton?

Here is an excerpt from the boot log:
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Restoring chunk tables... done
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Restoring inode generation... done
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Restoring open files counter... done
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing saved glue buffer
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing chunk tables
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing saved inode generation info
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing open files counter
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Activating Fuse module


Nothing in the cron-stderr log.

From the cron-stdout log:

    type : RFC 3820 compliant impersonation proxy
    strength : 1024 bits
    path : /tmp/x509up_u500
    timeleft : 129:59:57 (5.4 days)
    09:23:06 -0500 2016-02-02 [INFO] Downloading glidein
    09:23:11 -0500 2016-02-02 [INFO] Running glidein (check logs)



Next 20


©2020 CERN