41) Message boards : Number crunching : exceeded disk limit (Message 364)
Posted 11 May 2015 by Ben Segal
Post:
Any ideas why this bug affected CMS and not ATLAS? Or did it?

Ben
42) Message boards : Number crunching : Job queue empty!! (Message 284)
Posted 23 Apr 2015 by Ben Segal
Post:
ERROR:root:No message received! Nothing to do! . . .

I'm not sure I understand... Where is this error message coming from and what job queue are you saying is empty? I see plenty of tasks available on the server status page and my hosts haven't had any problem getting work, but maybe that's not what you are referring to.

CP is referring to the CMS job queue which feeds jobs into the VM from CERN. The error message he sees is on the 5th VM console output (shown by the "Show VM Console" button, followed by CTRL-ALT-F5 to select the 5th screen). Right now there is a problem at the CMS end which the admins know about and will fix asap.

The BOINC tasks are what you see on the server status page and there's no shortage of these - each BOINC task runs 24 hours and just starts a new VM each time.

Patience...
43) Message boards : News : VBox Wrappers Updated to 26165 (Message 253)
Posted 14 Apr 2015 by Ben Segal
Post:
Hi Ivan, the new wrappers since quite a while do not use snapshots. So that should not be the problem.
44) Message boards : Number crunching : Project Name (Message 232)
Posted 7 Apr 2015 by Ben Segal
Post:
Would it be possible to keep all LHC related project under the one related name?
CMS-dev in this case does not suggest any relation to LHC. I suggest it would be more in line if you called it /LHC-dev or /LHC-test. Once the site meets the requirement and app needs than it could be changed to another LHC sub project.
As it is the project needs an invitation and is considered as a test project. As such starting a ne sub project from the result of the test would be normal as long as you announce the start of the new project name.

Hi Peter,

There are discussions going on at CERN right now on unifying the support for our BOINC projects as much as possible. We already have plans to add CMS-dev (as well as Beauty@home) to the vLHC@home project as sub-applications when they advance from beta to production level. This will be easy to do as the lower level architectures of both these projects are based on that of Test4Theory, the original vLHC project. (Not so for ATLAS at the moment, however).

Stay tuned for more news about all this, and thanks for your post.
45) Message boards : News : Vbox Wrapper Updates (Message 202)
Posted 27 Mar 2015 by Ben Segal
Post:
I'm running vboxwrapper v26159 anonymous now.
For MAC that version is also compiled/available.
Not for Linux, maybe not needed cause Linux probably don't suffer from the same error condition.

Thanks CP, we'll probably upgrade to that very soon. Laurence will decide.
46) Message boards : News : VBox Wrappers Updated to 26158 (Message 193)
Posted 26 Mar 2015 by Ben Segal
Post:
Even more good news: a more vicious test also succeeded. I exited the BOINC manager while the task was in Running state (as was the VM of course). The VM was saved correctly and upon relaunching BOINC both the task and the VM returned to Running state.

I like that!

So far no snapshots have appeared, by the way, after 20 minutes or so of running. Good riddance...
47) Message boards : News : VBox Wrappers Updated to 26158 (Message 192)
Posted 26 Mar 2015 by Ben Segal
Post:
New version now available. Added both flags (at Ben's request):

enable_vm_savestate_usage/
disable_automatic_checkpoints/

As I understand it, this means you won't get the periodic checkpointing of the VM.

[Edit] Had to remove the angle brackets as otherwise the tags disappeared in my browser![/Edit]

Bravo Ivan, bravo Rom !!

It works well on Mac. I will leave it running and bash it some moreā€¦.

Ben
48) Message boards : News : VBox Wrappers Updated to 26158 (Message 187)
Posted 26 Mar 2015 by Ben Segal
Post:
Hi Ivan (and Rom),

Just tested the new feature on my Mac but it didn't work. The boot of CMS code worked fine. After a BOINC suspend, the VM went into Paused state (and could be resumed OK), but after an exit of the BOINC manager the VM was left in PowerOff state, (not "Saved" state) and hence rebooted when BOINC was restarted.

Ben


Can you abort the task so I can look at the stderr text?

----- Rom


OK, done!
49) Message boards : News : VBox Wrappers Updated to 26158 (Message 185)
Posted 26 Mar 2015 by Ben Segal
Post:
Hi Ivan (and Rom),

Just tested the new feature on my Mac but it didn't work. The boot of CMS code worked fine. After a BOINC suspend, the VM went into Paused state (and could be resumed OK), but after an exit of the BOINC manager the VM was left in PowerOff state, (not "Saved" state) and hence rebooted when BOINC was restarted.

Ben
50) Message boards : News : A Message To All Our Volunteers (Message 168)
Posted 24 Mar 2015 by Ben Segal
Post:
Ok so now I got 3 WUs successfully ending on my iMac (and another one currently crunching) and I can see all seems to go well so what you say is that I can stop CMS for the moment and let other projects go on instead ?

Yes, thanks a lot for helping us to debug on MacOSX.

We will announce when we need serious crunching power for new CMS jobs.
51) Message boards : News : VBox Wrappers Updated to 26157 (Message 160)
Posted 23 Mar 2015 by Ben Segal
Post:
Sorry, I misunderstood an email from Rom Walton, I have put back the enable_cern_dataformat tag in the job XML file.

Yes, and this is now working on Mac too, with the latest wrapper !!!
52) Message boards : News : MAC VBox Wapper Updated to 26156 (Message 148)
Posted 22 Mar 2015 by Ben Segal
Post:
I have just updated the VBox wrapper for MAC to version 26156. Please let us know if you have any problems.

It now boots the VM which is a good step forward. But now gives the error message:

ERROR: root: No floppy drive found! Could not get BOINC credential!

I will send you a screen shot with the traceback by email as I can't copy and paste from the RDP screen and no web files are yet available with the error logs.
53) Message boards : News : A Message To All Our Volunteers (Message 141)
Posted 21 Mar 2015 by Ben Segal
Post:
Hi

Thanks a lot for the information.

I'm a bit skeptical though : we are helping you starting an infrastructure for a new project, however when it works well you will move it to an existing infrastructure (vLHC) as "another project", so what's the point ? Am I missing something ?

Hi Jerome, what is happening is that CERN would like to consolidate technical support and user interaction for related BOINC projects as much as possible under one roof. It's not cast in concrete and we will see how such a combined platform works out over the next few months.
54) Message boards : News : VBox wrapper problems (Message 118)
Posted 20 Mar 2015 by Ben Segal
Post:
They are macs. Could that have something to do with the short run time?

Yes, there are known bugs with the Mac and Linux 26155 vboxwrapper which we are currently debugging. This causes rapid task termination and of course eats these tasks so please hold off for now, OK?
55) Message boards : News : New Release (v46) (Message 105)
Posted 19 Mar 2015 by Ben Segal
Post:
I immediately got "computation error" after switching to the latest version. I had previously removed the old VM by hand. This "computation error" persists even after a project reset!

Hi Ben,

What happens, if you upgrade to a newer VBOX-version with extension pack installed?
I ran one shortened task without any intervention of my site with success.

http://boincai05.cern.ch/CMS-dev/result.php?resultid=25660

Aha, all is explained. I am on a Mac. We have hit the known bug with the vboxwrapper upgrade on MacOSX and Linux, the one that forced us to revert to 26079 for these systems after the upgrade on T4T.

My omission not to advise Laurence and Hendrik about this. We will revert to 26079 on CMS too for MacOSX and Linux. The new wrapper works fine on Windows.

I have sent the stderr details to Rom and Charlie for their debugging.
56) Message boards : News : New Release (v46) (Message 103)
Posted 19 Mar 2015 by Ben Segal
Post:
I immediately got "computation error" after switching to the latest version. I had previously removed the old VM by hand. This "computation error" persists even after a project reset!
57) Message boards : Number crunching : Configuring Virtual Machines (Message 101)
Posted 18 Mar 2015 by Ben Segal
Post:
I am abit confused here regarding VM's. Do we as users of BOINC need to download and configure VM's or is it automatically done by your wrapper and we do not have to do anything..?

For example, I wish to run ATLAS as well as another LHC program using VM, do I have to do anything else bar selecting in BOINC to run those apps?

You do not need to configure VM's and in fact you must not! Just install VirtualBox and the Extension Pack and each project does the rest.
58) Message boards : Number crunching : A std::exception was thrown. (Message 90)
Posted 17 Mar 2015 by Ben Segal
Post:
This is probably because the CMS job queue is empty - be patient (:-))
59) Message boards : News : New Release (v45) (Message 89)
Posted 17 Mar 2015 by Ben Segal
Post:
Jerome, looks like the CMS job queue is empty. Nothing wrong with your setup.
60) Message boards : Number crunching : Missing links on account page (Message 74)
Posted 16 Mar 2015 by Ben Segal
Post:
Where are we in getting the aforementioned links fixed?

In addition the MCPLOTS Stats links to the wrong user. I am user ID 164 in this project but the link is to the user ID 164 of VLHCathome. Just for the record my VLHCathome ID is 21804.

Both these problems are now fixed. Thanks for the heads-up!


Previous 20 · Next 20


©2024 CERN