Message boards : News : VBox Wrappers Updated to 26158
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,880,796
RAC: 642
Message 184 - Posted: 26 Mar 2015, 11:56:25 UTC

Hi, thanks for waiting. Unfortunately I got side-tracked for a while due to CERN's changing the rules since the last time I renewed my contract...
I've changed the vboxwrapper to V.26158, so you can resume tasks if you want. It seems to run fine on Windows and Linux. Please run a test WU, esp. if you're on a Mac. (Maybe I can leverage this project to get a Mac myself!)
I've added the new feature to the vbox_job.xml file which will attempt to use 'savestate' instead of 'poweroff' when gracefully shutting down.
I haven't added another new feature , which will prevent snapshots from being created by vboxwrapper, because it wasn't immediately obvious to me how the volunteer overrides this, or sets his own checkpoint schedule -- no doubt this will be clarified to me in the very near future. I'm wary of just disabling checkpointing at this stage.
ID: 184 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ben Segal
Volunteer moderator
Volunteer developer
Volunteer tester

Send message
Joined: 12 Sep 14
Posts: 65
Credit: 544
RAC: 0
Message 185 - Posted: 26 Mar 2015, 12:13:02 UTC - in response to Message 184.  
Last modified: 26 Mar 2015, 12:13:35 UTC

Hi Ivan (and Rom),

Just tested the new feature on my Mac but it didn't work. The boot of CMS code worked fine. After a BOINC suspend, the VM went into Paused state (and could be resumed OK), but after an exit of the BOINC manager the VM was left in PowerOff state, (not "Saved" state) and hence rebooted when BOINC was restarted.

Ben
ID: 185 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rom Walton (BOINC)

Send message
Joined: 20 Mar 15
Posts: 14
Credit: 5,132
RAC: 0
Message 186 - Posted: 26 Mar 2015, 12:17:07 UTC - in response to Message 185.  

Hi Ivan (and Rom),

Just tested the new feature on my Mac but it didn't work. The boot of CMS code worked fine. After a BOINC suspend, the VM went into Paused state (and could be resumed OK), but after an exit of the BOINC manager the VM was left in PowerOff state, (not "Saved" state) and hence rebooted when BOINC was restarted.

Ben


Can you abort the task so I can look at the stderr text?

----- Rom
ID: 186 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ben Segal
Volunteer moderator
Volunteer developer
Volunteer tester

Send message
Joined: 12 Sep 14
Posts: 65
Credit: 544
RAC: 0
Message 187 - Posted: 26 Mar 2015, 12:24:48 UTC - in response to Message 186.  

Hi Ivan (and Rom),

Just tested the new feature on my Mac but it didn't work. The boot of CMS code worked fine. After a BOINC suspend, the VM went into Paused state (and could be resumed OK), but after an exit of the BOINC manager the VM was left in PowerOff state, (not "Saved" state) and hence rebooted when BOINC was restarted.

Ben


Can you abort the task so I can look at the stderr text?

----- Rom


OK, done!
ID: 187 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1182
Credit: 816,328
RAC: 288
Message 188 - Posted: 26 Mar 2015, 13:28:04 UTC - in response to Message 184.  

......
I've added the new feature <enable_vm_savestate_usage/> to the vbox_job.xml file which will attempt to use 'savestate' instead of 'poweroff' when gracefully shutting down.
....

This new tag is not in the new CMS_26_03_2015.xml I got with a new task.

Pse have a closer look, ivan.
ID: 188 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,880,796
RAC: 642
Message 189 - Posted: 26 Mar 2015, 14:17:02 UTC - in response to Message 188.  

I said I'd screw up...
I'd put it in the wrong file at first, edited it out, and then forgot to put it in the right file!
Watch this space.
ID: 189 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,880,796
RAC: 642
Message 190 - Posted: 26 Mar 2015, 14:44:23 UTC - in response to Message 189.  
Last modified: 26 Mar 2015, 14:50:33 UTC

New version now available. Added both flags (at Ben's request):

enable_vm_savestate_usage/
disable_automatic_checkpoints/

As I understand it, this means you won't get the periodic checkpointing of the VM.

[Edit] Had to remove the angle brackets as otherwise the tags disappeared in my browser![/Edit]
ID: 190 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1182
Credit: 816,328
RAC: 288
Message 191 - Posted: 26 Mar 2015, 15:17:15 UTC - in response to Message 190.  

..
As I understand it, this means you won't get the periodic checkpointing of the VM.
..

Rom choose for the name disable_automatic_checkpoints.
It's a bit confusion, because checkpoints are still made, but during a checkpoint no VM-snapshot is made.

In fact the name disable_automatic_snapshots would have been better.
ID: 191 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ben Segal
Volunteer moderator
Volunteer developer
Volunteer tester

Send message
Joined: 12 Sep 14
Posts: 65
Credit: 544
RAC: 0
Message 192 - Posted: 26 Mar 2015, 15:19:21 UTC - in response to Message 190.  

New version now available. Added both flags (at Ben's request):

enable_vm_savestate_usage/
disable_automatic_checkpoints/

As I understand it, this means you won't get the periodic checkpointing of the VM.

[Edit] Had to remove the angle brackets as otherwise the tags disappeared in my browser![/Edit]

Bravo Ivan, bravo Rom !!

It works well on Mac. I will leave it running and bash it some moreā€¦.

Ben
ID: 192 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ben Segal
Volunteer moderator
Volunteer developer
Volunteer tester

Send message
Joined: 12 Sep 14
Posts: 65
Credit: 544
RAC: 0
Message 193 - Posted: 26 Mar 2015, 15:40:11 UTC - in response to Message 192.  

Even more good news: a more vicious test also succeeded. I exited the BOINC manager while the task was in Running state (as was the VM of course). The VM was saved correctly and upon relaunching BOINC both the task and the VM returned to Running state.

I like that!

So far no snapshots have appeared, by the way, after 20 minutes or so of running. Good riddance...
ID: 193 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rom Walton (BOINC)

Send message
Joined: 20 Mar 15
Posts: 14
Credit: 5,132
RAC: 0
Message 194 - Posted: 26 Mar 2015, 15:52:55 UTC

I'm curious what happens during random power failures, reboots, and vboxsvc going nuts.

----- Rom
ID: 194 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1182
Credit: 816,328
RAC: 288
Message 195 - Posted: 26 Mar 2015, 18:05:35 UTC - in response to Message 194.  
Last modified: 26 Mar 2015, 18:09:38 UTC

I'm curious what happens during random power failures, reboots, and vboxsvc going nuts.

----- Rom

For your curiosity with running BOINC and running VM's:

After soft reset and sudden power off: VM's are in PowerOff mode and after starting BOINC the VM's are booted.

Killing VBoxSVC: VBoxHeadless.exe's are starving after a while and the VM's are in PowerOff mode.
The wrappers keeps on running, but after suspend and resume the VM's are booted.
But wrappers pausing and resuming longer leads to computation errors.
ID: 195 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
m
Volunteer tester

Send message
Joined: 20 Mar 15
Posts: 243
Credit: 886,442
RAC: 223
Message 199 - Posted: 27 Mar 2015, 2:11:09 UTC
Last modified: 27 Mar 2015, 2:54:48 UTC

Win7 VBox 4.3.26-98988.
VM pauses and restarts OK when tasks switch and when
manually suspended.
BUT rebooting the host after ca 13 minutes running
(VM appeared to be running OK) caused VM to reboot.
Is this OK?. BOINC checkpoint interval is set to 10
mins in mgr.. Perhaps I didn't wait long enough.
This throws up a couple of questions:-
1. Is checkpointing now done? If so can someone tell me how, please.
Edit:- I, for one am going to need checkpointing to get
much useful work done.
2. Where can I find release notes for the wrappers?

Edit:-Rebooted host again after ca45 mins. VM rebooted but elapsed time
shown by mgr didn't reset.
ID: 199 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1182
Credit: 816,328
RAC: 288
Message 200 - Posted: 27 Mar 2015, 6:43:21 UTC - in response to Message 199.  
Last modified: 27 Mar 2015, 6:45:11 UTC

BUT rebooting the host after ca 13 minutes running
(VM appeared to be running OK) caused VM to reboot.
Is this OK?

That's expected when the machine is rebooted without stopping BOINC first.

1. Is checkpointing now done? If so can someone tell me how, please.

Checkpointing is done for the task progress. Every 10 minutes. The first time the wrapper is started and after a wrapper restart it could be up to 20 minutes.
The task state is saved every time a task is suspended ("Leave applications in memory" ticked off) and when BOINC is stopped.
ID: 200 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
m
Volunteer tester

Send message
Joined: 20 Mar 15
Posts: 243
Credit: 886,442
RAC: 223
Message 203 - Posted: 27 Mar 2015, 10:44:31 UTC - in response to Message 200.  
Last modified: 27 Mar 2015, 11:02:12 UTC

Thanks, CP.


That's expected when the machine is rebooted without stopping BOINC first.

1. Is checkpointing now done? If so can someone tell me how, please.

Checkpointing is done for the task progress. Every 10 minutes. The first time the wrapper is started and after a wrapper restart it could be up to 20 minutes.
The task state is saved every time a task is suspended ("Leave applications in memory" ticked off) and when BOINC is stopped.


Does this mean that, in order to save the VM job in progress (as opposed to the BOINC task), BOINC must be stopped before the host is shut down? My hosts run overnight and are normally shut down each day by a Win scheduled task (or Linux cron job) so I could stop BOINC first, but a normal user wouldn't do this.

Edit. How do I find out what changes/bugfixes are in each wrapper release?
ID: 203 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1182
Credit: 816,328
RAC: 288
Message 205 - Posted: 27 Mar 2015, 12:00:32 UTC - in response to Message 203.  

Does this mean that, in order to save the VM job in progress (as opposed to the BOINC task), BOINC must be stopped before the host is shut down? My hosts run overnight and are normally shut down each day by a Win scheduled task (or Linux cron job) so I could stop BOINC first, but a normal user wouldn't do this.

1. A BOINC cruncher is not a normal user. ;-)
2. A BOINC cruncher using VBox is not a normal user at all. ;-)
3. It's always good to stop all user processes/save your work before shutdown.

You could add stopping BOINC to your scheduled task with the command "path"\boinccmd --quit
ID: 205 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : News : VBox Wrappers Updated to 26158


©2024 CERN