Message boards :
News :
VBox wrapper problems
Message board moderation
Author | Message |
---|---|
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
After upgrading to the version 26155 of the VBox wrappers, we have experienced some problems. Rather than reverting back to a working state we are going to push forwards and help debug them. We hope that this way our development project can then help those in production. Cheers, Laurence |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 3 |
Hi Laurence, Who absorbed all the tasks? This morning > 4,000 - Now 0 (zero) Without tasks no testing. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 3 |
Who absorbed all the tasks? I think Zombie did ;) All tasks went through in 14-40 seconds and were validated OK ?? |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Can Zombie reduce the number of machines? We really appreciate the support but at the moment we don't need the scale as the jobs are just sample jobs. The cycles would be better used in other projects for now. |
Send message Joined: 12 Sep 14 Posts: 1069 Credit: 334,882 RAC: 0 |
Have just sent 20K work units. |
Send message Joined: 26 Feb 15 Posts: 26 Credit: 5,042,431 RAC: 2,452 |
I just checked. The tasks I crunched were only across a handful of machines. The number of machines is not the problem, it's the extremely short run time. Any of us could have burned through them. I just had the luck of the BOINC back-off algorithm. Reno, NV Team: SETI.USA |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 3 |
Sorry, this time I burned 82 tasks into errors. Forgot to change vm_cache into vm_image in app_info. Current task is running fine and I set the duration to 3 hours: http://boincai05.cern.ch/CMS-dev/results.php?hostid=37 Running with the vboxwrapper_26155_windows_x86_64.pdb for possible debug information. |
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 861,475 RAC: 3 |
I just checked. The tasks I crunched were only across a handful of machines. The number of machines is not the problem, it's the extremely short run time. Any of us could have burned through them. I just had the luck of the BOINC back-off algorithm. The question is: why are the tasks running so short on those machines. As far as I can see, only your machines are affected. In the results is the line: VM Completion File Detected causing end of the task. Your machines are still burning the tasks. Please set No New Work on your machines except one and try to find out what's going on. That would help the project. |
Send message Joined: 26 Feb 15 Posts: 26 Credit: 5,042,431 RAC: 2,452 |
|
Send message Joined: 12 Sep 14 Posts: 65 Credit: 544 RAC: 0 |
They are macs. Could that have something to do with the short run time? Yes, there are known bugs with the Mac and Linux 26155 vboxwrapper which we are currently debugging. This causes rapid task termination and of course eats these tasks so please hold off for now, OK? |
Send message Joined: 26 Feb 15 Posts: 26 Credit: 5,042,431 RAC: 2,452 |
They are macs. Could that have something to do with the short run time? I have turned them off for now. Let us know when you want to test the fixes. Edit: Although this doesn't address any other macs attached. A better solution is to remove the mac app until you are ready to have people run it again. Reno, NV Team: SETI.USA |
Send message Joined: 15 Feb 15 Posts: 10 Credit: 16,387 RAC: 0 |
get the message VM Hypervisor failed to enter an online state in a timely fashion Let it run or can I abort it? |
Send message Joined: 6 Mar 15 Posts: 19 Credit: 142,109 RAC: 0 |
Although this doesn't address any other macs attached. A better solution is to remove the mac app until you are ready to have people run it again. I'm only running two Macs but I agree with Z67 that removing the app is the cleaner solution. I only burned through 150 WUs before I saw this. S. |
Send message Joined: 26 Feb 15 Posts: 26 Credit: 5,042,431 RAC: 2,452 |
Although this doesn't address any other macs attached. A better solution is to remove the mac app until you are ready to have people run it again. If this also effects linux, then that app should also be depreciated. Reno, NV Team: SETI.USA |
©2024 CERN