Message boards : Number crunching : Vritual Box E_ACCESSDENIED Error
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1064
Credit: 319,556
RAC: 73
Message 3557 - Posted: 11 Jun 2016, 21:37:03 UTC

Now that we are running in production we are experiencing all the failure modes out in the wild. Around 61% of the failing tasks are VMs that fail to boot and are killed by the heartbeat mechanism. There was not enough information in the stderr logs to understand the issues so Rom is working on some improvements to the vboxwrapper. A new version was deployed here on Friday and we already have some result in :).

http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=201302
http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=201380
http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=201377
http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=201185
http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=200884

The error message from VirtualBox is:

ERROR [COM]: aRC=E_ACCESSDENIED (0x80070005) aIID={f30138d4-e5ea-4b3a-8858-a059de4c93fd} aComponent={MachineWrap} aText={The object functionality is limited}, preserve=false aResultDetail=0


Does anyone have any ideas about what the causes may be?
ID: 3557 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 3558 - Posted: 12 Jun 2016, 8:54:37 UTC - in response to Message 3557.  

One possible solution might be to reinstall Vbox with admin rights.


Please uninstall vbox first and after reboot use "ccleaner" to clean up registry.
The reinstall vbox.

If that works, please post here.
ID: 3558 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile PDW

Send message
Joined: 20 May 15
Posts: 217
Credit: 5,010,018
RAC: 8,271
Message 3559 - Posted: 12 Jun 2016, 9:50:58 UTC

Do you have any of those errors appearing on machines with lots of memory ?

All your examples could easily be close to the edge and drop off when they try to start !
ID: 3559 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 3560 - Posted: 12 Jun 2016, 10:31:57 UTC
Last modified: 12 Jun 2016, 10:42:07 UTC

This one had only fails for several months.
Last valid result in January.


http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=201302
http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=201380
http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=201377

This is hardly suitable to demonstrate the case.
ID: 3560 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1064
Credit: 319,556
RAC: 73
Message 3561 - Posted: 12 Jun 2016, 10:52:29 UTC - in response to Message 3558.  

Rasputin42,

Thanks for the suggestion. I have sent PMs to those involved. Hopefully they can try this and will get back to us with the result.
ID: 3561 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1064
Credit: 319,556
RAC: 73
Message 3562 - Posted: 12 Jun 2016, 10:54:01 UTC - in response to Message 3559.  

The hosts involved in the tasks posted have either 4 or 6GB. Should be enough for one task.
ID: 3562 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1064
Credit: 319,556
RAC: 73
Message 3563 - Posted: 12 Jun 2016, 11:04:59 UTC - in response to Message 3560.  

Rasputin42,

Looking at the stderr_txt for the completed tasks shows that the VMs did not start and were just idling. Now with the heartbeat mechanism they will error out after 10 mins as a failure with no credit being assigned.
ID: 3563 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile PDW

Send message
Joined: 20 May 15
Posts: 217
Credit: 5,010,018
RAC: 8,271
Message 3565 - Posted: 12 Jun 2016, 11:42:29 UTC - in response to Message 3562.  

The hosts involved in the tasks posted have either 4 or 6GB. Should be enough for one task.

But how many are they running either from here or other projects (that can demand a lot of memory) ?
ID: 3565 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Vritual Box E_ACCESSDENIED Error


©2024 CERN