Message boards : Number crunching : A std::exception was thrown.
Message board moderation

To post messages, you must log in.

AuthorMessage
zombie67 [MM]
Avatar

Send message
Joined: 26 Feb 15
Posts: 26
Credit: 4,101,356
RAC: 0
Message 87 - Posted: 17 Mar 2015, 21:07:04 UTC

A number of my guests running under Linux seem to be stalling with a Fatal exception. Are we maybe taxing the incoming server a bit much? Or is this actually somewhat normal processing? :)

Exception Message:
A std::exception was thrown.
Can not get data (Additional Information: [frontier.c:799]: No more servers/proxies. Last error was: Request 26 on chan 1 failed at Tue Mar 17 15:45:46 2015: -6 [fn-socket.c:112]: connect to 128.142.156.169 timed out after 5 seconds) ( CORAL : "coral::FrontierAccess::Statement::execute" from "CORAL/RelationalPlugins/frontier" )

CMS_8510_1426111757.078819_0

http://boincai05.cern.ch/CMS-dev/result.php?resultid=25309

Whenever I peek in at these systems, the server is partially idling, which is unusual with them running BOINC to consume all those spare cycles. The VM Guests with this type of error on the console seem to be waiting for something and not doing any actual work. With the virtualbox guest only consuming about 10% of a core.

Just wanting to share.
Reno, NV
Team: SETI.USA
ID: 87 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ben Segal
Volunteer moderator
Volunteer developer
Volunteer tester

Send message
Joined: 12 Sep 14
Posts: 65
Credit: 544
RAC: 0
Message 90 - Posted: 17 Mar 2015, 22:27:43 UTC - in response to Message 87.  

This is probably because the CMS job queue is empty - be patient (:-))
ID: 90 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 91 - Posted: 18 Mar 2015, 0:25:03 UTC - in response to Message 87.  

A number of my guests running under Linux seem to be stalling with a Fatal exception. Are we maybe taxing the incoming server a bit much? Or is this actually somewhat normal processing? :)

Exception Message:
A std::exception was thrown.
Can not get data (Additional Information: [frontier.c:799]: No more servers/proxies. Last error was: Request 26 on chan 1 failed at Tue Mar 17 15:45:46 2015: -6 [fn-socket.c:112]: connect to 128.142.156.169 timed out after 5 seconds) ( CORAL : "coral::FrontierAccess::Statement::execute" from "CORAL/RelationalPlugins/frontier" )
Just wanting to share.

Thanks. This may be that the proxies we use for "frontier" (experiment conditions database as far as I understand it) are down or overloaded. From what I see, we're hardwired into using proxies at Rutherford Appleton Lab in Oxfordshire; this may not be an optimal ultimate configuration, we shall probably have to move to a more-intelligent regional-based distribution.
Not sure about Ben's guess that we're out of WUs, there seem to be enough there (unless someone has just refilled the cache).
ID: 91 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : A std::exception was thrown.


©2024 CERN