Message boards : LHCb Application : Debugging LHCb failed jobs
Message board moderation

To post messages, you must log in.

AuthorMessage
Cinzia

Send message
Joined: 3 Mar 16
Posts: 10
Credit: 33,623
RAC: 0
Message 2585 - Posted: 5 Apr 2016, 7:33:09 UTC

Dear all,

we are experiencing jobs failures and we would like to understand what is going wrong. We can see that, most of the failed jobs, are executed by the machines
with the following model name:

AMDAthlon(tm)IIX4630Processor
AMDAthlon(tm)IIX4635Processor
AMDA8-5500APUwithRadeon(tm)HDGraphics

We would like to know more about what you guys see while executing the jobs. Thank you for your contribution.

Cinzia
ID: 2585 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile PDW

Send message
Joined: 20 May 15
Posts: 217
Credit: 5,211,392
RAC: 8,905
Message 2641 - Posted: 11 Apr 2016, 16:08:24 UTC - in response to Message 2585.  

Hi Cinzia,

I assume you want users with those machines to let you know what is going wrong rather than what is going right.

I don't have any of those but at the moment though I am getting a Permission denied error when it is trying to 'touch' the shutdown file in the shared area so the jobs keep running longer than they should.
ID: 2641 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rbpeake

Send message
Joined: 15 Apr 15
Posts: 38
Credit: 227,251
RAC: 0
Message 2759 - Posted: 14 Apr 2016, 17:37:52 UTC - in response to Message 2641.  

Hi Cinzia,

... I am getting a Permission denied error when it is trying to 'touch' the shutdown file in the shared area so the jobs keep running longer than they should.


I am getting the same message.
ID: 2759 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Cinzia

Send message
Joined: 3 Mar 16
Posts: 10
Credit: 33,623
RAC: 0
Message 2885 - Posted: 21 Apr 2016, 10:19:27 UTC - in response to Message 2759.  

We will try to fix it.

Thanks for reporting.

Cheers
Cinzia
ID: 2885 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : LHCb Application : Debugging LHCb failed jobs


©2024 CERN