Message boards : Number crunching : Job queue empty!!
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,874,101
RAC: 154
Message 462 - Posted: 7 Jul 2015, 9:59:15 UTC - in response to Message 461.  
Last modified: 7 Jul 2015, 10:02:49 UTC

Okay, so back to the point...
It seems that the CERN job queue (databridge, copilot, whatever) has been unable to supply jobs for some weeks.
Do you guys want us to continue running BOINC jobs, or leave off until an announcement?

OK, I've had some feedback. Changes are in the pipeline, and a major contributor will be back from holidays next week. For the present, there's not much point in running any jobs except to test out how the project runs on your equipment and to report back any anomalies.
Note the expectation is to eventually merge with other LHC projects, and a new unified web-site has been set up, so we will eventually close down this development project and move across there. You might also want to check out the latest news from BOINC itself.
I'll try to keep some information flowing as it comes to me.
ID: 462 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Phil

Send message
Joined: 9 Apr 15
Posts: 57
Credit: 230,221
RAC: 0
Message 463 - Posted: 7 Jul 2015, 19:04:11 UTC - in response to Message 462.  

Okay, so back to the point...
It seems that the CERN job queue (databridge, copilot, whatever) has been unable to supply jobs for some weeks.
Do you guys want us to continue running BOINC jobs, or leave off until an announcement?

OK, I've had some feedback. Changes are in the pipeline, and a major contributor will be back from holidays next week.

Aah, thats fine. We're only a tiny testing project, and holiday absences are to be expected!

For the present, there's not much point in running any jobs except to test out how the project runs on your equipment and to report back any anomalies.

Fair enough, BOINCers are good at that.

You might also want to check out the latest news from BOINC itself.

Dont know if that'll have a negative impact, no impact or even a positive one! Time will tell.....

I'll try to keep some information flowing as it comes to me.


Thanks Ivan.
ID: 463 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,874,101
RAC: 154
Message 680 - Posted: 19 Aug 2015, 12:43:05 UTC

We're about to run out of jobs again. I need to check a few things on the condor queues so we might sit empty for an hour or two. I'll try to get something running again later this afternoon (London time). Thanks for your patience.
ID: 680 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,874,101
RAC: 154
Message 698 - Posted: 19 Aug 2015, 16:35:42 UTC - in response to Message 680.  

We're about to run out of jobs again. I need to check a few things on the condor queues so we might sit empty for an hour or two. I'll try to get something running again later this afternoon (London time). Thanks for your patience.

I'm hanging off on this overnight -- it looks like there are enough jobs being resubmitted by the Condor queue to keep things humming,
ID: 698 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 4 May 15
Posts: 64
Credit: 55,584
RAC: 0
Message 699 - Posted: 19 Aug 2015, 17:00:12 UTC - in response to Message 698.  

We're about to run out of jobs again. I need to check a few things on the condor queues so we might sit empty for an hour or two. I'll try to get something running again later this afternoon (London time). Thanks for your patience.

I'm hanging off on this overnight -- it looks like there are enough jobs being resubmitted by the Condor queue to keep things humming,

My Host 380 is looking decidedly yellow!
ID: 699 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,874,101
RAC: 154
Message 715 - Posted: 19 Aug 2015, 21:09:43 UTC - in response to Message 699.  

We're about to run out of jobs again. I need to check a few things on the condor queues so we might sit empty for an hour or two. I'll try to get something running again later this afternoon (London time). Thanks for your patience.

I'm hanging off on this overnight -- it looks like there are enough jobs being resubmitted by the Condor queue to keep things humming,

My Host 380 is looking decidedly yellow!

Sorry old chap, I don't understand your banter!
Are you saying we have run out?
Unfortunately, Everything Everywhere is making a terrible hash of my home broadband this week (since Easter, but especially bad this week) so I'm not confident of being able to log-in at work and make a meaningful intervention.
What seemed to be happening earlier was that Condor wasn't getting the message that we were successfully uploading results to stage-out, so it blithely re-scheduled jobs for another run, and then a third -- from earlier experience that is where it gives up. From the point of view of completeness, that gives failed jobs another two chances to be run to completion. From an efficiency point of view it's a bit -- you know!.
Although I wouldn't bet money that Laurence, Andrew or Daniele has found a cure and we're now out of jobs.
I'll try to look, I can't promise I can do anything until tomorrow morning at work.
ID: 715 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 4 May 15
Posts: 64
Credit: 55,584
RAC: 0
Message 717 - Posted: 19 Aug 2015, 21:24:25 UTC - in response to Message 715.  

Sorry old chap, I don't understand your banter!

Sorry, reference to my BoincView network monitoring software. It flags tasks with 'low CPU efficiency' in yellow (among sundry other colour-coded alerts).

In the case of CMS, 'low CPU efficiency' correlates to 'low CPU usage', which basically means no CMS jobs running in the VM. At the time I posted, I assumed that simply meant the system had run out of jobs. Then I went upstairs and checked the VM console and logs - and found it possibly meant glitches with the new Agent scripts. See follow-up posts in the News area.
ID: 717 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,874,101
RAC: 154
Message 721 - Posted: 20 Aug 2015, 7:54:42 UTC - in response to Message 717.  

Sorry old chap, I don't understand your banter!

Sorry, reference to my BoincView network monitoring software. It flags tasks with 'low CPU efficiency' in yellow (among sundry other colour-coded alerts).

In the case of CMS, 'low CPU efficiency' correlates to 'low CPU usage', which basically means no CMS jobs running in the VM. At the time I posted, I assumed that simply meant the system had run out of jobs. Then I went upstairs and checked the VM console and logs - and found it possibly meant glitches with the new Agent scripts. See follow-up posts in the News area.

OK, I'll be at work in 30 or 40 minutes to check on things then.
ID: 721 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Job queue empty!!


©2024 CERN