Message boards : CMS Application : No Tasks
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5240 - Posted: 7 Nov 2017, 10:36:33 UTC - in response to Message 5239.  

No tasks, again.Please increase from 10 to 50.

https://lhcathomedev.cern.ch/lhcathome-dev/server_status.php


It's the same at LHC@Home. I've sent an alert.
ID: 5240 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5241 - Posted: 7 Nov 2017, 11:04:24 UTC - in response to Message 5240.  

Problem identified -- a side-effect of the change to a new WMAgent. I'm getting new tasks again now.
ID: 5241 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 5243 - Posted: 8 Nov 2017, 6:29:28 UTC

Same again??????

No tasks, again.Please increase from 10 to 50.

https://lhcathomedev.cern.ch/lhcathome-dev/server_status.php
ID: 5243 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5244 - Posted: 8 Nov 2017, 8:25:08 UTC - in response to Message 5243.  

No, it looks like a WMAgent component went down and the Condor job queue drained. I've submitted a new batch and informed the WMS maintainers.
ID: 5244 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5251 - Posted: 23 Nov 2017, 8:40:04 UTC

We have another WMAgent problem, so the job queue is dry. LHC@home is apparently down for some maintenance so I can't report it there. Hopefully CERN will be on top of it very shortly, but consider setting No New Tasks.
ID: 5251 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5252 - Posted: 23 Nov 2017, 10:08:46 UTC - in response to Message 5251.  

We are up again.
ID: 5252 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 734
Credit: 11,558,298
RAC: 1,931
Message 5254 - Posted: 23 Nov 2017, 15:48:05 UTC - in response to Message 5252.  

NO CMS tasks again

Switching over to Theory
Mad Scientist For Life
ID: 5254 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5256 - Posted: 23 Nov 2017, 17:25:18 UTC - in response to Message 5254.  

NO CMS tasks again

Switching over to Theory

Yeah, there was an intervention which didn't go smoothly and they had to roll back to the old server. I'll try to find out when they will try again. Jobs are flowing again now, for the time being.
ID: 5256 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5257 - Posted: 24 Nov 2017, 9:03:41 UTC - in response to Message 5256.  

NO CMS tasks again

Switching over to Theory

Yeah, there was an intervention which didn't go smoothly and they had to roll back to the old server. I'll try to find out when they will try again. Jobs are flowing again now, for the time being.

From what I can see, the bit that affected us has been finished, and I don't see any upcoming interventions that will impact upon us.
ID: 5257 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 5264 - Posted: 11 Dec 2017, 18:40:17 UTC

NO TASKS AGAIN;AGAIN; AGAIN....


Do i really always have to remind you, or will you eventually notice yourself?

https://lhcathomedev.cern.ch/lhcathome-dev/server_status.php
ID: 5264 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 734
Credit: 11,558,298
RAC: 1,931
Message 5265 - Posted: 11 Dec 2017, 20:01:28 UTC - in response to Message 5264.  

There is a CMS server problem since yesterday Rasp

All the tasks we did have would just crash after about 33 minutes with that " [ERROR] Condor exited after 731s without running a job." problem.

Ivan sent them a message (same thing over at LHC CMS)
Mad Scientist For Life
ID: 5265 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 5266 - Posted: 11 Dec 2017, 21:05:59 UTC - in response to Message 5265.  

Thanks, Magic.
ID: 5266 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5267 - Posted: 12 Dec 2017, 9:11:12 UTC

We have a problem with the new WMAgent. I've sent some jobs via the old one but for some reason the -dev project isn't picking up on them. There was a difficulty at LHC@home where the automatic disabling of task creation when there are no jobs doesn't work with the new web-site, but I got jobs flowing with the old agent before Laurence manually disabled task creation. I fear it's going to be a fraught next month or so as people take holidays, etc...
ID: 5267 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5268 - Posted: 12 Dec 2017, 9:33:54 UTC

We have tasks again here, too.
ID: 5268 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 5270 - Posted: 12 Dec 2017, 19:07:57 UTC - in response to Message 5268.  
Last modified: 12 Dec 2017, 19:09:54 UTC

SSP shows no tasks to send.
ID: 5270 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5271 - Posted: 13 Dec 2017, 11:11:53 UTC - in response to Message 5270.  

Task creation has been tickled into life again, with a queue of 100 rather than 10.
ID: 5271 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 5272 - Posted: 13 Dec 2017, 21:20:45 UTC - in response to Message 5271.  

... with a queue of 100 rather than 10.


Which i suggested a long time ago.

But people are not listening.

Thanks
ID: 5272 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1128
Credit: 7,870,419
RAC: 595
Message 5273 - Posted: 13 Dec 2017, 21:23:44 UTC - in response to Message 5272.  

... with a queue of 100 rather than 10.


Which i suggested a long time ago.

But people are not listening.

Thanks

Yes, I know you did. Sometimes people listen eventually. :-)
ID: 5273 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 5275 - Posted: 15 Dec 2017, 18:44:22 UTC - in response to Message 5273.  

The queue is down to 10.
Maybe it is time to intervene?
ID: 5275 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 2,009
Message 5276 - Posted: 16 Dec 2017, 9:29:28 UTC - in response to Message 5275.  

The queue is down to 10.
Maybe it is time to intervene?
A queue of 10 is enough for the LHC development project.
There are only a few users here and when no new versions have to be tested,
why should you crunch here and not on the production project?
Last 24 hours only 6 users were active on CMS on the dev-project.
ID: 5276 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : CMS Application : No Tasks


©2024 CERN