Message boards :
CMS Application :
Stageout failures
Message board moderation
Previous · 1 · 2 · 3
Author | Message |
---|---|
Send message Joined: 20 Jan 15 Posts: 1139 Credit: 8,310,612 RAC: 37 |
The CMS jobs graph shows rising number of crab jobs, but the errors are staying the same(or even falling). It's just recovery from when we lost Condor on Tuesday night, I think. The CRAB jobs are going just to LHC@Home now (-dev and Laurence's cluster are munching on the WMAgent jobs, which show up as "unknown" for some unknown reason). Many hosts would have quota-ed out due to continued failure, so they will be gradually getting their quota back, while those that fell back to Test4Theory wouldn't return to us for the 18-20 hours their tasks run. We're only now back up to the 750-800 running CRAB jobs that we had before the disturbance. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Tasks have stopped (no cpu activity) and the following in the stderr.log: CMS jobs graphs not accessible,also. INFO:root:Beginning report processing for step logArch1 |
Send message Joined: 20 Jan 15 Posts: 1139 Credit: 8,310,612 RAC: 37 |
Tasks have stopped (no cpu activity) and the following in the stderr.log: Things look OK here at the moment, tho' there seems to have been a glitch at LHC@Home overnight, which appears to have led to a spike of "echo" failures once the problem cleared. I'll keep checking for a while, but I discovered when I came in this morning that the University is actually closed today, so I'll probably declare it POET'S Day[1] soon. [1] Traditional Aussie approach to any Friday, not just the one before Christmas -- P*ss Off Early, Tomorrow's Saturday. |
©2025 CERN