Message boards : CMS Application : New version 49.00
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 6 · Next

AuthorMessage
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1064
Credit: 325,950
RAC: 249
Message 6247 - Posted: 25 Mar 2019, 10:45:44 UTC

This new version updates the CVMFS cache. It will reduce the amount of data downloaded during each VM start.
ID: 6247 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 738
Credit: 11,558,798
RAC: 1,847
Message 6250 - Posted: 25 Mar 2019, 12:41:49 UTC - in response to Message 6247.  

But that is after having to d/l a 1.14GB .vdi

ISP's must love doing that.
ID: 6250 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 6252 - Posted: 25 Mar 2019, 13:10:48 UTC - in response to Message 6247.  

It will reduce the amount of data downloaded during each VM start.
The first cmsRun started within 7 minutes up-time.
ID: 6252 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,870,629
RAC: 576
Message 6253 - Posted: 25 Mar 2019, 16:05:22 UTC - in response to Message 6250.  

But that is after having to d/l a 1.14GB .vdi

ISP's must love doing that.

The one that I have is a bit bigger than that:
-rw-r--r-- 1 eesridr users 3008364544 Mar 25 11:38 CMS_2019_03_25.vdi
However, a two-core task that's been running for over 4 hours has only downloaded 151 MiB, compared to about 1 GB for the old version. Swings and roundabouts...
ID: 6253 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 6254 - Posted: 25 Mar 2019, 17:31:16 UTC - in response to Message 6253.  
Last modified: 25 Mar 2019, 17:33:39 UTC

But that is after having to d/l a 1.14GB .vdi

ISP's must love doing that.

The one that I have is a bit bigger than that..

The 1.14GB was the vdi.gz compressed file (1224983110 bytes).
ID: 6254 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,870,629
RAC: 576
Message 6255 - Posted: 25 Mar 2019, 20:41:40 UTC - in response to Message 6254.  

But that is after having to d/l a 1.14GB .vdi

ISP's must love doing that.

The one that I have is a bit bigger than that..

The 1.14GB was the vdi.gz compressed file (1224983110 bytes).

Ah, OK, I didn't realise it was sent compressed -- I'm slightly surprised it compresses so well.
ID: 6255 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 738
Credit: 11,558,798
RAC: 1,847
Message 6256 - Posted: 25 Mar 2019, 23:09:26 UTC

........one of these days


ID: 6256 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 6257 - Posted: 26 Mar 2019, 0:20:33 UTC

2" 50' for this Download in Germany.
ID: 6257 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rbpeake

Send message
Joined: 15 Apr 15
Posts: 38
Credit: 227,251
RAC: 0
Message 6258 - Posted: 26 Mar 2019, 1:27:22 UTC - in response to Message 6253.  
Last modified: 26 Mar 2019, 1:35:28 UTC

Kindly ignore.
ID: 6258 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 6259 - Posted: 26 Mar 2019, 6:54:05 UTC

Index of /logs
finished_0 log 39Byte text - (Running job output should appear here.)
the next all 4.8 MByte as usual. Now No.4.
ID: 6259 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,870,629
RAC: 576
Message 6260 - Posted: 26 Mar 2019, 9:47:01 UTC - in response to Message 6259.  

Index of /logs
finished_0 log 39Byte text - (Running job output should appear here.)
the next all 4.8 MByte as usual. Now No.4.

Yes, looks like a small bug in the indexing. finished_0.log is created at startup and is supposed to be overwritten by the log of the first job. Looking at the timings of some job logs, the first log is written to _1, the second to _2 and so on.
ID: 6260 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 467
Credit: 389,411
RAC: 449
Message 6261 - Posted: 26 Mar 2019, 13:42:20 UTC

Di 26 Mär 2019 14:29:45 CET | LHC@home | Started download of CMS_2019_03_25.vdi
Di 26 Mär 2019 14:29:57 CET | LHC@home | Finished download of CMS_2019_03_25.vdi

Download finished after 12 s if the vdi is already in the local proxy's cache.
ID: 6261 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 6262 - Posted: 26 Mar 2019, 14:27:56 UTC

First task running successful, third is running now, but second had this error:
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2762680
ID: 6262 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,870,629
RAC: 576
Message 6264 - Posted: 26 Mar 2019, 15:40:10 UTC - in response to Message 6262.  

First task running successful, third is running now, but second had this error:
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2762680

That looks like a transient local error -- something went wrong trying to set up the VM, but I don't see enough info to be more precise. Was your machine perhaps near its memory or processing capacity with other tasks?
ID: 6264 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 6265 - Posted: 26 Mar 2019, 15:45:09 UTC
Last modified: 26 Mar 2019, 15:55:51 UTC

Ivan,
this third task shows no RDP-Console in Boinc-Manager. Graphic is ok with Firefox.
In this Boinc-manager only one Einstein is running at the same time.
Memory 23 GByte in use, 9 GByte free in Windows.
Was not before the screen as the first task finished and the second died.
26.03.2019 14:43:09 | lhcathome-dev | Computation for task CMS_2228323_1553438293.163516_0 finished
26.03.2019 14:43:09 | lhcathome-dev | Sending scheduler request: To report completed tasks.
26.03.2019 14:43:09 | lhcathome-dev | Reporting 1 completed tasks
26.03.2019 14:43:09 | lhcathome-dev | Requesting new tasks for CPU
26.03.2019 14:43:10 | lhcathome-dev | Scheduler request completed: got 1 new tasks
26.03.2019 14:43:12 | lhcathome-dev | Starting task CMS_2146187_1553327273.079524_0
26.03.2019 14:44:11 | lhcathome-dev | Computation for task CMS_2146187_1553327273.079524_0 finished
26.03.2019 14:45:39 | lhcathome-dev | Sending scheduler request: To report completed tasks.
26.03.2019 14:45:39 | lhcathome-dev | Reporting 1 completed tasks
26.03.2019 14:45:39 | lhcathome-dev | Requesting new tasks for CPU
26.03.2019 14:45:41 | lhcathome-dev | Scheduler request completed: got 1 new tasks
26.03.2019 14:45:43 | lhcathome-dev | Starting task CMS_2209463_1553412188.064708_0

CMS_2146187_1553327273.079524_0 this is the second which died.
ID: 6265 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,870,629
RAC: 576
Message 6268 - Posted: 26 Mar 2019, 22:19:53 UTC - in response to Message 6265.  

I know it's no consolation to you, but this is beyond my level of knowledge. In fact I have a few machines I've never even got VirtualBox to run on (in one case I suspect it's the corporate Kaspersky antivirus install which managed to worm its way into my Win7 machine despite my best efforts to avoid it). I just updated my Linux Mint kernel at home, which killed my previous VirtualBox; I've managed to install a copy from a repository, and the extras, and the Guest gubbins, but when I try to look at the VM consoles in boincmgr I get nothing and a message on my screen:
execvp(rdesktop-vrdp, localhost:53594) failed with error 2!
I think error 2 is file_not_found, but how to find which it is is difficult -- strace is way too verbose...
ID: 6268 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 738
Credit: 11,558,798
RAC: 1,847
Message 6270 - Posted: 27 Mar 2019, 5:45:25 UTC

https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=2762100

This is a task from the previous version that I was letting it finish while waiting for the new version to d/l and it waited until it was finished and then called it a computer error.
And it wasn't my computer.

2019-03-26 21:13:45 (600): Guest Log: [ERROR] Condor ended by dragons.

2019-03-26 21:13:45 (600): Guest Log: [INFO] Shutting Down.

2019-03-26 21:13:45 (600): VM Completion File Detected.
2019-03-26 21:13:45 (600): VM Completion Message: Condor ended by dragons

So I will abort the other one since I rather not run 18 hours for no reason and dragons and penguins are not allowed where I am.
ID: 6270 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
maeax

Send message
Joined: 22 Apr 16
Posts: 660
Credit: 1,720,327
RAC: 2,947
Message 6271 - Posted: 27 Mar 2019, 5:58:11 UTC

Ivan,
this can checked from Laurence, if he have time therefore.
It is good, with your help to see CMS now running again, thank you.
My third task crashed at 5 UTC with dragon-Error as Magic wrote:

2019-03-27 02:59:22 (3116): Status Report: CPU Time: '20471.562500'
2019-03-27 04:43:15 (3116): Status Report: Job Duration: '64800.000000'
2019-03-27 04:43:15 (3116): Status Report: Elapsed Time: '48003.650917'
2019-03-27 04:43:15 (3116): Status Report: CPU Time: '23960.250000'
2019-03-27 05:00:07 (3116): Guest Log: [INFO] Condor exited with return value N/A.

2019-03-27 05:00:07 (3116): Guest Log: [INFO] Shutting Down.

2019-03-27 05:00:07 (3116): Guest Log: [ERROR] Condor ended by dragons.

2019-03-27 05:00:07 (3116): Guest Log: [INFO] Shutting Down.

2019-03-27 05:00:07 (3116): VM Completion File Detected.
2019-03-27 05:00:07 (3116): VM Completion Message: Condor ended by dragons.
.
ID: 6271 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 738
Credit: 11,558,798
RAC: 1,847
Message 6272 - Posted: 27 Mar 2019, 8:32:46 UTC

I also have the version 49.00 stopping with Condor ended by dragons now and looking at Ivans 40 core Xenon I see he is also getting many of those Condor ended by dragons

And now the server is telling me 3/27/2019 1:18:16 AM | lhcathome-dev | This computer has finished a daily quota of 1 tasks

So no more work on this one AND the other 2 hosts are still d/ling that new vdi.......so I sure hope in the morning I don't find that it has to be updated to another new version and that I will have to do that all over again.

The server status says Simulation 100 unsent 52 in progress and make_work_app boincai05 Not Running

1:30am .........
ID: 6272 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,870,629
RAC: 576
Message 6273 - Posted: 27 Mar 2019, 9:29:55 UTC - in response to Message 6272.  
Last modified: 27 Mar 2019, 9:30:54 UTC

Yes, sorry about that. Laurence says: "I have fixed the issue but it will only be picked up by new VMs".
So, any VMs started before 0600 GMT should probably be aborted. Note the jobs seem OK, it's only your BOINC credit that will be affected.
ID: 6273 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · 4 . . . 6 · Next

Message boards : CMS Application : New version 49.00


©2024 CERN