Message boards :
News :
Server upgrade
Message board moderation
Author | Message |
---|---|
Send message Joined: 12 Sep 14 Posts: 42 Credit: 111,031 RAC: 0 |
We will migrate the lhcathome-dev project to a Centos7 server and upgrade the BOINC server components. The lhcathome-dev project server will be unreachable for a while later today during the upgrade. |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 565 |
Thank you for the info, so we can stop work now. |
Send message Joined: 12 Sep 14 Posts: 42 Credit: 111,031 RAC: 0 |
Actually the upgraded server code gives an error with the scheduler: No start tag in scheduler reply . Will try to fix it, otherwise we'll revert back to the previous setup. |
Send message Joined: 8 Apr 15 Posts: 780 Credit: 12,150,930 RAC: 2,185 |
Well I sure hope this brings vLHC-dev back to normal again since most of the members have left us here. I even switched several of mine over to LHC but still have 2 here running the CMS and Theory tasks I have loaded. Finished the previous version of LHCb tasks but I rather not d/l the new vdi for now just to save this months data transfer so I can go longer than just one week before I use all 80GB total for the month from my satellite connection (45-50MBps) I have a feeling those vdi d/l/s along with VB tasks really eat that data transfer up so I am testing my other 32 cores running SixTracks without the ethernet cables plugged in.......I know it works great with the Einstein GPU tasks. But the main thing is getting this site back to normal since it is getting close to 10 weeks without keeping it all up to date. Mad Scientist For Life |
Send message Joined: 12 Sep 14 Posts: 42 Credit: 111,031 RAC: 0 |
Not sure if I follow you MAGIC, you are using the URL: https://lhcathomedev.cern.ch/lhcathome-dev/ for this dev project? Anyway, we are trying again the server update today, so for about 30 minutes, the server will be unavailable. There might be a couple such interventions on the dev project today, Tuesday 15th of August. |
Send message Joined: 8 Apr 15 Posts: 780 Credit: 12,150,930 RAC: 2,185 |
Not sure if I follow you MAGIC, you are using the URL: Well Nils we have been talking about this problem here since June 7th and on several threads yet we never get anything done about it and all I get is one of you saying you don't understand and then I type out a long step by step explanation and still nothing here. Once again.......look at the stats pages......they are all wrong and look at my account where it shows My Computers and there you will see it says I have not used any computer here for over 30 days. And e even when you check my Computers on the *Show: All computers* tab it will say the last contact here was in JUNE. Now how do you think that is possible when I am the one who has the highest average (RAC) and Total here? Once again I have to post here that I have been running 6 or 7 computers here 24/7 since the beginning on April 8th 2015 and the Total credit 3,068,798 did not happen by not running any computers here for months. BUT if you also check the RAC stats page it has *Paul* still at the top of the page since the stats pages have not changed for 10 WEEKS and that former member has not done ONE tasks here since June 7th. And once again let me remind you that this site has not even changed that user of the day for 10 WEEKS Most of the regulars here left because nothing ever gets done even when we write long posts here describing all the problems. There are only about 5 members still here and Ivan is one of them and I have always been the one doing most of the work here yet......as I have said about 10 times now.....this website has not been doing the simple job of keeping the stats up to date for 10 weeks. We can't even check how our computers are doing unless we go and check each one instead of just getting on ONE and going to our accounts to see what our computers are doing so we know if they have tasks running or if there is any problems. Now sure if I was just running one computer for Cern it wouldn't be all that bad but I am running NINE and I have been doing this 24/7 since 2004 and am one of the very few that also does all the Alpha-beta tests. And of course I use the right URL for vLHC-dev as does the other members here and the ones that left. One thing for sure is that I won't type this out again and next time I will just give the link to this post and maybe all the other ones about this. The only thing we get updated here is the stats below our avatar and that does not do us any good when it comes to checking all of our computers work from the accounts. Here you can find the many times I wasted here explaining this and was even asked to by other members since it is like this for all that are left here. https://lhcathomedev.cern.ch/lhcathome-dev/forum_user_posts.php?userid=192 Mad Scientist For Life |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 565 |
Nils, thank you for the Server-upgrade. The statistic of the -dev is now in boincstats.com again. |
Send message Joined: 13 Apr 15 Posts: 138 Credit: 2,969,210 RAC: 12 |
...... although ... even though I let it run and return a Benchmark task just now, (credited), this host is still below the 30 day cut-off and shows last contact as 15 June 8¬( , while 2 others above the cut-off successfully updated straight away. |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 565 |
Hi Ray, have for the -dev project after the last task was finished made a reset, removed it and made a new connect of the -dev project. Maybe this will help for a new connect-day. |
Send message Joined: 13 Apr 15 Posts: 138 Credit: 2,969,210 RAC: 12 |
Yes, Maeax, That would work but would require the download of all the .vdi's again so Magic would take about a week to do that on his 10 machines and clockwork network. And if a host is doing Sixtrack on the Production site for more than 30 days, it would again be seen as "inactive" here and the same "last contact" lock would occur. I don't knows where Nils et al might look but the fix must be server-side rather than host-side. Now that the stats are being exported again, this might now be a purely cosmetic issue but I don't know if it might have other implications down the line. |
Send message Joined: 8 Apr 15 Posts: 780 Credit: 12,150,930 RAC: 2,185 |
Ray is correct. https://lhcathomedev.cern.ch/lhcathome-dev/hosts_user.php?userid=192 The most important part is still the same. And last night and today I can't connect with the server 8/16/2017 10:18:11 AM | lhcathome-dev | Server error: feeder not running Mad Scientist For Life |
Send message Joined: 12 Sep 14 Posts: 42 Credit: 111,031 RAC: 0 |
Sorry about the feeder issue. We applied another update to the BOINC server code to test new job scheduling features, and missed a DB update in the process. This should now be ok. |
Send message Joined: 20 Jan 15 Posts: 1139 Credit: 8,181,211 RAC: 2,023 |
Sorry about the feeder issue. We applied another update to the BOINC server code to test new job scheduling features, and missed a DB update in the process. This should now be ok. I just got two new jobs, and server status is (mostly) all green, so it looks OK again. |
Send message Joined: 13 Apr 15 Posts: 138 Credit: 2,969,210 RAC: 12 |
... and the host that had been being ignored has updated it's "last contact" to the manual "phone home" I did just now 8¬). 5 of Magic's also have recent contacts again so the tap with a hammer seems to have done the trick. |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 565 |
Yes Ray, that's the real life with this hammer. Is it possible for the Admins to activate the processor-page under statistics and sort them inside those headers? This new -dev Server is so fast at the moment, whow. Thank you. |
Send message Joined: 22 Apr 16 Posts: 677 Credit: 2,002,766 RAC: 565 |
The -dev-Server is back. Thank you Cern-IT. |
Send message Joined: 20 Jan 15 Posts: 1139 Credit: 8,181,211 RAC: 2,023 |
The -dev-Server is back. Oh, goody! I'd better tickle my -dev machine to ask for tasks. |
Send message Joined: 8 Apr 15 Posts: 780 Credit: 12,150,930 RAC: 2,185 |
I just tried to get a couple on one of my 8-core pc's and this is what I got Stderr output <core_client_version>7.6.33</core_client_version> <![CDATA[ <message> app_version download error: couldn't get input files: <file_xfer_error> <file_name>vboxwrapper_26198ab7_windows_x86_64.exe</file_name> <error_code>-224 (permanent HTTP error)</error_code> <error_message>permanent HTTP error</error_message> </file_xfer_error> <file_xfer_error> <file_name>Theory_2017_05_29.xml</file_name> <error_code>-224 (permanent HTTP error)</error_code> <error_message>permanent HTTP error</error_message> </file_xfer_error> </message> ]]> I am about to see if it does the same thing on the other two 8-cores sitting next to that one. Mad Scientist For Life |
Send message Joined: 12 Sep 14 Posts: 42 Credit: 111,031 RAC: 0 |
Thanks for pointing this out. We still have work units in the DB that points to the former vLHCathome-dev. Download should be working again now. |
©2024 CERN