Message boards : Number crunching : Current issues
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,945,813
RAC: 2,949
Message 1904 - Posted: 6 Feb 2016, 22:20:45 UTC - in response to Message 1898.  

Any news on fixing the server?

Not from me, but I guess you realise I'm not involved in that side of the business.
I'm sort of back in harness again -- manually submitted a new certificate proxy today because the current batch was going to run past the 7-day default. I have to work on the report of wishlists and their fulfilment now, for a meeting on Wednesday, so please add any comments you want to the "constructive suggestions wanted" thread, preferably before Monday.
I'll be working in parallel with my real job -- I learnt a few things I didn't realise about templated C++ code on Friday, but I got it working in the end -- so I may be slow to react for a while yet.
So, here's hoping for a productive week coming up!
ID: 1904 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 1905 - Posted: 6 Feb 2016, 22:59:40 UTC - in response to Message 1904.  

Thanks Ivan.
Maybe your guys should compare server settings with the vLHC guys(their server is working).
They are all working for the same "club".

You are at least responding. A few lines about the progress, every now and then,
is not too much to ask, is it.?
Otherwise people loose interest and go somewhere else ( with their processing capabilities).
ID: 1905 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,945,813
RAC: 2,949
Message 1911 - Posted: 8 Feb 2016, 22:38:35 UTC - in response to Message 1888.  

Yes, I'm getting the same message.

It's all a bit strange. I tried playing around with the allowed disk space in boincmgr and an increase of 50 GB(!) brought the "required" amount down, another 50 GB and it was down to 2 GB needed, another 50 GB and it wanted 9 GB again...
ID: 1911 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 1912 - Posted: 8 Feb 2016, 23:27:36 UTC - in response to Message 1911.  

Hi Ivan,
It was a nice try, but the reported boinc space is misreported anyway.

No word of any progress. That is pretty sad.
ID: 1912 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 1913 - Posted: 9 Feb 2016, 2:24:59 UTC
Last modified: 9 Feb 2016, 3:24:50 UTC

I noticed, that the server allways sends 10 tasks at once. That happens, even though the event log says " no tasks sent".
The tasks do not show up in boinc, because of the message:

Message from server: CMS Simulation needs 9536.54MB more disk space. You currently have 0.21 MB available and it needs 9536.74 MB.

If i adjust the disk space available from 50.89GB to 50.88GB it says:

CMS Simulation needs 14.10MB more disk space. You currently have 9522.64 MB available and it needs 9536.74 MB.

This looks to me like a simple decimal place error.
We should get 1 task and get 10.
Disk space required is about 10* the expected size of 900 something MB.
ID: 1913 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rom Walton (BOINC)

Send message
Joined: 20 Mar 15
Posts: 14
Credit: 5,132
RAC: 0
Message 1914 - Posted: 9 Feb 2016, 2:29:39 UTC
Last modified: 9 Feb 2016, 2:30:00 UTC

Out of curiosity, what does the sched_request_boincai05.cern.ch_CMS-dev.xml file show as far as free disk space?

My sched_request_boincai05.cern.ch_CMS-dev.xml:
<host_info>
    ...
    <d_total>1999871410176.000000</d_total>
    <d_free>1583612952576.000000</d_free>
    ...
</host_info>


I'm curious if the client is misreporting the amount of free disk space to the project.

----- Rom
ID: 1914 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rom Walton (BOINC)

Send message
Joined: 20 Mar 15
Posts: 14
Credit: 5,132
RAC: 0
Message 1915 - Posted: 9 Feb 2016, 2:35:06 UTC - in response to Message 1914.  
Last modified: 9 Feb 2016, 2:35:25 UTC

Out of curiosity, what does the sched_request_boincai05.cern.ch_CMS-dev.xml file show as far as free disk space?

My sched_request_boincai05.cern.ch_CMS-dev.xml:
<host_info>
    ...
    <d_total>1999871410176.000000</d_total>
    <d_free>1583612952576.000000</d_free>
    ...
</host_info>


I'm curious if the client is misreporting the amount of free disk space to the project.

----- Rom


Nevermind, I just received the error message too.

2/8/2016 9:23:26 PM | CMS-dev | Message from server: CMS Simulation needs 3007.27MB more disk space.  You currently have 6529.48 MB available and it needs 9536.74 MB.


----- Rom
ID: 1915 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 1916 - Posted: 9 Feb 2016, 2:51:51 UTC - in response to Message 1914.  

Do not think so.

    <d_free>422110560256.000000</d_free>

ID: 1916 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 1917 - Posted: 9 Feb 2016, 2:59:39 UTC
Last modified: 9 Feb 2016, 3:22:59 UTC

If i increase the boinc disk space from
50.91 to 50.92GB (needed to dial that in carefully)

The message goes from
need 8.23MB to
have 2.01MB.

So if i increase the disk space by 10MB the needed memory changes by 10MB until it comes to a critical point, where it flips over.

It is impossible to change the disk space for boinc to meet the required 9534.73MB.

It has to be some sort of decimal place shift somewhere.
ID: 1917 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1185
Credit: 849,977
RAC: 1,116
Message 1918 - Posted: 9 Feb 2016, 12:36:16 UTC - in response to Message 1917.  

So if i increase the disk space by 10MB the needed memory changes by 10MB until it comes to a critical point, where it flips over.

Available to BOINC 28,39GB

With disk_max_used_gb 40.616600 requesting work reply =
CMS-dev 09 Feb 13:25:39 Message from server: CMS Simulation needs 0.01MB more disk space. You currently have 9536.73 MB available and it needs 9536.74 MB.

With disk_max_used_gb 40.616700 requesting work reply =
CMS-dev 09 Feb 13:25:50 Message from server: CMS Simulation needs 9536.65MB more disk space. You currently have 0.09 MB available and it needs 9536.74 MB.
ID: 1918 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Yeti
Avatar

Send message
Joined: 29 May 15
Posts: 147
Credit: 2,842,484
RAC: 0
Message 1919 - Posted: 9 Feb 2016, 17:41:38 UTC

This sounds to me as the good old times:

Do you remember, what an 8 Bit integer is ?

Do you remember, if you add 1 to an 8 Bit integer value that is 32767 ?

For me it looks like an variable-Overflow ...
ID: 1919 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,945,813
RAC: 2,949
Message 1920 - Posted: 9 Feb 2016, 19:40:37 UTC - in response to Message 1919.  

This sounds to me as the good old times:

Do you remember, what an 8 Bit integer is ?

Do you remember, if you add 1 to an 8 Bit integer value that is 32767 ?

For me it looks like an variable-Overflow ...


Good point, but I think you mean 16-bit, signed. :-)
ID: 1920 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Yeti
Avatar

Send message
Joined: 29 May 15
Posts: 147
Credit: 2,842,484
RAC: 0
Message 1924 - Posted: 9 Feb 2016, 20:42:02 UTC - in response to Message 1920.  
Last modified: 9 Feb 2016, 20:42:13 UTC

Good point, but I think you mean 16-bit, signed. :-)

Yeah, it is so far away but it seems you are right ;-)
ID: 1924 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile PDW

Send message
Joined: 20 May 15
Posts: 217
Credit: 5,871,767
RAC: 16,520
Message 1931 - Posted: 10 Feb 2016, 12:26:59 UTC - in response to Message 1924.  

I tried to get another job and got ten in my account, none on the PC and error message about space (CMS Simulation needs 8868.90MB more disk space. You currently have 667.84 MB available and it needs 9536.74 MB), there is over 100Gb free.

I removed the project as per Laurence's suggestion and the in progress tasks got marked as Abandoned.

I attached to the project again and requested work. Every time I request work I get another ten tasks added to the In Progress tally but nothing on the Ubuntu PC. I do not get any error messages about not having enough disk space. I now have 120 in progress according to my account.

So the good news is the warning message about space has gone, the bad news is that the limit of 20 tasks per day doesn't stop me being assigned much more than that. The really bad news is that I still cannot get any work downloaded to run.
ID: 1931 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 1932 - Posted: 10 Feb 2016, 12:50:41 UTC

I have one running!

I suspended all other projects and i got a cms task and it started.
I also left the boinc disk usage blank.


Did you guys change something on the server?
ID: 1932 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile PDW

Send message
Joined: 20 May 15
Posts: 217
Credit: 5,871,767
RAC: 16,520
Message 1934 - Posted: 10 Feb 2016, 13:02:21 UTC - in response to Message 1932.  

I have 2 (on separate PCs) :-)
ID: 1934 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1067
Credit: 329,589
RAC: 87
Message 1935 - Posted: 10 Feb 2016, 14:03:01 UTC - in response to Message 1932.  

The problem seems to have been related to the job templates used to create the tasks. These have been fixed and new tasks created.
ID: 1935 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Yeti
Avatar

Send message
Joined: 29 May 15
Posts: 147
Credit: 2,842,484
RAC: 0
Message 1936 - Posted: 10 Feb 2016, 14:06:00 UTC

Yeah, I got one !
ID: 1936 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,945,813
RAC: 2,949
Message 1942 - Posted: 10 Feb 2016, 15:52:41 UTC - in response to Message 1932.  
Last modified: 10 Feb 2016, 15:53:49 UTC

I have one running!

I suspended all other projects and i got a cms task and it started.
I also left the boinc disk usage blank.


Did you guys change something on the server?

Yes, some templates apparently got corrupted -- Laurence fixed it and posted in News in "Zombie Tasks".
ID: 1942 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 1943 - Posted: 10 Feb 2016, 16:04:58 UTC - in response to Message 1942.  

Thanks, Ivan, i saw it.

Is there any way to get the vLHC guys to put on CMS tasks?

They have been asked over an over again, no answer.
ID: 1943 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Number crunching : Current issues


©2024 CERN