Message boards : CMS Application : New Version v47.60
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Laurence
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 12 Sep 14
Posts: 1064
Credit: 325,950
RAC: 249
Message 4187 - Posted: 17 Oct 2016, 14:51:50 UTC

This new version enables Web proxy auto discovery (wpad) which means the CVMFS traffic should be directed to either CERN for FNAL depending on where you are.
ID: 4187 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ben Segal
Volunteer moderator
Volunteer developer
Volunteer tester

Send message
Joined: 12 Sep 14
Posts: 65
Credit: 544
RAC: 0
Message 4188 - Posted: 17 Oct 2016, 15:53:16 UTC - in response to Message 4187.  

This new version enables Web proxy auto discovery (wpad) which means the CVMFS traffic should be directed to either CERN for FNAL depending on where you are.

By the way, FNAL means Fermi National Accelerator Lab which is near Chicago USA.
ID: 4188 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 467
Credit: 389,411
RAC: 449
Message 4189 - Posted: 17 Oct 2016, 16:29:22 UTC - in response to Message 4187.  

My host still gets v47.30 vbox64_mt_mcore_cms as described here.

See: https://lhcathome.cern.ch/vLHCathome-dev/result.php?resultid=274756
ID: 4189 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 738
Credit: 11,558,798
RAC: 1,847
Message 4190 - Posted: 17 Oct 2016, 18:30:48 UTC

Batavia, Illinois

http://www.fnal.gov/
ID: 4190 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 4191 - Posted: 17 Oct 2016, 19:29:02 UTC - in response to Message 4189.  

My host still gets v47.30 vbox64_mt_mcore_cms as described here.

I got CMS Simulation v47.40 (vbox64_mt_mcore) windows_x86_64

However application 47.60 (vbox64_mt_mcore_cms) is available.
ID: 4191 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 467
Credit: 389,411
RAC: 449
Message 4192 - Posted: 17 Oct 2016, 20:10:17 UTC

Although the application table shows only v47.60 (vbox64_mt_mcore_cms) I still get v47.30.
ID: 4192 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 738
Credit: 11,558,798
RAC: 1,847
Message 4193 - Posted: 17 Oct 2016, 22:58:16 UTC - in response to Message 4192.  

You will probably notice when you get the new v47.60 vbox64_mt_mcore_cms since you will have to download the 640.23MB .vdi
Mad Scientist For Life
ID: 4193 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 17 Mar 15
Posts: 51
Credit: 602,329
RAC: 6
Message 4224 - Posted: 24 Oct 2016, 20:02:25 UTC

All failing on my Mac with "206 (0x000000CE) EXIT_INIT_FAILURE" error.

Example.
ID: 4224 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 8 Apr 15
Posts: 738
Credit: 11,558,798
RAC: 1,847
Message 4225 - Posted: 24 Oct 2016, 20:38:03 UTC - in response to Message 4224.  

All failing on my Mac with "206 (0x000000CE) EXIT_INIT_FAILURE" error.

Example.



It looks like we have Condor problems here just like over at vLHC

I just got one of those [ERROR] Could not ping HTCondor when I tried to run one on this pc and next few minutes it was gone and now I see it was done by the server because I had the previous version that I got yesterday......so now I am going to wait a while for that .vdi to d/l

In your case you do have the new version and it is not making contact so I guess they need to go feed that Condor again.
Mad Scientist For Life
ID: 4225 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ivan
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 20 Jan 15
Posts: 1129
Credit: 7,870,629
RAC: 576
Message 4226 - Posted: 24 Oct 2016, 21:15:55 UTC - in response to Message 4225.  

All failing on my Mac with "206 (0x000000CE) EXIT_INIT_FAILURE" error.

Example.



It looks like we have Condor problems here just like over at vLHC

I just got one of those [ERROR] Could not ping HTCondor when I tried to run one on this pc and next few minutes it was gone and now I see it was done by the server because I had the previous version that I got yesterday......so now I am going to wait a while for that .vdi to d/l

In your case you do have the new version and it is not making contact so I guess they need to go feed that Condor again.

No, RAL has shut down our servers over the "Dirty COW" Linux kernel bug. Apparently the mitigation suggested by Red Hat is not sufficient for the level of paranoia the RAL sysadmins have developed, so we wait for a proper kernel patch to be released. Keep watching the News, I'll let you know as soon as I hear the servers have been restarted. (Our nominal chap in control of our servers is on holidays ATM, but heard the news and let us know; the more central bods who made the decision may not even know we exist).
ID: 4226 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 17 Mar 15
Posts: 51
Credit: 602,329
RAC: 6
Message 4229 - Posted: 25 Oct 2016, 8:34:52 UTC

Thanks for the info.

When I'm grown up maybe I'll know what's RAL, ATM holidays and a dirty cow :)
ID: 4229 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 4230 - Posted: 25 Oct 2016, 10:07:31 UTC - in response to Message 4229.  

When I'm grown up maybe I'll know what's RAL, ATM holidays and a dirty cow :)

Let's help to grow you a bit:
RAL --> Rutherford Appleton Laboratory
ATM --> At The Moment
dirty cow --> Explaining Dirty COW
ID: 4230 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 17 Mar 15
Posts: 51
Credit: 602,329
RAC: 6
Message 4236 - Posted: 25 Oct 2016, 21:08:03 UTC

Do you mean vLHC is hosted by an "external" laboratory ? I would have thought this was "inside" LHC infrastructure, somehow...

Thanks for the nice video, ATM I probably only understood a portion of it, but that was nice ;)

So the "good news" is that the issue has nothing to do with the application itself nor my machine, so that's good enough for me !
ID: 4236 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ben Segal
Volunteer moderator
Volunteer developer
Volunteer tester

Send message
Joined: 12 Sep 14
Posts: 65
Credit: 544
RAC: 0
Message 4237 - Posted: 26 Oct 2016, 6:24:54 UTC - in response to Message 4236.  

Do you mean vLHC is hosted by an "external" laboratory ? I would have thought this was "inside" LHC infrastructure, somehow...

...

The vLHC BOINC servers are at CERN but the Condor servers which supply jobs can be elsewhere. In this case, CMS jobs are being sent from RAL where Ivan is partly based.
ID: 4237 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 4243 - Posted: 27 Oct 2016, 18:09:37 UTC
Last modified: 27 Oct 2016, 18:37:08 UTC

Thu 27 Oct 2016 08:05:20 PM CEST | vLHCathome-dev | [cpu_sched_debug] enforce: result CMS_23154_1477523974.478450_0 can't run, too big 3000.00MB > 1871.86MB


I used to be able to run 4 tasks. Where does it get the 3000MB figure from?
The official memory requirment in the CMSXXX.xml file is 2048:

(Boinc 7.6.33)

EDIT:Boinc says,about 6000MB of swap is used, which i do not have(I have NO swap file at all)and set the swap setting in boinc to minimum (1%).

Any suggestions?
ID: 4243 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 467
Credit: 389,411
RAC: 449
Message 4244 - Posted: 28 Oct 2016, 8:14:28 UTC

After I switched back to the non-HTTPS master URL I got a CMS task from version v47.60 (vbox64_mt_mcore_cms).

Settings:
no app_config.xml
Max # CPUs -> 3 (prefs on the website)


Result from stderr.txt:
2016-10-28 09:53:54 (26625): Setting Memory Size for VM. (3000MB)
2016-10-28 09:53:54 (26625): Setting CPU Count for VM. (1)


Shall I cancel the WU?
ID: 4244 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile tullio

Send message
Joined: 17 Aug 15
Posts: 62
Credit: 296,695
RAC: 0
Message 4245 - Posted: 28 Oct 2016, 9:56:30 UTC

All tasks fail on this host except Benchmarks.
Tullio
ID: 4245 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 28 Jul 16
Posts: 467
Credit: 389,411
RAC: 449
Message 4247 - Posted: 28 Oct 2016, 10:17:46 UTC - in response to Message 4245.  

All tasks fail on this host except Benchmarks.
Tullio

I tested only CMS.
Regarding that app you got an outdated version v47.40.

If your host is attached via the new HTTPS master URL you may detach and reattach via the old URL http://lhcathomedev.cern.ch/vLHCathome-dev/

Then check if you get the most recent CMS version v47.60 (vbox64_mt_mcore_cms).
ID: 4247 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rasputin42
Volunteer tester

Send message
Joined: 16 Aug 15
Posts: 966
Credit: 1,211,816
RAC: 0
Message 4248 - Posted: 28 Oct 2016, 10:18:31 UTC - in response to Message 4245.  
Last modified: 28 Oct 2016, 10:20:11 UTC

Settings for cpus per task in the account preferences are being ignored.
It just uses 1 core.

Overriding with app_config.xml works.
ID: 4248 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Crystal Pellet
Volunteer tester

Send message
Joined: 13 Feb 15
Posts: 1178
Credit: 810,985
RAC: 1,800
Message 4250 - Posted: 28 Oct 2016, 14:45:29 UTC

VM did not restore from saved snapshot after resume, but booted http://lhcathomedev.cern.ch/vLHCathome-dev/result.php?resultid=280024

Next task with settings 2 cores creates a single core VM with 3000MB RAM.
ID: 4250 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : CMS Application : New Version v47.60


©2024 CERN