21) Message boards : Number crunching : Current issues (Message 1891)
Posted 5 Feb 2016 by rbpeake
Post:
There is CMS work now at vLHC, and it is working properly.
22) Message boards : News : Graceful Shutdown Now Implemented (Message 1820)
Posted 2 Feb 2016 by rbpeake
Post:
F5 works again with the second run. Definitely a glitch! 😉
23) Message boards : News : Graceful Shutdown Now Implemented (Message 1815)
Posted 2 Feb 2016 by rbpeake
Post:
Can you look at the logs in boincmgr with the (misleading) "Show graphics" buton?

Here is an excerpt from the boot log:
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Restoring chunk tables... done
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Restoring inode generation... done
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Restoring open files counter... done
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing saved glue buffer
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing chunk tables
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing saved inode generation info
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Releasing open files counter
    Tue Feb 2 09:22:26 2016: grid.cern.ch: Activating Fuse module


Nothing in the cron-stderr log.

From the cron-stdout log:

    type : RFC 3820 compliant impersonation proxy
    strength : 1024 bits
    path : /tmp/x509up_u500
    timeleft : 129:59:57 (5.4 days)
    09:23:06 -0500 2016-02-02 [INFO] Downloading glidein
    09:23:11 -0500 2016-02-02 [INFO] Running glidein (check logs)

24) Message boards : News : Graceful Shutdown Now Implemented (Message 1812)
Posted 2 Feb 2016 by rbpeake
Post:
F5 is not working, bot Windows task manager shows the CPU is working. Is this a glitch?
25) Message boards : News : Constructive suggestions please (Message 1792)
Posted 1 Feb 2016 by rbpeake
Post:
Also the efforts to build and maintain this is only worth it if it makes a difference to the experiment and as over the next 5-10 years the projections on computing supply and demand differ by a factor of 10, this project may turn out to be quite helpful.

I think the potential and value of BOINC has been proven by Atlas@home, where BOINC has consistently been one of the largest if not the largest production contributor to Atlas simulations. http://atlasathome.cern.ch/atlas_job.php
Also, and as a result I believe, the level of support for BOINC by Atlas has been excellent.

So I would expect BOINC to be eventually as important to CMS.
26) Message boards : Number crunching : Expect errors eventually (Message 1499)
Posted 3 Dec 2015 by rbpeake
Post:
Just curious to see your timing goal to start production work? These latest 250 unit jobs seem to be going well for me.

Thanks!
27) Message boards : News : No new jobs (Message 1172)
Posted 2 Oct 2015 by rbpeake
Post:
So we are officially contributing results to the larger simulation database?
28) Message boards : Number crunching : issue of the day (Message 798)
Posted 21 Aug 2015 by rbpeake
Post:
Ivan, are any of your work units for your presentation running yet?
29) Message boards : News : Agent Fixed (Message 747)
Posted 20 Aug 2015 by rbpeake
Post:
Doesn't seem to use much CPU (13%). Is that a sign of a problem?

Is that on your host machine or in the ALT+F3 VM console?
Of course, if it's an 8-core machine, Task Manager will show 12 or 13% for a full core's usage...

That was in BoincTasks v. 1.67, but for the ALT+3, mostly zero, up to one moment 20.5%. Seems like no work is being done.
30) Message boards : News : Agent Fixed (Message 742)
Posted 20 Aug 2015 by rbpeake
Post:
Doesn't seem to use much CPU (13%). Is that a sign of a problem?
31) Message boards : Number crunching : Heads up! Looking to make a major challenge next Wednesday; a call for volunteers (Message 642)
Posted 17 Aug 2015 by rbpeake
Post:
I have no doubt that this will be a successful test for CMS since ATLAS tested BOINC results for months before accepting BOINC as part of their regular production team.
32) Message boards : Number crunching : Some questions (Message 594)
Posted 16 Aug 2015 by rbpeake
Post:
Ok seems that the problem with the console is Windows 10 related because I cant connect to the consoles from vLHCathome, too.

I have the same issue with my 2 Windows 10 boxes. Hopefully that is one of the things Oracle is fixing so that Virtual box is Windows 10 compatible.
33) Message boards : News : Agent Update (Message 502)
Posted 4 Aug 2015 by rbpeake
Post:
Seems to be stuck on:
Condor started in background, now waiting on process 952
34) Message boards : News : Welcome to the CMS development project (Message 444)
Posted 25 Jun 2015 by rbpeake
Post:
And the project is still just running test units, not production work?

Thanks.
35) Message boards : Number crunching : New testers, please post here (Message 421)
Posted 29 May 2015 by rbpeake
Post:
Hi, I am a fresh and new user ;-)

Downloaded the first WU and has already started to crunch.

I looked inside the VM (as learned by vLHC) and it is saying something like: Begin processing the 6th record. Run 1, Event 906, ...

So, I think all is running as it should ?

Cheers, Yeti


Well I know you have plenty of cores especially right now with no Atlas tasks.


Am I correct that this CMS is just doing test work and not "real" project work? I would add more cores if I knew we were working on the latter.

Thanks!
36) Message boards : Number crunching : task postponed 86400.000000 sec: VM Hypervisor failed to enter an online state in a timely fashion. (Message 261)
Posted 17 Apr 2015 by rbpeake
Post:
Got this message:

2015-04-17 08:33:51 (6620): Preference change detected
2015-04-17 08:33:51 (6620): Setting CPU throttle for VM. (100%)
2015-04-17 08:33:51 (6620): Checkpoint Interval is now 1000 seconds.
2015-04-17 09:25:52 (6620): VM state change detected. (old = 'running', new = 'paused')
2015-04-17 09:31:39 (6620): Stopping VM.
2015-04-17 09:31:42 (6620): Successfully stopped VM.
2015-04-17 10:59:44 (7952): vboxwrapper (7.5.26165): starting
2015-04-17 10:59:44 (7952): Feature: Checkpoint interval offset (208 seconds)
2015-04-17 10:59:44 (7952): Detected: VirtualBox COM Interface (Version: 4.3.26)
2015-04-17 10:59:44 (7952): Detected: Minimum checkpoint interval (600.000000 seconds)
2015-04-17 10:59:44 (7952): Starting VM. (boinc_4065a39949c5a323, slot#4)
2015-04-17 11:04:46 (7952): Successfully started VM. (PID = '6412')
2015-04-17 11:04:46 (7952): Reporting VM Process ID to BOINC.
2015-04-17 11:09:47 (7952): VM is no longer is a running state. It is in 'saved'.
2015-04-17 11:09:47 (7952): VM state change detected. (old = 'poweroff', new = 'saved')
2015-04-17 11:09:47 (7952): NOTE: VM failed to enter an online state within the timeout period.
2015-04-17 11:09:47 (7952): This might be a temporary problem and so this job will be rescheduled for another time.
2015-04-17 11:09:47 (7952): Powering off VM.
37) Message boards : Number crunching : exceeded disk limit (Message 259)
Posted 16 Apr 2015 by rbpeake
Post:
I am also getting a bunch of these ATLAS errors now:

Stderr output
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<message>
Maximum disk usage exceeded
</message>
<stderr_txt>
2015-04-16 18:04:25 (12728): vboxwrapper (7.5.26110): starting
2015-04-16 18:04:25 (12728): Feature: Checkpoint interval offset (391 seconds)
2015-04-16 18:04:26 (12728): Detected: VirtualBox 4.3.26r98988
2015-04-16 18:04:26 (12728): Detected: Minimum checkpoint interval (900.000000 seconds)
2015-04-16 18:04:26 (12728): successfully copied 'init_data.xml' to the shared directory.
2015-04-16 18:04:34 (12728): Create VM. (boinc_d5363a11d7601e3a, slot#1)
2015-04-16 18:04:34 (12728): Updating drive controller type and model for desired configuration.
2015-04-16 18:04:35 (12728): Setting CPU Count for VM. (1)
2015-04-16 18:04:35 (12728): Setting Memory Size for VM. (2048MB)
2015-04-16 18:04:35 (12728): Setting Chipset Options for VM.
2015-04-16 18:04:35 (12728): Setting Boot Options for VM.
2015-04-16 18:04:36 (12728): Setting Network Configuration for NAT.
2015-04-16 18:04:36 (12728): Disabling USB Support for VM.
2015-04-16 18:04:36 (12728): Disabling COM Port Support for VM.
2015-04-16 18:04:36 (12728): Disabling LPT Port Support for VM.
2015-04-16 18:04:37 (12728): Disabling Audio Support for VM.
2015-04-16 18:04:37 (12728): Disabling Clipboard Support for VM.
2015-04-16 18:04:37 (12728): Disabling Drag and Drop Support for VM.
2015-04-16 18:04:38 (12728): Adding storage controller to VM.
2015-04-16 18:04:38 (12728): Adding virtual ISO 9660 disk drive to VM. (vm_isocontext.iso)
2015-04-16 18:04:38 (12728): Adding VirtualBox Guest Additions to VM.
2015-04-16 18:04:38 (12728): Adding virtual cache disk drive to VM. (vm_cache.vdi)
2015-04-16 18:04:39 (12728): Adding network bandwidth throttle group to VM. (Defaulting to 1024GB)
2015-04-16 18:04:39 (12728): Enabling network access for VM.
2015-04-16 18:04:39 (12728): forwarding host port 52085 to guest port 80
2015-04-16 18:04:39 (12728): Enabling remote desktop for VM.
2015-04-16 18:04:40 (12728): Enabling shared directory for VM.
2015-04-16 18:04:40 (12728): WARNING: Stale VirtualBox VM Log used.
2015-04-16 18:04:40 (12728): WARNING: Stale VirtualBox VM Log Not Found.
2015-04-16 18:04:40 (12728): WARNING: Stale VirtualBox VM Log used.
2015-04-16 18:04:40 (12728): WARNING: Stale VirtualBox VM Log Not Found.
2015-04-16 18:04:40 (12728): Starting VM.

</stderr_txt>
]]>

Guess I will need to clean up my VM, will that help?
38) Message boards : Number crunching : exceeded disk limit (Message 258)
Posted 16 Apr 2015 by rbpeake
Post:
For my first CMS unit, when it completed I noticed it caused one my three running ATLAS tasks to error-out with a "computation error". The ATLAS output message is as follows:

Stderr output
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<message>
Maximum disk usage exceeded
</message>
<stderr_txt>

</stderr_txt>
]]>


Previous 20


©2024 CERN