Message boards :
Theory Application :
Device errors
Message board moderation
Author | Message |
---|---|
Send message Joined: 13 Feb 15 Posts: 1188 Credit: 857,561 RAC: 33 |
A 'new?' vdi (and xml) file was sent, but had the same name (Theory_2016_04_11) and more important the same device errors for device sda2 |
Send message Joined: 12 Sep 14 Posts: 1067 Credit: 334,882 RAC: 0 |
The same name but different file extensions. Will rebuild the image (vdi) tomorrow. |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
The same name but different file extensions. Will rebuild the image (vdi) tomorrow. Are you defragmenting the image before you put them on the server? |
Send message Joined: 12 Sep 14 Posts: 1067 Credit: 334,882 RAC: 0 |
No the image is not de-fragmented. As it is built from scratch, this should not be required. A new version is now available with a new image. CernVM have been upgrade from v1.18-13 to v2.3-0. |
Send message Joined: 12 Sep 14 Posts: 1067 Credit: 334,882 RAC: 0 |
Have the device errors disappeared with the new image? |
Send message Joined: 16 Aug 15 Posts: 966 Credit: 1,211,816 RAC: 0 |
Yes. Console F1 and F3 are working. Nothing on F2, F4 and F5. "Show graphics" Masterlog Starterlog Startlog stderr.log stdout.log 1st job finished after only 23min. Startlog 04/22/16 17:56:11 Create_Process succeeded, pid=6312 04/22/16 17:56:33 Process exited, pid=6312, status=1 04/22/16 17:56:33 ReliSock::put_file_with_permissions(): Failed to stat file '/var/lib/condor/execute/dir_6308/dat': No such file or directory (errno: 2, si_error: 1) 04/22/16 17:56:33 DoUpload: (Condor error code 13, subcode 2) STARTER at 10.0.2.15 failed to send file(s) to <188.184.187.167:9618>: error reading from /var/lib/condor/execute/dir_6308/dat: (errno 2) No such file or directory; SHADOW failed to receive file(s) from <84.118.78.114:49876> 04/22/16 17:56:33 JICShadow::notifyJobTermination(): Sending mock terminate event. 04/22/16 17:56:34 JIC::transferOutput() failed, waiting for job lease to expire or for a reconnect attempt 04/22/16 17:56:34 Returning from CStarter::JobReaper() 04/22/16 17:56:34 Got SIGQUIT. Performing fast shutdown. 04/22/16 17:56:34 ShutdownFast all jobs. 04/22/16 17:56:34 condor_write(): Socket closed when trying to write 203 bytes to <188.184.187.167:9618>, fd is 11 04/22/16 17:56:34 Buf::write(): condor_write() failed 04/22/16 17:56:34 Failed to send job exit status to shadow 04/22/16 17:56:34 JobExit() failed, waiting for job lease to expire or for a reconnect attempt I have this kind of message on all jobs finished. |
©2024 CERN