One of the switches has failed on fimm. Because the file-servers were connected to this switch the filesystems went down.
We have now re-routed the fileservers and frontend to one of the other switches and changed the switch-interconnect. Most of fimm is now back up again, including filesystems, but jobs have failed due to missing filesystems.
Author Archives: lsz075
Fimm file system hang
The file system on fimm hung around 14:00 today.
Update 15:00: File systems should now run as normal.
Update 15:00: File systems should now run as normal.
Scheduled downtime on fimm/filesystems
Due to a necessary re-configuration of the power in the machine room, there will be a downtime of fimm cluster plus filesystems (/migrate, /bcmhsm, /bjerknes*, as well as fimm filesystems /work, /work2, /home).
All running jobs will of course be killed when the nodes are shut down.
The planned downtime is on Monday November 12th from 09:00 to 14:00. It *may* be somewhat shorter if all goes well.
Questions can be sent to support-uib@notur.no
We are sorry for the inconvenience this may cause you.
Update, Mon. 12th 14:50: fimm is now back up again. In addition to the power downtime we did a necessary kernel and gpfs upgrade.
All running jobs will of course be killed when the nodes are shut down.
The planned downtime is on Monday November 12th from 09:00 to 14:00. It *may* be somewhat shorter if all goes well.
Questions can be sent to support-uib@notur.no
We are sorry for the inconvenience this may cause you.
Update, Mon. 12th 14:50: fimm is now back up again. In addition to the power downtime we did a necessary kernel and gpfs upgrade.
Login node on fimm will be upgraded today
The login node on fimm will for a short period between 4 pm and 6 pm be upgraded today.
Users will therefore not be able to login.
Running jobs on fimm will not be affected by this upgrade.
Update 16:53: The updated fimm login is now up and running. (Downtime 10 min).
Problems should be reported to hpc-support@hpc.uib.no
Users will therefore not be able to login.
Running jobs on fimm will not be affected by this upgrade.
Update 16:53: The updated fimm login is now up and running. (Downtime 10 min).
Problems should be reported to hpc-support@hpc.uib.no
IBM regatta p690 “tre” decommissioned
As previously announced, the IBM p690 Regatta system "tre" is now decommissioned.
Questions regarding access to files etc. must quickly be sent to
support-uib@notur.no
Questions regarding access to files etc. must quickly be sent to
support-uib@notur.no
Power failure
Power failure on fimm and tre occured around 19:00.
Update 23:44: Most machines are up, however some filesystems are still down.
Update 23:44: Most machines are up, however some filesystems are still down.
fimm cluster is upgraded
The scheduled maintenance of the fimm cluster is now (mostly) complete. Please note the following changes:
- Cluster is now running Rocks 4.3 which is based on CentOS 4.5
- Login to fimm.bccs.uib.no now ends up on one of the compute nodes acting as a login node. Currently this is called compute-1-14.
- Compilers are upgraded to Intel 10.0 and PGI 7.0
- Totalview is upgraded to 8.2
- MPI libraries are upgraded and located in /local
- Several libraries and programs in /local is upgraded
All jobs that were waiting on the old queue need to be submitted again into the new queue after the upgrade.
Send questions to support-uib@notur.no
- Cluster is now running Rocks 4.3 which is based on CentOS 4.5
- Login to fimm.bccs.uib.no now ends up on one of the compute nodes acting as a login node. Currently this is called compute-1-14.
- Compilers are upgraded to Intel 10.0 and PGI 7.0
- Totalview is upgraded to 8.2
- MPI libraries are upgraded and located in /local
- Several libraries and programs in /local is upgraded
All jobs that were waiting on the old queue need to be submitted again into the new queue after the upgrade.
Send questions to support-uib@notur.no
Password file on fimm nodes corrupted
The password file on the fimm nodes has been corrupted so no new jobs will run. We are currently fixing the problem.
Update 14:51: Most nodes have now been reinstalled. Users should be able to submit jobs again.
Update 14:51: Most nodes have now been reinstalled. Users should be able to submit jobs again.
Scheduled maintenance / upgrade of fimm
There will be a upgrade of fimm from Rocks 4.1 to Rocks 4.3 on Wed. Sep. 12th. Expected downtime is from 08:00 to 14:00. The new system will have updated OS, compilers and software and will be integrated with the grid-related activities.
This notice may be updated with more information at a later time.
This notice may be updated with more information at a later time.
Important notice: tre.bccs.uib.no will be taken out of service
The IBM p690 Regatta tre.bccs.uib.no / tre.ii.uib.no will be shut down and decommissioned in the morning of Monday October 1st 2007 at 08:00.
All jobs must be finished, and all data and personal files must be copied out of the machine before this time. The only exception would of course be data on external disk like /migrate, /net/bcmhsm and /net/bjerknes*.
Any questions regarding this can be sent to support-uib@notur.no.
All jobs must be finished, and all data and personal files must be copied out of the machine before this time. The only exception would of course be data on external disk like /migrate, /net/bcmhsm and /net/bjerknes*.
Any questions regarding this can be sent to support-uib@notur.no.