The login node on fimm will for a short period between 4 pm and 6 pm be upgraded today.
Users will therefore not be able to login.
Running jobs on fimm will not be affected by this upgrade.
Update 16:53: The updated fimm login is now up and running. (Downtime 10 min).
Problems should be reported to hpc-support@hpc.uib.no
IBM regatta p690 “tre” decommissioned
As previously announced, the IBM p690 Regatta system "tre" is now decommissioned.
Questions regarding access to files etc. must quickly be sent to
support-uib@notur.no
Questions regarding access to files etc. must quickly be sent to
support-uib@notur.no
Power failure
Power failure on fimm and tre occured around 19:00.
Update 23:44: Most machines are up, however some filesystems are still down.
Update 23:44: Most machines are up, however some filesystems are still down.
fimm cluster is upgraded
The scheduled maintenance of the fimm cluster is now (mostly) complete. Please note the following changes:
- Cluster is now running Rocks 4.3 which is based on CentOS 4.5
- Login to fimm.bccs.uib.no now ends up on one of the compute nodes acting as a login node. Currently this is called compute-1-14.
- Compilers are upgraded to Intel 10.0 and PGI 7.0
- Totalview is upgraded to 8.2
- MPI libraries are upgraded and located in /local
- Several libraries and programs in /local is upgraded
All jobs that were waiting on the old queue need to be submitted again into the new queue after the upgrade.
Send questions to support-uib@notur.no
- Cluster is now running Rocks 4.3 which is based on CentOS 4.5
- Login to fimm.bccs.uib.no now ends up on one of the compute nodes acting as a login node. Currently this is called compute-1-14.
- Compilers are upgraded to Intel 10.0 and PGI 7.0
- Totalview is upgraded to 8.2
- MPI libraries are upgraded and located in /local
- Several libraries and programs in /local is upgraded
All jobs that were waiting on the old queue need to be submitted again into the new queue after the upgrade.
Send questions to support-uib@notur.no
Password file on fimm nodes corrupted
The password file on the fimm nodes has been corrupted so no new jobs will run. We are currently fixing the problem.
Update 14:51: Most nodes have now been reinstalled. Users should be able to submit jobs again.
Update 14:51: Most nodes have now been reinstalled. Users should be able to submit jobs again.
Scheduled maintenance / upgrade of fimm
There will be a upgrade of fimm from Rocks 4.1 to Rocks 4.3 on Wed. Sep. 12th. Expected downtime is from 08:00 to 14:00. The new system will have updated OS, compilers and software and will be integrated with the grid-related activities.
This notice may be updated with more information at a later time.
This notice may be updated with more information at a later time.
Important notice: tre.bccs.uib.no will be taken out of service
The IBM p690 Regatta tre.bccs.uib.no / tre.ii.uib.no will be shut down and decommissioned in the morning of Monday October 1st 2007 at 08:00.
All jobs must be finished, and all data and personal files must be copied out of the machine before this time. The only exception would of course be data on external disk like /migrate, /net/bcmhsm and /net/bjerknes*.
Any questions regarding this can be sent to support-uib@notur.no.
All jobs must be finished, and all data and personal files must be copied out of the machine before this time. The only exception would of course be data on external disk like /migrate, /net/bcmhsm and /net/bjerknes*.
Any questions regarding this can be sent to support-uib@notur.no.
GPFS hang on node “en”
Node en had a GPFS hang. GPFS restartet. Downtime approx. 30 min.
Hang of node “en”
Node "en" of the regattas has a hang. We are investigating.
Update 09:45: Node rebooted. No jobs lost.
Update 09:45: Node rebooted. No jobs lost.
/home/fimm crashed
The filesystem /home/fimm crashed for unknown reason on fimm. We are currently investigating.
Users will not be able to log in until this is fixed.
Update 16.40: The filesystem is now up and working again.
Users will not be able to log in until this is fixed.
Update 16.40: The filesystem is now up and working again.