There was an interrupt in storage connection to /home filesystem. This rendered /home to be read-only, we are working to fix this problem ASAP.
Update 16:08 We are running fsck on /home, access is still closed
Update 19:50 Fsck has finished booting machine
Update 20:18 Machine is back online
Downtime
Hexagon: rebooted because of important security update
We had to reboot Hexagon because of important security update. Machine will be up in an hour. Our apologies for inconvenience.
Fimm and Grunch down time 5th April
We will have maintenance in machine room on 5th of
April. Electrician will work on power line in machine room which
requirers electricity to be switched off completely.
Therefor fimm.bccs.uib.no cluster and grunch server will be
shutdown for 3 hours. We have reserved cluster for maintenance which
means jobs submitted to cluster which can not be finished by that time
will not run, and jobs which is already running but will not be able to
finish by that time will be killed.
Maintenance will start from 09:00 AM in the morning. We would advice
you to save all your work on fimm.bccs.uib.no and
grunch.bccs.uib.no by that time.
We are sorry for inconvenience, and appreciate your understanding.
April. Electrician will work on power line in machine room which
requirers electricity to be switched off completely.
Therefor fimm.bccs.uib.no cluster and grunch server will be
shutdown for 3 hours. We have reserved cluster for maintenance which
means jobs submitted to cluster which can not be finished by that time
will not run, and jobs which is already running but will not be able to
finish by that time will be killed.
Maintenance will start from 09:00 AM in the morning. We would advice
you to save all your work on fimm.bccs.uib.no and
grunch.bccs.uib.no by that time.
We are sorry for inconvenience, and appreciate your understanding.
grunch maintenance is over
Finally grunch maintenance is over.
* Firmware is updated to latest.
* OS is updated to CentOS 6.4.
* grunch is added to new fimm cluster.
Now grunch user can use software which is installed on fimm with "module" command.
Please let me know if you have problem to login or if you need more software to be installed.
* Firmware is updated to latest.
* OS is updated to CentOS 6.4.
* grunch is added to new fimm cluster.
Now grunch user can use software which is installed on fimm with "module" command.
Please let me know if you have problem to login or if you need more software to be installed.
Hexagon: /work-common crash of one of osses
One of OSSes has crashed this night leaving /work-common unavailable.
We are working on to fix it ASAP.
Update 10:25 we've disabled ost25, access to /work-common/shared/bjerknes is not possible, we will try to resolve access ASAP.
Update 12:53 the failed ost25 has problems with the RAID controller. We are expecting Dell technician to replace this controller tomorrow.
Update 28Jan 12:02 the RAID controller was replaced, ost25 is back in the system and access to /work-common/shared/bjerknes should be restored
We are working on to fix it ASAP.
Update 10:25 we've disabled ost25, access to /work-common/shared/bjerknes is not possible, we will try to resolve access ASAP.
Update 12:53 the failed ost25 has problems with the RAID controller. We are expecting Dell technician to replace this controller tomorrow.
Update 28Jan 12:02 the RAID controller was replaced, ost25 is back in the system and access to /work-common/shared/bjerknes should be restored
Hexagon: unexpected interrupt
We had issue with our automatic shutdown system which stopped hexagon. We are working to bring the machine up ASAP.
Update: 17:45 Machine is up. Our apologies for inconvenience.
Update: 17:45 Machine is up. Our apologies for inconvenience.
Hexagon: cooling issues
Hexagon went down because of problems with cooling.
Update: 12:45 System is up.
Update: 12:45 System is up.
Hexagon: rebooted because of important security update
We had to apply important security update on Hexagon.
It was rebooted at 15:47 and was up at 16:05. All running jobs were terminated. Our apologies for inconvenience.
It was rebooted at 15:47 and was up at 16:05. All running jobs were terminated. Our apologies for inconvenience.
Hexagon: power failure – thunderstorm
Machine went down at night around 23:00 because of severe thunderstorms.
We are working to bring it up ASAP.
Update: 09:30 Machine is up.
We are working to bring it up ASAP.
Update: 09:30 Machine is up.
Fimm: maintenance
Dear fimm cluster user :
We will have scheduled down time for cluster fimm.bccs.uib.no. on 25th
Of Oct at 08:00 am. cluster is reserved for this downtime.
Downtime will last until 08:00 29/10/2013
At the end of maintenance old fimm cluster will demolished and new
fimm cluster will be in operation.
On new fimm cluster /fimm/home and /fimm/work file system will be
lustre file system.
We will have internal and external 10GB network connection.
We will only transfer your home file system but *NOT* work file system.old work file system will be nfs mounted on new cluster after
maintenance. you have to copy only necessary files from old work to your new work file system.
Software installation on new cluster is ongoing process, we have
installed most basic software already , we will install rest of the
software and create proper module as requested.
Let us know if you have any further question.
Update: 10:00
Maintenance started , and we have started last rsync process to get your home folder synchronized.
Update: 28/10/2013
We will not be able to open up fimm by 08:00/29/10/2013, we have to extend down time until 16:00/29/10/2013.
sorry for inconvenience.
Update: 16:24_29/10/2013
We have extended maintenance until 12:00_ 30/10/2013.
sorry for inconvenience.
Update: 30/10/2013
Fimm cluster is open for users now. we have limited amount of software installed,we will continue installation of the software.
some of the compute nodes are still not install , this is also ongoing process.
Old work directory is only mounted on login node under /old_work.
Pleas let us know if you have any question. we will keep progress updated.
We will have scheduled down time for cluster fimm.bccs.uib.no. on 25th
Of Oct at 08:00 am. cluster is reserved for this downtime.
Downtime will last until 08:00 29/10/2013
At the end of maintenance old fimm cluster will demolished and new
fimm cluster will be in operation.
On new fimm cluster /fimm/home and /fimm/work file system will be
lustre file system.
We will have internal and external 10GB network connection.
We will only transfer your home file system but *NOT* work file system.old work file system will be nfs mounted on new cluster after
maintenance. you have to copy only necessary files from old work to your new work file system.
Software installation on new cluster is ongoing process, we have
installed most basic software already , we will install rest of the
software and create proper module as requested.
Let us know if you have any further question.
Update: 10:00
Maintenance started , and we have started last rsync process to get your home folder synchronized.
Update: 28/10/2013
We will not be able to open up fimm by 08:00/29/10/2013, we have to extend down time until 16:00/29/10/2013.
sorry for inconvenience.
Update: 16:24_29/10/2013
We have extended maintenance until 12:00_ 30/10/2013.
sorry for inconvenience.
Update: 30/10/2013
Fimm cluster is open for users now. we have limited amount of software installed,we will continue installation of the software.
some of the compute nodes are still not install , this is also ongoing process.
Old work directory is only mounted on login node under /old_work.
Pleas let us know if you have any question. we will keep progress updated.