Scheduled maintenance

There will be a scheduled downtime on fimm and associated services starting on Thursday 13th Dec. The length of the downtime is uncertain at this time, but we expect to be up again on Tuesday 18th Dec.

The downtime is due to a major cooling system upgrade in the machineroom. We expect to need a second, smaller, downtime to finalize this upgrade in January.

Further information will be posted here when we know more.

Update, 12th 14:45: Because of the cooling failure (see other note above) the work on the cooling upgrade will start earlier.

Update, 13th 09:10: The frontend will now be unavailable until upgrade is completed.

Update, 17th 19:00: The expected startup of the cooling system on Tuesday has been delayed. We estimate the startup of the power and cooling systems around 12:00 Wednesday.

Update, 19th 12:00: The expected startup of the cooling system has been delayed for a couple of hours.

Update, 19th 16:30: Finally fimm is up and running again.

Due to a necessary re-configuration of the power in the machine room, there will be a downtime of fimm cluster plus filesystems (/migrate, /bcmhsm, /bjerknes*, as well as fimm filesystems /work, /work2, /home).
All running jobs will of course be killed when the nodes are shut down.

The planned downtime is on Monday November 12th from 09:00 to 14:00. It *may* be somewhat shorter if all goes well.

Questions can be sent to support-uib@notur.no

We are sorry for the inconvenience this may cause you.

Update, Mon. 12th 14:50: fimm is now back up again. In addition to the power downtime we did a necessary kernel and gpfs upgrade.

The scheduled maintenance of the fimm cluster is now (mostly) complete. Please note the following changes:

- Cluster is now running Rocks 4.3 which is based on CentOS 4.5
- Login to fimm.bccs.uib.no now ends up on one of the compute nodes acting as a login node. Currently this is called compute-1-14.
- Compilers are upgraded to Intel 10.0 and PGI 7.0
- Totalview is upgraded to 8.2
- MPI libraries are upgraded and located in /local
- Several libraries and programs in /local is upgraded

All jobs that were waiting on the old queue need to be submitted again into the new queue after the upgrade.

Send questions to support-uib@notur.no

We will need to do a scheduled maintenance (firmware upgrade) of the disksystem for /net/bcmhsm (for users from BCCR symlinked from /migrate) and /net/bjerknes1. Note that /net/bcmhsm is mounted as /bcmhsm on fimm.

/net/bcmhsm and /net/bjerknes1 will be unavailable on Monday 15. from 09:00 to 11:00 (if all goes well possibly earlier)

Update (11:00): /net/bcmhsm and /net/bjerknes1 is now up again. The downtime was also used to apply a security update on the backup-server (where /net/bcmhsm is).

The taperobot is getting 2 new tapedrives installed and will be unavailable from 09:45 to approx. 11:00 16. Feb.
Files in /migrate (and /net/bcmhsm) will be unavailable.
This entry will be updated with more information later.

11:20 Update: The upgrade takes somewhat longer than planned.

12:45 Update: The upgrade is complete and filesystem back.