We are going to stop /bcmhsm for a maintenance at 10:00 on Thursday November 3rd. The /bcmhsm will not be available for few hours.
Update: 03.11 10:10 Filesystem is back online
We are going to stop /bcmhsm for a maintenance at 10:00 on Thursday November 3rd. The /bcmhsm will not be available for few hours.
Update: 03.11 10:10 Filesystem is back online
We have updated following main software on fimm:
PGI/11.8
GCC/4.6.1
intel/12.1.6_233
openmpi/1.4.4 compiled with pgi/11.8 gcc/4.6.1
netcdf/4.1.3 compiled with pgi/11.8 gcc/4.6.1
HDF5/1.8.7 compiled with pgi/11.8 gcc/4.6.1
szip/2.1 compiled with pgi/11.8 gcc/4.6.1 intel/12.1.6_233
zlib/2.3.1 compiled with pgi/11.8 gcc/4.6.1 intel/12.1.6_233
We have also implemented PrgEnv-pgi and PrgEnv-gcc on fimm which will work same as hexagon, it is a software environment set which helps you to load right set of the software.
We keep rest of the software updated.
We have updated following main software on fimm:
PGI/11.8
GCC/4.6.1
intel/12.1.6_233
openmpi/1.4.4 compiled with pgi/11.8 gcc/4.6.1
netcdf/4.1.3 compiled with pgi/11.8 gcc/4.6.1
HDF5/1.8.7 compiled with pgi/11.8 gcc/4.6.1
szip/2.1 compiled with pgi/11.8 gcc/4.6.1 intel/12.1.6_233
zlib/2.3.1 compiled with pgi/11.8 gcc/4.6.1 intel/12.1.6_233
We have also implemented PrgEnv-pgi and PrgEnv-gcc on fimm which will work same as hexagon, it is a software environment set which helps you to load right set of the software.
We keep rest of the software updated.
There is an issue with part of the /work filesystem on Hexagon. We are investigating.
Update Tuesday 09:30, Still diagnosing the issue. No known fix-time as of now.
Update Tuesday 10:00, Machine goes down for maintenance.
Update Tuesday 13:30, Part of filesystem has been e2fsck checked.
Update Tuesday 14:00, Machine up again after maintenance.
We are going to change physical location of HSM server and HSM storage. Therefore downtime for /migrate and /bcmhsm will take place at Friday October 7th, from 12:00 till 14:00.
Update: The downtime have to be extended by half an hour.
Due to fimm.bccs.uib.no cluster core switch firmware update we will take down both internal and external core switch for maintenance tomorrow from 13:00~15:00, actual down time can be shorter then this.
All running job will be killed.
We are sorry for inconvenience and short notice.
We will keep you updated.
10:30 Fimm login node is blocked.
16:00 Both internal and external switch is updated to new firmware.
17:10 maintenance is finished. fimm cluster is operational.
The backend machine of Fimm crashed and has ongoing problems.
This means the queueing system and most other services are not avaliable.
13.08.2011, 10:00, service is back.
Because of an urgent security update the login node will be down for 1 hour.
Hexagon has updated software and libraries.
For full release notes from Cray, look at:
http://docs.cray.com/books/S-9401-1107//S-9401-1107.pdf
Note that some of the software listed there is not supported/will not be installed on hexagon.
Updates:
xt-mpt 5.3.0 -> 5.3.0
Petsc 3.1.05 -> 3.1.08
TPSL 1.0.01 -> 1.1.00
libfast 1.0.8 -> 1.0.9
ATP 1.2.0 -> 1.2.1
lgdb 1.3 -> 1.4
(xt-)gcc 4.5.3 -> 4.6.0
PGI 11.5.0 -> 11.6.0
xt-asyncpe 4.9 -> 5.00
Hexagon is down from 17:55 due to power-spike from thunderstorm. We are diagnosing and restarting the machine.
Update 20:25, machine is back up after disabling a failed module.