One of the fileserver for /work on hexagon crashed. we are working on the issue.
Author Archives: saerda
Hexagon downtime for /work filesystem maintenance
Hexagon will have planned maintenance on 15th August from 08:00.
Currently /work filesystem is running on reduced performance due to broken storage controller.
During the maintenance, we will replace the broken storage controller for the storage system where /work filesystem resides. Due to the high risk of data loss, we urge all /work filesystem users to backup their important, not reproducible data.
Please keep it in mind that work is not in backedup and work is scratch filesystem.
After the maintenance we expect /work filesystem will be back on full performance.
We appreciate your understanding.
Update 15.08.2018 11:00
Hexagon maintenance is over, we have successfully replaced the broken, controller. Work file-system is back to it's expected performance.
Work file-system crash on Hexagon
work file-system crashed Sunday afternoon, we manage to take it online again late Sunday. Jobs that are running on work file-system is crashed and has to be resubmitted.
Hexagon stop
Due to problem on the shared file-system we have to stop hexagon.
All running jobs will be killed.
15:50: Hexagon is up.
Hexagon: shared filesystem crashed
The shared filesystem on hexagon crashed around 14:00 today.
we are working on the issue.
Hexagon Crashed
Hexagon crashed today around 09:30, We are working on resolving the problem and taking up hexagon.
12:45 Update : hexagon is up, but we have hardware problem with fileserver which is
providing work file system.
Work filesystem crashed again on hexagon and Grunch
Work filesystem has crashed again on Hexagon. We are having a severe problem with work filesystem on hexagon and Grunch. We are working on to find out the root cause of the problem, meanwhile work filesystem will be unstable on Hexagon, we will get all users updated about the process.
We are sorry for the inconvenience and appreciate your understanding.
Hexagon work crashed
Hexagon work filesystem is down due to crashed lustre mds server. We are working on that issue.
Update 15:00 : hexagon work filesystem is back online. Jobs that are running during the crash probably died. We looking in to the root cause of the problem.
Hexagon stopped due to power loss
Hexagon stopped yesterday due to electric power loss. This morning Hexagon is online again.
Fimm cluster /home and /work filesystem crash
This morning /fimm filesystem crashed on fimm.hpc.uib.no. This caused /fimm and /work filesystem unaccessible for users and fimm login node hanged.
We are able to take it up back online quick, but we are investigating the cause of the problem.
Jobs that are running during the crash are all killed.
We are sorry fot inconvenience.
