cyclone.hpc.uib.no has lost external interface since late yesterday, which made users unable to access cyclone.
The problem is resolved and the cause is under investigation.
We apologize for the inconvenience.
cyclone.hpc.uib.no has lost external interface since late yesterday, which made users unable to access cyclone.
The problem is resolved and the cause is under investigation.
We apologize for the inconvenience.
During the last week, we have experienced a problem with leo.hpc.uib.no as our NFS server, it crashes due to Lustre bug triggered by NFS related unknown reason.
After some debugging, we have made some changes to our Lustre configuration, which looks promising so far.
leo.hpc.uib.no has been running without a problem for the last 2 days.
We will keep monitoring the system and will post here if anything else happens.
Please don't hesitate to contact us if you encounter any problem regarding NFS and samba exports from Leo.hpc.uib.no
We apologize for the inconvenience.
Machine room will have power maintenance on February 3rd.
Following servers/services will be down during this time:
Hexagon.hpc.uib.no
Grunch.hpc.uib.no
Cyclone.hpc.uib.no
Leo.hpc.uib.no
Everything under /shared/ and /Data will not be accessible. NFS and SMB exports will be offline.
The maintenance will start from 08:00 and will hopefully finish at 14:00. We kindly ask you to save all your work on mentioned servers and log out safely before servers are going down.
And we will keep you updated on this page.
Dear Hexagon users,
work filesystem crashed yesterday night again due to hardware errors from some compute nodes and service nodes. For clean up errors, we have to shutdown hexagon and restart it again.
Update 12:16 Hexagon is restarted and back online now.
Dear shared filesystem users :
Today around 16:00, /shared filesystem mounted itself read-only automatically due to a bug in the version of the Lustre filesystem we are running.
This made whole /shared filesystem read-only.
We had to unmount /shared filesystem and eliminated error to avoid the bug.
We apologize for any inconvenience and appreciate your understanding.
One of the fileserver for /work on hexagon crashed. we are working on the issue.
work file-system crashed Sunday afternoon, we manage to take it online again late Sunday. Jobs that are running on work file-system is crashed and has to be resubmitted.
Due to problem on the shared file-system we have to stop hexagon.
All running jobs will be killed.
15:50: Hexagon is up.
The shared filesystem on hexagon crashed around 14:00 today.
we are working on the issue.