Author Archives: lsz075

About lsz075

IT-avdelingen

Cyclone reboot planned for 09:00 19-03-2019

lsz075 • March 19, 2019

Update 10:40: Access to cyclone is reopened now. Delay was caused by missing kernel modules and old Lustre packages.

Cyclone will be sharply rebooted at 09:00 to apply new filesystem settings.

Hexagon: urgent reboot is needed

lsz075 • December 3, 2018

Update 2018-12-03 12:36:

Hexagon is up now.
Interconnect errors are cleared now and /work file system is up and functional again.
Unfortunately the previously submitted jobs had to be canceled. Please resubmit your jobs.

Dear Hexagon User,

We must reboot Hexagon due to repeated errors on the interconnect.
Will update this case when Hexagon is up and functional again.

Scheduled maintenance for /shared file system on 5th of November

lsz075 • October 22, 2018

Update 12_11 21:30:

Migration is over, we manage to take up Lustre filesystem with new MDS server. /shared and /work filesystem is mounted on cyclone.hpc.uib.no and grunch.hpc.uib.no. Hexagon is up and running again. Samba and NFS exports are also running on Leo.hpc.uib.no.

Update 12_11 15:00 :

Migration is still ongoing, we will keep you posted.

Update 02_11 09:30 :

Due to the delayed delivery of physical parts, we have to postpone our downtime to 12th November. Corresponding node reservation on the hexagon is also postponed to 12th November.

Thank you for your consideration!

Dear HPC User,

The metadata server for the /shared file system has to be replaced/upgraded and therefore it must be unmounted from all the clients.

This will result in scheduled downtime for Hexagon, Grunch and Cyclone machines. We start at 08:00 AM on the 5th of November and expect to be ready by the end of the working day.

Thank you for your consideration!

Access limitation to Hexagon from 14th of June

lsz075 • June 5, 2018

Update 14-06-2018 11:15: Access is now limited to UiB and IMR campus only.

Starting with 14 of June, access to Hexagon will be limited to campus network.

Please find more details at docs.hpc.uib.no/wiki/Getting_started.

Scheduled downtime on Hexagon and /shared on 23.05.2018

lsz075 • May 16, 2018

Update 23.05.2018 15:19 File system issues were solved and mounted back to both Hexagon and Grunch. Access to is reopened.

There is a scheduled downtime for Hexagon and /shared file system for Wednesday, 23rd of May. Scheduled downtime will start at 09:00 and we expect to have the systems back by 16:00, same day.

Our apologies for any inconvenience this downtime can give you.

Hexagon: tmp issues with /work

lsz075 • December 8, 2014

Looks like after the reboot of the machine on Friday not all data nodes hosting /work picked up proper settings and /work fs is temporary slow. We are working to resolve this issue ASAP.

Update 10:00 This issue is resolved now.

Hexagon: emergency stop, issues with the cooling

lsz075 • December 5, 2014

There are issues with the cooling in the machine room. We've contacted building maintenance team and are hoping to bring machine up in few hours. Update: 14:00 Cooling problems are over. Machine is back online.

Hexagon: qsub -I restored

lsz075 • December 4, 2014

Interactive job submission is working again. (qsub -I)

Hexagon: issues with /work storages

lsz075 • December 2, 2014

From this night there is a problem with one of our storage systems serving /work, we are looking into the problem.

Update 02.12.2014 13:00: Issue has been remediated, /work should be OK now.

Hexagon: changes after upgrade

lsz075 • November 28, 2014

This is a significant upgrade and some binaries are not compatible anymore, thus before opening support case please recompile your code.

Older compilers will have to use the following compiling recipe:

module swap cray-mpich cray-mpich2

module swap cray-libsci cray-libsci/12.2.0

module load craype-barcelona

Only current version of PGI will work without using the above mentioned recipe.

xt prefixed modules are deprecated and replaced by cray prefixed modules.
ex:

xtpe-interlagos

- replaced by

craype-interlagos

Cray CCE is not working currently because of missing license. We are in contact with the vendor.

HPC Syslog

Log over changes and events on UiB's HPC systems