New versions of Totalview and Chapel have been installed on Hexagon:
- Totalview 2016.06.22
- Chapel 1.14.0
Please note that they are default now. If you want to load previous defaults, please specify versions:
- totalview/8.15.7
- chapel/1.11.0
New versions of Totalview and Chapel have been installed on Hexagon:
Please note that they are default now. If you want to load previous defaults, please specify versions:
Hexagon is down because the high speed network went down. We are working to get the issue fixed and boot the machine.
Update: 2016-09-28 22:45 System is up again.
/work and /work-common filesystems will be unavailable on Grunch on 18th of October starting from 09:00 o'clock. This downtime is part of the scheduled maintenance advertised at
http://syslog.hpc.uib.no/2016/09/21/hexagon-planned-maintenance-18-10-19-10/.
Length of downtime is up to 8 hours for /work-common and up to 2 days for /work.
Please make sure that by this time there are no jobs using /work or /work-common, to avoid data-loss and/or data corruption.
We will keep you updated here.
Update: 2016-10-19 11:07 /work-common is back online and re-mounted on grunch.
During the maintenance we will carry out filesystem upgrade, firmware upgrades as well service the hardware.
The job submission system has reservation in place, thus jobs which are not able to finish before maintenance start, will not be started.
Update: 2016-10-18 09:35 Maintenance has started, slightly delayed due to traffic jam.
Update: 2016-10-19 12:00 Maintenance has been finished, Hexagon is up and accessible again.
We have problem with queuing system on hexagon and we are working on it. We will get back with more details later.
Update 13:15: Queue system is recovered. A handful of services on which the queue system and supporting tools are relying were in a "brain-split" or hanging state.
Dear Fimm cluster and Grunch server users:
Fimm cluster and grunch server will have downtime 25th August from 09:00 to 16:00.
During this downtime we will perform hardware firmware update, internal and external switch firmware update.
For Grunch server except hardware firmware update we will also enable quota on grunchfs.
We will also update slurm version to 16.05.4 on fimm.hpc.uib.no.
Both fimm.hpc.uib.no and grunch.bccs.uib.no will not be accessible during this downtime.
We will keep all process updated on this page.
Please contact hpc-support@hpc.uib.no if you have any questions.
26/08/2016 09:40 Update: Firmware update is done on internal and external switches. Slurm is updated on fimm.hpc.uib.no and Grunch firmware is also updated. Currently we are working on quota on grunchfs, which needs to scan whole file system, this will take some time before we can make gruncfs available for users on grunch server.
26/08/2016 10:40 Update: Grunch quota is enabled, and grunch server is online again.
PGI 16.05, CCE 8.5.1 and CrayPat 6.4.1 are installed on Hexagon as non default versions.
As the newer CCE and CrayPat installations are rather experimental, to access version 8.5.1 of the Cray compiler and 6.4.1 version of the CrayPat one needs to run:
module use /opt/cray/pe/modulefiles
NB. No extra steps are required to access PGI 16.05
We have added additional 74TB storage capacity to /work-common.
We lost one of our core internal switch this morning, unfortunately grunch is on this switch and lost connection to fimm and hexagon. Switch will be replaced early tomorrow hopefully. Meanwhile we will try to find different solution to resume connectivity.
The deadline for applying for NOTUR applications for period 2016.2 is 26th of August 2016. Please see https://www.sigma2.no/content/call-e-infrastructure-resources-20162 for more information.