Author Archives: Lóránd Szentannai

We will have a planned maintenance on Hexagon, starting on May 22nd at 09:00 AM. The maintenance is expected to last one day.
During the maintenance we will carry out software and firmware upgrades as well service the hardware.

The job submission system has reservation in place, thus jobs which are not able to finish before maintenance start, will not be started.

/work-common will be unavailable during the maintenance period and will be unmounted from Grunch and Fimm.

UPDATES:
  • 2017-05-22 09:00: Maintenance has started.
  • 2017-05-22 14:16: /work-common is available again and remounted on Grunch.
  • 2017-05-22 15:59: Maintenance has finished and access to Hexagon is re-opened.

login5 ran out of memory yesterday (27.02.2017) around 18:16 and took about 15 minutes to recover.

During this time the compute nodes were unable to contact the application scheduler running on login5 and some jobs might have crashed.
A typical error message for this case is: "aprun: Apid nnnnnnn: close of the compute node connection after app startup barrier".

We apologise for any inconvenience caused.

Four cabinets went down due to power issues caused by the storm. Storage controllers for /work-common are also affected.
Hexagon was started without /work-common filesystem.

We are trying to fix issues with the filesystem controllers and get back the filesystem in production as soon as possible.

Update 2016-12-27 14:50: Troubles with /work-common storage controllers were mitigated and filesystem is taken back online. Hexagon had to be rebooted today at 14:15. All systems are up and functional again.

UNINETT Sigma2 is organizing a Software Developer Course for the new HPC-system.

We are pleased to inform you that there will be a second HPC-course this autumn in Trondheim, at 30 November - 1 December, respectively. 
Registration is open at 
https://response.questback.com/uninett/hpctrainingseminar

Please refer to the announcement on www.sigma2.no for further details.