fimm was down Aug. 3. from 08:00 to 12:45 for scheduled maintenance.
Kernel and gpfs update, switch firmware update and satablade (disk) firmware update completed.
Scheduled maintenance
Scheduled downtime on fimm
Fimm will be down Wednesday Aug. 3. 08:00-14:00 for kernel upgrades, firmware update on switches, gpfs update and some minor fixes.
Maintenance summary
Regatta node TO and TRE had downtime from 08:00 to 12:45
for update of firmware.
Regatta node EN had downtime from 08:00 to 16:00
for update of firmware and change of 32GB memory module.
This node had problem booting from root-disks after hardware changes.
Moving the disks to TO and back again made EN bootable (unclear why).
Linux cluster FIRE had downtime from 08:00 to 16:00 due to dependancy on disks on EN.
for update of firmware.
Regatta node EN had downtime from 08:00 to 16:00
for update of firmware and change of 32GB memory module.
This node had problem booting from root-disks after hardware changes.
Moving the disks to TO and back again made EN bootable (unclear why).
Linux cluster FIRE had downtime from 08:00 to 16:00 due to dependancy on disks on EN.
Scheduled downtime on TRE+FIRE
The regatta cluster TRE will be down Tuesday June 14 08:00-14:00 for firmware upgrades, and replacement of a failed memory module on one of the nodes. Running jobs will be killed, and will have to be resubmitted after the maintenance stop.
Also the linux cluster FIRE will be down this periode, because it's depending on the regatta as file server.
Also the linux cluster FIRE will be down this periode, because it's depending on the regatta as file server.
Scheduled downtime on fimm
Fimm will be down monday May 9th. 08:00-12:00 for firmware upgrades on the SATABlade disk solution, and possibly other minor changes. This is to fix the bug that triggered the disk crashes on March 30th.
http://www.parallaw.uib.no/syslog/56
http://www.parallaw.uib.no/syslog/56
Linux cluster “FIRE” is being reinstalled
The linux cluster fire.bccs.uib.no will be down tuesday/wednesday March 29./30.. All nodes will be re-installed with the Rocks linux cluster distribution. This is a major system upgrade, so most software will have to be rebuilt to run on fire after the upgrade.
Maintenance summary
Fast Write Cache batteries on ssa0, ssa1 and ssa2 on node TO were replaced while a plumber worked on the cooling water. No problems.
Downtime on the regatta and linux cluster:
20040701 08:00-10:10 = 2 hours, 10 minutes
Downtime on the regatta and linux cluster:
20040701 08:00-10:10 = 2 hours, 10 minutes
Scheduled maintenance thursday July 1., 08:00-12:00
The regatta and linux cluster will be down for maintenance thursday july 1. 08:00-12:00. Running jobs thursday morning will be killed.
The maintenance that will be done is to replace disk cache memory batteries, and do some work on the cooling system in the machine room.
The maintenance that will be done is to replace disk cache memory batteries, and do some work on the cooling system in the machine room.
/migrate available again
The /migrate filesystem is now back online.
Downtime: 20040122 08:28- 12:30 = 4 hours 2 minutes.
Downtime: 20040122 08:28- 12:30 = 4 hours 2 minutes.
Maintenance stop for /migrate
The /migrate filesystem will be unavailable thursday January
22. because of maintenance on the tape storage system.
We will unmount /migrate around 08:00 thursday morning, and
bring it back online as soon as we're finished, but it might
take all day. Any processes accessing /migrate this morning
will be killed.
22. because of maintenance on the tape storage system.
We will unmount /migrate around 08:00 thursday morning, and
bring it back online as soon as we're finished, but it might
take all day. Any processes accessing /migrate this morning
will be killed.