Status
Future events
2024-02-19 Lucia: Planned maintenance (7:00-19:00)
2024-01-15 Manneback: Planned maintenance week
Current issues
None. If you notice something wrong, please notify us.
Past events
2023-11-20 14:00 Lucia: The virtual machine hosts are up and running, all virtual machines and their services are available.
2023-11-20 08:36 Lucia: multiple virtual machine hosts of Lucia crashed impacting multiple services. As a consequence, it might be difficult to connect to Lucia and from Lucia to remote hosts, submit jobs, etc.
2023-10-12 08:36 NIC5: The CECI common file system gateway of NIC5 has been rebooted. Access to all /CECI
partitions has been restored.
2023-10-12 00:49 NIC5: The CECI common file system gateway of NIC5 failed. As a consequence, access to all /CECI
partitions was lost. Jobs using one of these partitions may have failed.
2023-10-02 10:00-12:00 NIC5 and CECI websites: inaccessible due to a networking issue
2023-09-24 11:00 Hercules: Home filesystem back online.
2023-09-23 13:56 Hercules: Home filesystem unavailable preventing login.
2023-09-20 14:45 Lemaitre3: The BeeGFS global scratch /scratch
is back online after replacement of the failing hardware
2023-09-20 13:20 Lemaitre3: The BeeGFS global scratch /scratch
is currently unavailable.
2023-09-17 17:45 Lemaitre3 and gwceci.cism.ucl.ac.be: Network connectivity has been restored.
2023-09-16 16:45 Lemaitre3 and gwceci.cism.ucl.ac.be: UCLouvain HPC infra inaccessible due to a networking issue.
2023-09-05 16:04 Hercules2: workaround implemented to mitigate the slowdowns
2023-09-05 16:04 Hercules2: Cluster stability issues detected due to defective network device
2023-08-10 11:08 NIC5: NIC5 is up and running again
2023-08-10 09:00 NIC5: Login node memory replacement and reboot
2023-08-06 16:04 NIC5: Hardware memory problem on login node detected
Legend
Everything is running as expected.
The system status is degraded. Some functionalities might be missing, or less performant.
The system is unavailable ; we are working to make it functional again.
The system is undergoing planned maintenance operations.
Beginning of the event/issue
Resolution of the event/issue
Information and status update
Future announcements and "save the date" info