Hello Euler Users and PIs,
Euler is due for its yearly OS upgrade, and Euler's login/compute nodes will need to be offline for part of the process. That outage will begin on Monday, June 3rd, at 10am US Central Time. Login nodes will be returned to service as quickly as possible and
compute nodes will follow over the subsequent days. The cluster is expected to be fully returned to service by the end of the day on Friday, June 7th.
Thanks in large part to the storage changes implemented over the last year, we no longer need to have a full outage in order to upgrade those machines, so all of your files will still be accessible (via Globus) throughout the maintenance period.
The following changes will be implemented during that time:
OS upgrade to Fedora Server 40
-
A few nodes tagged in Slurm with the
euler-next feature will be available by Wednesday, May 15th for testing.
-
If you'd like one of your nodes to be included in that group so you can test your projects in advance, please send your request in an Euler support ticket.
Multi-factor Authentication (MFA)
-
MFA will be mandatory for all eligible users!
-
If you are unsure whether you have MFA set up for your CAE account, you may contact the CAE helpdesk for assistance.
Filesystem Quotas
-
Users who exceed the allowable data storage limit on Euler (default: 1.5 TB with a burst capacity of 2.5 TB) will become
unable to write additional data.
-
For this period, the limit will be set at 4× burst capacity, but it will be lowered slightly each month until the fall semester to better match the defaults.
NOTE: If you are currently storing too much data and do not have permission to do so, please open a support ticket to discuss your options and alternatives. Both CAE and the Research Cyberinfrastructure team at DoIT provide certain amounts of data storage
to researchers for free.
Notable Environment Module Updates
NOTE: The conda/miniforge module will continue to be available as an unencumbered alternative for research use.
In addition to the changes mentioned above, the following ongoing changes will continue:
Filesystem Encryption
-
Euler is in the process of encrypting its entire storage backend so as to provide better protection of research data at rest.
-
Currently, all metadata and about ¼ of file storage is encrypted; the operation is expected to reach 100% coverage around the beginning of the fall semester.
Node Retirements
-
Most nodes with Intel Haswell-era CPUs (haswell feature in Slurm) have passed their useful lifetime and will be removed this year.
-
Most nodes with NVIDIA Pascal GPUs (pascal feature in Slurm) have passed their useful lifetime and will be removed this year.
NOTE: If you own a system on Euler which meets these criteria, you will be contacted about retiring these systems this summer. If you expect to hear from us but don't, there may be special circumstances for that system. Please reach out for confirmation in
the Fall.
If you have an upcoming deadline which would be affected by this outage, please let us know ASAP so I can work with you to mitigate the impact of the outages as much as possible.
As always, please contact
euler-...@engr.wisc.edu to create a new support ticket if you have questions or concerns.
Regards,
Colin Vanden Heuvel