Dear CloudLab Team,
I need urgent help with a likely experiment-network/fabric issue in my Utah CloudLab experiment mrashid2-296178 (profile hpc_lustre_2_15_5, project DIRR).
I am running a 27-node Lustre deployment. The shared filesystem is hasanfs, mounted at /mnt/hasanfs, and it contains time-critical data. I have now lost complete access to this shared storage.
The apparent failure point is er114.utah.cloudlab.us, which is the Lustre MGS/MDT host for this filesystem. Its experiment-network interface ens1f0 (10.10.1.1/24) is down with NO-CARRIER, and lnetctl shows 10.10.1.1@tcp as down. Normal SSH to the node still works over the management network, but the storage network is not functioning.
Impact:
This appears to be outside the guest OS. I only attempted safe, non-destructive host-side checks and interface recovery steps, but the interface still has no carrier. I have intentionally avoided destructive recovery actions because preserving the data is my highest priority.
Could you please investigate the experiment-network path for er114.utah.cloudlab.us ens1f0 as soon as possible? This is a severe and time-sensitive outage affecting access to critical data.
I can provide exact command outputs if needed.
Best regards,
Hasan
--
You received this message because you are subscribed to a topic in the Google Groups "cloudlab-users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/cloudlab-users/n9IWrvupPUM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to cloudlab-user...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/cloudlab-users/51cf3aeb-194a-471b-9f7f-df739378c6a8n%40googlegroups.com.
You received this message because you are subscribed to the Google Groups "cloudlab-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cloudlab-user...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/cloudlab-users/CAPPB1itWHoNOym%3D%2BxMsYcUJaHjuwD4de5e_t1iCW%3D0WOHAyosg%40mail.gmail.com.
To view this discussion visit https://groups.google.com/d/msgid/cloudlab-users/CA%2BDvoUrocLgNxzuBeU4j8JUSUb0GwgNdxTQu%2B%2BMTMeh7W3ZdxA%40mail.gmail.com.