Unable to reboot a clemson r7525 node

27 views
Skip to first unread message

Xuchuan Luo

unread,
Apr 8, 2026, 1:28:14 PM (14 days ago) Apr 8
to cloudlab-users
Hi,

I fail to reboot an r7525 node (clgpu017) in my experiment (https://www.cloudlab.us/status.php?uuid=43f58e0d-820e-414e-92c7-49aea0e1832f#).
Could you please have a look at it?

Xuchuan Luo

unread,
Apr 8, 2026, 1:30:11 PM (14 days ago) Apr 8
to cloudlab-users
The node (clgpu017) is sometimes in a "ready" status, but I still cannot log in to it.

Mike Hibler

unread,
Apr 8, 2026, 2:48:12 PM (14 days ago) Apr 8
to cloudla...@googlegroups.com
We are going to need to reload the Mellanox BF2 smart NIC card on this
machine to get it working again. That will involve reloading the image
on the hard drive with our "reset" image. Is there anything on the node
currently that you need to backup?

Otherwise I will proceed with the reload/reflash.
> --
> You received this message because you are subscribed to the Google Groups
> "cloudlab-users" group.
> To unsubscribe from this group and stop receiving emails from it, send an email
> to cloudlab-user...@googlegroups.com.
> To view this discussion visit https://groups.google.com/d/msgid/cloudlab-users/
> cf3f9514-a410-4b2f-8ece-51521c0ca76bn%40googlegroups.com.

Xuchuan Luo

unread,
Apr 8, 2026, 2:53:22 PM (14 days ago) Apr 8
to cloudlab-users
There’s nothing on the node that needs to be backed up.
Please go ahead with the reload. Thank you.

Mike Hibler

unread,
Apr 8, 2026, 2:59:06 PM (14 days ago) Apr 8
to cloudla...@googlegroups.com
Okay. Please don't login or reboot the node until you hear from me
that everything is done.
> 5b0116a9-37a7-4f61-b8c2-b6e685484909n%40googlegroups.com.

Mike Hibler

unread,
Apr 8, 2026, 4:22:53 PM (13 days ago) Apr 8
to cloudla...@googlegroups.com
clgpu017 is back up and working.

On Wed, Apr 08, 2026 at 12:59:02PM -0600, Mike Hibler wrote:
> Okay. Please don't login or reboot the node until you hear from me
> that everything is done.
>
> On Wed, Apr 08, 2026 at 11:53:21AM -0700, Xuchuan Luo wrote:
> > There???s nothing on the node that needs to be backed up.
> > Please go ahead with the reload. Thank you.
> >
> > ???2026???4???9???????????? UTC+8 02:48:12<Mike Hibler> ?????????
> >
> > We are going to need to reload the Mellanox BF2 smart NIC card on this
> > machine to get it working again. That will involve reloading the image
> > on the hard drive with our "reset" image. Is there anything on the node
> > currently that you need to backup?
> >
> > Otherwise I will proceed with the reload/reflash.
> >
> > On Wed, Apr 08, 2026 at 10:30:11AM -0700, Xuchuan Luo wrote:
> > > The node??(clgpu017)??is sometimes in a "ready" status, but I still cannot
> > log in
> > > to it.
> > >
> > > ???2026???4???9???????????? UTC+8 01:28:14<Xuchuan Luo> ?????????
> > >
> > > Hi,
> > >
> > > I fail to reboot an r7525 node (clgpu017) in my experiment (https://
> > > www.cloudlab.us/status.php?uuid=43f58e0d-820e-414e-92c7-49aea0e1832f#).
> > > Could you please have a look at it?
> > >
> > > --
> > > You received this message because you are subscribed to the Google Groups
> > > "cloudlab-users" group.
> > > To unsubscribe from this group and stop receiving emails from it, send an
> > email
> > > to cloudlab-user...@googlegroups.com.
> > > To view this discussion visit https://groups.google.com/d/msgid/
> > cloudlab-users/
> > > cf3f9514-a410-4b2f-8ece-51521c0ca76bn%40googlegroups.com.
> >
> >
> > --
> > You received this message because you are subscribed to the Google Groups
> > "cloudlab-users" group.
> > To unsubscribe from this group and stop receiving emails from it, send an email
> > to cloudlab-user...@googlegroups.com.
> > To view this discussion visit https://groups.google.com/d/msgid/cloudlab-users/
> > 5b0116a9-37a7-4f61-b8c2-b6e685484909n%40googlegroups.com.
>
> --
> You received this message because you are subscribed to the Google Groups "cloudlab-users" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to cloudlab-user...@googlegroups.com.
> To view this discussion visit https://groups.google.com/d/msgid/cloudlab-users/20260408185902.GH39017%40flux.utah.edu.
Reply all
Reply to author
Forward
0 new messages