I checked the BIOS to make sure nothing was reconfigured. Then I powered it
off and back on, and...it seems to be getting full BW again. So I have
canceled taking it out of service in case you want to and can extend the
experiment.
On Tue, Oct 07, 2025 at 08:28:55AM -0600, Mike Hibler wrote:
> Go ahead and leave the experiment til it expires. I have scheduled it to go
> out of service in case we don't get a chance to look at it before then.
>
> Thanks for pointing this out and sorry we have not looked at it yet!
>
> On Tue, Oct 07, 2025 at 07:14:14AM -0700, Bijan Tabatabai wrote:
> > Hi Aleks,
> >
> > I'm just checking in since the CloudLab experiment that I am seeing this issue
> > in expires today. Would it be helpful to extend the experiment to allow
> > CloudLab staff to look at the machine more? Or should I just let the experiment
> > expire?
> >
> > Thanks,
> > Bijan
> >
> > On Wednesday, October 1, 2025 at 4:12:14???PM UTC-5 Bijan Tabatabai wrote:
> >
> > Hi Aleks,
> >
> > Thanks for the prompt response.
> >
> > > You say that you tried reloading the machine and the poor bandwidth
> > performance persisted past that?
> > Yes. For clarity, I just reloaded it again and ran the following commands
> >
> > $ sudo apt update
> > $ sudo apt upgrade
> > $ sudo apt install numactl??
> > $??wget
https://downloadmirror.intel.com/834254/mlc_v3.11b.tgz
> > $??tar xvf mlc_v3.11b.tgz
> > $??numactl -m 0 ./Linux/mlc --max_bandwidth
> >
> > MLC again reported 6.5GB/s of bandwidth.
> >
> > For the record, I experienced a similar problem with c6320 machines last
> > spring. I think then I just released the offending machines, assuming
> > whatever cleanup routines Cloudlab has would fix the issue.
> >
> > Bijan
> >
> > On Wednesday, October 1, 2025 at 3:45:25???PM UTC-5
ajma...@gmail.com wrote:
> >
> > Hi Bijan,
> >
> > My first suspicion would have been a failing or failed DIMM, as that
> > would have knocked the node into an unbalanced memory configuration
> > which is known to tank memory bandwidth performance. ??However, I
> > checked the iDRAC of that node and didn't see anything noteworthy in
> > the logs or anything else that would suggest a hardware issue. ??Nothing
> > really out of the ordinary in the OS either, from a cursory glance.
> > ??You say that you tried reloading the machine and the poor bandwidth
> > performance persisted past that?
> >
> > Best,
> > ??- Aleks
> > On Wednesday, October 1, 2025 at 2:25:01???PM UTC-6
bija...@gmail.com
> To view this discussion visit
https://groups.google.com/d/msgid/cloudlab-users/20251007142855.GA39225%40flux.utah.edu.