Two xl170 nodes do not appear to be working properly:
This first node is hp073: cpupower doesn't work properly. If I invoke "sudo cpupower frequency-set -g performance" I get this error back:
Setting cpu: 0
Error setting new values. Common errors:
- Do you have proper administration rights? (super-user?)
- Is the governor you requested available and modprobed?
- Trying to set an invalid policy?
- Trying to set a specific frequency, but userspace governor is not available,
for example because of hardware which cannot be set to a specific frequency
or because the userspace governor isn't loaded?
I think I've seen this error in the past, and it was because of a BIOS problem.
The second node is hp040: I'm not sure exactly what the problem is here, but my Homa benchmarks slow down by a factor of 5x if I include this node in the cluster. The problems go away if I don't include the node.
I have tried power-cycling both of these nodes, but it didn't fix either problem. I have also terminated the experiments containing both of these nodes, so they are "free" (at least at the moment).
-John-