Yuan Liang
unread,Oct 1, 2025, 4:06:33 PM (7 days ago) Oct 1Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to cloudlab-users
Hello,
I’m a beginner trying to learn the DAOS storage system on CloudLab, using the Rocky Linux 9 image on sm110p nodes. My goal is to run DAOS with NVMe so I can perform benchmark tests (e.g., IO500).
I understand that DAOS normally requires IOMMU to be enabled for NVMe, but on CloudLab if I edit GRUB to enable IOMMU the node fails to reboot (goes into shutdown/recovery). So I’ve been trying the recommended workaround of running DAOS without IOMMU, which seems to work only when the node exposes NVMe devices directly.
For example:
On sm110p-10s10615, NVMe drives show up directly (Samsung PM9A1/PM9A3/980PRO), and I was able to get the DAOS server running successfully.
On other nodes such as sm110p-10s10604 and sm110p-10s10617, the drives appear behind a VMD controller, and DAOS fails during server startup because it cannot validate the NVMe devices.
My questions:
Is there a way to use DAOS with NVMe on sm110p nodes that expose drives through VMD, given that I cannot enable IOMMU?
If not, is the only option to keep trying until I get an sm110p node that exposes NVMe directly?
Any advice or clarification would be very helpful. I’m still learning the system, so apologies if I’m missing something obvious.
Thanks in advance!