Problem with Configuring NVMe-oF over RDMA on d750

41 views
Skip to first unread message

Haoyu Li

unread,
Dec 7, 2022, 11:34:00 PM12/7/22
to cloudlab-users
Hi,

I'd like to setup NVMe-oF over RDMA on d750 nodes. However, when trying to discover the target node, I encountered the error below:

root@node-2:~# nvme discover -t rdma -a 10.10.1.2 -s 4420
Failed to write to /dev/nvme-fabrics: Connection reset by peer

root@node-2:~# dmesg
[ 2745.805437] nvme nvme1: Connect rejected: status 8 (invalid service ID).
[ 2745.812142] nvme nvme1: rdma connection establishment failed (-104)


I configured the target successfully, which is node-1, by following this tutorial. Before that, I installed Broadcom driver from here.

Also, I validated RoCE network as this page. It turned out that the `ib_write_bw` test could pass with the public IP instead of the private IP (10.10.1.2). When using the private IP, I got:

root@node-2:~# ib_write_bw -d bnxt_re0 -F --report_gbits -p 1800 -s 1048576 -q 16 10.10.1.2
Couldn't connect to 10.10.1.2:1800
Unable to open file descriptor for socket connection Unable to init the socket connection


Does anyone have any thoughts about this issue? Any reply is appreciated!

Best,
Haoyu

Haoyu Li

unread,
Dec 8, 2022, 12:28:49 AM12/8/22
to cloudlab-users
I think I’ve figured it out: iommu need to be turned on first according to this Cisco doc. So, before setting up NVMe-oF, add 'intel_iommu=on’ to ‘GRUB_CMDLINE_LINUX_DEFAULT’ in /etc/default/grub and run:

root@node-1:~# update-grub
root@node-1:~# reboot

The ‘ib_write_bw’ test can also pass with the private IP now.

Best,
Haoyu

--
You received this message because you are subscribed to a topic in the Google Groups "cloudlab-users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/cloudlab-users/yTsbSw7eXb4/unsubscribe.
To unsubscribe from this group and all its topics, send an email to cloudlab-user...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cloudlab-users/4e23d511-1b97-44d5-b2d3-99652e61f1d4n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages