Hi Jan,
thanks for sharing info on this nice system.
I guess using EDR InfiniBand in the server instead FDR would not be an option?
There is currently no built-in interface bonding in BeeGFS, but a workaround
that some people use is to run two instances of the beegfs-storage service
(multi-mode) on a server.
For this example, I will assume that you have 4 storage targets (without loss of
generality). The first beegfs-storage instance would export the first two
storage targets and would set ib0 as primary interface in its connInterfacesFile
in beegfs-storage.conf. The second beegfs-storage instance would export the
other two targets and would set ib1 as primary interface in its connInterfacesFile.
To have the routing working correctly, the IP addresses (yes, IP addresses,
because we are using rdma_cm, which establishes native RDMA connections based on
IP addresses) of those two IB interfaces would need to be in different subnets,
e.g.
i...@192.168.0.1/24 and
i...@192.168.1.1/24. And very important, the clients
also need to have an IP in both subnets, even though they might have only a
single IB interface, e.g. client01
i...@192.168.0.101/24 and
ib0:0/
192.168.1.101/24 (so you would just add the second IP as another virtual
interface ib0:0).
Best regards,
Sven