Chief Research Engineer HPC Email: to...@simula.no
Simula Research Laboratory - HPC department Mobile: +47 918 33 670
Kristian Augusts gate 23, 0164 Oslo, Norway
--
You received this message because you are subscribed to the Google Groups "beegfs-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to fhgfs-user+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/fhgfs-user/d4e4458f-9053-48f9-8ede-86ca9bafcaben%40googlegroups.com.
Chief Research Engineer HPC Email: to...@simula.no
Simula Research Laboratory - HPC department Mobile: +47 918 33 670
Kristian Augusts gate 23, 0164 Oslo, Norway
Hi Guan,
We have 50 nodes containing 3000+ CPU cores, 100+ A100 GPU cards, ~250T NVMe, and ~3P storage. Do you think it is worth in investing in IB network?Zhuang
I also have some questions on hardware setup. Any input is very much appreciated.We are planning on building on 4 all-NVMe beegfs nodes, each with 2*Intel 5317 processors, 256G DDR4 ECC memory, 2*2T NVMe(for metadata), 8*8T NVMe(for storage), and 2 Mellanox 100Gbps CX-5 NIC adaptors.My question is,Should I distribute metadata targets across four nodes? Or should I put all metadata targets on one node?If I enable RDMA, how much memory is actually needed for production envionment?Is Intel 5317 good enough in this setup?Should I assign two NIC adaptors to two NUMA nodes?Thank you!Zhuang