Hi Rob,
Yes, those questions make sense. From what I understand, MIG
should essentially split the GPU so that they behave as separate
cards. Hence two different users should be able to use two
different MIG instances at the same time and also a single job
could use all 14 instances. The result you observed suggests that
MIG is a feature of the driver i.e lspci shows one device but
nvidia-smi shows 7 devices.
I haven't played around with this myself in slurm but would be
interested to know the answers.
Laurence
/| | \/ | Yair Yarom | System Group (DevOps) [] | The Rachel and Selim Benin School [] /\ | of Computer Science and Engineering []//\\/ | The Hebrew University of Jerusalem [// \\ | T +972-2-5494522 | F +972-2-5494522 // \ | ir...@cs.huji.ac.il // |
You don't often get email from ir...@cs.huji.ac.il.
Learn why this is important
|