Hello Tim,
In the last year, I figured out the A100 MIG feature behavior with Slurm Workload Manager. At that time, it required non-default
DEVFS mode in kernel config to constraint the MIG device via Slurm cgroup. After the setting, A100 MIG works well to me so I suppose
it should NOT be blocking issue except you need to have the configuration.
My testing NVIDIA driver version was at 450.51.06, and the mode was not default at that time but the NVIDIA documents said the DEVFS
mode will be default in the future so that you should check the current newest docs if you mind the kernel setting.
The procedure how we can configure the DEVFS mode to A100 was written to my blog post(*1). It's so sorry that was in Japanese but
hopefully, the setting scripts and web links to NVIDIA official documents would be helpful for you. Perhaps, google translation too.
1:
https://medium.com/nttlabs/nvidia-a100-mig-as-linux-device-66220ca16698
Best,
--------------------------------------------
露崎 浩太 (Kota Tsuyuzaki)
kota.tsu...@hco.ntt.co.jp
NTTソフトウェアイノベーションセンタ
分散処理基盤技術プロジェクト
0422-59-2837
---------------------------------------------