I'd definitely select a raw audio format and not none of the compressed but also the smallest raw sonobus allows. The raw even though more mbps is uncompressed and your internet line with have some compression built in so it's not being compressed again. Using the plugin version of sonobus
might be more direct but if you can run sonobus along side and hardwire back in via adat or spdif to another device then this could give more stable results to.
On top of that possibly adjust the MTU ands having a look for your os version and operating system tweaks.
Even the slot your pci-e cards are in can make a difference, so using an adjacent 4x pci-e to your graphics cards 16x pci can split the lanes used by the cards into to, giving less to one you want full lanes used.
USB controllers uses lanes too so consider using hubs on one usb port rather than filling every port in the back of your machine. Like keyboard, mouse, webcam could sit on one port and usb a separate port exclusive for any usb audio devices and affordable usb3 is starting to pop up now.
It might be you try all that and still have the issues but it could be better. IF you get all the rest right then when it comes to your internet, if your still getting issues then slowing it down a touch or trying to buffer (which does create latency) to smoothen the transition from computer to internet.
A bit like how you increase buffers on an audio interface, it stops all the clicks and pops but at the gain of latency but reason it stops the click and pops is the computer has more time to handle the audio hence the delay when upping buffers.
I use 2 interfaces here and sometimes use 2 with sonobus but I have a few studio setups here so some allow for loopback at a low latency but on the machines not as powerful i use 2 audio interfaces which sometimes can yield a better result.