@amitesh,
Those are great slides, and introduce what is used by a lot of vendors out there.
That being said, you will have to admit that, **with respect to W3C specs** (which is the scope of riccardo's question), the choice of web socket for the signaling transport, and the set up of TURN server the way you did, if only, are arbitrary.
@riccardo,
I think asking the question in the scope of the w3c specs is not the right thing to do. With that in mind, both philipp ad amitesh answers are as-good-as-possible answers to your problem.
The average size (in number of users) of video conference is 3.4. Some even say that the main use case for webrtc is 1-1 (
http://bloggeek.me/5-things-webrtc/). Full mesh approaches can handle easily 10 people on desktop and 4 on mobile. Are you sure you need more than that?
Again, there are a lot of API/SDK out there that implement variations of the concept presented in amitesh slides (the full mesh concept philipp pointed you to), and you just have to go out, do your homework and some shopping around :-)
HTH
Alex.