I'm attempting to reproduce the demo from this
medium blog post. I can easily generate the embeddings and build a linear SVM to classify faces within images. However, I'd like to do the same on video frames (as shown in the video of the provided post with Will Ferrell and Chad Smith on Jimmy Fallon). I've looked at pyannote, and this seems promising. I could also try to modify the Openface
demo #1 to run on video input (e.g mp4) instead of using real-time webcam frames. I was hoping to get some suggestions before starting. Thanks.