I am interested in obtaining Audioset scores and class names from YT8M VGGish embeddings.
In your demonstration, you employed a WAV file as input for a pre-trained YAMNet model, which provided classes from the Audioset Ontology. Is there a potential method to utilize a VGGish input for the YAMNet model?
Thank you kindly in advance!
Maros