I have some questions regarding an ivector diarization setup.
As I don't have access to SWBD / SRE data, I made my own recipe (based on the callhome_diarization scripts) to use tedlium data as training set for ivector extractor and PLDA scoring.
(automatically labeled with the rule 1 recording = 1 speaker, assuming it's true for >95% of ted recordings)
It works quite good in general, but it seems a bit too sensitive in some cases. By that I mean, it sometimes cuts an intervention in several parts, assigning different speaker labels, when it shouldn't (and no noticeable significant change in volume, pitch or anything may disturb it).
1) First, is there some publically available (with 16k sampling) evaluation set for diarization? It would really help in tuning such a setup to have some baselines to refer to.
2) I reckon the tedlium corpus is not the obvious choice for such a task, but it would be good to have such a setup working with easily available data. May the issue of some speakers being split up too much come from a lack of certain types of speakers in the data? E.g. more male than female, certain accents, or specific tones.
3) The callhome setup seems to take a whole lot of data from SWBD and SRE corpora. Do we really need this much data or, would a significant amount of small samples (a few minutes) of different speakers would be enough for the PLDA part?
4) May the ivector extractor itsefl be a plausible cause of such issues of bad clustering for certain speakers?
5) Would playing with the ivector window size be of some help in tuning the setup?
6) Finally, it seems having a very long recording (over an hour), with a significant amount of speakers (over 5) and some perturbations (noise, laughter, crosstalk) is quite messing with the clustering part.
Is that a normal behaviour of the clustering approach? And would the PLDA threshold need some different tuning for different types of files? (length, amount of speakers, etc.)
That's a lot of questions, but I hope it will help others in tackling the diarization topic.
Thanks in advance!
François
--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/cdf32d02-afc6-461f-b44a-919744c0f772%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/44c08742-15d1-433c-8d81-a2ee8d15c1ff%40googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/40867419-58a3-44ed-be0b-61617f21e652%40googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/a81c43bf-7e36-481a-a1ec-fb23fece85a5%40googlegroups.com.