"VoxCeleb Xvector System 1a" pretrained model - PLDA scoring has Less Variability between Enrolled and Test Speakers

181 views
Skip to first unread message

Immu Ali

unread,
Mar 26, 2019, 9:55:57 AM3/26/19
to kaldi-help
Hi dan ,

Previously i used "SRE16 Xvector Model 1a" pretrained model and the PLDA scoring results are Good i.e high variation between the test and Enrolled Speakers 
 
Currently I'm using "VoxCeleb Xvector System 1a" pre-trained model and Found out that , there's less variability between Enrolled and Test Speaker , So , I checked the Recipe "run.sh" Script and Found out that in PLDA stage i.e, "stage-11" we are passing test speaker xvector two times  during PLDA scoring whereas in SRE16\V2 Recipe "run.sh" we are giving 1.test speaker xvector and 2.Enroll Speaker Xvector .

So , How Exactly PLDA Classifier Works in VoxCeleb\v2 Recipe by passing test Speaker xvectors 2 times ?   

Dummy Question : Why we are not giving Enroll Speaker Xvector in Voxceleb\v2 Recipe? 


Thanks in advance.

David Snyder

unread,
Mar 26, 2019, 10:05:28 AM3/26/19
to kaldi-help
It's nothing to worry about. It's just due to the way the evaluation is defined in the trials file.

In SRE16, it's something along these lines:

enroll-spk1 test-utt1 target
enroll-spk1 test-utt2 nontarget
enroll-spk2 test-utt1 nontarget
...

In Voxceleb it's something like this:

test-utt1 test-utt2 nontarget
testutt1 test-utt3 target
test-utt2 test-utt3 nontarget
...

So, the Voxceleb trials use the same list of utterances for both the first and second columns of the trials file, which is while the test utterances are passed into the PLDA scoring script twice. However, in SRE16, the evaluation is split into an enroll and test portion.
Reply all
Reply to author
Forward
0 new messages