Development and Evaluation set are again split into samples that
are used to enroll client models, and probe samples to be tested
against all client models.
Hope this can help.
An example of protocol could be found here:
https://github.com/bioidiap/spear/tree/master/protocols/banca/G
Best regards,
Elie
--
-- You received this message because you are subscribed to the Google Groups bob-devel group. To post to this group, send email to bob-...@googlegroups.com. To unsubscribe from this group, send email to bob-devel+...@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/bob-devel or directly the project website at http://idiap.github.com/bob/
---
You received this message because you are subscribed to the Google Groups "bob-devel" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bob-devel+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
-- ------------------- Dr. Elie Khoury Post Doctorant Biometric Person Recognition Group IDIAP Research Institute (Switzerland) Tel : +41 27 721 77 23
Hey Manuel,Thank you. I see...So, what if we train UBM with another database wih enough data, but with a different microphone and totally different speakers? It will be a very different model in comparison with the test dataset?What about T matrix? If the noises and channel characteristics in another database we use is different from the test dataset? What I'm trying to say is:1. If we use another database which is big enough, is it better to train just UBM on it, or UBM and T?
2. Is it better to train T on the same database that we want to test? to have the same kind of channel variability?
And ...3. A) In I-vector approach, we don't have any enrollment step. Yes?
B) But in JFA, we have an enrollment step. Correct?
4. A) We don't use any labels while training UBM and T in I-vector. We just use labels to train PLDA and in LDA. Correct?
B) But we use labels in training step in JFA. Am I right?
Well, Thank you Elie, you're so helpful.I think my UBM is trained good enough, but I still have a problem with my T, because I just use half my data to train my T, the other half is used for test. But it seems T matrix doesn't work well. I don't want to use UBM training data to train T. to emphasis channel variability, I think it's better just from my database. not other databases to train T.What do you think?1. Should I add my UBM training to my current T training?
2. Is that fine if I use exactly the same data to train UBM and T?
On Friday, October 10, 2014 11:36:58 AM UTC+2, Elie Khoury wrote:On 10/10/2014 11:29 AM, Harry wrote:
> 5. If we don't use labels to train T, can we use test data to train T?
> Or even UBM?
Many "No"s!!! A test set is meant to simulate a real world scenario
where you have only access to one utterance at once, and you're system
is already pre-trained!!
If your UBM training data is from the same database (i.e. same recording conditions), I don't see a reason not to add it to the TV training data.