--
-- You received this message because you are subscribed to the Google Groups bob-devel group. To post to this group, send email to bob-...@googlegroups.com. To unsubscribe from this group, send email to bob-devel+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/bob-devel or directly the project website at http://idiap.github.com/bob/
---
You received this message because you are subscribed to the Google Groups "bob-devel" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bob-devel+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
At present, I'm still trying to determine suitability of SPEAR over other tools for my purposes. My foremost question at the moment is: understanding that SPEAR is currently used mostly for conducting research experiments, what are the aspects of the toolkit which may be inefficient, limiting, missing or otherwise deficient for use in more practical scenarios? I say 'more practical scenarios' because I'm not planning on using it to grant/deny access to any production system but gather metadata on audio to identify who speaks and when. So the consequences of failure aren't huge. Basically, in this instance I'm wondering if the 'research tool' focus is a disclaimer to protect against someone relying too much on SPEAR or whether there is some tangible other aspect / functionality that its missing that I haven't yet foreseen the need for. Otherwise, SPEAR seems to be quite high in terms of its recognition accuracy in the given experiments, yes? Again, even assessing this has been difficult as its not been easy to get clear definitions (> 1 short sentence) of what EER / HTER mean in a practical sense! :-/About efficiency, you tell me.As far as I understood you are dealing with speaker identification 1-N. bob.bio.spear is a tool for speaker verification 1-1. What I'm trying to say is that bob.bio.spear does not have functions to index audio template identities. If you want to use it for identification, it is necessary to do a brutal force search in your dataset.
About efficiency again, another important variable is the size of your input data and the algorithm that you will use. bob.bio.spear has a lot of GMM based algorithms implemented, such as GMM itself, ISV, JFA, iVector (including some stuff that researchers usually do on top of it (PLDA, WCCN, Whitening, LDA..)). Which one will you use?
About the accuracy, it is really difficult to give an opinion about this. There are so many variables to consider. Just a hint, if your background models are very uncorrelated with your target scenario, you can expect a very low accuracy.
Hope I have answered your questions
At present, I'm still trying to determine suitability of SPEAR over other tools for my purposes. My foremost question at the moment is: understanding that SPEAR is currently used mostly for conducting research experiments, what are the aspects of the toolkit which may be inefficient, limiting, missing or otherwise deficient for use in more practical scenarios? I say 'more practical scenarios' because I'm not planning on using it to grant/deny access to any production system but gather metadata on audio to identify who speaks and when. So the consequences of failure aren't huge. Basically, in this instance I'm wondering if the 'research tool' focus is a disclaimer to protect against someone relying too much on SPEAR or whether there is some tangible other aspect / functionality that its missing that I haven't yet foreseen the need for. Otherwise, SPEAR seems to be quite high in terms of its recognition accuracy in the given experiments, yes? Again, even assessing this has been difficult as its not been easy to get clear definitions (> 1 short sentence) of what EER / HTER mean in a practical sense! :-/About efficiency, you tell me.As far as I understood you are dealing with speaker identification 1-N. bob.bio.spear is a tool for speaker verification 1-1. What I'm trying to say is that bob.bio.spear does not have functions to index audio template identities. If you want to use it for identification, it is necessary to do a brutal force search in your dataset.
About efficiency again, another important variable is the size of your input data and the algorithm that you will use. bob.bio.spear has a lot of GMM based algorithms implemented, such as GMM itself, ISV, JFA, iVector (including some stuff that researchers usually do on top of it (PLDA, WCCN, Whitening, LDA..)). Which one will you use?
About the accuracy, it is really difficult to give an opinion about this. There are so many variables to consider. Just a hint, if your background models are very uncorrelated with your target scenario, you can expect a very low accuracy.
You received this message because you are subscribed to a topic in the Google Groups "bob-devel" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/bob-devel/P9T25kaAmA0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to bob-devel+unsubscribe@googlegroups.com.