Nicolas Lubimov
unread,Nov 18, 2011, 4:52:08 AM11/18/11You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to
Hello!
I have one technical question about construction of Universal
Background Model (UBM). I'm trying to build the speaker recognition
system that uses UBM, and in several articles on the same topic I've
met the statement that UBM needs for very big training size (about
100h of speech), so it is not so trivial to train this model - it is
time and memory consuming operation. I have parallelized my procedure,
it has became a bit faster, but not so satisfactory. May be it is not
the right way of trying to use the whole dataset (in my case it is
about 30 million MFCC points)? I guess that the people have another
way to get UBM faster, for example, by randomly selecting small
subset... But I'm afraid that this method could affect on the quality
of final model, isn't it?