Hi,
First of all, sorry about the delay. I personally have been involved in several exams and research group meetings for the past few week.
Second, are you referring to the
Sirius paper? I personally have investigated the reasons for low accuracy of OpenEphyra, but it is a short report for a university course.
Third, we rely on kaldi for ASR: we choose
DNN models based on fisher, but are aware that there are other options. To personalize the ASR results, you could look into how online decoding is done in the current implementation, and figure out how personalized models could be introduced. I suggest
this as a starting point, but if you are using other implementations instead of kaldi, please let me know, either through email or post below.
Finally, we are using MongoDB to store data for
OpenEphyra (knowledge base),
command center (user information), and image matching. If you choose to integrate Lucida into your work, you only need to make sure the formats do not conflict with the existing ones shown above. Otherwise, the formats are of your choice. Regarding MondoDB, I find it very easy to work with due to its "NoSQL" nature. However, whether it is a perfect fit for your problem requires further thinking.
Thanks!
Yunsheng Bai