How to create lexicon.txt using automated script.

1,621 views
Skip to first unread message

dhamaraiselvi sekar

unread,
Mar 5, 2018, 1:05:05 AM3/5/18
to kaldi...@googlegroups.com
Hi,

I need ur help, I am willing to create a lexicon.txt file using automated scripts.

Any tool is available in Kaldi.
Please help me out.

Thanks & Regards,
Dhamaraiselvi Sekar

Uniphore Software Systems
Senior Software Scientist – I
08C, 8th Floor, IIT Madras Research Park, Near Tidel Park
Kanagam Road, Taramani, Chennai – 600 113 India
Tel 
+91.44.30655800 | Fax +91.44.30655900|
Email: dhamaraiselvi@uniphore.com
Website: www.uniphore.com

Daniel Povey

unread,
Mar 5, 2018, 1:24:04 AM3/5/18
to kaldi-help
This question is too vague, I won't answer other than to say you should understand the lexicon format, and if the language is phonetically spelled then each letter will correspond to its own phone.  Read the HTK book to understand the basic ideas of speech recognition.

--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/CA%2BHmEOQRVeMgkqO-%3DNuVBL%3D8%3DfR9F6kgTVqSF9_2YdhRFesSNg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Alim Misbullah

unread,
Mar 5, 2018, 1:26:52 AM3/5/18
to kaldi-help
Hi,

You can consider to see this link:


But, the link is for generating English word pronunciation.

Thanks.
Alim 

dhamaraiselvi sekar

unread,
Mar 6, 2018, 2:06:08 AM3/6/18
to kaldi...@googlegroups.com
Hi,

Still, now I created lexicon.txt manually.

I some tool like CNTK, Sequitor,lmtool, lex tool and Other tools for creating the dictionary.
 Eg:
lexicon.txt
a    a
a    axx
a    ee
a4    ee f o r
aa    aa
aadhaar    aa dx aa r
aadhaar    aa dx h aa r
aadil    aa dx i l
aadinath    aa dx i n aa t h
aaditya    aa d i t y aa
aadmi    aa dx m ii
aah    aa h
aahs    aa h s
aai    aa ei
aaj    aa j
aakash    aa k aa sh

Now, I am willing to write an automated script to create a dictionary using any one of the tools.

Before that, I need to know Kaldi have any inbuild scripts for the dictionary creation.

I found g2p.py script inside egs/librispeech/s5/local/g2p/train_g2p.sh
I don't have any idea how it will create model.

Thanks & Regards,
Dhamaraiselvi Sekar

Uniphore Software Systems
Senior Software Scientist – I
08C, 8th Floor, IIT Madras Research Park, Near Tidel Park
Kanagam Road, Taramani, Chennai – 600 113 India
Tel 
+91.44.30655800 | Fax +91.44.30655900|
Email: dhamaraiselvi@uniphore.com
Website: www.uniphore.com


--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages