How can I create .lab files from .txt file?

225 views

Skip to first unread message

Thanh Hà Nguyễn

unread,

May 18, 2021, 11:47:46 PM5/18/21

to MFA Users

Hi, I am new user. I am trying to generate a Vietnamese dictionary and align my dataset using MFA.

I tried with Mandarin dataset as in example 2 in the MFA docs webpage and be able to create a Mandarin dict. But the Mandarin example dataset already has .lab files in it. My Vietnamese dataset only has .txt files (grapheme transcriptions) of .wav files

How exactly can I generate phoneme transcriptions from those .txt files?

I've tried this command:

mfa g2p vietnamese_g2p /vietnamese_example /vietnamese_dict/vietnamese_dict.txt

but it seem to not generating anything:

Generating pronunciations from G2P model
Generating transcriptions for the 0 word types found in the corpus...
Generating pronunciations...
Processed 0 in 1.5735626220703125e-05 seconds

Reply all

Reply to author

Forward

0 new messages