Hi, I am new user. I am trying to generate a Vietnamese dictionary and align my dataset using MFA.
I tried with Mandarin dataset as in example 2 in the MFA docs webpage and be able to create a Mandarin dict. But the Mandarin example dataset already has .lab files in it. My Vietnamese dataset only has .txt files (grapheme transcriptions) of .wav files
How exactly can I generate phoneme transcriptions from those .txt files?
I've tried this command:
mfa g2p vietnamese_g2p /vietnamese_example /vietnamese_dict/vietnamese_dict.txt
but it seem to not generating anything:
Generating pronunciations from G2P model
Generating transcriptions for the 0 word types found in the corpus...
Generating pronunciations...
Processed 0 in 1.5735626220703125e-05 seconds