Dictionary format problem when using

23 views
Skip to first unread message

li...@metaserve.org

unread,
Jun 13, 2022, 9:28:46 PM6/13/22
to MFA Users
I'm using MFA 2.0.0rc7, and I'm training a g2p model for Shoshoni then creating a dictionary using that model and a word list as input. The small dictionary file I'm using to train the g2p model conforms to the specified pronunciation dictionary format and uses Unicode IPA characters in pronunciations. However, the g2p-generated dictionary does not conform to the expected MFA format. Instead of entries that look like the examples in (a), I get entries like those in (b)

(a) 
ama a m a
en ɨ n
hii h iː

(b)
ama ('a', 'm', 'a')
en ('ɨ', 'n')
hii ('h', 'iː')

I"m not sure where the parentheses, single quotes, and commas are coming from, since these are not part of MFA's dictionary specifications. Despite this weird notation, the dictionary appears to work as input for the mfa train process. However, the strange notation (specifically, the single quotes and commas) is perpetuated through to the aligned TextGrids. Does anyone know why this is happening or how to fix it? 

Lisa Johnson
Reply all
Reply to author
Forward
0 new messages