Character level n-grams and arpa file generation

21 views
Skip to first unread message

Sidharth Kamboj

unread,
Mar 25, 2015, 4:22:28 PM3/25/15
to berkeleyl...@googlegroups.com
Hi there, 

I want to create character level n-grams and generate the arpa file. Do I need to preprocess my input file and insert spaces between all the characters so that the word level n-gram method can be used or is there some method available in the Berkeleylm package that generates the character level n-grams.?
Thanks. 

Adam Pauls

unread,
Mar 25, 2015, 5:00:28 PM3/25/15
to berkeleyl...@googlegroups.com
Yes, you would need to preprocess, nothing special is implemented.

--
You received this message because you are subscribed to the Google Groups "berkeleylm-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to berkeleylm-disc...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages