Text analytics in R for Hebrew

51 views
Skip to first unread message

Michel Depiesse

unread,
Aug 24, 2016, 4:52:15 AM8/24/16
to Israel R User Group
Shalom,

I am looking for R text analytic packages for Hebrew.

But tm does not speak Hebrew.

tm designer says : 

tm itself is language agnostic as far as possible. As
long as the encoding is UTF-8 tm probably supports all languages.
However, you might need some specialized functions for tokenization or
stemming and custom stopword lists. Currently, there is no such specific
support for Hebrew (see e.g. help("stopwords", package = "tm") and
SnowballC::getStemLanguages() for explicitly supported languages).


Do you know something else ?

Toda rabba,
Michael
Antwerp


Reply all
Reply to author
Forward
0 new messages