Hai everybody.
I got a small error, while I was working in R with the Telugu text file saved in UTF-8 format, R is not showing the desired output when i prompt for read lines for some of the files.(its not loading correctly, the function which i gave was :
corpus.file<-scan(select.list(
dir()), what="char", sep="\n", encoding="UTF-8")
Read 1 item)
second, how to give the command to know each word boundary in the Telugu text file.( when i followed the book and gave the commands , R gave the results in some other format)
pls help me to understand in detail.
I have attached one of me sample text.
Pls find the .text attached.
Kindly guide me to go further.
Thanking you.