student new to stylo

54 views
Skip to first unread message

feb...@nyu.edu

unread,
Dec 13, 2018, 2:17:49 AM12/13/18
to computationalstylistics
hello all,

i'm looking for some help using stylo. i'm a student and very much a beginner. i run the program on a mac and consistently get the error message "invalid multibyte string" when running rolling.delta() and rolling.classify() which don't feature a [ ] UTF-8 checkbox like stylo() does. im unsure whether this is a problem with the parameters i've set once the rolling.delta and rolling.classify windows open up or if the texts need to be reformatted. any help is much appreciated! 

Joanna Byszuk

unread,
Dec 13, 2018, 6:15:02 AM12/13/18
to feb...@nyu.edu, computationalstylistics
Hi,

thanks for your interest in using stylo! If you're a beginner, as a general remark I'd recommend taking a look at one of our tutorials, a slideshow presenting absolute basics or HOW TO paper - particularly useful when you start changing parameters without using GUI - note examples in the margin.

Regarding your questions:
1) as of 0.6.9 stylo version, UTF-8 is the default expected encoding - if the program finds texts in a different encoding it should read them as well, but the major change is you no longer need to tick UTF-8 checkbox. However, if your files are encoded in a different way, especially if your corpus is in a language using non-Latin scripts or diacritics, R may have a problem with proper delimitation of characters and words. So best - keep your texts in UTF-8. Also, if text files include any emoticons etc. this may cause problems too.
2) as for passing parameters in rolling.delta() and rolling.classify() - of the two only the first one has GUI at the moment (and it is a bit underdeveloped as the function is rarely used since the introduction of the more popular rolling.classify). To pass the parameters include them in the brackets of the function, e.g. rolling.classify(encoding= "UTF-8").

Hope this helps a bit, if the problem persists please write more details about the data you are trying to analyze (number and length of texts /note that default sample size in rolling functions is 5000 words/, language, encoding) and we should be able to help more precisely.

Best wishes,
Joanna Byszuk


czw., 13 gru 2018 o 08:17 <feb...@nyu.edu> napisał(a):
hello all,

i'm looking for some help using stylo. i'm a student and very much a beginner. i run the program on a mac and consistently get the error message "invalid multibyte string" when running rolling.delta() and rolling.classify() which don't feature a [ ] UTF-8 checkbox like stylo() does. im unsure whether this is a problem with the parameters i've set once the rolling.delta and rolling.classify windows open up or if the texts need to be reformatted. any help is much appreciated! 

--
You received this message because you are subscribed to the Google Groups "computationalstylistics" group.
To unsubscribe from this group and stop receiving emails from it, send an email to computationalstyl...@googlegroups.com.
Visit this group at https://groups.google.com/group/computationalstylistics.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages