contractions in MOR

26 views
Skip to first unread message

Ying

unread,
Jun 4, 2014, 10:07:12 AM6/4/14
to chib...@googlegroups.com

Dear Chibolts,

I was trying to run MOR and POST on a CHAT file (attached). I used the codes below.

mor +t*CHI MEV001_E_retell.cha +1
post +t*CHI MEV001_E_retell.cha +1

There was a problem with contractions such as couldn't. I got error messages like this:
From file <MEV001_E_retell.cha>
*** ERROR 1: In file "MEV001_E_retell.cha"
  in item:    ?|couldn
  Can't find conversion for: ?|couldn
*** ERROR 1: In file "MEV001_E_retell.cha"
  in item:    ?|t
  Can't find conversion for: ?|t

What shall I do to avoid such error?

Thanks!
Ying
MEV001_E_retell.cha

Leonid Spektor

unread,
Jun 4, 2014, 2:22:39 PM6/4/14
to chib...@googlegroups.com
Ying,

You are using bad version of eng grammar. I would recommend to you to get the latest version of "eng" grammar and CLAN from childes server. If you really want to continue using your version of grammar, then please locate file called "connl.cut" in "eng" grammar folder and rename it to anything else, like "connl-hold.cut" for example.

Leonid.

--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+u...@googlegroups.com.
To post to this group, send email to chib...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/c34eb3fe-8757-4961-a9f9-6056dc808382%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
<MEV001_E_retell.cha>

Leonid Spektor

unread,
Jun 4, 2014, 3:35:30 PM6/4/14
to chib...@googlegroups.com
Ying,

I also noticed that in your files you use special single quotes characters. For example in word "couldn’t". Because of that MOR doesn't know how to interpret this word and you get result "?|couldn eq|eq2 ?|t". The single quote character should be " ' ". The special single quotes character has Unicode number 2019 and the normal one that MOR expects has Unicode number 27. Essentially you need to replace ’ character with ' character in your data files.

Leonid.

Ying Lu

unread,
Jun 4, 2014, 4:20:27 PM6/4/14
to chib...@googlegroups.com
Oh, I didn't realize the bad version issue~ Thanks Leonid for the advise!


Ying Lu

unread,
Jun 5, 2014, 10:39:25 AM6/5/14
to chib...@googlegroups.com
Dear Leonid,

Thank you very much for pointing out the special single quotes characters! In fact, the errors were due to the use of characters instead of the bad version of eng grammar:-) I will be more careful about typing the characters in the future!

Best!

Ying


Leonid Spektor

unread,
Jun 5, 2014, 12:55:49 PM6/5/14
to chib...@googlegroups.com
Ying,

The errors were due to bad grammar. We are experimenting with a different way of disambiguating results of MOR command. For test purposes POST was modified to work in two different modes. The new test mode uses much shorter morphemic forms than normal mode. But, for POST to work correctly both POST training and POST analyses must be done using the same mode. The test mode is triggered by presence of "connl.cut" file in root grammar folder. Even if you do not get any error messages after change all special single quote characters to simple single quote characters, the results of POST command might not be correct, because POST analyses will be using new test mode and the POST database file "post.db" was created using old mode. The test mode is not ready for public use yet, so the best thing to do is to get new eng grammar from childes web site or at the very least to rename the "connl.cut" file to some other name to disable POST's test mode.

Leonid.

Ying Lu

unread,
Jun 5, 2014, 1:26:20 PM6/5/14
to chib...@googlegroups.com
Dear Leonid,

I got the new eng grammar and change the character '. Now I can run MOR without an error. Thank you very much for your helpful info!

Best!
Ying


Reply all
Reply to author
Forward
0 new messages