mor tagging is as i-PL

17 kyke
Slaan oor na eerste ongeleesde boodskap

Janet Y Bang

ongelees,
30 Jul. 2019 19:32:462019-07-30
aan chib...@googlegroups.com

Hello, 


We are running the following code to get word types: 

freq +sm;*,o% @


We noticed that for some instance of "is", that mor is treating it as is-PL: 

*MOT: I don't think I know what a pegasus is .
%mor: pro:sub|I mod|do~neg|not v|think pro:sub|I v|know pro:int|what det:art|a n|pegasus n|i-PL .

Do you have suggestions for how we might be able to fix this automatically? 


Thank you,

Janet




--

Janet Y. Bang, Ph.D.

Postdoctoral Fellow

Department of Psychology

Stanford University


jb...@stanford.edu





Janet Bang

ongelees,
30 Jul. 2019 21:04:522019-07-30
aan chib...@googlegroups.com
Hello,

I apologize, I don't believe there is anything wrong with the mor library. We had included the letter "i" as a noun in order to capture the following sentence: 

Here "i" is referring to the letter.
MOT: she know's all her i's

In this example, "i" was coming up as an unknown word, and we were not able to do i@l's. Is there a way for us to transcribe this line to capture this as a plural of the letter i? Or some way to include this letter to our internal library to capture this as a different type? 

Thank you,
Janet


--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/DM5PR02MB32759CF15B15B27B87E5D5C3D7DC0%40DM5PR02MB3275.namprd02.prod.outlook.com.

Leonid Spektor

ongelees,
30 Jul. 2019 22:00:582019-07-30
aan chib...@googlegroups.com
Janet,

To get plural of “i” you need to trascribe it like this i@l-s.


Leonid.

Janet Bang

ongelees,
31 Jul. 2019 00:07:232019-07-31
aan chib...@googlegroups.com
Great! Thank you!

Brian MacWhinney

ongelees,
31 Jul. 2019 10:16:492019-07-31
aan ChiBolts
Dear Janet,

For MOR to work properly on this, you need to transcribe this as i@l-s. It is ugly, but for the plurals of letters, this is the way it must be done.

-- Brian MacWhinney, Teresa Heinz Professor of Cognitive Psychology, Computational Linguistics, and Modern Languages, CMU
> To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/CAC5V4hiKC3fMRpFjEGMrASH0wVJ4iLyCaZs_Yz-zrOh5WDF5Mg%40mail.gmail.com.

Janet Bang

ongelees,
21 Jun. 2020 15:47:572020-06-21
aan ChiBolts
Hello, 

We had a couple of questions for how to capture plurals in Spanish transcription:

1. Plural of proper nouns

In the example below, the plural form of the proper noun in English is noted with an "-s", but in Spanish this convention does not seem to work.  Is there a way to capture the plural form on the main line in Spanish? I tried using an "&PL" on the main line which seems to work, but wasn't sure if there was a more preferred option?

English only
*FAT: you know those blue Explorer-s ?
%mor: pro:per|you v|know pro:dem|those adj|blue n:prop|Explorer-PL 

Spanish only
*MOT: hay Mickey_Mouse-s aquí.
%mor: v|habe-3S&PRES&SPEC=have ?|Mickey_Mouse-s adv|aquí=here

Spanish with &PL
*MOT: hay Mickey_Mouse&PL aquí.
%mor: v|habe-3S&PRES&SPEC=have n:prop|Mickey_Mouse&PL adv|aquí=here


2. Plural letters in Spanish (also with two different special lexical forms): 

In the example below, the "s@s@l-s" is meant to denote that the mother pronounces the letter "s" in English (@s), but we also wanted to tag this as a letter (@l), and she uses the plural form. When we run MOR it doesn't seem to recognize how to break up this form, whereas in English we are able to get the breakdown to a plural letter. Is there a way we can tag the "s" as 1) second language 2) letter 3) plural?

English only
*MOT: [- eng] look here we have the s@l-s .
%mor: cop|look adv|here pro:sub|we v|have det:art|the n:let|s-PL 

Spanish/English
*MOT: mija, aquí las tienes las s@s@l-s.
%mor: co:voc|mija cm|cm adv|aquí=here pro:obj|la&f-PL=she v|tene-2S&PRES=have det:art|el&f-PL=the ?|s@s-s 

Thank you!
Janet




Brian MacWhinney

ongelees,
21 Jun. 2020 16:59:122020-06-21
aan ChiBolts, Janet Bang
Dear Janet,
    Most of the Spanish corpora have a %mor line in which the plural marker is coded as -PL, just as it is in English.  Spanish has very few irregular plurals, so &PL is uncommon.  Given the availability of tagging through MOR for Spanish, I would not recommend trying to pull off plurals from the main line.  
   As you note, we still rely on the -s tag for the plural of letters and proper nouns.  If you transcribe as s@l-s, you get n:let|s-PL which seems okay to me.  But I see that Mickey_Mouse-s comes out as n:prop|Mickey_Mouse–s which is not so great.  I can try to fix that, but this is a pretty rare case.

--Brian 

Janet Bang

ongelees,
21 Jun. 2020 17:21:212020-06-21
aan Brian MacWhinney, ChiBolts
Hi Brian, 

Ah I see, I was just concerned that the part of speech tag was a ?, but we are focusing on the lemma so I guess this doesn't matter in that case since the morphemes are still being separated the way we want. 

Thanks!
Janet
Antwoord almal
Antwoord skrywer
Stuur aan
0 nuwe boodskappe