Comparing headwords via regular expressions.

22 views
Skip to first unread message

Aaron Broadwell

unread,
Nov 10, 2017, 2:21:07 PM11/10/17
to FLEx list
Colleagues --
In the course of exploring possible derivational morphology, I would like to compare entries of verbs that end in the syllables -co or -to to similar entries of verbs without this final syllable.

So, for instance, if I have entries for verbs like aboto or aboco, I would like to look at the entry of a verb like abo.

I can work out how to do this manually, with two windows.  One window has the headwords filtered for final -co or -to, and in the other window, I scroll through entries looking for anything similar.

But I wonder if there is some more elegant way to do this with regular expressions.  I want (approximately) to compare headwords with a pattern (\w+) to those with a pattern \1{tc}o.  The problem is that I am not sure if there is a way to make the search apply across headwords.

I would be grateful for suggestions.

Aaron Broadwell

Daniel Kaufman

unread,
Nov 12, 2017, 10:32:50 AM11/12/17
to FLEx list
Hi Aaron, 

I don't think there's a way to search across headwords with regular expressions in FLEx but if you can export just your headwords into a single text file then, in principle, the following regular expression would do the trick:

(\w+\b) ?(?=(?:.*\s)*(?:\1(?:co|to)\b))

But it is memory intensive and can thus fail or take a very long time on a large file. Perhaps there is an easier way, but I'm not enough of a regexpert to figure that out. 

best,
Dan

Jeff Shrum

unread,
Nov 13, 2017, 11:37:50 PM11/13/17
to flex...@googlegroups.com

Aaron,

 

I am just thinking that Phonology Assistant (PA) may work for this.  While you are not doing phonology you are sorting on environments.  I believe that PA exports sort results for the use in phonology papers.  Something to consider that would not require exporting your data into a different format like say a spreadsheet or something.

 

 

Jeff Shrum

SIL International

Language Technology Consultant

Dallas, TX USA

Extension: 2030

Cell: 903-371-0297

--
You are subscribed to the publicly accessible group "FLEx list".
Only members can post but anyone can view messages on the website.
To change your status, please write to flex_d...@sil.org.
You can join this group by going to http://groups.google.com/group/flex-list.
---
You received this message because you are subscribed to the Google Groups "FLEx list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.
To post to this group, send email to flex...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/9ab17a91-f8b0-4b05-bffb-7847ec2253ce%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Jonathan Dailey

unread,
Nov 14, 2017, 5:26:57 PM11/14/17
to FLEx List
What if you filter on verbs and then sort?  Shouldn't that show you the data if it is like the above?

To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+unsubscribe@googlegroups.com.


To post to this group, send email to flex...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/9ab17a91-f8b0-4b05-bffb-7847ec2253ce%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You are subscribed to the publicly accessible group "FLEx list".
Only members can post but anyone can view messages on the website.
To change your status, please write to flex_d...@sil.org.
You can join this group by going to http://groups.google.com/group/flex-list.
---
You received this message because you are subscribed to the Google Groups "FLEx list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+unsubscribe@googlegroups.com.

To post to this group, send email to flex...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.



--

Aaron Broadwell

unread,
Nov 20, 2017, 8:13:01 PM11/20/17
to FLEx list
Thank you all for these suggestions!

Aaron


On Friday, November 10, 2017 at 2:21:07 PM UTC-5, Aaron Broadwell wrote:
Reply all
Reply to author
Forward
0 new messages