Is there a way to export a concordance?

129 views
Skip to first unread message

John Brownie

unread,
Feb 17, 2021, 4:02:37 AM2/17/21
to FLEx list
I'm working on analysis of a particular word, and have a concordance
search that gives me useful information. It would be nice if I could
export the concordance in some way so that I can manipulate it in ways
not possible in FLEx, such as sorting by the target word and following
or preceding words. I don't see anything like that, but I'll ask anyway.

Can I export the Concordance Results list to some sort of file, text,
CSV, XML, HTML, or something?

John
--
John Brownie
Mussau-Emira language, New Ireland Province, Papua New Guinea
Kouvola, Finland

Beth-docs Bryson

unread,
Feb 18, 2021, 12:39:58 PM2/18/21
to flex...@googlegroups.com
Unfortunately not.

There are Jira issues for two important things that are obvious needs with a concordance function:

- Be able to export in table format, so you could open it in Excel or some other spreadsheet program. (I can’t remember if there are requests for other formats.)
- Make the “sort” function (that is, clicking on the column header) sort based on the target word, not the first word of the preceding context.

Both of those are obvious things that anyone working with a concordance would need, but they never made it to the top of the priority list, partly because many of the releases focused on a specific “area” of FLEx (e.g., “Send/Receive”, “Dictionary Configuration”, and for the future: “User Roles and Permission” or “Access History in S/R"), and they didn’t get to the Texts & Words area yet.

-Beth
> --
> "FLEx list" messages are public. Only members can post.
> flex_d...@sil.org http://groups.google.com/group/flex-list.
> --- You received this message because you are subscribed to the Google Groups "FLEx list" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/b4c7f619-2fba-bef4-6ae6-65961fc2989d%40sil.org.

William Hanna

unread,
Feb 18, 2021, 4:26:07 PM2/18/21
to flex...@googlegroups.com
It would be a wonderful addition to be able to do better sorting in a
concordance. I would use it every day.

Bill

Ken Manson

unread,
Feb 18, 2021, 5:19:46 PM2/18/21
to flex...@googlegroups.com
There is a work around.

In the Texts&Words Area | Concordance | Concordance results

Click the cursor in the top left cell and drag to the bottom right cell.
Holding the mouse left-click.
On the keyboard Ctrl+c and paste it into your spreadsheet.

Ken
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/a0ae5889-25e6-6a7b-1c7c-506fe9c2e48d%40sil.org.

Kari Valkama

unread,
Feb 22, 2021, 7:48:06 AM2/22/21
to 'Kari Valkama' via FLEx list
Hi Ken, 

Doing your suggestion gets me whole texts, not the concordance. 

See below the concordance of the word torro
Ambeq 1.31 Den pissen wattu, § tonna puramo Lajanaq ngkalasii indona sola kakanna, § namale ngkalasiipi ambeqna. Polei lako bola Lajanaq, § ngkuanni ambeqna kua, § "Den dea pura kusaeq jio baraqbah." "Malepo kamuq dau nnalai," § "sanga boqjoq gajanaq, § nateqda kukua male nnalai." Nakua ambeqna, § "Umbori nanei joo dea § pura musaeq nenaq?" "Angkumale nnalai duang bongiq karuen." Nabalii Lajanaq kua, § "Jio ulu baraqbah, § jiong bala pao, § dikapuq belopa, § duang ujun sola." Ia tonna duang bongiqmo puranna nakuan ambeqna Lajanaq 

Sometimes it has worked, but now it does not work.

And when it worked, it took a lot of time to scroll down, if you have 1000 lines or more of concordance. And it takes a long time for it to copy. 

Yours, 
Kari


John Brownie

unread,
Feb 22, 2021, 10:23:12 AM2/22/21
to flex...@googlegroups.com
Ken, thank you! That does help by getting the reference and sentences. I still need to split up the sentences around the key word, but that's relatively minor work, and it does get me closer to what I want. I only have 49 occurrences, so it's not as big a problem as Kari has.

A thought is to then export to a text-only form (probably tab-delimited, since there are commas in the baseline) and do a search/replace to break up the sentences. It would be tricky if the key word appears more than once in the sentence, though, probably requiring manual adjustment.

John

'Kari Valkama' via FLEx list wrote on 22/2/21 14:47:

John Brownie

unread,
Feb 22, 2021, 10:36:51 AM2/22/21
to flex...@googlegroups.com
On further examination, I realise that I am getting the whole paragraph,
which I should have noticed in the actual FLEx concordance window. It
makes it a bit more difficult, as I would expect more chance of multiple
occurrences per paragraph, but I can still work with it.

Ken Manson

unread,
Feb 23, 2021, 12:28:02 AM2/23/21
to flex...@googlegroups.com
Hi John,

Export your interlinear texts to html (web page) and use AntConc (http://www.laurenceanthony.net/software/antconc/). It's a powerful program (runs on Windows, OS X, Linux). It's a stand-alone program, doesn't need to be installed.

Happy concordancing
Cheers
Ken

-----Original Message-----
From: flex...@googlegroups.com <flex...@googlegroups.com> On Behalf Of John Brownie
Sent: Tuesday, 23 February 2021 2:37 AM
To: flex...@googlegroups.com
Subject: Re: [FLEx] Is there a way to export a concordance?

--
"FLEx list" messages are public. Only members can post.
flex_d...@sil.org
http://groups.google.com/group/flex-list.
---
You received this message because you are subscribed to the Google Groups "FLEx list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/728f562a-816a-fb54-fff2-becec4546ac6%40sil.org.

Kari Valkama

unread,
Feb 23, 2021, 3:30:11 AM2/23/21
to 'Kari Valkama' via FLEx list
Hi all,

That seems to be my problem also, I get the whole paragraph, which means that there is a lot of clean up to do, especially since my paragraphs are long.

Would it be a big change in the code to change it from paragraph to sentence?

Yours,
Kari
> --
> "FLEx list" messages are public. Only members can post.
> flex_d...@sil.org http://groups.google.com/group/flex-list.
> --- You received this message because you are subscribed to the Google Groups "FLEx list" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/728f562a-816a-fb54-fff2-becec4546ac6%40sil.org.

John Brownie

unread,
Feb 23, 2021, 7:01:08 AM2/23/21
to flex...@googlegroups.com
I had a look at AntConc, but it looks as though plain text rather than
HTML is what it's aimed at analysing. For my current purpose, studying
an invariant word, this works fine. If I wanted to study a morpheme
which has significant variation in form and overlap with other forms, it
wouldn't work so well.

Alexander Rice

unread,
Feb 23, 2021, 10:35:41 AM2/23/21
to flex...@googlegroups.com
John,

AntConc allows you specify tags to show/hide, by default it will hide anything in angle brackets < >, you can configure this in global settings tab, then select 'tags' and unhide anything in angle brackets. Then you could do a concordance search of the html exports.

So if you had a morpheme with a lot form variation you would do a concordance search for <td class="itx_morph_cf">X </td>, "X" being the lexical entry from shared by all the variants of the morpheme.

Or you could use the morph gloss if all the variants have the same gloss:  <td class="itx_morph_gls">X </td>, "X" being the gloss

--
"FLEx list" messages are public. Only members can post.
flex_d...@sil.org
http://groups.google.com/group/flex-list.
---
You received this message because you are subscribed to the Google Groups "FLEx list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.

--
"FLEx list" messages are public. Only members can post.
flex_d...@sil.org
http://groups.google.com/group/flex-list.
---
You received this message because you are subscribed to the Google Groups "FLEx list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.

John Brownie

unread,
Feb 24, 2021, 10:04:30 AM2/24/21
to flex...@googlegroups.com
Alexander Rice wrote on 23/2/21 17:35:
> AntConc allows you specify tags to show/hide, by default it will hide
> anything in angle brackets < >, you can configure this in global
> settings tab, then select 'tags' and unhide anything in angle
> brackets. Then you could do a concordance search of the html exports.
>
> So if you had a morpheme with a lot form variation you would do a
> concordance search for <td class="itx_morph_cf">X </td>, "X" being the
> lexical entry from shared by all the variants of the morpheme.
>
> Or you could use the morph gloss if all the variants have the same
> gloss:  <td class="itx_morph_gls">X </td>, "X" being the gloss
It doesn't seem to be very helpful when I try this with a subset of my
data. The problem is that it's really hard to work out where the hit is
given all the verbiage of the HTML. It seems to me that this only really
gets me the information that it is there without the ability to extract
the sentence aligned with the key term in any straightforward manner.
Reply all
Reply to author
Forward
0 new messages