.png?part=0.1&view=1)
.png?part=0.2&view=1)
--
"FLEx list" messages are public. Only members can post.
flex_d...@sil.org
http://groups.google.com/group/flex-list.
---
You received this message because you are subscribed to a topic in the Google Groups "FLEx list" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/flex-list/KpF8nbc-ndI/unsubscribe.
To unsubscribe from this group and all its topics, send an email to flex-list+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/54fc9e62-6686-4f77-b4c9-5697f82e71can%40googlegroups.com.
Dear Tim and all,
Some points that I find useful in my workflows:
-- Instead of multiple transcription lines,
put all but one (which is ideally in IPA or similar) into
"translation" -- e.g. add another writing system for the
vernacular language into Analysis writing systems, and add a
tier for "Literal translation" in that writing system.
(I avoid using Notes because they behave differently from
translation lines.)
Note that in recent versions of FLEx, you can add Custom tiers
at sentence level -- perfect for storing the original text in
different writing systems. It also exports/imports OK to/from
flextext.
-- You're right that all changes involving
splitting/merging sentences should be done BEFORE you import
from ELAN into FLEx.
If you happen to have done some splitting/mergin in FLEx, your
flextext export will have lost time and speaker info on the
affected sentences. However this can be fixed manually in the
flextext file, e.g. using a plain text editor like Notepad++. It
needs some time and attention but is far easier than
re-importing and re-analyzing the entire text (unless you have
too many fixes to do).
When your text is time-aligned, each
sentence (<phrase> element in flextext) should have
several attributes, e.g.:
<phrase guid="e6f7b5ec-0a7d-40a8-8acb-665b6af36058"
begin-time-offset="60760" end-time-offset="62750" speaker="MPT"
media-file="15762501-d42c-4c82-bdf4-ad15fc821a82">
The time offsets are in milliseconds; here they point to a 2 sec
fragment from 01:00.760 to 01:02.750.
When the time-alignment is broken in FLEx,
only the "guid" will remain:
<phrase guid="e6f7b5ec-0a7d-40a8-8acb-665b6af36058">
You'll need to look up the timestamps (in
milliseconds) in ELAN, and then just copy the string with all
the attributes from an undamaged <phrase> element and put
the right values for "begin-time-offset" and "end-time-offset".
Don't forget to check the time values on the preceding/following
<phrase> -- the border adjacent to the one you fixed can be
wrong too.
-- Since FLEx cannot import any word- or
morph-level info, it has indeed little sense to repeat
ELAN-FLEx-ELAN roundtrip more than once for any given text. So,
do a maximum of transcription, editing and other preprocessing
in ELAN -- gloss in FLEx -- do all the rest in ELAN. In
particular, pay attention to have no sentence-final punctuation
(. ! ?) in the middle of annotations before exporting from ELAN
-- otherwise FLEx will split the sentence and you'll have to fix
the time/speaker attributes as described above.
-- If you have your participants each in
his/her own set of tiers in ELAN, and if you set the
"Participant" property correctly for all the tiers, you don't
need to store the participant in the Note field.
You can still put it into Note if you're more comfortable to see it in FLEx (otherwise it is there invisible), but you don't have to, since it will be stored in the "speaker" attribute with each <phrase>, and ELAN will take it from there.
-- Finally, the inconvenience when going
from FLEx back to ELAN, as you said, is that you don't have your
main transcription for the whole sentence anymore (you've got
the segnum tier instead). While this behaviour is unlikely to
change in FLEx, you can copy your new *-word-txt-* tier to a new
one, specifying segnum as parent and choosing e.g. phrase-item
as type (with Symbolic Association as stereotype). This will
concatenate all words and punctuation in a sentence, separated
with a whitespace. And it is just one command to get something
readable again. Not ideal, but you can further use
search&replace e.g. to remove extra spaces before commas.
Hope that helps,
Alexandre
You received this message because you are subscribed to the Google Groups "FLEx list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/CAFKa-3dCHjVNo3KeexgWLi56rMQxMzx%3DtpYtsL%2Bvsbk8NpGAPA%40mail.gmail.com.
Very useful, thanks !
De : 'Alexandre Arkhipov' via FLEx list <flex...@googlegroups.com>
Envoyé : mardi 6 avril 2021 16:33
À : flex...@googlegroups.com
Objet : Re: [FLEx] Re: Multi-language, multi-participant ELAN-FLEx-ELAN experiences sought
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/4d5dce00-8040-d81f-dbf3-114df9497d33%40mail.ru.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/4d5dce00-8040-d81f-dbf3-114df9497d33%40mail.ru.
Hi Tim,
Are you aware whether it is possible to change a transcription (so not the word or morph level information, but the basic transcription) in ELAN and then, after import, re-gloss only that particular changed item in FLEx without issue?
Generally, not this way.
If you reimport a text with even slight changes into FLEx (or
without any changes), all the words will have lost their analysis
(unless it happens differently if you try to merge the two
versions of the text during import, but I never tried that --
maybe someone can correct me). More precisely, the words that did
not change will have their analysis (from the old version of the
text) shown as a suggestion, which then will need to be confirmed.
So one option would perhaps be for your consultants to save their
changes in ELAN (and mark them in some way, e.g. in the comments),
and the person who's in charge of the FLEx project to go through
these changes and replicate them in FLEx.
--> The issue seems to be that if participant "A" is not the first participant in the recording (i.e., the recording starts with participant B), the after ELAN-FLEx-ELAN export, ELAN shows participant "B" as participant "A" and vice versa, have you observed that? I was advised to make a 'dummy segment' in the beginning of each recording with participant "A" if participant "A" is not the first speaker.
Well, I didn't notice that, but the dummy segment is probably a good solution.
Best,
Alexandre
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/CAFKa-3ftC4AbAR4x1A__cV4W%3DrjUH4Y82TnAEOJm_G%2BhLOs6ig%40mail.gmail.com.