Dear Leonid,
No worries. Thank you so much for your input.
Within our study, utterances are being split based on communication units that consist of the main clause and its dependent clauses. However, some utterances have been split due to interruptions by the administrator (e.g. prompts, affirmations) have been re-combined using the "+/." and "+," markers between utterances as it forms a communication unit when combined.
*CHI: one sunny sunday morning the cat +/.
*CHI: +, is at [x 3] the beach .
Based on the CHAT manual, it seemed like it was possible to combine these utterances using the MLU function: "An advantage of using +/. instead of +... is that programs
like MLU are able to piece together the two segments and treat it as a single utterance when
a segment with +/. is followed by +, on the next utterance."
Due to the population that we are testing, we intend to use MLU-3 instead of MLU for our analysis. Following the instructions from CLAN, we would have to run MAXWD on these files first: "The second CLAN analysis we will perform computes the mean length in morphemes
of each child’s five longest utterances. To do this, we will run MAXWD on the five files
in the ne20 folder and then MLU on the output of MAXWD. By default, MAXWD runs
on the %mor line, rather than the main line.
maxwd +t*CHI +g1 +c5 +d1 *.cha"
For our analysis, we have used these commands for MLU-3:
1. maxwd +t*CHI +g1 +c3 +d1 +f +s+xxx *.cha
2. mlu +s+xxx *.cex
However, in the event of split utterances (due to interruptions), this output 3 longest utterance will be extracted instead:
*CHI: cat is trying to catch the butterfly.
*CHI: the [x 3] boy is trying to &-s say next time cannot catch the butterfly +/. (The utterance combined to this, which is the next child utterance within the transcript, has not been combined automatically.)
*CHI: +, is at [x 3] the beach . (The utterance combined to this, which is the previous child utterance within the transcript, has not been combined automatically. Refer to above example for the combined utterance.)
(Note: Analysis tiers were excluded, this is just an example.)
As shown, this leaves us with partial utterances which would not be accurate for an MLU-3 analysis.
Hope this clarifies the situation, and I hope that you could advise if there is a way to resolve this situation.