Hello all,
I have a corpus that I want to analyze concerning a specific pattern, illustrated as follows:
"て嬉しい"
the て is usually a connector, connecting the previous word and the adjective 嬉しい.
I want to create a frequency word list of the word that is preceded and thus connected by て (e.g. 会えて嬉しい). In this case I would be interested in 会え[て], or its plain form.
The corpus does hold complete sentences however, so it would be important it does not regard the complete string before て嬉しい but only the word right before て, which could consist of one or multiple letters.
Is there a way to create a query for that?
Something that might give me a list similar to this (I don't mind, whether it shows the word including て, without て / the word stem or if it would even show the whole part 会えて嬉しい):
Token Count
会え(て嬉しい) 25
見れ(て嬉しい) 16
I tried to query simply like that (although this is not ideal, as it does not really select for words), but if I include て it will not give me results, as I have tagged the corpus with TagAnt in Japanese and I guess I would need to include that somehow in my query.
If I just search for 嬉しい and set the span to 5L I will get a list, that could help me somehow, but would require a lot of manual checking.
Thank you so much.
Best
--
You received this message because you are subscribed to the Google Groups "AntConc-Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antconc+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/antconc/017cc460-a2fc-4282-8939-80571080e02an%40googlegroups.com.