Affixes, etc. in agglutinating languages

24 views
Skip to first unread message

Graham Ranger

unread,
Mar 19, 2025, 3:03:03 AMMar 19
to AntConc-Discussion
Hello,
I'm working on a highly agglutinating (agglutinative?) language. I'd like to be able to identify the lemmata, which are often hidden inside prefixes and suffixes. Is there a method to identify recurring sequences of letters inside a word? I'm thinking of an equivalent to the n-grams tool, only for characters, but I cannot seem to tweak the token definitions, etc. to produce this.
Many thanks in advance for any help you might be able to give me.
Best regards,
Graham.

Laurence Anthony

unread,
Mar 19, 2025, 4:17:43 AMMar 19
to ant...@googlegroups.com
Hi Graham,

I'm working on full AI integration into AntConc, which might be a perfect fit for this kind of problem. Can you give me three different examples of inputs and expected outputs for different use cases?

Laurence.

###############################################################
Laurence ANTHONY, Ph.D.
Professor of Applied Linguistics
Faculty of Science and Engineering
Waseda University
3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
E-mail: antho...@gmail.com
WWW: http://www.laurenceanthony.net/
###############################################################


--
You received this message because you are subscribed to the Google Groups "AntConc-Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antconc+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/antconc/dffe2920-722e-4c66-b77a-0febd0c74b96n%40googlegroups.com.

Graham Ranger

unread,
Mar 19, 2025, 4:29:25 AMMar 19
to AntConc-Discussion
Thanks Laurence! It's a team project... I hope to be back shortly with potential input / output examples after I've been able to discuss things with colleagues.
Best,
Graham.
Reply all
Reply to author
Forward
0 new messages