Using AWP with Arabic texts

60 views
Skip to first unread message

Alex Strick van Linschoten

unread,
Feb 6, 2017, 3:26:27 AM2/6/17
to AntWordProfiler-Discussion
I've newly discovered this tool thanks to a video I watched yesterday. I'm interested in getting an accurate profile of Arabic-language texts and their difficulty levels, but I'm not sure if there are any word profiles available for Arabic. 

Has anyone using this forum / programme had any experience of making Arabic frequency profiles?

Thanks,

Alex

Laurence Anthony

unread,
Feb 7, 2017, 10:13:00 AM2/7/17
to antword...@googlegroups.com
Hi Alex,

If you have a general Arabic corpus, you can use AntConc (or another more Arabic-oriented) corpus tool to create a frequency list from it.

Then, if you split the results into frequency bands (saving your data in the UTF-8 encoding), you can then load these into AntWordProfiler as separate frequency lists and it should work fine.

I suggest you try it out by making a set of three very simple level lists and testing the program. For English, it would be something like:

the
of
a

cat
dog
mouse

big
fat
tall

And then, you can load a test file like below:
the fat cat sat on the big mat

I hope that helps.

Laurence.



###############################################################
Laurence ANTHONY, Ph.D.
Professor
Center for English Language Education in Science and Engineering (CELESE)
Faculty of Science and Engineering
Waseda University
3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
E-mail: antho...@gmail.com
WWW: http://www.laurenceanthony.net/
###############################################################

--
You received this message because you are subscribed to the Google Groups "AntWordProfiler-Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antwordprofiler+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Alex Strick van Linschoten

unread,
Feb 12, 2017, 11:37:23 AM2/12/17
to antword...@googlegroups.com

Thank you for this suggestion. I will see if I can find an Arabic corpus to work with.

Alex

You received this message because you are subscribed to a topic in the Google Groups "AntWordProfiler-Discussion" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/antwordprofiler/ytwbvelVsSk/unsubscribe.
To unsubscribe from this group and all its topics, send an email to antwordprofil...@googlegroups.com.

Laurence Anthony

unread,
Feb 12, 2017, 11:40:56 AM2/12/17
to antword...@googlegroups.com
You're welcome!

Laurence.

###############################################################
Laurence ANTHONY, Ph.D.
Professor
Center for English Language Education in Science and Engineering (CELESE)
Faculty of Science and Engineering
Waseda University
3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
E-mail: antho...@gmail.com
WWW: http://www.laurenceanthony.net/
###############################################################

To unsubscribe from this group and all its topics, send an email to antwordprofiler+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages