Arabic Word Frequency

131 views
Skip to first unread message

Fouad Shammary

unread,
Apr 22, 2024, 9:03:40 AM4/22/24
to sig...@googlegroups.com
السلام عليكم ورحمة الله, 
هل يوجد هناك مكان فيه كلمات اللغة العربية مع عدد تكرارها (word frequency) أو ما يشابه ذلك؟

Is there an available public dataset for Arabic word and their frequencies or something similar?

مع الشكر الجزيل,

Abdelhakim Freihat

unread,
Apr 22, 2024, 10:34:56 AM4/22/24
to Fouad Shammary, sig...@googlegroups.com

--
You received this message because you are subscribed to the Google Groups "SIGARAB: Special Interest Group on Arabic Natural Language Processing" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigarab+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/sigarab/CAH6AA6%3D0_Y-bvu_hESv1GUuzXxfXj4B5AgPi-Jtp9NO2yRLgPA%40mail.gmail.com.

Fouad Shammary

unread,
Apr 22, 2024, 3:55:02 PM4/22/24
to Abdelhakim Freihat, sig...@googlegroups.com
Thank you Abdelhakim. Much appreciated!

Mohamed H.

unread,
Apr 23, 2024, 8:31:26 AM4/23/24
to sig...@googlegroups.com

waalaykumsalam wa rahmatullah wa barakatuhu,

Dear brother Fouad,

There is a dataset on word frequencies for Hadith done by Maxim Romanov:

https://maximromanov.github.io/2016/05-30.html

The output/dataset is in PDF, but he outlines an algorithm to reproduce the results.

I hope this is what you are looking for.

Shukran,
Mohamed

--
You received this message because you are subscribed to the Google Groups "SIGARAB: Special Interest Group on Arabic Natural Language Processing" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigarab+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/sigarab/CAH6AA6%3D0_Y-bvu_hESv1GUuzXxfXj4B5AgPi-Jtp9NO2yRLgPA%40mail.gmail.com.

Mohamed H.

unread,
Apr 23, 2024, 8:33:25 AM4/23/24
to sig...@googlegroups.com

Oh and I forgot to add my own Arabic frequency data of Quranic verbs contextualized:

https://www.dragomen.org/projects/85-percent-quran/verbs/

Shukran,

Salam Khalifa

unread,
Jun 29, 2024, 5:35:06 AM6/29/24
to Abdelhakim Freihat, Fouad Shammary, sig...@googlegroups.com
Hi Fouad,

Here you will find the CAMeL Arabic Frequency Lists a comprehensive set of frequency lists for MSA, Dialectal Arabic, and CA.

Best,
Salam

On Mon, Apr 22, 2024 at 6:34 PM Abdelhakim Freihat <abdel....@gmail.com> wrote:
Reply all
Reply to author
Forward
0 new messages