[Resource] Arabic WordNet 4.0 Released - 109,823 synsets (CC BY 4.0)

3 views
Skip to first unread message

Salah Abdo

unread,
4:39 AM (4 hours ago) 4:39 AM
to sig...@googlegroups.com
Dear SIGARAB community,

I am happy to announce the release of Arabic WordNet 4.0, a
comprehensive lexical resource for Arabic NLP research.

Key Features:
- 109,823 synsets (100% OEWN coverage)
- 124,653 lexical entries
- 166,643 senses
- 265,676 synset relations
- 97.2% ILI coverage
- WN-LMF 1.4 format
- CC BY 4.0 license

Methodology:
Created using the expand approach, with translations generated
using AI-assisted translation (Google Gemini 3 Pro Preview).

Links:
- GitHub: https://github.com/Salah-Sal/arabic-wordnet-v4
- DOI: https://doi.org/10.5281/zenodo.18335226


This provides 11x more coverage than the previous Arabic WordNet
in OMW (9,916 synsets).

Derived from Open English WordNet (CC BY 4.0), based on Princeton
WordNet 3.0.

Feedback and contributions welcome via GitHub issues.

Best regards,
Salah Abdo
Salah.A...@gmail.com

Nizar Habash

unread,
5:00 AM (3 hours ago) 5:00 AM
to Salah Abdo, sig...@googlegroups.com
Thanks Salah for sharing this. Very useful.

A couple of questions:
(1) Is there a written report on the creation process?
(2) How does this work relate to the original Arabic Wordnet or the more recent efforts by Freihat et al 2024?
(3) Is there a quality evaluation of the generated resources? LLMs are good... but they hallucinate as you know. It would be helpful to quantify an error estimation.   For example, what is the degree of overlap with the original Arabic Wordnet? or a 1000 synset manual check?

Best
Nizar



--
You received this message because you are subscribed to the Google Groups "SIGARAB: Special Interest Group on Arabic Natural Language Processing" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigarab+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/sigarab/CAJOy2DY6abtzQm9PfVF8rLznb0f%3Dsp6O6EhGJr2QJAp2ztY_zQ%40mail.gmail.com.


--
Nizar Habash
Professor of Computer Science
New York University Abu Dhabi
https://www.nizarhabash.com/ 
Reply all
Reply to author
Forward
0 new messages