Masader Search: search +1000 datasets using natural language text

7 views
Skip to first unread message

Zaid Alyafeai

unread,
Jan 18, 2026, 7:09:20 AM (24 hours ago) Jan 18
to SIGARAB: Special Interest Group on Arabic Natural Language Processing
I am excited to share with you an initial version of an improved search mechanism for over 1000 datasets in Masader. The tool can be used to find datasets using a simple search prompt like "datasets published after 2022" or "language modeling datasets with more than 150k tokens", etc. 


PS: This is an experimental version of the tool. Any comments, suggestions, etc. are welcome. 

Zaid 
Reply all
Reply to author
Forward
0 new messages