Announcing the release of Names Based Ethnicity Classification System

5 views
Skip to first unread message

Kalmasoft

unread,
Apr 21, 2026, 2:22:14 AM (yesterday) Apr 21
to SIGARAB: Special Interest Group on Arabic Natural Language Processing
Hello SIGARAB

We are pleased to announce the Name-Based Ethnicity Classification System, a specialized hybrid data-driven approach designed to identify, categorize, and geolocate individuals based on the analysis of their personal names (first, middle, and last names). It operates as part of a larger suite of name-processing and database tools, using large-scale, proprietary databases to infer ethnicity, religion, culture, name origin, and race, the system utilizes a trained model as a fallback last resource to predict the same classes.

English documentation:
https://kalmasoft.com/KNAP/Kalmasoft_Personal_Names_Classification_System.pdf

Arabic documentationhttps://kalmasoft.com/KNAP/Kalmasoft_Personal_Names_Classification_System_Arabic.pdf

Standout features:

1.     Multiple pass processing and hybrid methodology to manage the high linguistic variability inherent in global personal names, this approach allows the system to refine its classification through successive layers of analysis.

 

2.     Multilevel statistical inference with double factor scoring reference.

 

3.     Multilingual and multi-script structures, handles names that vary significantly across scripts and character systems.

 

4.     Manages several types of complex naming conventions, complex structure names beyond simple “surnames and given name” pairs.

Regards

Kalmasoft


Reply all
Reply to author
Forward
0 new messages