Alternative Spellings

23 views
Skip to first unread message

mustaph...@gmail.com

unread,
May 31, 2014, 7:04:28 AM5/31/14
to ope...@googlegroups.com
Hey.. I took a look at the legislators.csv file in of the nouweb repository... I think it's essential to have an "alternative_spelling" field for the proper names. It solves a big problem because people transliterate differently in their from arabic.

Example:
first_name: Nayla (canonical)
first_name_alternative_spellings: Neila, Neyla, Naila;
last_name: Tueni (canonical)
last_name_alternative_spellings: Twayni, Twayny, Tueny, Twaini, Twainy

Then in the search or api calls any combination should land on the canonical result.

Marc Farra

unread,
May 31, 2014, 7:13:50 AM5/31/14
to ope...@googlegroups.com
Good point. I'll post it as an issue to the github repo https://github.com/openleb/nouwweb/issues 

Marc Farra

unread,
Jun 7, 2014, 6:38:43 AM6/7/14
to ope...@googlegroups.com
Mustapha, I was thinking about this one and I think the best idea right now would be to start a doc to get all the alternative spellings, unless someone already has that data. What do you think?


On Saturday, May 31, 2014 2:04:28 PM UTC+3, Mustapha Hamoui wrote:
Reply all
Reply to author
Forward
0 new messages