Example: Albuquerque, NM, USA can be written in many ways, including:
% Just plain "Albuquerque".
% "Albuquerque, USA": city and country, no state name
% "Albuquerque, Nuevo Mexico": city and alternate name for state
% "Albuquerque, Bernalillio County, New Mexico, USA": includes county
% "Alburquerque, Comte de Bernalillio, USA": alternate city name,
ASCII-fied alternate county name.
% "Albuquerque, NM": city name and state postal code
% "Abq, NM": commonly used abbreviation
% "The Duke City": city nickname (I don't think geonames has this)
% "Albuquerque, Land of Enchantment, USA": state nickname (I don't
think geonames has this)
Basically, this would be a giant JOIN, combining all names for
Albuquerque with all names for the county it's in (but allowing for
empty string as county), with all names for the state (again allowing
for the empty string), etc.
As a final step, I'd remove non-alpha characters (including spaces) and
lowercase everything for normalized text searching (eg,
"albuquerquenmusa").
This seems like an obvious thing to do. Has anyone done it or working
on something like this?
--
We're just a Bunch Of Regular Guys, a collective group that's trying
to understand and assimilate technology. We feel that resistance to
new ideas and technology is unwise and ultimately futile.
Marc