Hi,
For my NLP project, I need to write a program to match two sets of names.
For example: I need to identify below matchings.
Barack Obama - Democrat Barack Obama
Barack Obama - Mr. Obama
Barack Obama - Senator Barack Obama
I thought of first stripping off the Prefixes like "Democrat", "Senator" etc and then using some variant of edit distance algorithm.
Is there a way nltk can tell me that Democrat, Senator etc are not part of the name, so that I can strip them off.
Thank you
Bala
PS: Is there a good opensource name matching software available.