In addition to the helpful suggestions made above, here are some R-specific pointers:
— stringr is an extremely helpful package with which to do most of the string manipulation actions (whitespace removal, tokenisation, regex matching) recommended above.
— you may also need a package that helps you compute ‘distances’ between the strings you are comparing. stringdist is one such package. However, with Indian names, I found some of the phonetic distance algorithms (rogerroot, soundex) in the phonics package much more helpful.
Hope this helps! Good luck!
Madhu