Hi all,
We are excited announce a couple of developments for dedupe.
1. We have built a web interface for dedupe:
http://dedupe.datamade.us/
Right now, this is just for deduplicating spreadsheets using a pretty simple data model. However, it doesn't require any installation or programming ability, so that's pretty awesome.
This code is still pretty raw.
2. Major update for dedupe library
We finally have merged in Nikit Saraf's valuable contributions for linking record sets. Other goodies include parallel processing, a better user interaction model, and lots of performance improvements.
Read more about it :
http://datamade.us/blog/dedupe-0-5/