Dedupe 1.0

82 views
Skip to first unread message

Forest Gregg

unread,
Aug 18, 2015, 5:35:42 PM8/18/15
to open-source-...@googlegroups.com
Hi all,

We are releasing dedupe 1.0 today. 

Despite the big version number update, there are no breaking changes in the version. Dedupe is now at a point of completeness and stability for a 1.0 release.

This version does bring some new features. Namely, String type fields or a derivative of String Type can now use Dirko Coetsee's Hidden Alignment Conditional Random Field comparison.  This distance measure can give you more accurate results but it is significantly slower than the default edit distance.

There have been other performance benefits and you can get the details for everything here: https://github.com/datamade/dedupe/blob/master/CHANGELOG.md

Best,

Forest
Reply all
Reply to author
Forward
0 new messages