--
---
You received this message because you are subscribed to the Google Groups "open source deduplication" group.
To unsubscribe from this group and stop receiving emails from it, send an email to open-source-dedupl...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
#Show predicates
print(deduper.blocker.predicates)
print(deduper.blocker.index_fields)
1/10 positive, 0/10 negativeDo these records refer to the same thing?(y)es / (n)o / (u)nsure / (f)inished / (p)reviousINFO:dedupe.training:Final predicate set:INFO:dedupe.training:(SimplePredicate: (commonIntegerPredicate, s_road), SimplePredicate: (commonTwoTokens, scity))
2/10 positive, 0/10 negativeDo these records refer to the same thing?(y)es / (n)o / (u)nsure / (f)inished / (p)reviousINFO:dedupe.training:Final predicate set:INFO:dedupe.training:(SimplePredicate: (commonIntegerPredicate, s_road), SimplePredicate: (commonTwoTokens, scity))INFO:dedupe.training:(SimplePredicate: (commonTwoTokens, nameclean), SimplePredicate: (wholeFieldPredicate, s_road))
6/10 positive, 3/10 negativeDo these records refer to the same thing?(y)es / (n)o / (u)nsure / (f)inished / (p)reviousINFO:dedupe.training:Final predicate set:INFO:dedupe.training:(SimplePredicate: (commonTwoTokens, nameclean), SimplePredicate: (tokenFieldPredicate, szip5))INFO:dedupe.training:(SimplePredicate: (oneGramFingerprint, szip5), TfidfNGramCanopyPredicate: (0.8, s_road))INFO:dedupe.training:(SimplePredicate: (commonIntegerPredicate, s_road), SimplePredicate: (commonTwoTokens, scity))
8/10 positive, 4/10 negativeDo these records refer to the same thing?(y)es / (n)o / (u)nsure / (f)inished / (p)reviousINFO:dedupe.training:Final predicate set:INFO:dedupe.training:(SimplePredicate: (commonThreeTokens, s_road), SimplePredicate: (oneGramFingerprint, szip5))INFO:dedupe.training:(SimplePredicate: (commonTwoTokens, nameclean), SimplePredicate: (tokenFieldPredicate, szip5))
19/10 positive, 9/10 negativeDo these records refer to the same thing?(y)es / (n)o / (u)nsure / (f)inished / (p)reviousINFO:dedupe.training:Final predicate set:INFO:dedupe.training:(SimplePredicate: (twoGramFingerprint, s_house_number), TfidfNGramCanopyPredicate: (0.4, s_road))INFO:dedupe.training:(SimplePredicate: (commonThreeTokens, nameclean), SimplePredicate: (tokenFieldPredicate, szip5))INFO:dedupe.training:(SimplePredicate: (commonIntegerPredicate, s_road), SimplePredicate: (commonTwoTokens, scity))INFO:dedupe.training:(SimplePredicate: (commonTwoTokens, nameclean), SimplePredicate: (commonTwoTokens, scity))
25/10 positive, 14/10 negativeDo these records refer to the same thing?(y)es / (n)o / (u)nsure / (f)inished / (p)reviousINFO:dedupe.training:Final predicate set:INFO:dedupe.training:(SimplePredicate: (tokenFieldPredicate, sstate), SimplePredicate: (wholeFieldPredicate, s_house_number))INFO:dedupe.training:(LevenshteinCanopyPredicate: (3, nameclean), SimplePredicate: (tokenFieldPredicate, szip5))INFO:dedupe.training:(TfidfNGramCanopyPredicate: (0.8, s_po_box), TfidfTextCanopyPredicate: (0.4, nameclean))
...