[ANN] dedup: A fast stream deduplication package.

141 views
Skip to first unread message

Klaus Post

unread,
Oct 22, 2015, 10:54:57 AM10/22/15
to golang-nuts
Hi!


I have just released a package that will allow you to deduplicate streams, which allows you to remove duplicate data across gigabytes and even terabytes of data, at speeds >1GB/s.

For an introduction and example benchmarks I have pu up a blog post: https://blog.klauspost.com/fast-stream-deduplication-in-go

If you just want to see the package, you can go the github project page: https://github.com/klauspost/dedup


It is currently in a "stable" state, but still lacks some corner case tests, which I will be adding in the coming days. It is however not extensively battle-tested yet, so if you are interested in that, please let me know.

I do not plan do modify the current API and file format unless there is a fundamental problem I cannot fix without doing so.


Comments, feedback, questions, suggestions, problems are all very welcome.



/Klaus
Reply all
Reply to author
Forward
0 new messages