Hello,
I am Jordi Montes, Computer Scientist from Barcelona. The last year I been studying a new algorithm called Recordinality. Recordinality counts the number of records (more generally, k-records) in a stream for giving a Cardinality estimation. It depends in the underlying permutation of the first occurrences of distinct values, so it's a very different from the other estimators.
I discovered this library about 6 months ago and I want to contribute to it. I have found the time for this project and I want to spend the next days doing it. I have clone the project and ran maven. I have implemented the clas Recordinality (inside cardinality) which extends from ICardinality and Serialize. I have written some tests inspired by the tests written for TestHyperloglog class.
However, I have some questions about how to integrate Recordinality in your code:
- I would need that someone check my work because I have dudes about the serialize process.