Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Finding population size
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  2 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Jason Kolb  
View profile  
 More options Jun 27 2012, 11:44 am
From: Jason Kolb <jason.k...@gmail.com>
Date: Wed, 27 Jun 2012 08:44:26 -0700 (PDT)
Local: Wed, Jun 27 2012 11:44 am
Subject: Finding population size

Hi there,

First of all, thanks for making this project available, it's very
interesting. I have two (hopefully quick) questions:

Is there any primer for how to use it? I'm trying to figure it out by
looking at the unit tests but there seems to be several ways to use it and
I'm a little lost in that regard.

Secondly, is there any way to find the actual population size using the
HyperLogLog estimator? I'd like to not only see the cardinality but also
the population size. I can always subclass it and record the count
separately (it'd be a simple counter, count++ type of thing), but it seems
like there's some more direct method that I'm missing.

Thanks!
Jason


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Matt Abrams  
View profile  
 More options Jun 28 2012, 10:44 am
From: Matt Abrams <abra...@gmail.com>
Date: Thu, 28 Jun 2012 10:44:03 -0400
Local: Thurs, Jun 28 2012 10:44 am
Subject: Re: Finding population size
Hi Jason -

I'm glad you are interested in using stream-lib.  In order to help
give you some guidance it would be helpful to know what your intended
use case is?  Are you looking at the bloom filters, counters, top
estimators?

For your second question.  What do you mean by population size?  If
you are just talking about the number of objects you've offered to the
counter then your idea of using count++ is fine.  If you do want to
subclass the counters to keep track of this separately then you will
also need to make sure to update the merge and serialization methods
so that the offer count is persisted and merged correctly if you are
working with multiple counters.

Matt


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »