On 22 Feb 2013, at 3:29 PM, Valentin Tablan <
v.ta...@gmail.com> wrote:
> Is there a way to (quickly) find the global count for a given term? I
> would rather not have to iterate the whole postings list to accumulate
> the counts.
OK. This can mean
0) The frequency (number of documents in which the term appears)
1) The occurrency (the number of occurrencies of the term in the whole collection).
If you want 0), of course IndexIterator has a .frequency() method. If you want 1), it's in .globcounts (but that's the old name--they're called .occurrencies by now), gamma-coded. You can use SemiExternalGammaList to access the data in a relatively speedy manner. I should add a cache to that stuff...
Ciao,
seba