Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

why a search engine don't or can't index all of a file when it's of large size?

1 view
Skip to first unread message

entrepreneur

unread,
Jun 16, 2006, 1:11:04 AM6/16/06
to
why a search engine don't or can't index all of a file when it's of
large size?

Trevor Jenkins

unread,
Jun 16, 2006, 9:57:01 AM6/16/06
to
On 16 Jun 2006 00:11:04 -0500, entrepreneur <mailtu...@gmail.com> wrote:

> why a search engine don't or can't index all of a file when it's of
> large size?

The answer to this (and also partly for your question about stopwords)
is simple: saving disk space. Well that used to the excuse for both things
but compressing the index and the position pointers within each inverted
entry will save more space than partial indexing or stopwording.

You should read Witten, Moffat and Bell's book "Managing Gigabytes" that
will provide you with enough technical background to answer these (and
many more) questions for yourself.

Regards, Trevor

<>< Re: deemed!

0 new messages