Gmail Calendar Documents Reader Web more »
Recently Visited Groups | Help | Sign in
Google Groups Home
Message from discussion Blog Search side(bar) effects
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Jeremy Hylton  
View profile  
 More options Nov 12 2008, 9:49 pm
From: Jeremy Hylton <jhyl...@gmail.com>
Date: Wed, 12 Nov 2008 18:49:18 -0800 (PST)
Local: Wed, Nov 12 2008 9:49 pm
Subject: Re: Blog Search side(bar) effects
On Nov 3, 9:39 am, "Karen J. Cravens" <karen.crav...@gmail.com> wrote:

> I have a search on my blog name, as I'm sure a lot of people do.
> Recently Blog Search has started picking up its appearance on
> blogrolls, which means I'm getting false positives every time someone
> posts to a blog that has me in their blogroll. And it's a contextual
> snippet, which means I can't even use it as a substitute for
> subscribing to that blog (which I often already am, meaning I'm
> getting doubled volume in Reader).

> I can see the point in searching the whole page, given the number of
> feeds that are partial, but if GBS isn't savvy enough to leave out the
> static content, it's of no use to me.

We have changed the way we index blog posts to include the full
content of the page.  We've had occasional complaints about the use of
the feed content, particularly the problem with partial feeds that you
mentioned.  The indexing change has improved the results for a lot of
queries, both because we have the full content of the page and because
we extract links that are missing from the feeds.  The downside of
this change is that we see more results that match only the blogroll
and other parts of the page that are common to all of a blog's posts.

We expected some problems from blogroll matches, but may have
underestimated the impact on searches using the link: operator or
where the query matches a blog or blogger's name.  We do expect to fix
the problem you're seeing.  We'll use the full page content, but
exclude the content that isn't really part of the post.  I'm not sure
if we'll be able to make the change before the end of the year, but we
are working on it and are pretty confident that it can be solved.
We'll post an update here when we've got a solution.

Jeremy Hylton
Google BlogSearch


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google