Account Options

  1. Sign in
Google Groups Home
« Groups Home
Discussions > Crawling, indexing, and ranking > XHTML and ampersands interpretation
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  9 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
thuss  
View profile  
 More options Feb 7 2007, 6:14 pm
From: thuss
Date: Wed, 07 Feb 2007 15:14:39 -0800
Local: Wed, Feb 7 2007 6:14 pm
Subject: XHTML and ampersands interpretation
We are seeing Googlebot crawl URL's on our site without interpreting
the XHTML & as an & character. For example we do a proper XHTML
href on our pages (note the & entity reference):

<a href="http://www.greatschools.net/schools.page?
district=101&amp;state=CA&amp;lc=h">Some text</a>

which browsers then property interpret and take the user to:

http://www.greatschools.net/schools.page?district=101&state=CA&lc=h

However, we're seeing a lot of 500 errors in our logs and in the
webmaster console because Google is requesting the URL:

http://www.greatschools.net/schools.page?district=101&state=CA&am...

Anyone else running into this problem?

Thanks,
Todd


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
thuss  
View profile  
 More options Feb 7 2007, 6:18 pm
From: thuss
Date: Wed, 07 Feb 2007 15:18:50 -0800
Local: Wed, Feb 7 2007 6:18 pm
Subject: Re: XHTML and ampersands interpretation
Just to clarify, what we're seeing is that GoogleBot is requesting the
URL with &amp; between arguments instead of &.

On Feb 7, 3:14 pm, thuss wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
mve  
View profile  
 More options Feb 7 2007, 7:37 pm
From: mve
Date: Wed, 07 Feb 2007 16:37:15 -0800
Local: Wed, Feb 7 2007 7:37 pm
Subject: Re: XHTML and ampersands interpretation
You have a problem in your code:

<tr><td id="filters" colspan="2">
Filtered: High (<a href="/schools.page?
district=101&amp;state=CA&amp;lc=h">remove</a>)
</td></tr></table>

NOTE the &amp;state it should be: &state


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
RainboRick  
View profile  
 More options Feb 7 2007, 7:43 pm
From: RainboRick
Date: Wed, 07 Feb 2007 16:43:14 -0800
Local: Wed, Feb 7 2007 7:43 pm
Subject: Re: XHTML and ampersands interpretation
Just a guess, but you probably need to change the 'Content-type'
<meta> tag.  Your server is sending Content-Type "text/html" in the
response header for these URLs, and then you reinforce that with the
<meta> tag which comes AFTER your <!DOCTYPE>.  Try "text/xhtml" in the
<meta> tag, and set up code 301 redirects in your .htaccess file to
fix the ampersand problems you've already got in Google.

On Feb 7, 3:18 pm, thuss wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
mve  
View profile  
 More options Feb 7 2007, 7:48 pm
From: mve
Date: Wed, 07 Feb 2007 16:48:08 -0800
Local: Wed, Feb 7 2007 7:48 pm
Subject: Re: XHTML and ampersands interpretation
Well that didn't work ... here's what I am trying to "say":

   http://support.hashjoin.com/tp/1600.in_reply_to_xhtml_and_ampersands_...


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
cass-hacks  
View profile  
 More options Feb 8 2007, 12:49 am
From: cass-hacks
Date: Wed, 07 Feb 2007 21:49:25 -0800
Local: Thurs, Feb 8 2007 12:49 am
Subject: Re: XHTML and ampersands interpretation
Rainborick,
Wow!!  The subject of 301 comes up here a LOT but I would have never
thought of creating a 301 to fix a bug in a URL like that, no matter
where the error came from.

That's genius!!!

Or, am I just an idjit?  :-()

Craig

On Feb 8, 9:43 am, RainboRick wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
FredC  
View profile  
 More options Feb 8 2007, 4:08 am
From: FredC
Date: Thu, 08 Feb 2007 01:08:53 -0800
Local: Thurs, Feb 8 2007 4:08 am
Subject: Re: XHTML and ampersands interpretation
Is text/xhtml a legitimate MIME type? I've never seen it before. I
always thought XHTML used the text/html MIME.

Fred

On Feb 7, 6:43 pm, RainboRick wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
thuss  
View profile  
 More options Feb 8 2007, 12:49 pm
From: thuss
Date: Thu, 08 Feb 2007 09:49:45 -0800
Local: Thurs, Feb 8 2007 12:49 pm
Subject: Re: XHTML and ampersands interpretation
MVE, thanks for pointing out the error, you were spot on:

http://support.hashjoin.com/tp/1600.in_reply_to_xhtml_and_ampersands_...

One of our tag handlers is indeed doubly escaping the ampersands.

On Feb 8, 1:08 am, FredC wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
mve  
View profile  
 More options Feb 8 2007, 1:17 pm
From: mve
Date: Thu, 08 Feb 2007 10:17:40 -0800
Local: Thurs, Feb 8 2007 1:17 pm
Subject: Re: XHTML and ampersands interpretation
On Feb 8, 9:49 am, thuss wrote:

> MVE, thanks for pointing out the error, you were spot on:

Not a problem.  I am glad it helped.

 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »