Account Options

  1. Sign in
The old Google Groups will be going away soon.
Switch to the new Google Groups.
Google Groups Home
« Groups Home
Discussions > Crawling, indexing, and ranking > Not existing unreachable URLs (web crawl)
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  1 message - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
ReneW  
View profile  
 More options Jul 11 2007, 3:50 am
From: ReneW
Date: Wed, 11 Jul 2007 00:50:24 -0700
Local: Wed, Jul 11 2007 3:50 am
Subject: Not existing unreachable URLs (web crawl)
Since a couple of month I'm monitoring our new website http://www.myhotelsoftware.com/
to make sure everything gets indexed properly. Since the structure has
changed we get a lot of 'Pages not Found' which I mostly remove
manually with Google's "URL Removals" or if an old page is similar to
a new page I redirect this page with a 301 header.

Since a couple of weeks the Googlebot found some Unreachable URLs. I'm
still not sure what is causing this, but I'm investigating our server
to make sure there is nothing wrong on our side. But .. today and
yesterday some new Unreachable URLs showed up which are not valid
anyway (e.g. http://www.myhotelsoftware.com/ts/overview.html). Our
website uses the language as virtual directory. There are four
languages (en, us, nl, de) but the language "ts" that the Googlebot
tried to visit does not exist. If an invalid language is called, I
just show the English version of the page, so Googlebot will always
see a page, but it shouldn't go to this invalid language in the first
place. I thought that maybe there is some website pointing to this
wrong language, but after searching in Google I couldn't find any
website pointing to pages with the language 'ts'.

So, my question: Where could these wrong URL's come from?

Thanks for any help in advance!


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »