Since a couple of month I'm monitoring our new website
http://www.myhotelsoftware.com/ to make sure everything gets indexed properly. Since the structure has
changed we get a lot of 'Pages not Found' which I mostly remove
manually with Google's "URL Removals" or if an old page is similar to
a new page I redirect this page with a 301 header.
Since a couple of weeks the Googlebot found some Unreachable URLs. I'm
still not sure what is causing this, but I'm investigating our server
to make sure there is nothing wrong on our side. But .. today and
yesterday some new Unreachable URLs showed up which are not valid
anyway (e.g. http://www.myhotelsoftware.com/ts/overview.html). Our
website uses the language as virtual directory. There are four
languages (en, us, nl, de) but the language "ts" that the Googlebot
tried to visit does not exist. If an invalid language is called, I
just show the English version of the page, so Googlebot will always
see a page, but it shouldn't go to this invalid language in the first
place. I thought that maybe there is some website pointing to this
wrong language, but after searching in Google I couldn't find any
website pointing to pages with the language 'ts'.
So, my question: Where could these wrong URL's come from?
Thanks for any help in advance!