> Thank you for the pointers. I knew of a few empty links created by WP
> that I do not know how to delete, but I did not know of any dead
> ones.
> Where did you find the redirected url "...gallery" redirected to
> "...gallery/"?
> Gallery is new to me and I have not had a chance to get inside and
> tweak it yet.
> SL
> On May 11, 7:48 pm, webado wrote:
> > I'd say easier to just rename it.
> > Delete the old one from Webmaster Tools and submit the new name.
> > If your sitemap is specified in robots.txt don't forget to change that
> > url.
> > It probably got indexed due to a link to it being found somewhere.
> > There's nothing wrong with that as such, but I can see how it's
> > annoying.
> > Your robots.txt file needs a few more things to be disallowed.
> > Anything concerning register, login for forum and blog, reply in the
> > forum, post a comment, feeds, trackbaks, and whatever else is no
> > business of robots or represents duplciated content, should be
> > disallowed.
> > You do not need the last line you currently have:
> > Allow: /
> > By definition anyhting which is not disallowed is allowed.
> > You also need to take care of the canonical domain issue, where www
> > and non-www urls serve the same content. Pick one, make sure all yrou
> > navigation use it and 301 redirect all urls from the other form to the
> > one you picked.
> > See here:http://groups.google.com/group/only-validation/web/fix-canonical-issu...
> > You seem to have lots of broken links on your site. Use Xenu to check
> > it over. Xenu does not follow robots.txt so you have to tell it
> > specifically which url prefixes not to crawl.
> > you also have redriected urls:
> >http://www.dimfuzzies.com/gallery
> > redirected to:http://www.dimfuzzies.com/gallery/
> > status code: 301 (object permanently moved)
> > linked from page(s):
> > http://www.dimfuzzies.com/
> > http://www.dimfuzzies.com/2008/04/10/new-life/
> > http://www.dimfuzzies.com/category/site-information/
> > http://www.dimfuzzies.com/2008/04/
> >http://www.dimfuzzies.com/gallery/
> > redirected to:http://www.dimfuzzies.com/gallery/main.php
> > status code: 302 (object temporarily moved)
> > linked from page(s):
> > http://www.dimfuzzies.com/
> > http://www.dimfuzzies.com/2008/04/26/celebrate-national-astronomy-day...
> > http://www.dimfuzzies.com/2008/04/10/rocky-mountain-star-stare-2008/
> > etc.
> > The first one is because you left out the trailing slash and the
> > server does a 302 redirection automatically.
> > Robots should not meet redirections when they crawl. Use the final
> > destination at all times.
> > On May 11, 6:17 pm, cristina wrote:
> > > I do not know if it will work,
> > > and be careful to monitor how your site will be indexed
> > > after that,
> > > but a suggestion for stopping the sitemap URL to be indexed
> > > would be to add to the HTTP header of the
> > > sitemap file the noindex X-Robot-tag, seehttp://googleblog.blogspot.com/2007/07/robots-exclusion-protocol-now-...
> > > For an Apache server you can add a X-Robot-tag
> > > to the HTTP header of a file sitemap.xml like this
> > > <Files "sitemap.xml">
> > > Header set X-Robots-Tag "noindex"
> > > </Files>
> > > I repeat, keep checking if your site's search results
> > > are affected by this and remove the noindex
> > > directive for the sitemap from the .htaccess file
> > > if something goes wrong.
> > > Check HTTP headers of your site's URLs with
> > > HTTP header viewers.
> > > Cristina.
> > > On May 11, 9:28 pm, SLeach wrote:
> > > > My sitemap file "itself" is indexed. I want the sitemap to be read by
> > > > the crawlers, but I do not want the sitemap file to be indexed. How
> > > > do I block it from being indexed without blocking it from being
> > > > crawled?
> > > > ref: the sitemap.xls file athttp://www.dimfuzzies.com/
> > > > Thx, SL- Hide quoted text -
> > > - Show quoted text -- Hide quoted text -
> > - Show quoted text -