Thanks Christina -
I will check this Xenu out on monday.. I am also blocking all images and pdf's in my robots.txt file. I am able to complete my crawls on these sites using other sitemap generating tools with crawlers just fine. GSite is not successful.
Even though I am able to complete my crawls with the other sitemap generators just fine, the fact the gsite is failing leaves me wondering if I have a site issue rather than a crawler issue.... I am truly in a grey area and I am not able to find anyone who knows how to assist me. Scary think is I am not a programmer, yet, I am seem to know more about robots.txt and crawling than most of the programmers and developers I've been speaking too. This concerns me...
So currently, I am able to get two Sitemap generators to crawl 10 individual sites and create sitemaps just fine. Both tools are generating identical results for all sites. Thus I have uploaded my robots.txt file on my production site, and have been testing it for several days... I am not reporting any errors in webmaster tools or bing webmaster tools. Yet, I still have no idea if things are working as they should.
Do you know if there is a way to verify is google and bing/yahoo are able to report the following:
1 . If each engine is completing a crawl of my sites
2. How long each crawl is taking
3. How many pages were crawled
4. How many pages were indexed?
I know webmaster tools have a crawler stats and crawler errors interface, but they are vague...
Thanks Norm