GSiteCrawler

43 views
Skip to first unread message

franimaldoc

unread,
Sep 28, 2009, 6:00:40 PM9/28/09
to SOFTplus GSiteCrawler
Hi-

I've just spent months
a) learning to create a website
b) creating a website

and am now overwhelmed by the amount of information one must learn in
order to get people to see the website.

So I was delighted to find GSiteCrawler, a program that would help
with the whole search engine thing, free of charge.

The problem is, once I've filled in all the (I think) necessary blanks
in the GSiteCrawler Wizard, and clicked the Finish button,
nothing happens.
Well, not exactly nothing.
The bottom line flickers every 4-5 seconds, but nothing appears in the
Crawler lines.
What am I doing wrong, please?

Also, how do I get a Yahoo verification file? I have a Google file,
but had some difficulty finding the place to submit my website to a
place in Yahoo for which I didn't have to pay. Have already invested I-
am-embarrassed-to-admit-how-much money in this website, I'd rather not
buck up any more than I have to.
Any help would be greatly appreciated.

Thanks for your time,
Fran

webado2

unread,
Sep 28, 2009, 6:41:19 PM9/28/09
to SOFTplus GSiteCrawler
I would need your website url to check to see if it can be crawled.

As for submitting to Yahoo, find Yahoo SIte Explorer - that's where
you will find something similar to Google Webmaster Tools which iwll
explain how you can submit (it's free) and give you a verification
file and/or verification meta tag.

franimaldoc

unread,
Sep 29, 2009, 4:05:20 PM9/29/09
to SOFTplus GSiteCrawler
Oops, sorry, not clairevoyant, eh?

The website address is www.aboutpagearizona,com

And thanks for the Yahoo info

Fran

Christina S

unread,
Sep 29, 2009, 8:17:18 PM9/29/09
to gsitec...@googlegroups.com
1) The domain is very new, only registered a week ago. It takes time for
search engines to find it and crawl it and index it.
2) The are DNS problems with your domain:
http://www.intodns.com/aboutpagearizona.com
You should get that fixed - your hoster need to take care of this.
3) You have a couple of broken links:
Xenu found these:

http://www.aboutpagearizona.com/citizen_care%20.html
error code: 404 (not found), linked from page(s):
http://www.aboutpagearizona.com/page_community_resources.html

http://www.aboutpagearizona.com/houseforsale_pages/about_page/transportation.html
error code: 404 (not found), linked from page(s):
http://www.aboutpagearizona.com/page_schools.html

4) You are using index.html in naviagtion - you should use only "/" or the
domain root url as http://www.aboutpagearizona.com/ throughout.
5) Your server responds with 200 for both the www and the non-www versions
of the urls. You pick one form and 301 redirect the other to it. See here
hwo it's done usig the .htaccess file:
http://groups.google.com/group/only-validation/web/fix-canonical-issues-www-vs-non-www-and-more-on-apache-server

GSiteCrawler found these (the same as above actually):
Failed at 2009-09-29 20:11:
URL:
http://www.aboutpagearizona.com/houseforsale_pages/about_page/transportation.html
Error: HTTP-Error 404 Not Found
Linked from: http://www.aboutpagearizona.com/page_schools.html

Failed at 2009-09-29 20:11:
URL: http://www.aboutpagearizona.com/citizen_care .html
Error: HTTP-Error 404 Not Found
Linked from:
http://www.aboutpagearizona.com/page_community_resources.html

5) There's no problem running GSiteCrawler on your site, I just did. It
found 10 urls, of whihc 9 are distinct (due to index.html being among them
and a duplicate of the root url). Would be 12 were it not for the broken
links reported.



Christina
www.webado.net

franimaldoc

unread,
Oct 2, 2009, 1:24:51 AM10/2/09
to SOFTplus GSiteCrawler
Wow.
Thank you- that's a pretty intense amount of work you just did for me.
I'm not sure what some of it means though - if I talk to HostMonster
about the DNS thing and just give them that link to intodns, will they
know what to do with it? They've been really good with support so far,
and very knowledgeable, so I just want to make sure I'm bringing them
all I need.

I'm sorry it took me so long to get back to you - when you pointed out
the broken links to me, it made me take a step back and look at the
site, and realize it was pretty haphazardly structured. I fixed the
links you found, but I also rearranged the site to tighten it up a
bit, and while Dreamweaver updated the links, I haven't found
Dreamweaver to be infallible in its coding either. So I was wondering
what you used to check the links - I'll check out the changes, make
sure everything's linked properly.

I fixed all the index.html references- I never heard of doing it that
way. Why does that make a difference?

The last question I have for you ( I hope) is about site maps. Right
now the site map for the site only consists of the index.html page.
Shouldn't it outliine the whole site, and if so, how do I get it to do
so?

Thank you so much for your help on all of this.
I really appreciate you being willing to take time out of your busy
schedule to answer my questions.
Fran


On Sep 29, 5:17 pm, "Christina S" <web...@gmail.com> wrote:
> 1) The domain is very new, only registered a week ago. It takes time for
> search engines to find it and crawl it and index it.
> 2) The are DNS problems with your domain:http://www.intodns.com/aboutpagearizona.com
> You should get that fixed - your hoster need to take care of this.
> 3) You have a couple of broken links:
> Xenu found these:
>
> http://www.aboutpagearizona.com/citizen_care%20.html
> error code: 404 (not found), linked from page(s):
>  http://www.aboutpagearizona.com/page_community_resources.html
>
> http://www.aboutpagearizona.com/houseforsale_pages/about_page/transpo...
> error code: 404 (not found), linked from page(s):
>  http://www.aboutpagearizona.com/page_schools.html
>
> 4) You are using index.html in naviagtion - you should use only "/" or the
> domain root url ashttp://www.aboutpagearizona.com/throughout.
> 5) Your server responds with 200 for both the www and the non-www versions
> of the urls. You pick one form and 301 redirect the other to it. See here
> hwo it's done usig the .htaccess file:http://groups.google.com/group/only-validation/web/fix-canonical-issu...
>
> GSiteCrawler found these (the same as above actually):
> Failed at 2009-09-29 20:11:
>    URL:http://www.aboutpagearizona.com/houseforsale_pages/about_page/transpo...

Christina S

unread,
Oct 2, 2009, 7:40:38 AM10/2/09
to gsitec...@googlegroups.com
Hi Fran,
Hostmonster should at least understand what the problems reported in the
IntoDNS report are. How fast they can fix them is another matter. But they
are the only one who can fix them.

I checked for broken links using Xenu Link Sleuth - a free program from:
http://home.snafu.de/tilman/xenulink.html

The same thing was found by running GSiteCrawler actually.

Ideally a sitemap should list all the urls of the site. That's what
GSiteCrawler builds for you.
But to the extent that Google doesn't actually need a sitemap in order to
crawl and index a site, it doesn't matter really.
The sitemap becomes more important however if there are many deep links to
be discovered - Googlebot may not be too inclined to crawl too deeply for a
new site at first, so it may take longer until it finds them all.
Having a sitemap with only the homepage url is not very useful, no different
from only giving Google the website url and letting Googlebot find the rest.

The reason behind NOT using index.html in links to the homepage is that they
are duplicated urls: both http://www.example.com/ and
http://www.example.com/index.html would be the same page (for the site whose
homepage is in the file index.html). It's called canonicalization. Same as
having both the www and non-www versions of urls active (responding with
response code 200). They are duplicate urls. You pick one canonical form of
the domain (www or non-www), stick to it throughout the site navigation, and
if possible, implement 301 redirections at the server level from one form to
the one you prefer. It will consolidate and concentrate the site under one
canonical form, minimum number of unique page urls - thus maximum value to
each url.

Christina

franimaldoc

unread,
Oct 2, 2009, 2:22:30 PM10/2/09
to SOFTplus GSiteCrawler
Thank you for the explanations - very understandable.

I think I've screwed up, and I'm scared I now have a website that I've
spent a gejillion hours on that I've sicced the web spiders on that
doesn't work.
Even though I can navigate it online just fine.
I'm confused.

I downloaded Venu and tried to do a local link check of the whole
site, since I wasn't sure what the address was Xenu was looking for
remotely.
But I can't find the broken links and can't fathom how I screwed this
up so badly, trying to fix them.
Any help you could give me in understanding the report generated would
help to keep me from panicking here. The most comprehensive problem
seems to be the reference to ///c: on every page, and I don't know to
what that refers and how to fix it. The simplest thing seems to be the
objection to me using the same reference to small_spacer1 in multiple
div tags in the same file. Do I have to create a newly named css
reference in each of the different spots in which I'm currently using
small_spacer1. But that one seems to be the least of my problems.
I'm sorry to keep bothering you, but it seems the more I try to fix
this site, the more problems I generate, and I have too many hours in
it to just throw up my hands and give up, which is what I feel like
doing right now

Please help..


These are the results:
Xenu's broken link report

Created on October 02, 2009 at 10:56:51
Root URL: file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html
Table of contents

* Broken links, ordered by link
* Broken links, ordered by page
* List of redirected URLs
* List of valid URLs you can submit to a search engine
* Site Map of HTML pages with a Title
* Broken page-local links
* Orphan files
* Statistics for managers

[Xenu sez: visit Amazon.com!]
Broken links, ordered by link:

file:///C:
error code: 5 (access denied), linked from page(s):
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/citizen_care.html
error code: 2 (file not found), linked from page(s):
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/restaurants.html
error code: 2 (file not found), linked from page(s):
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html

http://www.helpinghandsagency.com/Scholarships.html
error code: 404 (not found), linked from page(s):
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html

14 broken link(s) reported

Return to Top
Broken links, ordered by page:

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html
file:///C:
\_____ error code: 5 (access denied)

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html
file:///C:
\_____ error code: 5 (access denied)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/restaurants.html
\_____ error code: 2 (file not found)
http://www.helpinghandsagency.com/Scholarships.html
\_____ error code: 404 (not found)

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html
file:///C:
\_____ error code: 5 (access denied)

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html
file:///C:
\_____ error code: 5 (access denied)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/restaurants.html
\_____ error code: 2 (file not found)

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html
file:///C:
\_____ error code: 5 (access denied)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/restaurants.html
\_____ error code: 2 (file not found)

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html
file:///C:
\_____ error code: 5 (access denied)

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html
file:///C:
\_____ error code: 5 (access denied)

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html
file:///C:
\_____ error code: 5 (access denied)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/restaurants.html
\_____ error code: 2 (file not found)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/citizen_care.html
\_____ error code: 2 (file not found)


14 broken link(s) reported

Return to Top
List of redirected URLs

Return to Top
List of valid URLs you can submit to a search engine:

http://www.houseforsale-page-arizona.com/
http://www.evrsoft.com/fastsubmit/
http://www.houseforsale-page-arizona.com/houseforsale_pages/houseforsale_pages.htm
http://www.lakeview.pageusd.org/
http://www.desertview.pageusd.org/
http://www.ms.pageusd.org/
http://www.hs.pageusd.org/
http://www.hs.pageusd.org/index.cfm?pID=1463
http://www.helpinghandsagency.com/index.html
http://houseforsale-page-arizona.com/
http://www.cityofpage.org/fire.shtml
http://www.cityofpage.org/police.shtml
http://www.ehow.com/videos-on_1380_train-drug-dog.html
http://www.classiclifeguard.com/
http://www.classiclifeguard.com/TheClassiclifeguard_Aircraft.htm
http://www.classiclifeguard.com/N181CG.htm
http://www.asu-nvg.com/Products.aspx
http://www.helpinghandsagency.com/Services.html?Service=Express
http://www.greatlakesav.com/
http://westwindairservice.com/
http://www.golflakepowell.com/
http://www.helpinghandsagency.com/Services.html?Service=DayProg
http://www.helpinghandsagency.com/Services.html?Service=LifeSkills
http://www.helpinghandsagency.com/Services.html?Service=AfterSchool
http://www.helpinghandsagency.com/Services.html?Service=Music
http://www.helpinghandsagency.com/Services.html?Service=ArtProg
http://www.helpinghandsagency.com/Services.html?Service=LunchProg
http://www.helpinghandsagency.com/Services.html?Service=Respite
http://www.cityofpage.org/senior.shtml
http://www.beehivecare.com/thedifference.php
http://www.bannerhealth.com/Locations/Arizona/Page+Hospital/_Page+Hospital.htm
http://cchcaz.org/CUC_prov.htm
http://cchcaz.org/lpmc.htm
http://www.manta.com/coms2/dnbcompany_gqdx95
http://www.pagechamber.com/

Return to Top
Site Map of valid HTML pages with a title:

Note: if you upload this Site Map to your web page, exclude its URL or
delete its page before checking, to prevent the software from checking
the Site Map itself!
About Page Arizona

* City of Page Arizona's Restaurants
* Page Arizona's Community Resources
* City of Page Arizona's School System
* City of Page Arizona Emergency Services
* City of Page Arizona - Transit System and Championship Golf
Course
* City of Page Arizona's Medical Services
* Page Arizona's Citizen Care copyright 2009
* City of Page Arizona's Businesses

Return to Top
Broken page-local links (also named 'anchors', 'fragment
identifiers'):

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/community_resources.html#small_spacer1
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/schools.html#small_spacer1
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/emergency_services.html#small_spacer1
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/transportation.html#small_spacer1
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/medical_services.html#small_spacer1
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#photo_right_bottom
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#photo_right_bottom
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/citizen_care.html#photo_right_bottom
anchor occurs multiple times

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html#small_spacer1
anchor occurs multiple times
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/business.html#small_spacer1
anchor occurs multiple times



71 bad local link(s) reported

Return to Top
Orphan files:

Note: Links that aren't spidered (e.g. webforms, dynamically generated
links, and links from within stylesheet files) will appear as orphans
in this list.

file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/.htaccess
(1.1 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/_notes/dwsync.xml
(1.7 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/_notes/dwsync.xml
(0.1 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/business/_notes/dwsync.xml
(0.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/citizen_care/_notes/dwsync.xml
(1.3 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/citizen_care/public_library.jpg
(25.3 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/emergency_services/_notes/dwsync.xml
(1.0 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/emergency_services/lg_emergency_images/lg-classic_scene_call.JPG
(64.3 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/medical/_notes/dwsync.xml
(1.0 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/recreation_and_entertainment/_notes/dwsync.xml
(0.7 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/recreation_and_entertainment/cultural_arts_building_cab.jpg
(28.4 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/recreation_and_entertainment/john_wesley_powell_museum.JPG
(36.4 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/recreation_and_entertainment/lg_rec/_notes/dwsync.xml
(0.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/recreation_and_entertainment/lg_rec/lg_18_hole_golf_course.jpg
(64.3 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/recreation_and_entertainment/oasis_on_the_rim_arts.jpg
(23.6 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/restaurants/_notes/dwsync.xml
(1.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/restaurants/pinciottis_italian.jpg
(29.0 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/schools/_notes/dwsync.xml
(1.0 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/schools/large_school_images/_notes/dwsync.xml
(1.0 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/schools/psd/_notes/dwsync.xml
(0.5 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/schools/psd/lakeview_elementary.psd
(113.8 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/schools/psd/middle_school.psd
(76.9 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/schools/psd/page_middle_school.psd
(114.1 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/transportation/_notes/dwsync.xml
(0.8 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/community_resources/transportation/bell206_long_ranger.JPG
(20.9 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/home_page_images/_notes/building_glen_canyon_dam.jpg.mno
(15.4 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/home_page_images/_notes/dwsync.xml
(1.0 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/_notes/dwsync.xml
(2.9 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/contact_owners_button_about_page.GIF
(1.5 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/_notes/dwsync.xml
(1.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/about_page_button_about_page.PSD
(157.5 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/areas_of_the_house_about_page.psd
(44.7 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/community_resources_button.psd
(161.4 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/contact_owners_button_about_page.psd
(159.4 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/home_page_button_about_page.psd
(157.8 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/of_the_house_for_sale_sign.PSD
(174.0 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/navigation/pds/restaurants_button.psd
(157.1 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/about_page_images/scenic/_notes/dwsync.xml
(0.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/_notes/dwsync.xml
(1.8 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/entertainment_and_recreation.html
(9.9 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/religiious_choices.html
(9.6 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/test.html
(10.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/css/_notes/dwsync.xml
(1.4 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/css/about_beehive.css
(3.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/css/text.css
(3.4 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/google283d3792aa2a7155.html
(0.3 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/gss.xsl
(12.9 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/log.txt
(0.5 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/pctlsp.log
(0.2 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/sitemap.xml
(0.6 KB)
file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/sitemap.xml.gz
(0.4 KB)


51 orphan(s) reported (1697.5 KB)

Return to Top
Statistics for managers
Correct internal URLs, by MIME type:
MIME type count % count Σ size Σ size (KB) % size min size max size Ø
size Ø size (KB) Ø time
text/html 9 URLs 12.68% 96726 Bytes (94 KB) 5.88% 6392 Bytes 14433
Bytes 10747 Bytes (10 KB) 0
text/css 6 URLs 8.45% 19854 Bytes (19 KB) 1.21% 3164 Bytes 3468 Bytes
3309 Bytes (3 KB)
6 URLs 8.45% 10140 Bytes (9 KB) 0.62% 1370 Bytes 2435 Bytes 1690
Bytes (1 KB)
image/jpeg 50 URLs 70.42% 1519054 Bytes (1483 KB) 92.30% 16275 Bytes
68683 Bytes 30381 Bytes (29 KB)
Total 71 URLs 100.00% 1645774 Bytes (1607 KB) 100.00%
All pages, by result type:
ok 108 URLs 96.43%
access denied 1 URLs 0.89%
file not found 2 URLs 1.79%
not found 1 URLs 0.89%
Total 112 URLs 100.00%

On Oct 2, 4:40 am, "Christina S" <web...@gmail.com> wrote:
> Hi Fran,
> Hostmonster should at least understand what the problems reported in the
> IntoDNS report are. How fast they can fix them is another matter. But they
> are the only one who can fix them.
>
> I checked for broken links using Xenu Link Sleuth - a free program from:http://home.snafu.de/tilman/xenulink.html
>
> The same thing was found by running GSiteCrawler actually.
>
> Ideally a sitemap should list all the urls of the site. That's what
> GSiteCrawler builds for you.
> But to the extent that Google doesn't actually need a sitemap in order to
> crawl and index a site, it doesn't matter really.
> The sitemap becomes more important however if there are many deep links to
> be discovered - Googlebot may not be too inclined to crawl too deeply for a
> new site at first, so it may take longer until it finds them all.
> Having a sitemap with only the homepage url is not very useful, no different
> from only giving Google the website url and letting Googlebot find the rest.
>
> The reason behind NOT using index.html in links to the homepage is that they
> are duplicated urls: bothhttp://www.example.com/andhttp://www.example.com/index.htmlwould be the same page (for the site whose
> homepage is in the file index.html). It's called canonicalization. Same as
> having both the www and non-www versions of urls active (responding with
> response code 200). They are duplicate urls. You pick one canonical form of
> the domain (www or non-www), stick to it throughout the site navigation, and
> if possible, implement 301 redirections at the server level from one form to
> the one you prefer. It will consolidate and concentrate  the site under one
> canonical form, minimum number of unique page urls - thus maximum value to
> each url.
>

Christina S

unread,
Oct 2, 2009, 7:09:25 PM10/2/09
to gsitec...@googlegroups.com
For Xenu you use your website url, not the local one.
For starters don,t ask it to check external links.
Just let it loose and check the output at the end.

Christina

franimaldoc

unread,
Oct 3, 2009, 4:31:46 AM10/3/09
to SOFTplus GSiteCrawler
Sorry - I panic too easily sometimes.
I dumped the whole site to paper and went over it with a fine tooth
comb, teased out all (I think) the knots.
I renamed all those spacers so I wasn't reusing them, made sure of all
the links - altho using "/" wouldnt work for me, I had to use the full
site name - and tidied it all up.
I got a clean bill of health (I think) from Xenu, altho there were
some comments I wasn't sure of.
Would you mind doing me the favor of running a check and seeing if any
of the comments will interfere with spider runs?

I sure would appreciate it, and I apologize again for the somewhat
schitzy tone of the previous message.
I just have so much time invested in this project, and I get a little
crazy, so close to being finished and having a whole new set of
hurdles appear.

So please forgive a novice - who really sucks at detail work, so you
can imagine the fun I've been having doing this.

Thanks again for your time,
Fran
> Created on October 02, 2009 at 10:56:51Root URL:file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html
> http://www.houseforsale-page-arizona.com/http://www.evrsoft.com/fastsubmit/http://www.houseforsale-page-arizona.com/houseforsale_pages/housefors...http://www.lakeview.pageusd.org/http://www.desertview.pageusd.org/http://www.ms.pageusd.org/http://www.hs.pageusd.org/http://www.hs.pageusd.org/index.cfm?pID=1463http://www.helpinghandsagency.com/index.htmlhttp://houseforsale-page-arizona.com/http://www.cityofpage.org/fire.shtmlhttp://www.cityofpage.org/police.shtmlhttp://www.ehow.com/videos-on_1380_train-drug-dog.htmlhttp://www.classiclifeguard.com/http://www.classiclifeguard.com/TheClassiclifeguard_Aircraft.htmhttp://www.classiclifeguard.com/N181CG.htmhttp://www.asu-nvg.com/Products.aspxhttp://www.helpinghandsagency.com/Services.html?Service=Expresshttp://www.greatlakesav.com/http://westwindairservice.com/http://www.golflakepowell.com/http://www.helpinghandsagency.com/Services.html?Service=DayProghttp://www.helpinghandsagency.com/Services.html?Service=LifeSkillshttp://www.helpinghandsagency.com/Services.html?Service=AfterSchoolhttp://www.helpinghandsagency.com/Services.html?Service=Musichttp://www.helpinghandsagency.com/Services.html?Service=ArtProghttp://www.helpinghandsagency.com/Services.html?Service=LunchProghttp://www.helpinghandsagency.com/Services.html?Service=Respitehttp://www.cityofpage.org/senior.shtmlhttp://www.beehivecare.com/thedifference.phphttp://www.bannerhealth.com/Locations/Arizona/Page+Hospital/_Page+Hos...http://cchcaz.org/CUC_prov.htmhttp://cchcaz.org/lpmc.htmhttp://www.manta.com/coms2/dnbcompany_gqdx95http://www.pagechamber.com/
>
> Return to Top
> Site Map of valid HTML pages with a title:
>
> Note: if you upload this Site Map to your web page, exclude its URL or
> delete its page before checking, to prevent the software from checking
> the Site Map itself!
> About Page Arizona
>
>     * City of Page Arizona's Restaurants
>     * Page Arizona's Community Resources
>     * City of Page Arizona's School System
>     * City of Page Arizona Emergency Services
>     * City of Page Arizona - Transit System and Championship Golf
> Course
>     * City of Page Arizona's Medical Services
>     * Page Arizona's Citizen Care copyright 2009
>     * City of Page Arizona's Businesses
>
> Return to Top
> Broken page-local links (also named 'anchors', 'fragment
> identifiers'):
>
> file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/index.html#small_spacer1
> anchor occurs multiple times
>
> file:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#small_spacer1
> anchor occurs multiple timesfile:///C:/Users/franimaldoc/Documents/local_sites/aboutpagearizona/community_resources/restaurants.html#underlined
> anchor occurs multiple times ...
>
> read more »

Christina S

unread,
Oct 3, 2009, 9:20:27 AM10/3/09
to gsitec...@googlegroups.com
Ok, the other comment that occurs several times is really about the use of :
<div id="underlined">

several times. A particular id is to be used only once per page.

You'd have seen this issue and others by running the w3 validator as well:
http://validator.w3.org/check?verbose=1&uri=http://www.aboutpagearizona.com/community_resources/restaurants.html

Start by fixing the first error found and revalidate. Currently even
reported errors are mixed up due to the very first one.

Christina
www.webado.net

----- Original Message -----
From: "franimaldoc" <frani...@gmail.com>
To: "SOFTplus GSiteCrawler" <gsitec...@googlegroups.com>
Sent: Saturday, October 03, 2009 4:31 AM
Subject: [GSiteCrawler] Re: GSiteCrawler



Reply all
Reply to author
Forward
0 new messages