Google Groups Home
Help | Sign in
Page not being indexed
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  23 messages - Collapse all
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
tapirgal  
View profile
 More options Mar 5, 11:10 am
From: tapirgal <she...@tapirback.com>
Date: Wed, 5 Mar 2008 08:10:59 -0800 (PST)
Local: Wed, Mar 5 2008 11:10 am
Subject: Page not being indexed
Hello,

This is Sheryl in Oregon (was oregontapir). I changed my Google
account, so I hope this works.

I have no complaints about how Google indexes most of my site, but I
can't seem to get it to find this page:

http://www.tapirback.com/images/animals/mammals/aquatic/pinnipeds/sea...

I even linked it from one of my blogs, which get indexed really fast
and seem to bring up related pages well.

Any ideas? The page has been up since last October. Thanks.

Sheryl


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile
 More options Mar 5, 11:38 am
From: webado <web...@gmail.com>
Date: Wed, 5 Mar 2008 08:38:10 -0800 (PST)
Local: Wed, Mar 5 2008 11:38 am
Subject: Re: Page not being indexed
That page is nto idnexed at all. A site: query does not show it.

The highest level indexed is http://www.tapirback.com/images/animals/
which liks to your particular page.

You have errors on that page and probably similar errors on all pages
of your site:
http://validator.w3.org/check?verbose=1&uri=http://www.tapirback.com/...

if your page does NOT use an xhtml doctype you must not close any tags
with /> at all.

In particualr those meta and link tags in the head of the page. This
results in a premature end of the ehad and of the page, with content
not being processed further.

I recommend you validate all the pages on yrou site, starting with the
homepage.

It could be a lot, I agree.

This can validate the entie site but watch out for a lot of output:
 http://htmlhelp.com/tools/validator/

You may be able to fix things easily if you are using any kind of
templates or if you are abel to use search and replace for all pages
you have  on your pc.

On Mar 5, 11:10 am, tapirgal <she...@tapirback.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
tapirgal  
View profile
 More options Mar 5, 1:00 pm
From: tapirgal <oregonta...@gmail.com>
Date: Wed, 5 Mar 2008 10:00:40 -0800 (PST)
Local: Wed, Mar 5 2008 1:00 pm
Subject: Re: Page not being indexed
Thanks, Webado. It would have been awhile before I found that problem.
Actually, many, many of my 1300+ pages do validate, such as the sea
lion page itself validates, but I wouldn't have realized that one or
several upstream had cut off the search. At least it validates on CSE
v 8.04.

The gift shop validates - it's the recreational (fun for me, less time
to play with) areas and my old tapir pages I didn't follow up on.  :/

Sheryl


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
tapirgal  
View profile
 More options Mar 6, 2:40 am
From: tapirgal <she...@tapirback.com>
Date: Wed, 5 Mar 2008 23:40:18 -0800 (PST)
Local: Thurs, Mar 6 2008 2:40 am
Subject: Re: Page not being indexed
Well, it seems that my main mistake on these pages was using a meta
tag that close with " /> which should be reserved for XHTML and I
should have only used the closing bracket without the slash and space
in my HTML 4.01 Transitional doc.

What I don't understand is why these pages validated cleanly with CSE
Validator Pro v8.04 with the latest updates. I thought that was a good
piece of software. Any comments on this software? I've been using it
on my whole site. I found it easier to work with than any of the
online pages I found (such as the W3C page) and to understand the
corrections when I had so many pages with so many mistakes a couple of
years ago, and I kept using the program, which obviously didn't
consider the current mitake a mistake. <br /> are also not seen in CSE
as a mistake in HTML 4.01 Transitional.

Sheryl


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile
 More options Mar 6, 10:05 am
From: webado <web...@gmail.com>
Date: Thu, 6 Mar 2008 07:05:56 -0800 (PST)
Local: Thurs, Mar 6 2008 10:05 am
Subject: Re: Page not being indexed
So the conclusion is CSE is not as good as you think it is.

You cannot replace the real valiadator.

On Mar 6, 2:40 am, tapirgal <she...@tapirback.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Tidal_Wave_One  
View profile
 More options Mar 6, 2:47 pm
From: Tidal_Wave_One <goo...@wiersch.com>
Date: Thu, 6 Mar 2008 11:47:49 -0800 (PST)
Local: Thurs, Mar 6 2008 2:47 pm
Subject: Re: Page not being indexed

Hi Sheryl,

CSE HTML Validator is good software. Those issues you mention are
extremely unlikely to affect search engine rankings. Also, remember
that CSE HTML Validator is designed for practical use, not simply to
check for strict compliance like the W3C validator. This makes CSE
HTML Validator more useful for most people.

If you want CSE HTML Validator to report errors for "/>" in HTML, then
you can uncheck the "XML compatibility" option in the Validator Engine
Options. This option is checked by default because it's not a problem
with today's browser/user agents.

If you want results more like the W3C validator, please see this page:
http://www.htmlvalidator.com/htmlval/v80/docs/validate_to_w3c_standar...

Also see:
http://www.htmlvalidator.com/htmlval/whycseisbetter.html

I hope this helps.

--
Albert Wiersch
AI Internet Solutions
supp...@htmlvalidator.com
http://www.htmlvalidator.com/

On Mar 6, 1:40 am, tapirgal <she...@tapirback.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile
 More options Mar 6, 4:11 pm
From: webado <web...@gmail.com>
Date: Thu, 6 Mar 2008 13:11:59 -0800 (PST)
Local: Thurs, Mar 6 2008 4:11 pm
Subject: Re: Page not being indexed
Albert, it is VERY important the head of a page NOT be broken. Using
XHTML syntax in a non-xhtml document (e.g. no xhtml doctype) breaks
the head in the worst way, resulting in the page content not being
well parsed if at all.

On 6 mar, 14:47, Tidal_Wave_One <goo...@wiersch.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Tidal_Wave_One  
View profile
 More options Mar 6, 4:26 pm
From: Tidal_Wave_One <goo...@wiersch.com>
Date: Thu, 6 Mar 2008 13:26:23 -0800 (PST)
Local: Thurs, Mar 6 2008 4:26 pm
Subject: Re: Page not being indexed

It sounds like you're talking about strict standards compliance, but
what modern search engine or web browser do you know of that will not
be able to parse or handle "/>" at the end of a meta tag? I'm not
aware of any but would like to know if there is indeed one.

Thanks,
Albert

On Mar 6, 3:11 pm, webado <web...@gmail.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Only Validation Re: Page not being indexed" by webado
webado  
View profile
 More options Mar 6, 4:56 pm
From: webado <web...@gmail.com>
Date: Thu, 6 Mar 2008 16:56:23 -0500
Local: Thurs, Mar 6 2008 4:56 pm
Subject: Re: Only Validation Re: Page not being indexed

Googlebot.

It won't complain. But the robot simply appears to skip over everything.
Seen lots of sites doing poorly, pages dropping out of the index, or gone
suppplemental (when that still existed officially). They often have this in
common - broken head.

--
www.webado.net
Webhosting and Design

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Tidal_Wave_One  
View profile
 More options Mar 6, 5:52 pm
From: Tidal_Wave_One <goo...@wiersch.com>
Date: Thu, 6 Mar 2008 14:52:41 -0800 (PST)
Local: Thurs, Mar 6 2008 5:52 pm
Subject: Re: Only Validation Re: Page not being indexed

But a "broken head" to googlebot is probably something other
(something much more serious) than simply using "/>" to end a meta
tag. Do you have any supporting links that say that "/>" is
responsible for googlebot's skipping of info? I seriously doubt that
is the problem. I think it is far too speculative to simply blame it
on a "broken head" because of "/>". It is much more likely that it is
caused by more serious issues or just changes in how or why pages are
indexed.

Also, I think many pages have "broken heads" since many pages are
poorly written. But many of these "broken pages" still do well in the
search engines.

I'm not saying that validation is not important of course. It is
important for many reasons, but I think to mess up a search engine
ranking due to an "invalid" document requires a serious validation or
document structure issue and not simply using "/>".

On Mar 6, 3:56 pm, webado <web...@gmail.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile
 More options Mar 6, 6:17 pm
From: webado <web...@gmail.com>
Date: Thu, 6 Mar 2008 15:17:53 -0800 (PST)
Local: Thurs, Mar 6 2008 6:17 pm
Subject: Re: Only Validation Re: Page not being indexed
I use my logic.

For a robot to parse pages semantically, all markup has to be good,
unbroken.

Modern Googelbots parse semantically.

There there's ancecdotal evidence of the broken mata tag causing pages
to go suppelemental (back when that still existed). Hence the urban
legend that "submitting to sitemaps makes the site tank".

On 6 mar, 17:52, Tidal_Wave_One <goo...@wiersch.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
tapirgal