Account Options

  1. Sign in
The old Google Groups will be going away soon.
Switch to the new Google Groups.
Google Groups Home
« Groups Home
Discussions > Google webmaster tools > Most sites will not have all of their pages indexed.
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  9 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
baghdad  
View profile  
 More options Mar 10 2008, 1:21 am
From: baghdad
Date: Sun, 9 Mar 2008 22:21:05 -0700 (PDT)
Local: Mon, Mar 10 2008 1:21 am
Subject: Most sites will not have all of their pages indexed.
hi

this is the problem

i submit my sitemap and everythings ok

untill befor 3 weeks

my site doesnot appear in google search or its appearance decrease in
hoorible manner

i enter the webmaster tools

and found this

Sitemap statistics:
Total URLs: 1009771
Indexed URLs: 0

Filename <<<<<<<<<< Status <<<<< URLs submitted
sitemap_index.xml.gz <<<<<<<<<<< OK<<<<<< 1009771

but no archieve

i go to more details and see this

Sitemap summary
Most sites will not have all of their pages indexed.
Improve how Google crawls and indexes your site

i dont know why this happened??

and how to fix that

thanks


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "NO URL PROVIDED - Re: Most sites will not have all of their pages indexed." by webado
webado  
View profile  
 More options Mar 10 2008, 1:43 am
From: webado
Date: Sun, 9 Mar 2008 22:43:10 -0700 (PDT)
Local: Mon, Mar 10 2008 1:43 am
Subject: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
URL?
Can't guess.

On Mar 10, 1:21 am, baghdad wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
baghdad  
View profile  
 More options Mar 10 2008, 3:46 am
From: baghdad
Date: Mon, 10 Mar 2008 00:46:47 -0700 (PDT)
Local: Mon, Mar 10 2008 3:46 am
Subject: Re: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile  
 More options Mar 10 2008, 9:10 am
From: webado
Date: Mon, 10 Mar 2008 06:10:22 -0700 (PDT)
Local: Mon, Mar 10 2008 9:10 am
Subject: Re: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
The robtos.txt file is a bit incorrect and a lot confusing.

You have:

User-Agent: *
Allow: /
Allow: /vb/
Allow: /gallery/
Allow: /mag/
Disallow: /tmp/
Disallow: /abuse/
Disallow: /up/
Disallow: /attachment/
Disallow: /bill/
Disallow: /client/
Disallow: /download/
Disallow: /help/
Sitemap: http://www.baghdad4ever.net/vb/sitemap_index.xml.gz

The Sitemap line needs to be seprate by one blank line from the last
line of robots directives. The Alow lines are pretty useless since
they do not override any of the disallow parts. I would rewrite it as:

User-Agent: *
Disallow: /tmp/
Disallow: /abuse/
Disallow: /up/
Disallow: /attachment/
Disallow: /bill/
Disallow: /client/
Disallow: /download/
Disallow: /help/

Sitemap: http://www.baghdad4ever.net/vb/sitemap_index.xml.gz

Remember however there are lots of urls prefixes you have to disallow
when you have a forum. Anything to do with login, register, reply,
private message, etc.

Are you sure the forum should have over 1 million url's ? Is it
possible there may be repetition there due to allowing everythign to
be crawled?

I cannot even see one of the sitemaps fully without my browser
freezing.

Wit a vb forum I would think you can get a lot more out of letting
only the archives get indexed rather than the whole forum.

So I would modify the robots.txt file this way:

User-Agent: *
Disallow: /tmp/
Disallow: /abuse/
Disallow: /up/
Disallow: /attachment/
Disallow: /bill/
Disallow: /client/
Disallow: /download/
Disallow: /help/
Disallow: /vb/
Allow: /vb/archive/

Sitemap: http://www.baghdad4ever.net/vb/sitemap_index.xml.gz

When you generate the sitemap limit it to the folder /vb/archive/ .

You will want to submit the forum as a separate site, and submit it as
http://www.baghdad4ever.net/vb/archive/ .

On Mar 10, 3:46 am, baghdad wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
baghdad  
View profile  
 More options Mar 11 2008, 3:21 am
From: baghdad
Date: Tue, 11 Mar 2008 00:21:53 -0700 (PDT)
Local: Tues, Mar 11 2008 3:21 am
Subject: Re: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
thanks webado

it is still the same

Sitemap statistics:
Total URLs: 181959
Indexed URLs: 0

i dont know why this occur

in the past i have 800000 url indexed and every thing ok

why now??

plz help


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile  
 More options Mar 11 2008, 9:51 am
From: webado
Date: Tue, 11 Mar 2008 06:51:31 -0700 (PDT)
Local: Tues, Mar 11 2008 9:51 am
Subject: Re: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
You have lots of pages indexed, as you can see in a site: query. That
information in Webmaster Tools refers specifically to urls contained
in the sitemap being  indexed from submitting the sitemap.
Probably most of whatever has been indexed so far was discoverd by
other means, not as a consequence of submitting the sitemap.

if the sietmap is faulty, too large, too slow, it won't get downloaded
and/or processed. But you'd have a message to that effect in Webmaster
Tools when that happens.

In any case Webmaster Tools reports of numbers and dates usually lag
behind reality.

You do realize a single sitemap must not contain more than 50000 urls
and the file itself must not be bigger than 10MB. If it is you must
break it up into even smaller pieces and submit them as separate
sitemaps, each of which respects those guidelines, all bundled in one
or several sitemap indexes as needed.

As concerns your current robots.txt I guess you misunderstoof what I
said. I said to disallwo /vb/ and allow /vb/archive/  and build
sitemap for /vb/archive/ urls ONLY.

I woudl make the robots.txt this way:

User-Agent: *
Disallow: /tmp/
Disallow: /abuse/
Disallow: /up/
Disallow: /attachment/
Disallow: /bill/
Disallow: /client/
Disallow: /download/
Disallow: /help/
Disallow: /vb/
Allow: /vb/archive/

Sitemap: http://www.baghdad4ever.net/vb/archive/sitemap_index.xml.gz

Move the sitemap index and ingredient  sitemaps into the /vb/archive/
folder if possible. Otherwise move it all up to the root. But do not
keep it in the /vb/ folder itself because the robots.txt file
disallows it.

In any case make sure your sitemaps in the sitemap index contain ONLY
urls from /vb/archive/ .

On Mar 11, 3:21 am, baghdad wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile  
 More options Mar 11 2008, 9:54 am
From: webado
Date: Tue, 11 Mar 2008 06:54:53 -0700 (PDT)
Local: Tues, Mar 11 2008 9:54 am
Subject: Re: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
And be patient. It  will take time to catch up.
In the meanwhile make sure you are not breaking any webmaster
guidelines as cocnerns spam, paid links, reciprocal links and all
that.
Since it's a forum it is highly possibly forum member may be posting
posts full of spam, sel promo links and all that.  If this  happens
your site can get penalized.
How good and vigilant are your forum moderators to dsiallow spam and
remove bad posts?  it's such a huge forum apparently it must be hard
to keep it clean.

On Mar 11, 3:21 am, baghdad wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
baghdad  
View profile  
 More options Mar 11 2008, 4:35 pm
From: baghdad
Date: Tue, 11 Mar 2008 13:35:46 -0700 (PDT)
Local: Tues, Mar 11 2008 4:35 pm
Subject: Re: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
hi

thanks for your help

my forum contain 40 moderators

and they knock any spam topic

and
i cant disallow vb in robots

bec i use vbseo hack which is make site map in the root of vb

and i think the problem is from redirection

when you post

baghdad4ever.net/vb

it return to

http://baghdad4ever.net/vb

without www

i think it is the problem

i put some code in the htaccess

all scripts in my site will take www automatically

except vb

bec it has its own htaccess (which is belong to vbseo hack)

i dont know how to solve that

and

i dont know if it is the cause of this problem

im waiting your reply my friend


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
webado  
View profile  
 More options Mar 11 2008, 4:43 pm
From: webado
Date: Tue, 11 Mar 2008 13:43:32 -0700 (PDT)
Local: Tues, Mar 11 2008 4:43 pm
Subject: Re: NO URL PROVIDED - Re: Most sites will not have all of their pages indexed.
I would crawl using Xenu. And then generate a sitemap using
GsiteCrawler, so you are not restricted.

I think  as long as you are allowing the crawling of the entire thing
you will be in trouble with duplication. And the most useful indexing
is for the archive in my opinion.

On 11 mar, 16:35, baghdad wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »