Hi.
I put a large robots.txt file (5000 lines) into root directory.
Clearly it was a mistake and I corrected it, using wildcard characters
now. This is what happened.
During the time my old, large robots.txt was online, the "robots.txt
analysis" tool was cutting the robots file size. It seems that the
robots file was simply too big (In the text box) for this tool.
Suddenly, according to the webmaster tool, urls were blocked, that
were not in the robots file. Some of these urls still appeared in the
google index, some without a "cached version" and some without a title
and description.
My questions are simple:
1. Does the "robots.txt analysis" tool really reflect what happens in
Google, since robot in google can be larger than 5000 lines?
2. Is it possible that the tool shows a certain URL as blocked altough
it was not blocked by google's engines?
4. If it was really blocked, for a 10-20 days, how long will it be
until it is reindexed properly? (page has PR 4)
Thank you,
Marc