<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing</id>
  <title type="text">Crawling, indexing, and ranking Google Group</title>
  <subtitle type="text">
  Exchange information about how Google finds and ranks pages and learn more about your site in the Google index.
  </subtitle>
  <link href="/group/Google_Webmaster_Help-Indexing/feed/atom_v1_0_msgs.xml" rel="self" title="Crawling, indexing, and ranking feed"/>
  <updated>2008-09-07T02:23:17Z</updated>
  <generator uri="http://groups.google.com" version="1.99">Google Groups</generator>
  <entry>
  <author>
  <name>Redhousepainter</name>
  <email/>
  </author>
  <updated>2008-09-07T02:23:17Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/9b31b360c643971f?show_docid=9b31b360c643971f</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/9b31b360c643971f?show_docid=9b31b360c643971f"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  Looks like I&#39;m in a mess! I don&#39;t even know where to start! &lt;br&gt; &lt;p&gt;In the case of... &lt;br&gt; &lt;p&gt;&lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://phauxshow.com/gallery/jana-gouchev/Flirting+Between+Lost+&amp;+Fou&quot;&gt;[link]&lt;/a&gt;... &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://phauxshow.com/gallery/jana-gouchev/Flirting%20Between%20Lost%2&quot;&gt;[link]&lt;/a&gt;... &lt;br&gt; &lt;p&gt;the actual image on the server is... &lt;br&gt; &lt;p&gt;Flirting Between Lost &amp;amp; Found v1.jpg
  </summary>
  </entry>
  <entry>
  <author>
  <name>webado</name>
  <email/>
  </author>
  <updated>2008-09-07T02:10:56Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/66734a162e002d32?show_docid=66734a162e002d32</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/66734a162e002d32?show_docid=66734a162e002d32"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  I believe there is a MAC equivalent to Xenu, though maybe not as good. &lt;br&gt; &lt;p&gt;Somebody posted about it recently ... hmmmm... need to find that post. &lt;br&gt; &lt;p&gt;Ok, there is this: &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://peacockmedia.co.uk/index.php?view=article&amp;catid=7%3Aproducts&amp;id=4%3Aintegrity&amp;option=com_content&amp;Itemid=4&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; &lt;p&gt;And here: &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://blog.westhost.com/2008/02/break-your-broken-link-problem-with-xenu/&quot;&gt;[link]&lt;/a&gt;
  </summary>
  </entry>
  <entry>
  <author>
  <name>Redhousepainter</name>
  <email/>
  </author>
  <updated>2008-09-07T02:04:11Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/d4c6d94e8465d25e?show_docid=d4c6d94e8465d25e</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/d4c6d94e8465d25e?show_docid=d4c6d94e8465d25e"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  Webado, &lt;br&gt; &lt;p&gt;I replied before I read your last post. I&#39;ll go through it now and &lt;br&gt; post again in a bit.
  </summary>
  </entry>
  <entry>
  <author>
  <name>Redhousepainter</name>
  <email/>
  </author>
  <updated>2008-09-07T02:02:29Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/2de4626016247052?show_docid=2de4626016247052</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/2de4626016247052?show_docid=2de4626016247052"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  Webado, &lt;br&gt; &lt;p&gt;Thanks for the invaluable information! &lt;br&gt; &lt;p&gt;I checked out Xenu. Is there a Mac equivalent? &lt;br&gt; &lt;p&gt;Also, I changed my robots.txt to what you recommended. I also have one &lt;br&gt; for the Zenphoto part of the site, in which I have... &lt;br&gt; &lt;p&gt;User-agent: * &lt;br&gt; Allow: /gallery/albums &lt;br&gt; Disallow: /gallery/cache &lt;br&gt; Disallow: /gallery/themes
  </summary>
  </entry>
  <entry>
  <author>
  <name>Snowman2468</name>
  <email/>
  </author>
  <updated>2008-09-07T01:54:37Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/291e47e7ec6a1434/5c6d1aa0f1f57672?show_docid=5c6d1aa0f1f57672</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/291e47e7ec6a1434/5c6d1aa0f1f57672?show_docid=5c6d1aa0f1f57672"/>
  <title type="text">Re: Reconsideration request - have we done enough ?</title>
  <summary type="html" xml:space="preserve">
  Thanks BBDeath - we&#39;ll be looking to tighten things up a lot more with &lt;br&gt; all of your good inputs.
  </summary>
  </entry>
  <entry>
  <author>
  <name>webado</name>
  <email/>
  </author>
  <updated>2008-09-07T01:51:40Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/58ab7eef653c2e90?show_docid=58ab7eef653c2e90</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/58ab7eef653c2e90?show_docid=58ab7eef653c2e90"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  OK, from the gallery folder and below you have 21 broken links (for &lt;br&gt; images), when crawling with Xenu: &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://home.snafu.de/tilman/xenulink.html&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; &lt;p&gt;Broken links, ordered by link: &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://phauxshow.com/gallery/albums/pamela-nash/i/bleach%20abstract.jpg&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; error code: 404 (not found), linked from page(s):
  </summary>
  </entry>
  <entry>
  <author>
  <name>webado</name>
  <email/>
  </author>
  <updated>2008-09-07T01:29:48Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/097c866e1a43819b?show_docid=097c866e1a43819b</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/097c866e1a43819b?show_docid=097c866e1a43819b"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  Keep in mind that you will get 404&#39;s for any brokne links apeparing &lt;br&gt; elsewhere on the web. As long as your site has no such broken links &lt;br&gt; NOW, don&#39;t worry about them, they will drop out in time. Or try to &lt;br&gt; capture and 301 redirect them to the correct url. &lt;br&gt; It looks like those malformed links got cached maybe from a tempoarary
  </summary>
  </entry>
  <entry>
  <author>
  <name>webado</name>
  <email/>
  </author>
  <updated>2008-09-07T01:26:19Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/93294cb5c6832e1a?show_docid=93294cb5c6832e1a</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/93294cb5c6832e1a?show_docid=93294cb5c6832e1a"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  With a Wordpress blog you need to block certain urls in robots.txt. &lt;br&gt; For instance anythign from the folders suffixed by wp- . &lt;br&gt; So the robots.txt fiel shoudl be at least: &lt;br&gt; &lt;p&gt;User-agent: * &lt;br&gt; Disallow: /wp-
  </summary>
  </entry>
  <entry>
  <author>
  <name>webado</name>
  <email/>
  </author>
  <updated>2008-09-07T01:21:53Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/3354eedde0dd22e1?show_docid=3354eedde0dd22e1</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/3354eedde0dd22e1?show_docid=3354eedde0dd22e1"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  Did you scan the site using Xenu? &lt;br&gt; &lt;p&gt;Also your robots.txt is not quite right. &lt;br&gt; &lt;p&gt;You have: &lt;br&gt; &lt;p&gt;User-Agent: * &lt;br&gt; Allow: / &lt;br&gt; Allow: /gallery/albums &lt;br&gt; Allow: /albums/ &lt;br&gt; Allow: /gallery/ &lt;br&gt; &lt;p&gt;Sitemap: &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://phauxshow.com/sitemap.xml.gz&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; &lt;p&gt;It doesn&#39;t seem you intend to block anything, so this should get &lt;br&gt; rewritten as:
  </summary>
  </entry>
  <entry>
  <author>
  <name>Redhousepainter</name>
  <email/>
  </author>
  <updated>2008-09-07T01:06:02Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/c7ba42c1c8326006?show_docid=c7ba42c1c8326006</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/937679ef60d0096c/c7ba42c1c8326006?show_docid=c7ba42c1c8326006"/>
  <title type="text">Re: Wordpress, Zenphoto and a tonne of 404s</title>
  <summary type="html" xml:space="preserve">
  Thanks for looking! Some of the urls may no longer exist - however &lt;br&gt; there shouldn&#39;t be more than a dozen or so of those. I&#39;ve been working &lt;br&gt; on the problem for some time and have changed things around to try to &lt;br&gt; identify the issue. &lt;br&gt; &lt;p&gt;As it stands now Webmaster says I have 735 errors for urls and 743 not
  </summary>
  </entry>
  <entry>
  <author>
  <name>webado</name>
  <email/>
  </author>
  <updated>2008-09-07T00:45:11Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/419335affea2dfbe/dcb37d7bea2aefd8?show_docid=dcb37d7bea2aefd8</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/419335affea2dfbe/dcb37d7bea2aefd8?show_docid=dcb37d7bea2aefd8"/>
  <title type="text">Re: Multipule Questions</title>
  <summary type="html" xml:space="preserve">
  It&#39;s indexed: &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.google.com/search?sourceid=navclient&amp;ie=UTF-8&amp;rlz=1T4GGIH_enCA226CA226&amp;q=site:thepubliccompanyindex%2ecom&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; &lt;p&gt;Whether you can find the site by any search terns you are trying or &lt;br&gt; not is a different story. That depends on how well optimised the site &lt;br&gt; is for whatever searches you want to rank for, how many backlinks you
  </summary>
  </entry>
  <entry>
  <author>
  <name>NACMI</name>
  <email/>
  </author>
  <updated>2008-09-07T00:41:49Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e0ffac1c41e7f8a2/d4e95aa854595737?show_docid=d4e95aa854595737</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e0ffac1c41e7f8a2/d4e95aa854595737?show_docid=d4e95aa854595737"/>
  <title type="text">Re: Problem with Text Only Cache</title>
  <summary type="html" xml:space="preserve">
  Thanks so much! I will try and validate it.
  </summary>
  </entry>
  <entry>
  <author>
  <name>JLH</name>
  <email/>
  </author>
  <updated>2008-09-07T00:36:46Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/dea161fe117ac88b/4e3aa0ac0a34ed7a?show_docid=4e3aa0ac0a34ed7a</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/dea161fe117ac88b/4e3aa0ac0a34ed7a?show_docid=4e3aa0ac0a34ed7a"/>
  <title type="text">Re: What could still be causing my penalty?</title>
  <summary type="html" xml:space="preserve">
  &amp;quot;go to a real forum like webmasterworld&amp;quot; &lt;br&gt; &lt;p&gt;Now that is funny.
  </summary>
  </entry>
  <entry>
  <author>
  <name>webado</name>
  <email/>
  </author>
  <updated>2008-09-07T00:33:16Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/adb0004b1b3eccf4/d0392925f36c3f92?show_docid=d0392925f36c3f92</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/adb0004b1b3eccf4/d0392925f36c3f92?show_docid=d0392925f36c3f92"/>
  <title type="text">Re: Googlebot killing one of my servers</title>
  <summary type="html" xml:space="preserve">
  If this account /~hlstatsx/ does not exist it should issue a different &lt;br&gt; response, not a 404 simply for the url not existing. &lt;br&gt; Maybe a 403 or a 500.
  </summary>
  </entry>
  <entry>
  <author>
  <name>Dan Sherman</name>
  <email/>
  </author>
  <updated>2008-09-07T00:31:22Z</updated>
  <id>http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/419335affea2dfbe/04970c8a51eacdee?show_docid=04970c8a51eacdee</id>
  <link href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/419335affea2dfbe/04970c8a51eacdee?show_docid=04970c8a51eacdee"/>
  <title type="text">Multipule Questions</title>
  <summary type="html" xml:space="preserve">
  First I verified my site about a week ago. Today I was asked to verify &lt;br&gt; again. Do I have to do this every time I log in? If not how frequently &lt;br&gt; do I have to do it? &lt;br&gt; &lt;p&gt;Second google says that they indexed my site ( &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://thepubliccompanyindex.com&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; ) on the 1st. they also say im included in the google index. However
  </summary>
  </entry>
</feed>
