<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<rss version="2.0">
  <channel>
  <title>SOFTplus GSiteCrawler Google Group</title>
  <link>http://groups.google.com/group/gsitecrawler</link>
  <description>Discussion group for the GSiteCrawler, a Windows tool used to crawl websites and automatically create Google Sitemap files (and much more).</description>
  <language>en</language>
  <item>
  <title>Sitemaps for folders</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/2096f03e38bfa53a/4ce2fc2e9b30c888?show_docid=4ce2fc2e9b30c888</link>
  <description>
  If you decide to separate your site into multiple sub-sites (maybe &lt;br&gt; based on sub-folders), then you can build a site-map for each such &lt;br&gt; sub-site, according to your own schedule. Again the same deal, each &lt;br&gt; site-map can be a site-map index with individual site-maps gzipped. &lt;br&gt; &lt;p&gt;Can you give me more information on how to do this?
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/2096f03e38bfa53a/4ce2fc2e9b30c888?show_docid=4ce2fc2e9b30c888</guid>
  <author>
  2snap...@gmail.com
  (MrMyckster)
  </author>
  <pubDate>Wed, 11 Nov 2009 10:22:59 UT
</pubDate>
  </item>
  <item>
  <title>Re: Heavy Site Map Issues</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/e74b49b9f1f9ad64/416428a043ec806e?show_docid=416428a043ec806e</link>
  <description>
  Hi Karthick, &lt;br&gt; &lt;p&gt;First of all I have to say I have no first hand knowledge of how to &lt;br&gt; manage such very large sites. The largest site I have has about 12000 &lt;br&gt; urls, easily managed by GSC, though it takes 3 hours or so to recrawl. &lt;br&gt; &lt;p&gt;GsiteCrawler has the option of making sitemap indexes for multiple &lt;br&gt; sitemaps.
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/e74b49b9f1f9ad64/416428a043ec806e?show_docid=416428a043ec806e</guid>
  <author>
  web...@gmail.com
  (webado)
  </author>
  <pubDate>Tue, 10 Nov 2009 13:35:24 UT
</pubDate>
  </item>
  <item>
  <title>Heavy Site Map Issues</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/e74b49b9f1f9ad64/039574d5ac8cd9df?show_docid=039574d5ac8cd9df</link>
  <description>
  Hi There &lt;br&gt; &lt;p&gt;I am karthick working for Sify Technologies India. I work for the &lt;br&gt; domain sify.com it is one of the premier portal in India we have &lt;br&gt; decided to create sitemap for our domain, We have many channels like &lt;br&gt; sports, news, finance, movies. we have planned to create separate &lt;br&gt; sitemap for all channels. The issue here is there will be thousands of
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/e74b49b9f1f9ad64/039574d5ac8cd9df?show_docid=039574d5ac8cd9df</guid>
  <author>
  sifychen...@gmail.com
  (Karthick)
  </author>
  <pubDate>Tue, 10 Nov 2009 07:24:50 UT
</pubDate>
  </item>
  <item>
  <title>Re: [GSiteCrawler] Re: Issue with ftp upload using explicit SSL</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/c4d02eba5d557875?show_docid=c4d02eba5d557875</link>
  <description>
  I cannot check because I don&#39;t have an SSL cert anywhere. &lt;br&gt; &lt;p&gt;I fail to see how the upload function can change protocol - I thought that &lt;br&gt; was established at connect time. But what do I know ... &lt;br&gt; &lt;p&gt;----- Original Message ----- &lt;br&gt; To: &amp;quot;SOFTplus GSiteCrawler&amp;quot; &amp;lt;gsitecrawler@googlegroups.com &amp;gt; &lt;br&gt; Sent: Thursday, October 29, 2009 9:40 AM
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/c4d02eba5d557875?show_docid=c4d02eba5d557875</guid>
  <author>
  web...@gmail.com
  (Christina S)
  </author>
  <pubDate>Fri, 30 Oct 2009 05:23:54 UT
</pubDate>
  </item>
  <item>
  <title>Re: Issue with ftp upload using explicit SSL</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/bf5e515c0087f578?show_docid=bf5e515c0087f578</link>
  <description>
  I dont think the issue is with connecting to the server using ftpes as &lt;br&gt; this works. We can see from the log that we connect correctly to the &lt;br&gt; server. &lt;br&gt; &lt;p&gt;Where it goes wrong is when we try to upload the file. At this point &lt;br&gt; the log indicates &lt;br&gt; &lt;p&gt;534 Policy requires SSL. &lt;br&gt; Put failed: 534 Policy requires SSL.
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/bf5e515c0087f578?show_docid=bf5e515c0087f578</guid>
  <author>
  spen...@3ex.co.uk
  (spencer@3ex)
  </author>
  <pubDate>Thu, 29 Oct 2009 13:40:55 UT
</pubDate>
  </item>
  <item>
  <title>Re: Issue with ftp upload using explicit SSL</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/33d7bb3d7542becc?show_docid=33d7bb3d7542becc</link>
  <description>
  Sorry, I cannot tell, I don&#39;t use this for my server. &lt;br&gt; &lt;p&gt;But does your ftp address reflect the SSL ? Are you using ftps://.... &lt;br&gt; for the host address?
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/33d7bb3d7542becc?show_docid=33d7bb3d7542becc</guid>
  <author>
  web...@gmail.com
  (webado)
  </author>
  <pubDate>Wed, 28 Oct 2009 12:37:05 UT
</pubDate>
  </item>
  <item>
  <title>Issue with ftp upload using explicit SSL</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/232c68f3bbdf045e?show_docid=232c68f3bbdf045e</link>
  <description>
  Hello, &lt;br&gt; &lt;p&gt;we are very new to gsite crawler so please bear with us. &lt;br&gt; &lt;p&gt;We have to use explicit SSL to upload files to our website. &lt;br&gt; &lt;p&gt;when we try to upload the test file we get the following error &lt;br&gt; &lt;p&gt;FTP Connection 28/10/2009 11:54 &lt;br&gt; GSiteCrawler v1.23 rev. 286 &lt;br&gt; ------------------------------ ----------
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/a5fdbc60a79dbd92/232c68f3bbdf045e?show_docid=232c68f3bbdf045e</guid>
  <author>
  spen...@3ex.co.uk
  (spencer@3ex)
  </author>
  <pubDate>Wed, 28 Oct 2009 12:10:23 UT
</pubDate>
  </item>
  <item>
  <title>Re: [GSiteCrawler] Re: Recrawl question(s)</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/14affed641cb12e5/52010d45d980fc65?show_docid=52010d45d980fc65</link>
  <description>
  Many thanks. &lt;br&gt; &lt;p&gt;Joe &lt;br&gt; &lt;p&gt;MOTORHEAD extraordinaire &lt;br&gt; Professional Storage and Workspace Solutions &lt;br&gt; 79 Park Road - Chelmsford, MA - 01824 &lt;br&gt; Toll Free 800.618.8028 - Direct 978.618.2800 - Fax 978.418.0404 &lt;br&gt; Visit our web site at &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.MotorheadExtraordinaire.com&quot;&gt;[link]&lt;/a&gt; and &lt;br&gt; for our latest specials, &lt;br&gt; &amp;lt;&lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;https://www.motorheadextraordinaire.com/create_account.php&quot;&gt;[link]&lt;/a&gt;&amp;gt;sign up
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/14affed641cb12e5/52010d45d980fc65?show_docid=52010d45d980fc65</guid>
  <author>
  motorheadextraordina...@gmail.com
  (Joe Germann)
  </author>
  <pubDate>Fri, 23 Oct 2009 12:15:17 UT
</pubDate>
  </item>
  <item>
  <title>Re: [GSiteCrawler] Recrawl question(s)</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/14affed641cb12e5/d1767bdc44dcb51a?show_docid=d1767bdc44dcb51a</link>
  <description>
  You can delete all the currently listed urls from URL List (option Delete &lt;br&gt; all non-manual urls) and start a fresh crawl after that. &lt;br&gt; &lt;p&gt;----- Original Message ----- &lt;br&gt; To: &amp;quot;SOFTplus GSiteCrawler&amp;quot; &amp;lt;gsitecrawler@googlegroups.com &amp;gt; &lt;br&gt; Sent: Thursday, October 22, 2009 5:53 PM &lt;br&gt; &lt;p&gt;Christina &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.webado.net&quot;&gt;[link]&lt;/a&gt;
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/14affed641cb12e5/d1767bdc44dcb51a?show_docid=d1767bdc44dcb51a</guid>
  <author>
  web...@gmail.com
  (Christina S)
  </author>
  <pubDate>Fri, 23 Oct 2009 04:45:00 UT
</pubDate>
  </item>
  <item>
  <title>Recrawl question(s)</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/14affed641cb12e5/90f1f7841f39ac91?show_docid=90f1f7841f39ac91</link>
  <description>
  I just did a major reorganization of my eCommerce web site and moved &lt;br&gt; categories and products all around. I also generated a .htaccess file &lt;br&gt; that did a &amp;quot;Redirect 301 From To&amp;quot; for everything that moved about. &lt;br&gt; &lt;p&gt;I just kicked off a recrawl with GSiteCrawler and it looks like GSC is &lt;br&gt; crawling the old URL&#39;s frorm what was last in the database. These must
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/14affed641cb12e5/90f1f7841f39ac91?show_docid=90f1f7841f39ac91</guid>
  <author>
  motorheadextraordina...@gmail.com
  (Motorhead Extraordinaire)
  </author>
  <pubDate>Thu, 22 Oct 2009 21:53:18 UT
</pubDate>
  </item>
  <item>
  <title>Re: [GSiteCrawler] Re: robots.txt and web site remap</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/c96e756a746be07b?show_docid=c96e756a746be07b</link>
  <description>
  Works great. Thanks a bunch. &lt;br&gt; &lt;p&gt;Joe &lt;br&gt; &lt;p&gt;MOTORHEAD extraordinaire &lt;br&gt; Professional Storage and Workspace Solutions &lt;br&gt; 79 Park Road - Chelmsford, MA - 01824 &lt;br&gt; Toll Free 800.618.8028 - Direct 978.618.2800 - Fax 978.418.0404 &lt;br&gt; Visit our web site at &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.MotorheadExtraordinaire.com&quot;&gt;[link]&lt;/a&gt; and &lt;br&gt; for our latest specials,
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/c96e756a746be07b?show_docid=c96e756a746be07b</guid>
  <author>
  j...@motorheadextraordinaire.com
  (Joe Germann)
  </author>
  <pubDate>Tue, 20 Oct 2009 05:38:31 UT
</pubDate>
  </item>
  <item>
  <title>Re: robots.txt and web site remap</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/c93648122003ecdf?show_docid=c93648122003ecdf</link>
  <description>
  Use &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://web-sniffer.net/&quot;&gt;[link]&lt;/a&gt; and put in your url. &lt;br&gt; &lt;p&gt;On 19 oct, 14:17, Joe Germann &amp;lt;motorheadextraordina...@gmail .com&amp;gt; &lt;br&gt; wrote:
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/c93648122003ecdf?show_docid=c93648122003ecdf</guid>
  <author>
  web...@gmail.com
  (webado)
  </author>
  <pubDate>Mon, 19 Oct 2009 19:58:12 UT
</pubDate>
  </item>
  <item>
  <title>Re: [GSiteCrawler] Re: robots.txt and web site remap</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/4e96a3156becbddf?show_docid=4e96a3156becbddf</link>
  <description>
  I implemented the following .htaccess and it appears to work just &lt;br&gt; fine for users. I can plug in my IP and still get access to my web site. &lt;br&gt; Options +FollowSymLinks &lt;br&gt; RewriteEngine On &lt;br&gt; RewriteBase / &lt;br&gt; RewriteCond %{HTTP_USER_AGENT} &lt;br&gt; ^.*(Googlebot|Googlebot|Mediap artners|Adsbot|Feedfetcher)-?( Google|Image)?
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/4e96a3156becbddf?show_docid=4e96a3156becbddf</guid>
  <author>
  motorheadextraordina...@gmail.com
  (Joe Germann)
  </author>
  <pubDate>Mon, 19 Oct 2009 18:17:45 UT
</pubDate>
  </item>
  <item>
  <title>Re: [GSiteCrawler] Re: robots.txt and web site remap</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/a90aa75132d161e5?show_docid=a90aa75132d161e5</link>
  <description>
  It looks like it; just have to play around a bit to figure it all out. &lt;br&gt; &lt;p&gt;Joe
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/a90aa75132d161e5?show_docid=a90aa75132d161e5</guid>
  <author>
  j...@motorheadextraordinaire.com
  (Joe Germann)
  </author>
  <pubDate>Mon, 19 Oct 2009 01:00:41 UT
</pubDate>
  </item>
  <item>
  <title>Re: [GSiteCrawler] Re: robots.txt and web site remap</title>
  <link>http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/eb18d8d282c31c24?show_docid=eb18d8d282c31c24</link>
  <description>
  Yes, one of those examples should work for you. &lt;br&gt; &lt;p&gt; ----- Original Message ----- &lt;br&gt; From: Joe Germann &lt;br&gt; To: gsitecrawler@googlegroups.com &lt;br&gt; Sent: Sunday, October 18, 2009 8:44 PM &lt;br&gt; Subject: [GSiteCrawler] Re: robots.txt and web site remap &lt;br&gt; &lt;p&gt; Thanks for the guidance. I am investigating how to properly set us a .htaccess to do this. It looks like it is straight forward. I just have to read up a bit more and set up a test scenario. &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.askapache.com/htaccess/503-service-temporarily-unavailable.html&quot;&gt;[link]&lt;/a&gt;
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/gsitecrawler/browse_thread/thread/73151e82c679a125/eb18d8d282c31c24?show_docid=eb18d8d282c31c24</guid>
  <author>
  web...@gmail.com
  (Christina S)
  </author>
  <pubDate>Mon, 19 Oct 2009 00:55:38 UT
</pubDate>
  </item>
  </channel>
</rss>
