<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<rss version="2.0">
  <channel>
  <title>wikiteam-discuss Google Group</title>
  <link>http://groups.google.com/group/wikiteam-discuss</link>
  <description>Discuss about WikiTeam</description>
  <language>en</language>
  <item>
  <title>Wikimedia Commons grab</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/db108c935a2f036d/0a0a5752b1461c51?show_docid=0a0a5752b1461c51</link>
  <description>
  Hydriz, as regards &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://lists.wikimedia.org/pipermail/labs-l/2013-May/001239.html&quot;&gt;[link]&lt;/a&gt; , it &lt;br&gt; really makes no sense to stop the Commons archiving &amp;quot;in the process of &lt;br&gt; trying to optimize bandwidth and resource usage&amp;quot;. &lt;br&gt; From what I understood, the blockers were special characters and some &lt;br&gt; database corruption. If that&#39;s been fixed or worked around, we must
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/db108c935a2f036d/0a0a5752b1461c51?show_docid=0a0a5752b1461c51</guid>
  <author>
  nemow...@gmail.com
  (Federico Leva (Nemo))
  </author>
  <pubDate>Wed, 22 May 2013 14:07:00 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:640] Move Wikia dumps to Internet Archive</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/1de6445c4fd4a434?show_docid=1de6445c4fd4a434</link>
  <description>
  Found a method stupid enough for me: I downloaded the HTML and JSON with &lt;br&gt; wget, &lt;br&gt; &amp;lt;&lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;https://archive.org/download/wikia_dump_20121204/2013-05-03-index.zip&quot;&gt;[link]&lt;/a&gt;&amp;gt; &lt;br&gt; Grep tells me there are now 37440 wikis, 29502 last dumped in 2012 and &lt;br&gt; 6111 in 2013, 3988 in April. &lt;br&gt; &lt;p&gt;Nemo
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/1de6445c4fd4a434?show_docid=1de6445c4fd4a434</guid>
  <author>
  nemow...@gmail.com
  (Federico Leva (Nemo))
  </author>
  <pubDate>Sat, 04 May 2013 09:40:43 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:640] Move Wikia dumps to Internet Archive</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/df4b1be5ac96c906?show_docid=df4b1be5ac96c906</link>
  <description>
  Emilio J. Rodr�guez-Posada, 03/05/2013 09:29: &lt;br&gt; &amp;gt; I don&#39;t see many new dumps in &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://dumps.wikia.net/&quot;&gt;[link]&lt;/a&gt; but I haven&#39;t &lt;br&gt; &amp;gt; counted. &lt;br&gt; &amp;gt; &lt;br&gt; &amp;gt; &lt;br&gt; &amp;gt; count it ... &lt;br&gt; &lt;p&gt;I will at some point. &lt;br&gt; &lt;p&gt; &amp;gt; But we have dumps only of 10 % of their wikis, which are about 340k. &lt;br&gt; &amp;gt; &lt;br&gt; &amp;gt; &lt;br&gt; &amp;gt; Probably 90% of them are empty (just the mainpage).
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/df4b1be5ac96c906?show_docid=df4b1be5ac96c906</guid>
  <author>
  nemow...@gmail.com
  (Federico Leva (Nemo))
  </author>
  <pubDate>Fri, 03 May 2013 10:48:07 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:640] Move Wikia dumps to Internet Archive</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/2c26e4abd2085168?show_docid=2c26e4abd2085168</link>
  <description>
  2013/5/2 Federico Leva (Nemo) &amp;lt;nemow...@gmail.com&amp;gt; &lt;br&gt; &lt;p&gt;count it ... &lt;br&gt; &lt;p&gt;ok &lt;br&gt; &lt;p&gt;Probably 90% of them are empty (just the mainpage). &lt;br&gt; &lt;p&gt;If someone can write a simple bot to scrape ?Special:Statistics page and &lt;br&gt; read the good=XXX value to count how many wikis have real content...
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/2c26e4abd2085168?show_docid=2c26e4abd2085168</guid>
  <author>
  emi...@gmail.com
  (Emilio J. Rodríguez-Posada)
  </author>
  <pubDate>Fri, 03 May 2013 07:29:03 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:640] Move Wikia dumps to Internet Archive</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/7d33ae05b7dd3a21?show_docid=7d33ae05b7dd3a21</link>
  <description>
  Emilio J. Rodr�guez-Posada, 02/05/2013 18:49: &lt;br&gt; &lt;p&gt;I don&#39;t see many new dumps in &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://dumps.wikia.net/&quot;&gt;[link]&lt;/a&gt; but I haven&#39;t &lt;br&gt; counted. The help page is wrong, by the way, autoconfirmed is enough. &lt;br&gt; &lt;p&gt;But we have dumps only of 10 % of their wikis, which are about 340k. &lt;br&gt; &lt;p&gt;Nemo
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/7d33ae05b7dd3a21?show_docid=7d33ae05b7dd3a21</guid>
  <author>
  nemow...@gmail.com
  (Federico Leva (Nemo))
  </author>
  <pubDate>Thu, 02 May 2013 17:08:18 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:640] Move Wikia dumps to Internet Archive</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/bd58f54f92a33a53?show_docid=bd58f54f92a33a53</link>
  <description>
  How many dumps are being generated by sysops? &lt;br&gt; &lt;p&gt;We need to know how many wikis have been edited since last backup and no &lt;br&gt; new dump is available. &lt;br&gt; &lt;p&gt;Anyway, Wikia wikis were saved in December 2012, so it is not an emergency &lt;br&gt; by now. &lt;br&gt; &lt;p&gt;2013/5/1 Federico Leva (Nemo) &amp;lt;nemow...@gmail.com&amp;gt;
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/bd58f54f92a33a53?show_docid=bd58f54f92a33a53</guid>
  <author>
  emi...@gmail.com
  (Emilio J. Rodríguez-Posada)
  </author>
  <pubDate>Thu, 02 May 2013 16:49:07 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:640] Move Wikia dumps to Internet Archive</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/3007b0be843e1f63?show_docid=3007b0be843e1f63</link>
  <description>
  Ah, I hadn&#39;t seen it: now dumps can only be requested by wiki sysops. &lt;br&gt; &amp;lt;&lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://community.wikia.com/wiki/Help:Database_download?diff=974378&amp;oldid=957802&quot;&gt;[link]&lt;/a&gt;&amp;gt; &lt;br&gt; What should we do, run a bot to request all sysops to press the button? &lt;br&gt; Just start crawling all wikis directly at last? &lt;br&gt; &lt;p&gt;Nemo
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/3007b0be843e1f63?show_docid=3007b0be843e1f63</guid>
  <author>
  nemow...@gmail.com
  (Federico Leva (Nemo))
  </author>
  <pubDate>Wed, 01 May 2013 16:27:47 UT
</pubDate>
  </item>
  <item>
  <title>Our collections have moved</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/833d9fa1a046c969/5343a19417652615?show_docid=5343a19417652615</link>
  <description>
  Hi all, &lt;br&gt; In case you have not noticed, all collections related to wikis (&amp;quot;wikiteam&amp;quot; &lt;br&gt; and &amp;quot;wikimediadownloads&amp;quot;) have all been moved to a new collection called &lt;br&gt; &amp;quot;wikicollections&amp;quot;, though still under the web collection. I have already &lt;br&gt; sent an email to the staff regarding the rationale behind this change, and
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/833d9fa1a046c969/5343a19417652615?show_docid=5343a19417652615</guid>
  <author>
  ad...@alphacorp.tk
  (Hydriz Wikipedia)
  </author>
  <pubDate>Sat, 20 Apr 2013 14:14:55 UT
</pubDate>
  </item>
  <item>
  <title>Fwd: [Xmldatadumps-l] making imports suck less, the sequel</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/92f9b9bc40727ba9/4926e8f7a3b554ba?show_docid=4926e8f7a3b554ba</link>
  <description>
  I think this is important for Wikiteam. The dumps we produce are of &lt;br&gt; little value if we don&#39;t ensure they can actually be used/read! &lt;br&gt; &lt;p&gt;Nemo &lt;br&gt; &lt;p&gt;-------- Messaggio originale -------- &lt;br&gt; Oggetto: [Xmldatadumps-l] making imports suck less, the sequel &lt;br&gt; Data: Wed, 10 Apr 2013 23:40:48 +0300 &lt;br&gt; Mittente: Ariel T. Glenn
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/92f9b9bc40727ba9/4926e8f7a3b554ba?show_docid=4926e8f7a3b554ba</guid>
  <author>
  nemow...@gmail.com
  (Federico Leva (Nemo))
  </author>
  <pubDate>Wed, 10 Apr 2013 21:21:17 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:642] trying to generate a dump of wikilawschool.net for import into wikireader</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/80d218c4da62bcbf/9ce7a6f1f8d4d90e?show_docid=9ce7a6f1f8d4d90e</link>
  <description>
  Looks like Special:Export is disabled &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.wikilawschool.net/wiki/Special:Export&quot;&gt;[link]&lt;/a&gt; which is the method we use &lt;br&gt; to export the content. &lt;br&gt; &lt;p&gt;But API allows content extraction &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.wikilawschool.net/w/api.php?action=query&amp;prop=revisions&amp;titles=API|&quot;&gt;[link]&lt;/a&gt;Main%20Page&amp;amp;rvprop=timestamp|u ser|comment|contentAlthought
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/80d218c4da62bcbf/9ce7a6f1f8d4d90e?show_docid=9ce7a6f1f8d4d90e</guid>
  <author>
  emi...@gmail.com
  (emijrp)
  </author>
  <pubDate>Tue, 09 Apr 2013 10:17:03 UT
</pubDate>
  </item>
  <item>
  <title>trying to generate a dump of wikilawschool.net for import into wikireader</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/80d218c4da62bcbf/028551ce4381f72a?show_docid=028551ce4381f72a</link>
  <description>
  I am trying to generate a dump of: &lt;br&gt; python dumpgen.py --api=&lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.wikilawschool.net/w/api.php&quot;&gt;[link]&lt;/a&gt; --xml &lt;br&gt; --curonly --path=dump5 --delay=8 &lt;br&gt; The titles work ok but then when it tries to retrieve the XML, I get: &lt;br&gt; Server is slow... Waiting some seconds and retrying... &lt;br&gt; An error have occurred while retrieving &amp;quot;Main_Page&amp;quot;
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/80d218c4da62bcbf/028551ce4381f72a?show_docid=028551ce4381f72a</guid>
  <author>
  ggar...@twentyseven.com
  (Gustavo Garcia)
  </author>
  <pubDate>Tue, 09 Apr 2013 05:34:38 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:611] Move Wikia dumps to Internet Archive</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/e3b492eb7e484e4c?show_docid=e3b492eb7e484e4c</link>
  <description>
  Great, I see you did another backup &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://archive.org/details/wikia_dump_20121204&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; &lt;p&gt;I have added it to &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://code.google.com/p/wikiteam/wiki/AvailableBackups&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; &lt;p&gt;34,000 wikis saved, yeah! Nice work. &lt;br&gt; &lt;p&gt;2012/11/11 Federico Leva (Nemo) &amp;lt;nemow...@gmail.com&amp;gt;
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/3a5564af9829be40/e3b492eb7e484e4c?show_docid=e3b492eb7e484e4c</guid>
  <author>
  emi...@gmail.com
  (emijrp)
  </author>
  <pubDate>Thu, 28 Mar 2013 20:58:17 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:638] xml dump of private mediawiki</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/5a027ccca0b9a43a/3198e843bdd86b9e?show_docid=3198e843bdd86b9e</link>
  <description>
  I&#39;m watching &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://svn.wikimedia.org/svnroot/pywikipedia/trunk/pywikipedia/login.py&quot;&gt;[link]&lt;/a&gt; and &lt;br&gt; login through API is a bit a mess... &lt;br&gt; &lt;p&gt;Also, Special:Export allows login? We export text from that special page, &lt;br&gt; not API. We use API only for titles. &lt;br&gt; &lt;p&gt;2013/3/28 Federico Leva (Nemo) &amp;lt;nemow...@gmail.com&amp;gt;
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/5a027ccca0b9a43a/3198e843bdd86b9e?show_docid=3198e843bdd86b9e</guid>
  <author>
  emi...@gmail.com
  (emijrp)
  </author>
  <pubDate>Thu, 28 Mar 2013 14:08:23 UT
</pubDate>
  </item>
  <item>
  <title>Re: [wikiteam-discuss:638] xml dump of private mediawiki</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/5a027ccca0b9a43a/1e44c2810eb097cf?show_docid=1e44c2810eb097cf</link>
  <description>
  Scott Kraft, 28/03/2013 14:02: &lt;br&gt; &lt;p&gt;No, not yet. &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;https://code.google.com/p/wikiteam/issues/detail?id=20&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; &lt;p&gt;Nemo
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/5a027ccca0b9a43a/1e44c2810eb097cf?show_docid=1e44c2810eb097cf</guid>
  <author>
  nemow...@gmail.com
  (Federico Leva (Nemo))
  </author>
  <pubDate>Thu, 28 Mar 2013 13:19:58 UT
</pubDate>
  </item>
  <item>
  <title>xml dump of private mediawiki</title>
  <link>http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/5a027ccca0b9a43a/30420c1b89053c87?show_docid=30420c1b89053c87</link>
  <description>
  I have a private mediawiki (a login is required to view and edit pages) &lt;br&gt; installation on my ftp server. I am looking to do an xml dump before I &lt;br&gt; upgrade the mediawiki version (I already backed up the database). I am &lt;br&gt; trying to use this: &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://code.google.com/p/wikiteam/wiki/NewTutorial#I_have_no_shell_access_to_server&quot;&gt;[link]&lt;/a&gt;
  </description>
  <guid isPermaLink="true">http://groups.google.com/group/wikiteam-discuss/browse_thread/thread/5a027ccca0b9a43a/30420c1b89053c87?show_docid=30420c1b89053c87</guid>
  <author>
  sakra...@gmail.com
  (Scott Kraft)
  </author>
  <pubDate>Thu, 28 Mar 2013 13:02:12 UT
</pubDate>
  </item>
  </channel>
</rss>
