* dumpgenerator.py tool: https://code.google.com/p/wikiteam/source/browse/trunk/dumpgenerator.py
* a new tutorial: https://code.google.com/p/wikiteam/wiki/NewTutorial
On 16 dic, 19:11, emijrp <emi...@gmail.com> wrote:
> Dear all;
>
> I'm working in a well known wiki of wikis, WikiIndex.[1] Using its semantic
> properties capabilities, I have generated a dynamic list with all the wikis
> included in WikiIndex and the backup status for all them.[2]
>
> I request your help for downloading all those wikis, and complete the
> table. We have to exclude Wikimedia (Wikipedias, Wiktionaries, etc...) and
> Wikia wikis, which offer public backups, and of course all non-MediaWiki
> wikis (until we develop new backup tools).
>
> So, for example, you choose "Hortipedia" in that list, generate the backup
> with dumpgenerator.py, upload it into Internet Archive and/or Google Code
> (send me an e-mail to add it to Download section), and the last step is to
> add "| backupurl = http://..." to the infobox template in the article in
> WikiIndex (in this casehttp://wikiindex.org/Hortipedia). The table will be
How many are they? Why are you using this list and not
<http://s23.org/wikistats/mediawikis_html.php?sort=good_desc,total_desc&th=0&lines=1348>
which has API URL etc.?
Nemo
That list only includes about ~1300 wikis (truncated query?).
If you prefer to work using that list, OK, but it neither can be
edited to mark wikis as backed up, nor add new wikis in an easy way.
WikiIndex is a wiki, and you can create articles for any wiki you want
(and it will be listed).
Again, use whatever you prefer, but we need a method to avoid dupe
work.
Regards,
emijrp
> <http://s23.org/wikistats/mediawikis_html.php?sort=good_desc,total_des...>
How many wikis to be archived does wikiindex include?
s23.org has over 60.000 wikis in total, way more than wikiindex.
>
> If you prefer to work using that list, OK, but it neither can be
> edited to mark wikis as backed up, nor add new wikis in an easy way.
> WikiIndex is a wiki, and you can create articles for any wiki you want
> (and it will be listed).
>
> Again, use whatever you prefer, but we need a method to avoid dupe
> work.
On wikiindex I don't see any method to flag a wiki as being archived,
either. And it's very easy to see whether a ikis has already been
archived: you only have to look for it on archive.org (given that this
is what we're using).
Nemo
emijrp, 16/12/2011 19:37:How many wikis to be archived does wikiindex include?
That list only includes about ~1300 wikis (truncated query?).
s23.org has over 60.000 wikis in total, way more than wikiindex.
On wikiindex I don't see any method to flag a wiki as being archived, either.
If you prefer to work using that list, OK, but it neither can be
edited to mark wikis as backed up, nor add new wikis in an easy way.
WikiIndex is a wiki, and you can create articles for any wiki you want
(and it will be listed).
Again, use whatever you prefer, but we need a method to avoid dupe
work.
And it's very easy to see whether a ikis has already been archived: you only have to look for it on archive.org (given that this is what we're using).
Nemo
It's not truncated, that's only the list of private mediawikis.
They're not filtered on that list, how many are they on wikindex? I see
4000 on http://wikiindex.org/Category:MediaWiki , and the subcategories
for wiki farms contain about 1700, so this leaves 2300; are those
correctly tagged or could they belong to some wiki farm?
Nemo
Just to change the information that emijrp provides, please don't upload it to the Internet Archive directly, it would bring great trouble for us (or actually me) to pull the identifier with the wikiteam collection. We are currently having some trouble with the earlier identifiers created.
Just either email emijrp directly, and if possible, email me so that I can actually add it into the wikiteam collection. My email is ad...@alphacorp.tk, and I can accept dumps at least up to 10GB :P