IMSLP unfriendlies on facilitating archival

15 views
Skip to first unread message

Luiz Augusto

unread,
Oct 30, 2012, 1:48:03 PM10/30/12
to wikiteam...@googlegroups.com
Firstly, I'm very happy to see that I aren't the only one from Wikimedia projects interested in digital preservation of fragile content. I'm not alone with this insanity! xD

I've made an attempt to public contact the owner from IMSLP.org project requesting data dumps. Unfortunatelly until this moment I got only the default reply "OMG SECURITY ISSUES!" that everyone knows that it isn't the real answer for at least 90% of related scenarios.

So... How difficult will be to write a script to make collaborative small portions copies from that project? They are basically based on default MediaWiki wikitext, default MediaWiki uploaded medias (for the most oldest uploads) and custom code to hide the full external URL and track clicks (for the most recent uploads).

I don't have sufficient knowledge to help on coding but I will be very happy to help on executing and saving the data (and even to poke continually Wikimedia community, staff and Board to create a de facto free fork for that project).

Related discussion:

http://imslp.org/index.php?title=Special:WikiForum&thread=2513

Best,
[[:m:User:555]]

Federico Leva (Nemo)

unread,
Oct 30, 2012, 4:52:05 PM10/30/12
to wikiteam...@googlegroups.com, Nickanc Wikipedia
Luiz Augusto, 30/10/2012 18:48:
> Firstly, I'm very happy to see that I aren't the only one from Wikimedia
> projects interested in digital preservation of fragile content. I'm not
> alone with this insanity! xD

Glad to see you here, indeed like-minded people find each other, not
only in Wikimedia projects. ;-)

> I've made an attempt to public contact the owner from IMSLP.org project
> requesting data dumps. Unfortunatelly until this moment I got only the
> default reply "OMG SECURITY ISSUES!" that everyone knows that it isn't
> the real answer for at least 90% of related scenarios.
>
> So... How difficult will be to write a script to make collaborative
> small portions copies from that project? They are basically based on
> default MediaWiki wikitext, default MediaWiki uploaded medias (for the
> most oldest uploads) and custom code to hide the full external URL and
> track clicks (for the most recent uploads).

It's very easy: https://code.google.com/p/wikiteam/wiki/NewTutorial
We've already tried dumping their wiki several times, but they've always
blocked us; or at least this is what we believe, errors don't always
have a clear source.

> I don't have sufficient knowledge to help on coding but I will be very
> happy to help on executing and saving the data (and even to poke
> continually Wikimedia community, staff and Board to create a de facto
> free fork for that project).
>
> Related discussion:
>
> http://imslp.org/index.php?title=Special:WikiForum&thread=2513

I don't think a fork is in order.
As WMIT we're already in contact with them and if I remember correctly
they're also Internet Archive friends: https://archive.org/details/imslp
If we're not able to convince them to make public dumps of their wiki,
it means that we need better arguments... or better contacts. Nickanc,
can you help?

Nemo

Luiz Augusto

unread,
Oct 30, 2012, 4:56:34 PM10/30/12
to wikiteam...@googlegroups.com

Since my original message to this list, he said that if receives a 100,000 donation, will can set up the XML dumping. I‘ve sent to him the Wikimedia Grants link and offered help on requesting a grant.

Luiz Augusto

unread,
Oct 30, 2012, 5:13:58 PM10/30/12
to wikiteam...@googlegroups.com

Indeed, specially because firstly he have said that a useful data dump will release private data from his users...

My intention is to remove any possibility of against argumentation from him. If he needs only to get paid for running a command, the transparency from the Grants will stop his attempt. If he really needs additional infraestructure for a stable environment, I will try to help him.

Em 30/10/2012 19:04, "Federico Leva (Nemo)" <nemo...@gmail.com> escreveu:
Luiz Augusto, 30/10/2012 21:56:

Since my original message to this list, he said that if receives a
100,000 donation, will can set up the XML dumping. I‘ve sent to him the
Wikimedia Grants link and offered help on requesting a grant.

This doesn't seem to make any sense...

Nemo

Platonides

unread,
Oct 30, 2012, 5:13:26 PM10/30/12
to wikiteam...@googlegroups.com
On 30/10/12 22:04, Federico Leva (Nemo) wrote:
> Luiz Augusto, 30/10/2012 21:56:
>> Since my original message to this list, he said that if receives a
>> 100,000 donation, will can set up the XML dumping. I�ve sent to him the
>> Wikimedia Grants link and offered help on requesting a grant.
>
> This doesn't seem to make any sense...
>
> Nemo

That's just an excuse. "I ask for an exorbitant amount for doing what
they want". We could setup the dumps for him in less than one hour.
*For free*, so he doesn't need to spend $100,000 on that.


Federico Leva (Nemo)

unread,
Oct 30, 2012, 5:04:02 PM10/30/12
to wikiteam...@googlegroups.com
Luiz Augusto, 30/10/2012 21:56:
> Since my original message to this list, he said that if receives a
> 100,000 donation, will can set up the XML dumping. I�ve sent to him the
> Wikimedia Grants link and offered help on requesting a grant.

Hydriz Wikipedia

unread,
Oct 31, 2012, 4:22:41 AM10/31/12
to wikiteam...@googlegroups.com
Agreed, we should just force our way through to download the wiki. Archiving is important, even if they may try to block us.

On Wed, Oct 31, 2012 at 5:13 AM, Platonides <plato...@gmail.com> wrote:
On 30/10/12 22:04, Federico Leva (Nemo) wrote:
> Luiz Augusto, 30/10/2012 21:56:
>> Since my original message to this list, he said that if receives a
>> 100,000 donation, will can set up the XML dumping. I‘ve sent to him the

>> Wikimedia Grants link and offered help on requesting a grant.
>
> This doesn't seem to make any sense...
>
> Nemo

That's just an excuse. "I ask for an exorbitant amount for doing what
they want". We could setup the dumps for him in less than one hour.
*For free*, so he doesn't need to spend $100,000 on that.





--
Regards,
Hydriz

We've created the greatest collection of shared knowledge in history. Help protect Wikipedia. Donate now: http://donate.wikimedia.org

Luiz Augusto

unread,
Nov 12, 2012, 10:18:08 AM11/12/12
to wikiteam...@googlegroups.com
The last public update on this subject that I was able to found is Feldmahler replying on October 31 "Thanks, I'll take a look at that program. [Wikimedia Grants]".

In the meantime, I've managed to run dumpgenerator.py without getting banned by their scripts.

Unfortunatelly I can't give more details until it gets finished without the risk of getting banned if their intention is to owns contents that he isn't the owner (even this brief note may be sufficient to get my background effort stoped...), but at least something is running very very slowly on this subject...
Reply all
Reply to author
Forward
0 new messages