This is great, Erik.
Thank you for your ideas and sharing the code. I am actually implementing it in a slightly different way, as I want to monitor the weekly updates this way.
Will keep you updated.
Markus
To view this discussion on the web visit https://groups.google.com/d/msgid/aarddict/1b00f14d-d810-44c4-b270-7b1763dab73an%40googlegroups.com.
Good. Sound strange. I have a similar (unresolved) issue with enwikitionary. Full scraping and updating does not deliver all articles. I have all articles with dewiki and enwiki.
The ceration of slob is super fast on your machine. For double the articles ( approx 6.8 mio) my enwiki takes about 5 days on a 4 core machine as a VM.
Let me know how to get your files and I will host it in the
library on RWTH Aachen in the Spanish section. Will give you more
details later today, as I am in a hurry now.
Thank you for your update
To view this discussion on the web visit https://groups.google.com/d/msgid/aarddict/f488737a-c3c6-464d-bff4-6133caaee11an%40googlegroups.com.