I just pushed a first cut of a reference implementation for the new dictionary file format (called slob), check out documentation and source code at https://github.com/itkach/slob. Dictionaries in the new format are quite a bit smaller than current Aard Dictionary format: ...
...
Slob is influenced by both Aard Dictionary format and OpenZIM and can store any content, not just text or html.
...
Next I'll be looking into getting this deployed on Android and hooking it up with a mobile user interface.
Is there also a plan how to download the actual wikipedias?
At the moment, I am downloading the german wikipedia for a zimfile creation for kiwix, through the undocumented mwoffliner.js, but I have to say, that I am quite disappointed.
The download is runnung already two days and takes maybe some days more(!!!), so it is not comparable with the former easy aard-dic-creation from Wiki-Dumps.
I would love to get a offline Wikipedia from the mobile version of wikipedia, but for kiwix this is not yet possible and I doubt, that I will follow this too troublesome way for an offline wiki anyway.
Maybe with your way, this would be possible?
Next I'll be looking into getting this deployed on Android and hooking it up with a mobile user interface.
Sounds senseful to me, but I don't know it (Python for Android).
...Yes, I plan to write a wikipedia downloader that would get rendered HTML (and possibly images, at least some of them) from Wikipedia itself via their web API. Such downloader is going to be necessarily slow, but downloaded documents can be stored in a document database such as CouchDB (which can be easily shared and/or replicated) and updated only when they are really modified, so initial download would take a while, but keeping it up to date would be (hopefully) relatively fast...
Will slob support incremental updates to slob files or is this only planned for the raw data using CouchDB?
I think incremental updates are very important for users if large dictionaries (like wiki + images) are used.
It would also be nice to be able to exchange pictures in a slob. This way a slob with very low resolution thumbnails could be published. The slob-reader could than download higher resolution pictures and integrate them into the slob on demand.
This would also allow you to only publish wikis with placeholders instead of images. These placeholders could then be updated afterwards by the user. I imagine that the images could be stored in an Adam7 interlaced format and the application would stopp the download of each image after a certain resolution has been reached.
The application could then have an option to choose the maximum resolution and download the images. As the downloaded wikis only contain placeholders they would be similar in size to wikis without images.