Running the command
mw2slob dump -b $binsize -c lzma2 -o ~/Downloads/$lang"wiki"-$DAT.slob --siteinfo $lang"wiki".si.json ~/data/tmp/$lang"wiki"-NS0-$DAT-ENTERPRISE-HTML.json.tar.gz -f wiki common
results after 4.3 GB of data in:
S Bill Sorvino (34988)
ERROR:mw2slob.core:
Traceback (most recent call last):
File "/home/markus/env-slob/lib/python3.9/site-packages/mw2slob/core.py", line 140, in run
for title, aliases, text, error in resulti:
File "/usr/lib/python3.9/multiprocessing/pool.py", line 448, in <genexpr>
return (item for chunk in result for item in chunk)
File "/usr/lib/python3.9/multiprocessing/pool.py", line 870, in next
raise value
zlib.error: Error -3 while decompressing data: invalid block type
Finished adding content in 1 day, 8:07:46
Finalizing...
Sorting... sorted in 0:04:51
Resolving aliases...
Sorting... sorted in 0:04:58
Resolved aliases in 0:04:58
Finalized in 0:10:34Traceback (most recent call last):
File "/home/markus/env-slob/bin/mw2slob", line 8, in <module>
sys.exit(main())
File "/home/markus/env-slob/lib/python3.9/site-packages/mw2slob/cli.py", line 394, in main
args.func(args)
File "/home/markus/env-slob/lib/python3.9/site-packages/mw2slob/cli.py", line 109, in cli_dump
run(outname, info, itertools.chain(*scrape_articles, dump_articles), args)
File "/home/markus/env-slob/lib/python3.9/site-packages/mw2slob/cli.py", line 67, in run
core.create_slob(
File "/home/markus/env-slob/lib/python3.9/site-packages/mw2slob/core.py", line 197, in create_slob
run(slb, articles, filters, info.interwikimap, info.namespaces, html_encoding)
File "/home/markus/env-slob/lib/python3.9/site-packages/mw2slob/core.py", line 140, in run
for title, aliases, text, error in resulti:
File "/usr/lib/python3.9/multiprocessing/pool.py", line 448, in <genexpr>
return (item for chunk in result for item in chunk)
File "/usr/lib/python3.9/multiprocessing/pool.py", line 870, in next
raise value
zlib.error: Error -3 while decompressing data: invalid block type
is there anything wrong with the datafile?
I checked hashes and they are identical.
The same command is running fine with other dump files.
Any idea?
Any idea?