I tested cmix_hpv3 by Byron Knoll (released Aug. 5) on my Lenovo
laptop, Core i7-1165G7, 2.80 GHz, 16 GB RAM, SSD, Ubuntu under Windows
10.
enwik9 compressed to 113591263 in 52:21:32 at 94% CPU using 6671 MB
max resident memory and 17.2 GB temp files.
decompression OK in 52:14:37 at 94% CPU using 6693 MB memory and 16.3
GB temp files.
The compressor size (cmix) is 397411 bytes consisting of the compiled
output using clang++12 at 119196 bytes before appending the compressed
dictionary and article reordering data. The output of the compressor
(archive9) is a self extracting archive which includes a copy of
itself with the compressed data appended.
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run2$ /usr/bin/time -v ./cmix -e
~/enwik9 archive9
78127 bytes -> 415377 bytes in 63.29 s.
200076 bytes -> 1131233 bytes in 162.54 s.
Detected block types: TEXT: 100.0%
934188796 bytes -> 113591263 bytes in 178537.92 s.
free(): invalid size
Command terminated by signal 6
Command being timed: "./cmix -e /home/matt/enwik9 archive9"
User time (seconds): 169926.00
System time (seconds): 8839.13
Percent of CPU this job got: 94%
Elapsed (wall clock) time (h:mm:ss or m:ss): 52:21:32
Average shared text size (kbytes): 0
Average unshared data size (kbytes): 0
Average stack size (kbytes): 0
Average total size (kbytes): 0
Maximum resident set size (kbytes): 6671896
Average resident set size (kbytes): 0
Major (requiring I/O) page faults: 10375707
Minor (reclaiming a frame) page faults: 2335609765
Voluntary context switches: 10294077
Involuntary context switches: 2793631
Swaps: 0
File system inputs: 1660324680
File system outputs: 6510469224
Socket messages sent: 0
Socket messages received: 0
Signals delivered: 0
Page size (bytes): 4096
Exit status: 0
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run2$ cd ..
matt@DESKTOP-KSVMRD1:~/cmixhpv3$ mkdir run3
matt@DESKTOP-KSVMRD1:~/cmixhpv3$ cd run3
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run3$ cp ../run2/archive9 .
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run3$ /usr/bin/time -v ./archive9
78127 bytes -> 415377 bytes in 62.98 s.
113591263 bytes -> 934188796 bytes in 177352.75 s.
Command being timed: "./archive9"
User time (seconds): 170047.99
System time (seconds): 7368.66
Percent of CPU this job got: 94%
Elapsed (wall clock) time (h:mm:ss or m:ss): 52:14:37
Average shared text size (kbytes): 0
Average unshared data size (kbytes): 0
Average stack size (kbytes): 0
Average total size (kbytes): 0
Maximum resident set size (kbytes): 6693856
Average resident set size (kbytes): 0
Major (requiring I/O) page faults: 10616738
Minor (reclaiming a frame) page faults: 2335736358
Voluntary context switches: 10535460
Involuntary context switches: 3508856
Swaps: 0
File system inputs: 1736866272
File system outputs:
6513257536
Socket messages sent: 0
Socket messages received: 0
Signals delivered: 0
Page size (bytes): 4096
Exit status: 0
In the file listings below, run2 is the compressor output, run3 is the
decompressor output, run is the original compiled compressor.
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run2$ ll
total 17201408
drwxr-xr-x 2 matt matt 4096 Aug 23 13:55 ./
drwxr-xr-x 10 matt matt 4096 Aug 26 11:43 ../
-rw-r--r-- 1 matt matt 9745 Aug 21 09:37 .coda
-rw-r--r-- 1 matt matt 119196 Aug 21 09:33 .decomp_bin
-rw-r--r-- 1 matt matt 415377 Aug 21 09:34 .dict
-rw-r--r-- 1 matt matt 78127 Aug 21 09:33 .dict.comp
-rw-r--r-- 1 matt matt 1404 Aug 21 09:37 .intro
-rw-r--r-- 1 matt matt 999988851 Aug 21 09:37 .main
-rw-r--r-- 1 matt matt 934177647 Aug 21 09:40 .main_phda9prepr
-rw-r--r-- 1 matt matt 999988851 Aug 21 09:37 .main_reordered
-rw-r--r-- 1 matt matt 1131233 Aug 21 09:37 .new_article_order
-rw-r--r-- 1 matt matt 200076 Aug 21 09:34 .new_article_order.comp
-rw-r--r-- 1 matt matt 934188796 Aug 21 09:40 .ready4cmix
-rwxrwxrwx 1 matt matt 113788598 Aug 23 13:55 archive9*
-rwxr-xr-x 1 matt matt 397411 Aug 21 09:31 cmix*
-rw-r--r-- 1 matt matt 197323 Aug 23 13:55 dec1
-rw-r--r-- 1 matt matt 113788586 Aug 23 13:55 dec2
-rw-r--r-- 1 matt matt 12 Aug 23 13:55 header4archive.dat
-rw-r--r-- 1 matt matt 953981679 Aug 21 09:40 out1
-rw-r--r-- 1 matt matt 953981667 Aug 21 09:39 out10
-rw-r--r-- 1 matt matt 953981673 Aug 21 09:39 out11
-rw-r--r-- 1 matt matt 953981677 Aug 21 09:39 out12
-rw-r--r-- 1 matt matt 953981661 Aug 21 09:38 out2
-rw-r--r-- 1 matt matt 985154324 Aug 21 09:38 out3
-rw-r--r-- 1 matt matt 984215015 Aug 21 09:38 out4
-rw-r--r-- 1 matt matt 980080701 Aug 21 09:38 out5
-rw-r--r-- 1 matt matt 999988944 Aug 21 09:38 out7
-rw-r--r-- 1 matt matt 999988935 Aug 21 09:38 out8
-rw-r--r-- 1 matt matt 953981679 Aug 21 09:39 out9
-rw-r--r-- 1 matt matt 149516 Aug 23 13:54 progress.log
-rw-r--r-- 1 matt matt 12 Aug 21 09:33 test.dat
-rw-r--r-- 1 matt matt 937788523 Aug 21 09:39 tmp1a
-rw-r--r-- 1 matt matt 16193156 Aug 21 09:39 tmp1b
-rw-r--r-- 1 matt matt 929737295 Aug 21 09:38 tmp2a
-rw-r--r-- 1 matt matt 24244366 Aug 21 09:38 tmp2b
-rw-r--r-- 1 matt matt 934179051 Aug 21 09:40 un1
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run2$ cd ../run3
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run3$ ll
total 16390188
drwxr-xr-x 2 matt matt 4096 Aug 28 15:58 ./
drwxr-xr-x 10 matt matt 4096 Aug 26 11:43 ../
-rw-r--r-- 1 matt matt 9745 Aug 28 15:55 .coda_decomp
-rw-r--r-- 1 matt matt 78127 Aug 26 11:44 .dict.comp_decomp
-rw-r--r-- 1 matt matt 415377 Aug 26 11:45 .dict_decomp
-rw-r--r-- 1 matt matt 934188796 Aug 28 15:55 .input_decomp
-rw-r--r-- 1 matt matt 1404 Aug 28 15:55 .intro_decomp
-rw-r--r-- 1 matt matt 934177647 Aug 28 15:55 .main_decomp
-rw-r--r-- 1 matt matt 999988851 Aug 28 15:57 .main_decomp_restored
-rw-r--r-- 1 matt matt 999988851 Aug 28 15:58 .main_decomp_restored_sorted
-rw-r--r-- 1 matt matt 113591263 Aug 26 11:45 .ready4cmix_decomp
-rwxr-xr-x 1 matt matt 113788598 Aug 26 11:44 archive9*
-rw-r--r-- 1 matt matt 1000000000 Aug 28 15:59 enwik9_uncompressed
-rw-r--r-- 1 matt matt 953981661 Aug 28 15:56 out10d
-rw-r--r-- 1 matt matt 953981673 Aug 28 15:56 out11d
-rw-r--r-- 1 matt matt 953981667 Aug 28 15:56 out12d
-rw-r--r-- 1 matt matt 953981677 Aug 28 15:56 out13d
-rw-r--r-- 1 matt matt 999988935 Aug 28 15:57 out15d
-rw-r--r-- 1 matt matt 953981679 Aug 28 15:55 out1d
-rw-r--r-- 1 matt matt 980080701 Aug 28 15:56 out2d
-rw-r--r-- 1 matt matt 999049635 Aug 28 15:56 out3d
-rw-r--r-- 1 matt matt 999988944 Aug 28 15:56 out4d
-rw-r--r-- 1 matt matt 984215035 Aug 28 15:56 out5d
-rw-r--r-- 1 matt matt 953981679 Aug 28 15:55 out6d
-rw-r--r-- 1 matt matt 12 Aug 26 11:44 test.dat
-rw-r--r-- 1 matt matt 999990255 Aug 28 15:58 un1_d
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run3$ du
16390184 .
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run3$ md5sum enwik9_uncompressed
e206c3450ac99950df65bf70ef61a12d enwik9_uncompressed
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run3$ cmp enwik9_uncompressed ~/enwik9
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run3$ cd ../run
matt@DESKTOP-KSVMRD1:~/cmixhpv3/run$ ll
total 920
drwxr-xr-x 2 matt matt 4096 Aug 21 09:30 ./
drwxr-xr-x 10 matt matt 4096 Aug 26 11:43 ../
-rwxr-xr-x 1 matt matt 397411 Aug 21 09:30 cmix*
-rwxr-xr-x 1 matt matt 119196 Aug 21 09:27 cmix_orig*
-rw-r--r-- 1 matt matt 78127 Aug 21 09:28 comp_dict
-rw-r--r-- 1 matt matt 200076 Aug 21 09:30 comp_order
-rw-r--r-- 1 matt matt 12 Aug 21 09:30 header.dat
-rw-r--r-- 1 matt matt 0 Aug 21 09:27 patch
-rw-r--r-- 1 matt matt 122083 Aug 21 09:30 progress.log
On Sun, Jul 11, 2021 at 2:59 AM Marcus Hutter <
marcus...@gmx.net> wrote:
>
> Hi Bryan,
>
> On 7/11/21 12:20 AM, Byron Knoll wrote:
>
> I think I might be able to make a submission within those constraints in a couple weeks.
>
> That'll be great!
>
> Currently I am quite close - I can get about 5% time improvement and still beat starlit by over 1%. With 10% time improvement, cmix-hp currently doesn't beat starlit by 1%.
>
> If you don't manage, then please submit the 5% speed-up next month or so, and we'll consider this.
>
> Cheers,
>
> Marcus
>
>
> On Sat, Jul 10, 2021 at 1:20 PM James Bowery <
jabo...@gmail.com> wrote:
>>
>> Given my system's inability to handle submissions based on STARLIT -- presumably because of a bug in the way LLVM produces binary for the Ryzen 3700 CPU (and a frustrating experience I had trying to report the problem to the LLVM folks) -- I'm going to have to rent time on a server farm or upgrade my linode hosted web server to 16GB ram which will double my monthly fee there. I suppose I could buy a new motherboard and CPU.
>>
>> On Sat, Jul 10, 2021 at 3:06 PM Marcus Hutter <
marcus...@gmx.net> wrote:
>>>
>>> Hi All,
>>>
>>> Given that I will accept Artemiy's submission and Bryon's is only 10% slower than Artemiy's and beats Artemiy's by 1%, I think I should permanently relax the 50'000/T to 70'000/T and (re)pconsider Bryon's submission.
>>>
>>> @James. Could you test the submission?
>>> @Bryon. Any chance to make yours 10% faster to put it clearly under the relaxed time constraint?
>>>
>>> Cheers,
>>>
>>> Marcus
--
-- Matt Mahoney,
mattma...@gmail.com