<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <id>http://groups.google.com/group/Hutter-Prize</id>
  <title type="text">Hutter Prize Google Group</title>
  <subtitle type="text">
  Discussion of The Hutter Prize for lossless compression of human knowledge. This prize rewards increasing the quality of representation of human knowledge; currently a sizeable sample of Wikipedia. The prize fund, initially 50,000 Euros, was originally endowed by Marcus Hutter. -- Jim Bowery
  </subtitle>
  <link href="/group/Hutter-Prize/feed/atom_v1_0_msgs.xml" rel="self" title="Hutter Prize feed"/>
  <updated>2008-09-19T14:20:21Z</updated>
  <generator uri="http://groups.google.com" version="1.99">Google Groups</generator>
  <entry>
  <author>
  <name>Matt Mahoney</name>
  <email>matmaho...@yahoo.com</email>
  </author>
  <updated>2008-09-19T14:20:21Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c17cb0f6c15ce393/6b44dca690803871?show_docid=6b44dca690803871</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c17cb0f6c15ce393/6b44dca690803871?show_docid=6b44dca690803871"/>
  <title type="text">Re: [Hutter Prize] Stochastic Context Free Grammar</title>
  <summary type="html" xml:space="preserve">
  I&#39;m not aware of PCFG being applied to text compression, although in theory it might work. The top compressors like paq8hp12 and durilca4linux model language mostly at the level of n-grams and a crude semantic/part-of-speech model in which related words are grouped in a predefined dictionary and whole groups are used as context. I believe that durilca4linux used a clustering algorithm to find related words. However, its top position in the LTCB is probably because it uses twice as much memory. The model itself is more primitive because it uses PPM rather than context mixing, which restricts the semantic model to a 1-dimensional space.
  </summary>
  </entry>
  <entry>
  <author>
  <name>nadam</name>
  <email>na...@freemail.hu</email>
  </author>
  <updated>2008-09-19T12:21:24Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c17cb0f6c15ce393/76062c514f80fa76?show_docid=76062c514f80fa76</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c17cb0f6c15ce393/76062c514f80fa76?show_docid=76062c514f80fa76"/>
  <title type="text">Stochastic Context Free Grammar</title>
  <summary type="html" xml:space="preserve">
  Hi! &lt;br&gt; &lt;p&gt;I&#39;ve only found the Hutter prize some days ago...:) &lt;br&gt; I am wondering: did anyone try an approach of creating an algorithm, &lt;br&gt; which automatically learns a Stochastic Context Free Grammar ( &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://en.wikipedia.org/wiki/PCFG&quot;&gt;[link]&lt;/a&gt; ) based on the data? &lt;br&gt; &lt;p&gt;Creating such an algorithm seems to be non-trivial, so before working
  </summary>
  </entry>
  <entry>
  <author>
  <name>James A. Bowery</name>
  <email>jabow...@gmail.com</email>
  </author>
  <updated>2008-09-11T05:54:06Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/4f6181ff8b42ee9a/3d1f10faad32c8cb?show_docid=3d1f10faad32c8cb</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/4f6181ff8b42ee9a/3d1f10faad32c8cb?show_docid=3d1f10faad32c8cb"/>
  <title type="text">Ocarina Announces $1 Million In Compression Prizes</title>
  <summary type="html" xml:space="preserve">
  OCARINA UNVEILS $1 MILLION PRIZE FUND TO ADVANCE THE STATE OF &lt;br&gt; COMPRESSION RESEARCH FOR DATA STORAGE &lt;br&gt; &lt;p&gt;Prize Announcement Follows Compression Summit Featuring Leading &lt;br&gt; Industry Researchers &lt;br&gt; &lt;p&gt;September 10, 2008 – San Jose, Calif. – Ocarina Networks, provider of &lt;br&gt; the industry’s first online storage optimization solution, today
  </summary>
  </entry>
  <entry>
  <author>
  <name>Dmitry Shkarin</name>
  <email>dmitry.shka...@mtu-net.ru</email>
  </author>
  <updated>2007-10-13T15:19:23Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c6f47b6548a2857a/66daefa56cbb8f48?show_docid=66daefa56cbb8f48</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c6f47b6548a2857a/66daefa56cbb8f48?show_docid=66daefa56cbb8f48"/>
  <title type="text">Re: The XML content</title>
  <summary type="html" xml:space="preserve">
  Hi! &lt;br&gt; It is incorrect. You can try next XML tricks: &lt;br&gt; 1) replacing &amp;lt;a&amp;gt;...&amp;lt;b&amp;gt;...&amp;lt;/b&amp;gt;...&amp;lt;/a&amp;gt; with &lt;br&gt; &amp;lt;a&amp;gt;...&amp;lt;b&amp;gt;...SpecSymbol...SpecS ymbol by stack machine; &lt;br&gt; 2) moving contents of &amp;lt;timestamp&amp;gt;, &amp;lt;id&amp;gt;, &amp;lt;revision&amp;gt;, &amp;lt;page&amp;gt; to the &lt;br&gt; separate stream; &lt;br&gt; 3) moving contents of &amp;lt;text&amp;gt;, &amp;lt;title&amp;gt;, &amp;lt;comment&amp;gt; to the separate &lt;br&gt; stream;
  </summary>
  </entry>
  <entry>
  <author>
  <name>Duffy</name>
  <email>du...@quinda.com</email>
  </author>
  <updated>2007-10-11T17:34:14Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c6f47b6548a2857a/d2b19cf32c7a7812?show_docid=d2b19cf32c7a7812</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c6f47b6548a2857a/d2b19cf32c7a7812?show_docid=d2b19cf32c7a7812"/>
  <title type="text">The XML content</title>
  <summary type="html" xml:space="preserve">
  I&#39;d like to chime in with some interesting empirical results from my &lt;br&gt; attempts to compete for this prize. In due time I&#39;ll reveal all my &lt;br&gt; successful techniques, but I wanted to contribute to the discussion &lt;br&gt; here with a fascinating and revealing unsuccessful technique. &lt;br&gt; Throughout the history of the contest, there has been a latent
  </summary>
  </entry>
  <entry>
  <author>
  <name>Matt Mahoney</name>
  <email>matmaho...@yahoo.com</email>
  </author>
  <updated>2007-09-02T10:06:33Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/b90deb50ca8ffbf2/b717092ca5900857?show_docid=b717092ca5900857</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/b90deb50ca8ffbf2/b717092ca5900857?show_docid=b717092ca5900857"/>
  <title type="text">Re: [Hutter Prize] Re: Verified HKCC-2</title>
  <summary type="html" xml:space="preserve">
  I verified the Sept. 1, 2007 update of HKCC-2. &lt;br&gt; 16,234,075 compressed_enwik8.dat &lt;br&gt; 14,420 decompress.exe &lt;br&gt; 16,248,495 bytes &lt;br&gt; Decompression time was 11847 sec. (Athlon-64 3500+, 2 GB, WinXP) using 880 MB &lt;br&gt; memory slowly climbing to about 940 MB. Output matched enwik8. &lt;br&gt; decompress3.exe does not appear to be updated for the new compressed file, but
  </summary>
  </entry>
  <entry>
  <author>
  <name>James Bowery</name>
  <email>jabow...@gmail.com</email>
  </author>
  <updated>2007-09-01T20:03:48Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/b90deb50ca8ffbf2/4f543d2f428426a3?show_docid=4f543d2f428426a3</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/b90deb50ca8ffbf2/4f543d2f428426a3?show_docid=4f543d2f428426a3"/>
  <title type="text">Re: [Hutter Prize] Verified HKCC-2</title>
  <summary type="html" xml:space="preserve">
  I verified decompress3.exe on a 1.7GHz Athlon, running in 5 hr 45 min.
  </summary>
  </entry>
  <entry>
  <author>
  <name>Matt Mahoney</name>
  <email>matmaho...@yahoo.com</email>
  </author>
  <updated>2007-08-31T14:55:28Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/b90deb50ca8ffbf2/5ffdd488e16cc0be?show_docid=5ffdd488e16cc0be</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/b90deb50ca8ffbf2/5ffdd488e16cc0be?show_docid=5ffdd488e16cc0be"/>
  <title type="text">Verified HKCC-2</title>
  <summary type="html" xml:space="preserve">
  I have verified Alexander Ratushnyak&#39;s entry as of Aug. 29 2007. &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://binet.com.ua/~artest/HKCC-2/&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; The program is a decompressor and a file that decompresses to enwik8. &lt;br&gt; I used the command &lt;br&gt; decompress compressed_enwik8.dat enwik8 &lt;br&gt; On an AMD Athlon-64 3500+, 2 GB memory, Windows XP SP2 (32 bit) the &lt;br&gt; program output enwik8 (verified identical) in 11802 seconds process
  </summary>
  </entry>
  <entry>
  <author>
  <name>Matt Mahoney</name>
  <email>matmaho...@yahoo.com</email>
  </author>
  <updated>2007-07-24T18:06:59Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c0a0a4388b1c791f/a6cf275aee040688?show_docid=a6cf275aee040688</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c0a0a4388b1c791f/a6cf275aee040688?show_docid=a6cf275aee040688"/>
  <title type="text">Re: Alexander Ratushnyak Wins Second Hutter Prize Payout</title>
  <summary type="html" xml:space="preserve">
  I should have done this earlier, but paq8hp12_any source code is now &lt;br&gt; posted to &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://cs.fit.edu/~mmahoney/compression/text.html#1323&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; I also posted a &amp;quot;lite&amp;quot; version of paq, lpaq1, that might be useful for &lt;br&gt; others entering the contest. I wanted an open source context mixing &lt;br&gt; compressor that&#39;s small and fast enough to build on. It is 35 times
  </summary>
  </entry>
  <entry>
  <author>
  <name>James A. Bowery</name>
  <email>jabow...@gmail.com</email>
  </author>
  <updated>2007-07-09T22:59:53Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c0a0a4388b1c791f/4b929e003f95cf2a?show_docid=4b929e003f95cf2a</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/c0a0a4388b1c791f/4b929e003f95cf2a?show_docid=4b929e003f95cf2a"/>
  <title type="text">Alexander Ratushnyak Wins Second Hutter Prize Payout</title>
  <summary type="html" xml:space="preserve">
  Alexander Ratushnyak &amp;lt;&lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://prize.hutter1.net/ratushnyak.jpg&quot;&gt;[link]&lt;/a&gt;&amp;gt; is the &lt;br&gt; second winner of The Hutter Prize for Lossless Compression of Human &lt;br&gt; Knowledge &amp;lt;&lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://prize.hutter1.net/&quot;&gt;[link]&lt;/a&gt;&amp;gt;. The Hutter Prize was &lt;br&gt; established and initially underwritten by Marcus Hutter &amp;lt;http:// &lt;br&gt; en.wikipedia.org/wiki/Marcus_H utter&amp;gt; based on the equivalence between
  </summary>
  </entry>
  <entry>
  <author>
  <name>Matt Mahoney</name>
  <email>matmaho...@yahoo.com</email>
  </author>
  <updated>2007-05-16T22:12:25Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/a5a82e99510af570?show_docid=a5a82e99510af570</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/a5a82e99510af570?show_docid=a5a82e99510af570"/>
  <title type="text">Re: paq8hp11</title>
  <summary type="html" xml:space="preserve">
  On May 14 I confirmed compression and decompression for paq8hp12 -7 &lt;br&gt; enwik8. &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://cs.fit.edu/~mmahoney/compression/text.html#1330&quot;&gt;[link]&lt;/a&gt; &lt;br&gt; 16,381,959 enwik8.paq8hp12 &lt;br&gt; 99,696 PAQ8HP12.EXE &lt;br&gt; 16,481,655 total size &lt;br&gt; Timer 3.01 process time was 13082 sec. to compress, 13148 sec. to &lt;br&gt; decompress &lt;br&gt; on a 2.2 GHz Athlon 64 3500+ with 2 GB memory. Windows Task Manager
  </summary>
  </entry>
  <entry>
  <author>
  <name>Attila Olah</name>
  <email>jola...@gmail.com</email>
  </author>
  <updated>2007-05-10T12:07:07Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/bdd2409b25bf469e?show_docid=bdd2409b25bf469e</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/bdd2409b25bf469e?show_docid=bdd2409b25bf469e"/>
  <title type="text">Re: [Hutter Prize] Re: paq8hp11</title>
  <summary type="html" xml:space="preserve">
  No, not yet. I was a bit busy, but now that paq8hp11any is out, i can &lt;br&gt; compress it with 512 Mb. Thanks anyway.
  </summary>
  </entry>
  <entry>
  <author>
  <name>Matt Mahoney</name>
  <email>matmaho...@yahoo.com</email>
  </author>
  <updated>2007-05-04T17:30:34Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/c193ef0a17f3b5b0?show_docid=c193ef0a17f3b5b0</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/c193ef0a17f3b5b0?show_docid=c193ef0a17f3b5b0"/>
  <title type="text">Re: paq8hp11</title>
  <summary type="html" xml:space="preserve">
  Oh, sorry, I deleted it after the test. Do you already have the &lt;br&gt; compressed file? It should be 16,459,515 bytes. &lt;br&gt; -- Matt Mahoney
  </summary>
  </entry>
  <entry>
  <author>
  <name>attila</name>
  <email>jola...@gmail.com</email>
  </author>
  <updated>2007-05-04T13:50:10Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/b6c136f8918af272?show_docid=b6c136f8918af272</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/b6c136f8918af272?show_docid=b6c136f8918af272"/>
  <title type="text">Re: paq8hp11</title>
  <summary type="html" xml:space="preserve">
  Could someone please send me the md5 (or other hash) of the compressed &lt;br&gt; archive? I don&#39;t have enough memory, but I&#39;ll try to do it with &lt;br&gt; virtual memory, probably under wine environment, and I want to be sure &lt;br&gt; I have the right file before trying to uncompress (it will be a time &lt;br&gt; consuming process). I hope I&#39;m not asking too much...
  </summary>
  </entry>
  <entry>
  <author>
  <name>James Bowery</name>
  <email>jabow...@gmail.com</email>
  </author>
  <updated>2007-05-01T22:34:01Z</updated>
  <id>http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/9b844abb2020c75a?show_docid=9b844abb2020c75a</id>
  <link href="http://groups.google.com/group/Hutter-Prize/browse_thread/thread/2c44efd134083b26/9b844abb2020c75a?show_docid=9b844abb2020c75a"/>
  <title type="text">Re: [Hutter Prize] paq8hp11</title>
  <summary type="html" xml:space="preserve">
  My results were the same for an AMD Athlon Xp 2100+ 1.73 GHz with 1.25 &lt;br&gt; GB of RAM except it took compression and decompression took almost &lt;br&gt; exactly 6 hours.
  </summary>
  </entry>
</feed>
