On Apr 10, 10:29 am,
LawCouns...@aol.com wrote:
> On Monday, 9 April 2012 17:03:04 UTC+1, biject wrote:
> > On Apr 9, 5:19 am, James Dow Allen <
jdallen2...@yahoo.com> wrote:
> > > On Apr 9, 12:58 pm,
lawcouns...@gmail.com wrote:
>
> > > > a source with probability of producing an 'a' symbol 25% of time
> > > > a 'b' symbol 25% of times a 'c' symbol 50% of times
>
> > > Allow me to recommend the optimal Huffman code:
> > > c - 0
> > > a - 10
> > > b - 11
> > > This can be improved, though only slightly, using details
> > > you've omitted from your summary.
>
> > > This was so trivial, I'll discount it down to, say $950.
>
> > > If this is unsatisfactory, I'll withdraw from the contest.
> > > Even paid at minimum wage I'm afraid it would take significant
> > > funds (payable in advance, please!) just to elicit a
> > > proper problem statement from you.
>
> > > I don't have PayPal. Contact me for instructions on how
> > > to pay the $950. :-)
>
> > > James
>
> > Lets see c is .5 * 1 = .5 b = .25*2 = .5 c = .25*2 = .5
> > see thats .5 + .5 + .5 = 1.5 for the average sequence while
> > if you encode each with 1.5849625 you save about .0849625 which
> > is more than the .08 It appears your in the money. I have a
> > hunch that there still is something missing in which case I would
> > not count on the money yet.
>
> > First of all does he want at least .08 bits saved in every case
> > or just the average case. If its the average case you could be
> > on the right track. If its every case then since you write only
> > whole numbers of bits the .08 savings gets a little harder. It
> > would be nice if the guy decides you haven't won just what does
> > he want. I have read it several times and yet I do not think its
> > clear enough to tackle without him saying oh I meant this and not
> > that.
>
> > Assuming he doesn't declare you the winner
> > 1) is the savings an average things or does each file have to be less.
> > 2) how do you measure the savings is it .08 from a 1.5849625 per
> > symbol
> > or is it .08 less then 1.5
> > 3) not sure why you say source C = .5 while A and B = .25 the
> > fact is even if the source is A = B = C = 1/3 for short files
> > if you run the sources enough times and created a 100 files each
> > you still could get the same set of 100 files for both cases.
> > So you test set up is not valid. There is nothing magical about
> > your source. Except if I know its a fixed IID souce from say 2 or
> > 3 different models as you create more files. You can with increasing
> > probability determine which one it most likely is. But you can't be
> > 100% certain which one it is unless you do an ever increasing number
> > of file.
> THE COMPLETE SPECIFICATIONS :
> =============================
>
> 1. generates a number eg 1,000 of such sequences ( each sequence composed of ternary symbols 'a' 'b' 'c' , when # of 'c' = # of 'b' + 2 Then sequence ENDS & next sequences begins ) using a source producing symbol 'a' 25% of times symbol 'b' 25% of times symbol 'c' 50% of times ) .... call the total # of symbols in these 1,000 sequences N . NOTE : among these eg 1,000 sequences the # of 'a' is invariable near = the # of 'b' & the # of 'c' is invariable near = 2 * the # of 'b' THUS the probability model here is 25% : 25% : 50%
>
> 2. compresses these eg 1,000 generated sequences using your .exe , & must decode back to the same 1,000 sequences
>
> 3. IF you compressed file bitslength =< 1.5 * N - ( 0.08 * N ) THEN YOU WIN THE REWARDS ! ie if your .exe saves 'on average' 0.08 bit each sequences you WON ( needs not be invariable every time on every conceivable file ! ) , but note the original # of sequences is here taken to be of bitslength N * 1.5 bits long ( as originally 'explicit' stated to be 1.5 * N bits long , NOT 1.5849625 * N bits long )
>
> 4. there is no restrictions on memory storage requirements , you may even show your .exe works on 'research network supercomputer cluster' , BUT processing must complete within a day- Hide quoted text -
>
> - Show quoted text -
=============================================================
UPDATE ANNOUNCEMENT :
US$3M DATA COMPRESSION PRIZE :
=============================================================
AM NOW MADE AWARE OF COUPLE OF SOLUTIONS PUT FORTH CERTAIN
TO EASILY FAR EXCEED THE REQUIRED 0.08 BIT COMPRESSION SAVINGS PER
SEQUENCE NEEDED ....
WILL NOW NEEDS TAKE SOME TIME TEST DEVELOP CONFIRM THE SOLUTION
KEEP YOUR SOLUTIONS COMING .....
.
Warm Regards,
LawCounsels