Hi, all. I noticed a paper called "Memory-Efficient GroupBy-Aggregate using Compressed Buffer Trees" in SoCC 2013 accepted list yesterday. Check out
http://www.socc2013.org/papers.
It argues another GroupBy-Aggregate approach which is different with MapReduce's Sort and Spark's hash-based aggregatation.
It seems a new way but I cannot understand the pros and cons of this approach.
Any comments and discussion are welcome.