Lectures on MapReduce Design Patterns

40 views
Skip to first unread message

David G. Boney

unread,
Dec 14, 2012, 4:36:06 PM12/14/12
to Austin-ACM-...@meetup.com, austin...@yahoogroups.com, aust...@googlegroups.com, semantic...@googlegroups.com
Next year, Austin ACM SIGKDD will run a series of weekly lectures on map-reduce design patterns for Hadoop. The lectures will be based on the book, "MapReduce Design Patterns" by Donald Miner & Adam Shook. There will be one lecture per design pattern. There are 24 patterns in the book. The lecture will go through the design pattern and the coding examples in the book for the design pattern. Dates for the specific session will be announced in the monthly emails, and posted on the calendars at Meetup and Yahoo Groups, based on the availability of the meeting space at Northwest Recreation Center and my schedule.

Textbooks

MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems by Donald Miner and Adam Shook

Hadoop: The Definitive Guide, 3rd ed., by Tom White

Syllabus

Week  1 - Overview of Hadoop -                          from Hadoop: The Definitive Guide by Tom White
Week  2 - Hadoop API, Readers and Writers -  from Hadoop: The Definitive Guide by Tom White
Week  3 - Numerical Summarizations Pattern
Week  4 - Inverted Index Summarizations Pattern
Week  5 - Counting with Counters Pattern
Week  6 - Filtering Pattern
Week  7 - Bloom Filtering Pattern
Week  8 - Top Ten Pattern
Week  9 - Distinct Pattern
Week 10 - Structured to Hierarchical Pattern
Week 11 - Partitioning Pattern
Week 12 - Binning Pattern
Week 13 - Total Order Sorting Pattern
Week 14 - Shuffling Pattern
Week 15 - Reduce Side Join Pattern
Week 16 - Replicated Join Pattern
Week 17 - Composite Join Pattern
Week 18 - Cartesian Produce Pattern
Week 19 - Job Chaining Pattern
Week 20 - Chain Folding Pattern
Week 21 - Job Merging Pattern
Week 22 - Custom Input and Output in Hadoop Pattern
Week 23 - Generating Data Pattern
Week 24 - External Source Output Pattern
Week 25 - External Source Input Pattern
Week 26 - Partition Prunning Pattern
Reply all
Reply to author
Forward
0 new messages