I've been working on a python module for running reports in Hadoop.
Its sort of a wrapper around the pig data processing language and some
smarts for running reports on a hadoop cluster and pushing and pulling
data to it. It's designed primarily to make it easier and more
efficient to run complex sets of interdependent reports - I've been
using it to do business reporting on our customer behavior at Zattoo.
I'm hoping to get the chance to give a talk about it at the Hadoop
Summit June 10th, and would love the chance to show it off locally
before that.
Marshall Weir
marsha...@gmail.com