PyCascading brings Python to Cascading. It lets you assemble and run
Cascading flows from Python, and user-defined functions, aggregators,
and filters are also written natively in Python. The Python
interpreter is Jython, so all the libraries available to both Python
and Java are reusable for putting together Cascading jobs quickly.
The usual word counting example would look like this:
for word in tuple.get('line').split():
input | split_lines | GroupBy('word') | Count() | output
Decorators are used to decorate functions to be used in flows, pipes
are chained together with the '|' operator, and merged with '&' in
PyCascading is open source, and hosted at https://github.com/twitter/pycascading
. Please see the examples folder for usage patterns, and submit any
questions or reports to the cascading-user list.