Dear BeScala members,
I am Tran Nam-Luc and I work for the R&D department of Euranova (
http://www.euranova.eu), a Belgian company focused on innovation in IT.
In the scope of one of our research project we have recently developped AROM, a distributed processing framework which main intention is to provide a playground for research and developing on distributed processing.
Indeed AROM is based on the general Data Flow Graph processing model which we believe is more general than the MapReduce model proposed by Google. This not only allows to design distributed algorithms in a more natural way but also leaves more opening for optimizations. Examples of such algorithms comprise relational operations, iterations and, especially, jobs with pipelined topologies.
Based on the DFG processing model, AROM jobs are defined as Directed Ayclic Graphs (DAG) where each vertex is a program processing data, also called operator, and edges represent communication channels between vertices. We provide a comprehensive API for the definition of the job graph and the connections between the operators.
AROM is written in Scala. Using functional programming paradigms to express the jobs permits us to enforce generic and reusable operators (this is also the point-of-view of many projects involved in data processing, like
Scalding for example). At its basis it uses the
Akka actors framework. This architecture potentially allows for deployment on very large clusters and on Cloud infrastructures.
Very recently, we have released AROM as an Open Source project. We are interested in exposing the ideas and concepts of AROM to a community of Scala enthusiasts and in developing a community around the project. Would you be interested in a presentation and a talk around AROM?
Looking forward to hearing from you,
Nam-Luc Tran <namluc DOT tran AT euranova DOT eu>