Skip to content

The captivating AROM of distributed processing

Last month we have had the opportunity to present AROM at the 4th IEEE International Conference on Cloud Computing Technology and Science (CLOUDCOM) in Taipei, the beautiful capital of Taiwan. For the people who are just getting on train, I will quickly here recall what AROM is about, where one can find it and how its taste is.

AROM is a distributed processing framework which aims at providing a playground for research on distributed processing. Its main objective is to provide a convenient environment for quickly implementing and prototyping on distributed algorithms and jobs. This is achieved first by the DataFlow Graph processing model, where jobs are expressed as Directed Acyclic Graphs expresing operations and the dependencies of the data.  By remaining simple and general, this processing model allows for greater flexibility and optimization. Furthermore, the fact that it is written entirely using Scala and the Akka Actors framework ensures for scalability, and the reusing of paradigms borrowed from the functional programming world (such as higher order, anonymous constructs, …)  enforces reusability and genericity of the job implementations.

Oh, and did I tell you that it is fully released in open source?

References

– N.-L. TRAN, S. SKHIRI, A. LESUISSE, E. ZIMANYI, AROM: Processing Big Data with DataFlow Graphs and Functional Programming. Proceedings of the International Workshop on Cloud Computing for Research Collaborations. 4th IEEE International Conference on Cloud Computing Technology and Science (CLOUDCOM): http://code.ulb.ac.be/dbfiles/TraSkhZimLes2012incollection.pdf

–  AROM distributed processing: http://arom-processing.org/

– AROM on GitHub: https://github.com/nltran/arom


 

Releated Posts

Calibrate to Interpret

Trustworthy machine learning is driving a large number of the ML community works in order to improve ML acceptance and adoption. In this paper, we show a first link between uncertainty and explainability, by studying the relation between calibration and interpretation.
Read More

Mass Estimation of Planck Galaxy Clusters using Deep Learning

Galaxy cluster masses can be inferred indirectly using measurements from X-ray band, Sunyaev-Zeldovich (SZ) effect signal or optical observations. Unfortunately, all of them are affected by some bias. Alternatively, we provide an independent estimation of the cluster masses from the Planck PSZ2 catalogue of galaxy clusters using a machine-learning method.
Read More