Section: New Software and Platforms
Platforms
BigGraphs
Participants : Luc Hogie [Contact] , Nicolas Chleq [SED-SOP] , Michel Syska [Coordinator] , David Coudert, Paul Bertot, Flavian Jacquot, Arnaud Legout [ Diana ] , Fabrice Huet [ Scale ] , Éric Madelaine [ Scale ] .
Functional Description
The objective of BigGraphs is to provide a distributed platform for very large graphs processing. A typical data set for testing purpose is a sample of the Twitter graph with 3 millions of nodes and 200 millions of edges. Last year we started the project with the evaluation of existing middlewares (GraphX/Spark and Giraph/Hadoop). After having tested some useful algorithms (written in the BSP model) we decided to develop our own platform.
This platform is based on the existing BigGrph library and this year we have focused on the quality and the improvement of the code. In particular we have designed strong test suites and some non trivial bugs have been fixed. We also have implemented specific data structures for BSP and support for distributed debugging. This comes along with the implementation of algorithms such as BFS or strongly connected components that are run on the NEF cluster.
This project is a joint work of the three EPI Coati , Diana and Scale and is supported by an ADT grant.
URL: http://www.i3s.unice.fr/~hogie/software/index.php?name=grph
The following software are useful tools that bring basic services to the platform (they are not dedicated to BigGrph ).
-
Jac-a-boo : is a framework aiming at facilitating the deployment of distributed Java scientific applications over clusters and is used to start BigGrph . computers.
-
ldjo : (Live Distributed Java Objects) is a framework for the development and the deployment of Java distributed data structures
-
Octojus : provides an object-oriented RPC (Remote Procedure Call) implementation in Java
-
URL : http://www.i3s.unice.fr/~hogie/ {jacaboo,ldjo,octojus}