EN FR
EN FR


Section: New Software and Platforms

New Software

Omnisc'IO

Participants : Matthieu Dorier, Shadi Ibrahim, Gabriel Antoniu.

Contact:

Matthieu Dorier

Presentation:

Omnisc'IO is a middleware integrated in the POSIX and MPI-I/O stacks to observe, model and predict the I/O behavior of any HPC application transparently. It is based on formal grammars, implementing a modified version of the Sequitur algorithm. Omnisc'IO has been used on Grid'5000 with the CM1 atmospheric simulation, the LAMMPS molecular dynamics simulation, the GTC fusion simulation and the Nek5000 CFD simulation. Omnisc'IO was subject to a publication at SC14.

Users:

Omnisc'IO is currently used only within the KerData team.

URL:

http://omniscio.gforge.inria.fr/

License:

GNU Lesser General Public License (LGPL) version 3.

Status:

This software is available on Inria's forge. Registration with APP is in progress.

Darshan-Web

Participants : Matthieu Dorier, Thomas Bouguet.

Contact:

Matthieu Dorier

Presentation:

Darshan-Web is a web interface for Darshan-Ruby, based on Ruby on Rails and AJAX technologies. It allows to navigate through many Darshan log files and display graphs on demand, directly on a web brother. A demo of Darshan-Web is available at http://darshan-web.irisa.fr/ , which includes 2 months of logs from ANL's Intrepid supercomputer. The code of this demo is available and can be installed and used by the community.

Users:

The KerData team is currently seeking potential users, in particular from Argonne National Laboratory, and will push the development further according to potential users' feedback.

URL:

http://darshan-ruby.gforge.inria.fr/

License:

GNU Lesser General Public License (LGPL) version 3.

Status:

Prototype and demo available on demand.

JetStream

Participants : Radu Tudoran, Alexandru Costan, Gabriel Antoniu.

Contact:

Alexandru Costan

Presentation:

JetStream is a middleware solution for batch-based, high-performance streaming across cloud data centers. JetStream implements a set of context-aware strategies for optimizing batch-based streaming, being able to self-adapt to changing conditions. Additionally, the system provides multi-route streaming across cloud data centers for aggregating bandwidth by leveraging the network parallelism. It enables easy deployment across .Net frameworks and seamless binding with event processing engines such as StreamInsight.

Users:

JetStream is currently used at Microsoft Research ATLE Munich for the management of the Azure cloud infrastructure.

License:

Microsoft Public License.

Status:

Prototype and demo available.

OverFlow

Participants : Radu Tudoran, Alexandru Costan, Gabriel Antoniu.

Contact:

Alexandru Costan.

Presentation:

OverFlow is a uniform data management system for scientific workflows running across geographically distributed sites, aiming to reap economic benefits from this geo-diversity. The software is environment-aware, as it monitors and models the global cloud infrastructure, offering high and predictable data handling performance for transfer cost and time, within and across sites. OverFlow proposes a set of pluggable services, grouped in a data-scientist cloud kit. They provide the applications with the possibility to monitor the underlying infrastructure, to exploit smart data compression, deduplication and geo-replication, to evaluate data management costs, to set a tradeoff between money and time, and optimize the transfer strategy accordingly.

Users:

Currently, OverFlow is used for data transfers by the Microsoft Research ATLE Munich team as well as for synthetic benchmarks at the Politehnica University of Bucharest.

License:

GNU Lesser General Public License (LGPL) version 3.

Status:

Registration of the latest version with APP is in progress

iHadoop

Participants : Tien Dat Phan, Shadi Ibrahim.

Contact:

Shadi Ibrahim

Presentation:

iHadoop is a Hadoop simulator developed in Java on top of SimGrid to simulate the behavior of Hadoop and therefore accurately predict the performance of Hadoop in normal scenarios and under failures.

Users:

iHadoop is an internal software prototype, which was initially developed to validate our idea for exploring the behavior of Hadoop under failures. iHadoop has preliminarily evaluated within our group and it has shown very high accuracy when predicating the execution time of a Map-Reduce application. We intend to integrate iHadoop within the SimGrid distribution and make it available to the SimGrid community.

License:

GNU Lesser General Public License (LGPL) version 3.

Status:

Available on Inria's forge. Registration with APP is in progress.