EN FR
EN FR


Section: Application Domains

Bioinformatics

Large-scale data management is certainly one of the most important applications of distributed systems of the future. Bioinformatics if a field producing such kinds of applications. For example, DNA sequencing applications make use of MapReduce skeletons.

The MapReduce programming model publicized by Google is a widely used model for deploying application services on platforms for data processing on a large scale (Grids and Clouds). MapReduce implementations extremely are scalable and efficient when located on the datacenters of big companies. However, technological (use of a parallel file system) and algorithmic (hard-coded replication) choices of an implementation such as Hadoop compromise performance on platforms like grids of heterogeneous clusters. Applying it to large scale and volatile environments such as desktop grids still remain a challenge.