<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN" "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8"/>
    <title>Project-Team:ZENITH</title>
    <link rel="stylesheet" href="../static/css/raweb.css" type="text/css"/>
    <meta name="description" content="Research Program - Distributed Data Management"/>
    <meta name="dc.title" content="Research Program - Distributed Data Management"/>
    <meta name="dc.subject" content=""/>
    <meta name="dc.publisher" content="INRIA"/>
    <meta name="dc.date" content="(SCHEME=ISO8601) 2019-01"/>
    <meta name="dc.type" content="Report"/>
    <meta name="dc.language" content="(SCHEME=ISO639-1) en"/>
    <meta name="projet" content="ZENITH"/>
    <script type="text/javascript" src="https://cdn.mathjax.org/mathjax/latest/MathJax.js?config=TeX-MML-AM_CHTML">
      <!-- MathJax -->
    </script>
    <script type="text/javascript" src="../static/js/piwik.js">
      <!-- Piwik JS -->
    </script>
    <noscript>
      <p>
        <img src="https://piwik.inria.fr/matomo.php?idsite=49&amp;rec=1" style="border:0;" alt=""/>
      </p>
      <!-- Piwik Img -->
    </noscript>
  </head>
  <body>
    <div class="tdmdiv">
      <div class="logo">
        <a href="http://www.inria.fr">
          <img style="align:bottom; border:none" src="../static/img/icons/logo_INRIA-coul.jpg" alt="Inria"/>
        </a>
      </div>
      <div class="TdmEntry">
        <div class="tdmentete">
          <a href="uid0.html">Project-Team Zenith</a>
        </div>
        <span>
          <a href="uid1.html">Team, Visitors, External Collaborators</a>
        </span>
      </div>
      <div class="TdmEntry">
        <a href="./uid3.html">Overall Objectives</a>
      </div>
      <div class="TdmEntry">Research Program<ul><li class="tdmActPage"><a href="uid5.html&#10;&#9;&#9;  ">Distributed Data Management</a></li><li><a href="uid6.html&#10;&#9;&#9;  ">Big Data</a></li><li><a href="uid10.html&#10;&#9;&#9;  ">Data Integration</a></li><li><a href="uid11.html&#10;&#9;&#9;  ">Data Analytics</a></li><li><a href="uid18.html&#10;&#9;&#9;  ">High dimensional data processing and search</a></li></ul></div>
      <div class="TdmEntry">Application Domains<ul><li><a href="uid23.html&#10;&#9;&#9;  ">Data-intensive Scientific Applications</a></li></ul></div>
      <div class="TdmEntry">
        <a href="./uid30.html">Highlights of the Year</a>
      </div>
      <div class="TdmEntry">New Software and Platforms<ul><li><a href="uid37.html&#10;&#9;&#9;  ">Pl@ntNet</a></li><li><a href="uid41.html&#10;&#9;&#9;  ">ThePlantGame</a></li><li><a href="uid45.html&#10;&#9;&#9;  ">Chiaroscuro</a></li><li><a href="uid49.html&#10;&#9;&#9;  ">DfAnalyzer</a></li><li><a href="uid55.html&#10;&#9;&#9;  ">CloudMdsQL Compiler</a></li><li><a href="uid59.html&#10;&#9;&#9;  ">Savime</a></li><li><a href="uid64.html&#10;&#9;&#9;  ">OpenAlea</a></li><li><a href="uid69.html&#10;&#9;&#9;  ">Triton Server</a></li><li><a href="uid73.html&#10;&#9;&#9;  ">museval</a></li><li><a href="uid77.html&#10;&#9;&#9;  ">Imitates</a></li><li><a href="uid81.html&#10;&#9;&#9;  ">VersionClimber</a></li><li><a href="uid87.html&#10;&#9;&#9;  ">UMX</a></li></ul></div>
      <div class="TdmEntry">New Results<ul><li><a href="uid92.html&#10;&#9;&#9;  ">Scientific Workflows</a></li><li><a href="uid96.html&#10;&#9;&#9;  ">Query Processing</a></li><li><a href="uid99.html&#10;&#9;&#9;  ">Data Analytics</a></li><li><a href="uid104.html&#10;&#9;&#9;  ">Machine Learning for Biodiversity Informatics</a></li><li><a href="uid112.html&#10;&#9;&#9;  ">Machine Learning for Audio Heritage Data</a></li></ul></div>
      <div class="TdmEntry">Bilateral Contracts and Grants with Industry<ul><li><a href="uid120.html&#10;&#9;&#9;  ">SAFRAN (2018-2019)</a></li><li><a href="uid121.html&#10;&#9;&#9;  ">INA (2019-2022)</a></li></ul></div>
      <div class="TdmEntry">Partnerships and Cooperations<ul><li><a href="uid123.html&#10;&#9;&#9;  ">National Initiatives</a></li><li><a href="uid130.html&#10;&#9;&#9;  ">European Initiatives</a></li><li><a href="uid134.html&#10;&#9;&#9;  ">International Initiatives</a></li></ul></div>
      <div class="TdmEntry">Dissemination<ul><li><a href="uid171.html&#10;&#9;&#9;  ">Promoting Scientific Activities</a></li><li><a href="uid249.html&#10;&#9;&#9;  ">Teaching - Supervision - Juries</a></li><li><a href="uid282.html&#10;&#9;&#9;  ">Popularization</a></li></ul></div>
      <div class="TdmEntry">
        <div>Bibliography</div>
      </div>
      <div class="TdmEntry">
        <ul>
          <li>
            <a id="tdmbibentyear" href="bibliography.html">Publications of the year</a>
          </li>
        </ul>
      </div>
    </div>
    <div id="main">
      <div class="mainentete">
        <div id="head_agauche">
          <small><a href="http://www.inria.fr">
	    
	    Inria
	  </a> | <a href="../index.html">
	    
	    Raweb 
	    2019</a> | <a href="http://www.inria.fr/en/teams/zenith">Presentation of the Project-Team ZENITH</a> | <a href="https://team.inria.fr/zenith/">ZENITH Web Site
	  </a></small>
        </div>
        <div id="head_adroite">
          <table class="qrcode">
            <tr>
              <td>
                <a href="zenith.xml">
                  <img style="align:bottom; border:none" alt="XML" src="../static/img/icons/xml_motif.png"/>
                </a>
              </td>
              <td>
                <a href="zenith.pdf">
                  <img style="align:bottom; border:none" alt="PDF" src="IMG/qrcode-zenith-pdf.png"/>
                </a>
              </td>
              <td>
                <a href="../zenith/zenith.epub">
                  <img style="align:bottom; border:none" alt="e-pub" src="IMG/qrcode-zenith-epub.png"/>
                </a>
              </td>
            </tr>
            <tr>
              <td/>
              <td>PDF
</td>
              <td>e-Pub
</td>
            </tr>
          </table>
        </div>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid3.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid6.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
      <div id="textepage">
        <!--DEBUT2 du corps du module-->
        <h2>Section: 
      Research Program</h2>
        <h3 class="titre3">Distributed Data Management</h3>
        <p>Data management is concerned with the storage,
organization, retrieval and manipulation of data of all kinds, from small and simple
to very large and complex. It has become a major domain of computer science, with a large
international research community and a strong industry. Continuous technology transfer
from research to industry has led to the development of powerful DBMS, now at the heart
of any information system, and of advanced data management capabilities in many kinds of
software products (search engines, application servers, document systems, etc.).</p>
        <p>To deal with the massive scale of scientific data, we exploit
large-scale distributed systems,
with the objective of making distribution transparent to the users and applications. Thus,
we capitalize on the principles of large-scale
distributed systems such as clusters, peer-to-peer (P2P) and cloud.</p>
        <p>Data management in distributed systems has been traditionally achieved by distributed
database systems which enable users to transparently access and update
several databases in a network using a high-level query language (e.g. SQL).
Transparency is achieved through a global schema which hides the local databases'
heterogeneity. In its simplest form, a distributed database system supports a global schema and implements distributed database techniques
(query processing, transaction management, consistency management, etc.). This approach
has proved to be effective for applications that can benefit from centralized control and
full-fledge database capabilities, e.g. information systems. However, it cannot scale up
to more than tens of databases.</p>
        <p>Parallel database systems extend the distributed database approach
to improve performance (transaction throughput or query response time) by exploiting
database partitioning using a multiprocessor or cluster system. Although data integration
systems and parallel database systems can scale up to hundreds of data sources or database
partitions, they still rely on a centralized global schema and strong assumptions about
the network.</p>
        <p>In contrast, peer-to-peer (P2P) systems adopt a completely decentralized approach to
data sharing. By distributing data storage and processing across autonomous peers in
the network, they can scale without the need for powerful servers.
P2P systems typically have millions of users sharing petabytes of
data over the Internet. Although very useful, these systems are quite simple (e.g. file
sharing), support limited functions (e.g. keyword search) and use simple techniques (e.g.
resource location by flooding) which have performance problems.
A P2P solution is well-suited to support the
collaborative nature of scientific applications as it provides
scalability, dynamicity, autonomy and decentralized control. Peers can
be the participants or organizations involved in collaboration and may
share data and applications while keeping full control over their
(local) data sources.
But for very-large scale scientific data analysis,
we believe cloud computing (see next section), is the right
approach as it can provide virtually infinite computing, storage and
networking resources.
However, current cloud architectures are proprietary, ad-hoc, and may
deprive users of the control of their own data. Thus, we postulate
that a hybrid P2P/cloud architecture is more appropriate for
scientific data management, by combining the best of both
approaches. In particular, it will enable the clean integration of the
users’ own computational resources with different clouds.</p>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid3.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid6.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
    </div>
  </body>
</html>
