<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN" "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8"/>
    <title>Project-Team:ZENITH</title>
    <link rel="stylesheet" href="../static/css/raweb.css" type="text/css"/>
    <meta name="description" content="Overall Objectives - Overall Objectives"/>
    <meta name="dc.title" content="Overall Objectives - Overall Objectives"/>
    <meta name="dc.subject" content=""/>
    <meta name="dc.publisher" content="INRIA"/>
    <meta name="dc.date" content="(SCHEME=ISO8601) 2016-01"/>
    <meta name="dc.type" content="Report"/>
    <meta name="dc.language" content="(SCHEME=ISO639-1) en"/>
    <meta name="projet" content="ZENITH"/>
    <script type="text/javascript" src="https://raweb.inria.fr/rapportsactivite/RA2016/static/MathJax/MathJax.js?config=TeX-MML-AM_CHTML">
      <!--MathJax-->
    </script>
  </head>
  <body>
    <div class="tdmdiv">
      <div class="logo">
        <a href="http://www.inria.fr">
          <img style="align:bottom; border:none" src="../static/img/icons/logo_INRIA-coul.jpg" alt="Inria"/>
        </a>
      </div>
      <div class="TdmEntry">
        <div class="tdmentete">
          <a href="uid0.html">Project-Team Zenith</a>
        </div>
        <span>
          <a href="uid1.html">Members</a>
        </span>
      </div>
      <div class="tdmActPage">
        <a href="./uid3.html">Overall Objectives</a>
      </div>
      <div class="TdmEntry">Research Program<ul><li><a href="uid10.html&#10;&#9;&#9;  ">Data Management</a></li><li><a href="uid11.html&#10;&#9;&#9;  ">Distributed Data Management</a></li><li><a href="uid12.html&#10;&#9;&#9;  ">Cloud Data Management</a></li><li><a href="uid13.html&#10;&#9;&#9;  ">Big Data</a></li><li><a href="uid17.html&#10;&#9;&#9;  ">Data Integration</a></li><li><a href="uid18.html&#10;&#9;&#9;  ">Data Analytics</a></li><li><a href="uid22.html&#10;&#9;&#9;  ">Data Search</a></li></ul></div>
      <div class="TdmEntry">Application Domains<ul><li><a href="uid27.html&#10;&#9;&#9;  ">Data-intensive Scientific Applications</a></li></ul></div>
      <div class="TdmEntry">
        <a href="./uid33.html">Highlights of the Year</a>
      </div>
      <div class="TdmEntry">New Software and Platforms<ul><li><a href="uid37.html&#10;&#9;&#9;  ">Pl@ntNet</a></li><li><a href="uid38.html&#10;&#9;&#9;  ">The Plant Game: crowdsourced plants identification</a></li><li><a href="uid39.html&#10;&#9;&#9;  ">Smart’Flore</a></li><li><a href="uid40.html&#10;&#9;&#9;  ">Snoop &amp; SnoopIm</a></li><li><a href="uid41.html&#10;&#9;&#9;  ">MultiSite-Rec</a></li><li><a href="uid42.html&#10;&#9;&#9;  ">Chiaroscuro</a></li><li><a href="uid43.html&#10;&#9;&#9;  ">LogMagnet</a></li><li><a href="uid44.html&#10;&#9;&#9;  ">FP-Hadoop</a></li><li><a href="uid45.html&#10;&#9;&#9;  ">CloudMdsQL Compiler</a></li><li><a href="uid46.html&#10;&#9;&#9;  ">AgroLD</a></li><li><a href="uid47.html&#10;&#9;&#9;  ">SciFloware</a></li></ul></div>
      <div class="TdmEntry">New Results<ul><li><a href="uid49.html&#10;&#9;&#9;  ">Data Integration</a></li><li><a href="uid52.html&#10;&#9;&#9;  ">Data Management</a></li><li><a href="uid55.html&#10;&#9;&#9;  ">Scientific Workflows</a></li><li><a href="uid59.html&#10;&#9;&#9;  ">Data Analytics</a></li><li><a href="uid62.html&#10;&#9;&#9;  ">Data Search</a></li></ul></div>
      <div class="TdmEntry">Bilateral Contracts and Grants with Industry<ul><li><a href="uid69.html&#10;&#9;&#9;  ">Microsoft ZcloudFlow (2013-2017)</a></li><li><a href="uid70.html&#10;&#9;&#9;  ">Triton I-lab (2014-2016)</a></li></ul></div>
      <div class="TdmEntry">Partnerships and Cooperations<ul><li><a href="uid72.html&#10;&#9;&#9;  ">Regional Initiatives</a></li><li><a href="uid75.html&#10;&#9;&#9;  ">National Initiatives</a></li><li><a href="uid81.html&#10;&#9;&#9;  ">European Initiatives</a></li><li><a href="uid86.html&#10;&#9;&#9;  ">International Initiatives</a></li><li><a href="uid101.html&#10;&#9;&#9;  ">International Research Visitors</a></li></ul></div>
      <div class="TdmEntry">Dissemination<ul><li><a href="uid104.html&#10;&#9;&#9;  ">Scientific Animation</a></li><li><a href="uid155.html&#10;&#9;&#9;  ">Teaching - Supervision - Juries</a></li><li><a href="uid182.html&#10;&#9;&#9;  ">Popularization</a></li></ul></div>
      <div class="TdmEntry">
        <div>Bibliography</div>
      </div>
      <div class="TdmEntry">
        <ul>
          <li>
            <a id="tdmbibentmajor" href="bibliography.html">Major publications</a>
          </li>
          <li>
            <a id="tdmbibentyear" href="bibliography.html#year">Publications of the year</a>
          </li>
        </ul>
      </div>
    </div>
    <div id="main">
      <div class="mainentete">
        <div id="head_agauche">
          <small><a href="http://www.inria.fr">
	    
	    Inria
	  </a> | <a href="../index.html">
	    
	    Raweb 
	    2016</a> | <a href="http://www.inria.fr/en/teams/zenith">Presentation of the Project-Team ZENITH</a> | <a href="https://team.inria.fr/zenith/">ZENITH Web Site
	  </a></small>
        </div>
        <div id="head_adroite">
          <table class="qrcode">
            <tr>
              <td>
                <a href="zenith.xml">
                  <img style="align:bottom; border:none" alt="XML" src="../static/img/icons/xml_motif.png"/>
                </a>
              </td>
              <td>
                <a href="zenith.pdf">
                  <img style="align:bottom; border:none" alt="PDF" src="IMG/qrcode-zenith-pdf.png"/>
                </a>
              </td>
              <td>
                <a href="../zenith/zenith.epub">
                  <img style="align:bottom; border:none" alt="e-pub" src="IMG/qrcode-zenith-epub.png"/>
                </a>
              </td>
            </tr>
            <tr>
              <td/>
              <td>PDF
</td>
              <td>e-Pub
</td>
            </tr>
          </table>
        </div>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid1.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid10.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
      <div id="textepage">
        <!--DEBUT2 du corps du module-->
        <h2>Section: 
      Overall Objectives</h2>
        <h3 class="titre3">Overall Objectives</h3>
        <p>Data-intensive science such as agronomy, astronomy, biology and
environmental science must deal with overwhelming amounts of
experimental data produced through empirical observation and
simulation. Such data must be processed (cleaned, transformed, analyzed) in all kinds of
ways in order to draw new conclusions, prove scientific theories and
produce knowledge. However, constant progress in scientific
observational instruments (e.g. satellites, sensors, large hadron
collider) and simulation tools (that foster in silico experimentation) creates
a huge data overload. For example, climate modeling data are growing
so fast that they will lead to collections of hundreds of exabytes by 2020.</p>
        <p>Scientific data is very complex, in particular because of
heterogeneous methods used for producing data, the uncertainty of
captured data, the inherently multi-scale nature (spatial scale,
temporal scale) of many sciences and the growing use of imaging
(e.g. molecular imaging), resulting in data with hundreds of
attributes, dimensions or descriptors.
Modern science research is also highly collaborative, involving scientists from different disciplines
(e.g. biologists, soil scientists, and geologists working on an
environmental project), in some cases from different organizations
in different countries. Each discipline or
organization tends to produce and manage its own data, in specific
formats, with its own processes. Thus, integrating such distributed data
gets difficult as the amounts of heterogeneous data grow.</p>
        <p>Despite their variety, we can identify common features of scientific
data: big data; manipulated through complex, distributed
workflows; typically complex, e.g. multidimensional or graph-based;
with uncertainty in the data values, e.g., to reflect data capture or
observation; important metadata about experiments and their
provenance; and mostly append-only (with rare updates).</p>
        <p>Relational DBMSs, which have proved effective in many application domains (e.g. business
transactions, business intelligence), are not efficient at dealing
with scientific data or big data, which is typically unstructured.
In particular, they have been criticized for their “one size fits all” approach.
As an alternative , more specialized solutions are being developped
such as NoSQL/NewSQL DBMSs and data processing frameworks (e.g. Spark) on top of
distributed file systems (e.g. HDFS).</p>
        <p>The three main challenges of scientific data management can be
summarized by: (1) scale (big data, big applications); (2) complexity
(uncertain, multi-scale data with lots of dimensions), (3)
heterogeneity (in particular, data semantics heterogeneity).
These challenges are also those of data science, with the goal of making
sense out of data by combining data management, machine learning,
statistics and other disciplines.
The overall goal of Zenith is to address these challenges, by
proposing innovative solutions with significant advantages in terms of
scalability, functionality, ease of use, and performance. To produce
generic results, these solutions are in terms of architectures, models
and algorithms that can be implemented in terms of components or
services in specific computing environments, e.g. cloud.
We design and validate our solutions by working closely
with our scientific application partners such as INRA and IRD in
France, or the National Research Institute on e-medicine (MACC) in Brazil. To
further validate our solutions and extend the scope of our results, we
also foster industrial collaborations, even in non scientific
applications, provided that they exhibit similar challenges.</p>
        <p>Our approach is to capitalize on the principles of distributed and parallel data
management. In particular, we exploit: high-level languages as
the basis for data independence and automatic optimization; data
semantics to improve information retrieval and automate data integration; declarative
languages to manipulate data and workflows; and highly distributed
and parallel environments such as P2P, cluster and cloud. We also
exploit machine learning and statistics for data analytics and data search.
To reflect our approach, we organize our research program in five
complementary themes:</p>
        <ul>
          <li>
            <p class="notaparagraph"><a name="uid4"> </a>Data integration, including data capture and cleaning;</p>
          </li>
          <li>
            <p class="notaparagraph"><a name="uid5"> </a>Data management, in particular, indexing and privacy;</p>
          </li>
          <li>
            <p class="notaparagraph"><a name="uid6"> </a>Scientific workflows, in particular, in grid and cloud;</p>
          </li>
          <li>
            <p class="notaparagraph"><a name="uid7"> </a>Data analytics, including data mining and statistics;</p>
          </li>
          <li>
            <p class="notaparagraph"><a name="uid8"> </a>Data search, including machine learning and content-based image retrieval.</p>
          </li>
        </ul>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid1.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid10.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
    </div>
  </body>
</html>
