<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN" "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8"/>
    <title>Project-Team:GALEN</title>
    <link rel="stylesheet" href="../static/css/raweb.css" type="text/css"/>
    <meta name="description" content="Research Program - Shape, Grouping and Recognition"/>
    <meta name="dc.title" content="Research Program - Shape, Grouping and Recognition"/>
    <meta name="dc.subject" content=""/>
    <meta name="dc.publisher" content="INRIA"/>
    <meta name="dc.date" content="(SCHEME=ISO8601) 2016-01"/>
    <meta name="dc.type" content="Report"/>
    <meta name="dc.language" content="(SCHEME=ISO639-1) en"/>
    <meta name="projet" content="GALEN"/>
    <script type="text/javascript" src="https://raweb.inria.fr/rapportsactivite/RA2016/static/MathJax/MathJax.js?config=TeX-MML-AM_CHTML">
      <!--MathJax-->
    </script>
  </head>
  <body>
    <div class="tdmdiv">
      <div class="logo">
        <a href="http://www.inria.fr">
          <img style="align:bottom; border:none" src="../static/img/icons/logo_INRIA-coul.jpg" alt="Inria"/>
        </a>
      </div>
      <div class="TdmEntry">
        <div class="tdmentete">
          <a href="uid0.html">Project-Team Galen</a>
        </div>
        <span>
          <a href="uid1.html">Members</a>
        </span>
      </div>
      <div class="TdmEntry">Overall Objectives<ul><li><a href="./uid3.html">GALEN@Centrale-Paris</a></li></ul></div>
      <div class="TdmEntry">Research Program<ul><li class="tdmActPage"><a href="uid5.html&#10;&#9;&#9;  ">Shape, Grouping and Recognition</a></li><li><a href="uid7.html&#10;&#9;&#9;  ">Machine Learning &amp; Structured Prediction</a></li><li><a href="uid11.html&#10;&#9;&#9;  ">Self-Paced Learning with Missing Information</a></li><li><a href="uid15.html&#10;&#9;&#9;  ">Discrete Biomedical Image Perception</a></li></ul></div>
      <div class="TdmEntry">Application Domains<ul><li><a href="uid19.html&#10;&#9;&#9;  ">Testing for Difference in Functional Brain Connectivity</a></li><li><a href="uid20.html&#10;&#9;&#9;  ">Lung Tumor Detection and Characterization</a></li><li><a href="uid21.html&#10;&#9;&#9;  ">Protein function prediction</a></li><li><a href="uid22.html&#10;&#9;&#9;  ">Imaging biomarkers for chronic lung diseases</a></li><li><a href="uid23.html&#10;&#9;&#9;  ">Co-segmentation and Co-registration of Subcortical Brain Structures</a></li></ul></div>
      <div class="TdmEntry">
        <a href="./uid25.html">Highlights of the Year</a>
      </div>
      <div class="TdmEntry">New Software and Platforms<ul><li><a href="uid40.html&#10;&#9;&#9;  ">DISD</a></li><li><a href="uid44.html&#10;&#9;&#9;  ">DPMS</a></li><li><a href="uid48.html&#10;&#9;&#9;  ">ECP</a></li><li><a href="uid52.html&#10;&#9;&#9;  ">DROP</a></li><li><a href="uid56.html&#10;&#9;&#9;  ">FastPD</a></li><li><a href="uid59.html&#10;&#9;&#9;  ">GraPeS</a></li><li><a href="uid63.html&#10;&#9;&#9;  ">HOAP-SVM</a></li><li><a href="uid67.html&#10;&#9;&#9;  ">LBSD</a></li><li><a href="uid71.html&#10;&#9;&#9;  ">TeXMeG</a></li><li><a href="uid75.html&#10;&#9;&#9;  ">mrf-registration</a></li><li><a href="uid79.html&#10;&#9;&#9;  ">Newton-MRF</a></li><li><a href="uid83.html&#10;&#9;&#9;  ">Relative MMD</a></li><li><a href="uid87.html&#10;&#9;&#9;  ">Deep Graph Structure Discovery</a></li></ul></div>
      <div class="TdmEntry">New Results<ul><li><a href="uid92.html&#10;&#9;&#9;  ">Learning Grammars for Architecture-Specific Facade Parsing</a></li><li><a href="uid93.html&#10;&#9;&#9;  ">Non-Rigid Surface Registration</a></li><li><a href="uid94.html&#10;&#9;&#9;  ">Monocular Surface Reconstruction using 3D Deformable Part Models</a></li><li><a href="uid95.html&#10;&#9;&#9;  ">Learning with Non-modular loss functions</a></li><li><a href="uid96.html&#10;&#9;&#9;  ">Asymptotic Variance of MMD and Relative MMD</a></li><li><a href="uid97.html&#10;&#9;&#9;  ">Deconvolution and Deinterlacing of Video Sequences</a></li><li><a href="uid98.html&#10;&#9;&#9;  ">A Variational Bayesian Approach for Restoring Data Corrupted with Non-Gaussian Noise</a></li><li><a href="uid99.html&#10;&#9;&#9;  ">The Majorize-Minimize Subspace Algorithm and Block Parallelization</a></li><li><a href="uid100.html&#10;&#9;&#9;  ">Stochastic Forward-Backward and Primal-Dual Approximation Algorithms with Application to Online Image Restoration</a></li><li><a href="uid101.html&#10;&#9;&#9;  ">Random primal-dual proximal iterations for sparse multiclass SVM</a></li><li><a href="uid102.html&#10;&#9;&#9;  ">PALMA, an improved algorithm for DOSY signal processing</a></li><li><a href="uid103.html&#10;&#9;&#9;  ">Graph-based change detection and classification in satellite image pairs</a></li><li><a href="uid104.html&#10;&#9;&#9;  ">Graphical models in artificial vision</a></li><li><a href="uid105.html&#10;&#9;&#9;  ">Pattern analysis of EEG signals with epileptic activity</a></li></ul></div>
      <div class="TdmEntry">Partnerships and Cooperations<ul><li><a href="uid107.html&#10;&#9;&#9;  ">National Initiatives</a></li><li><a href="uid129.html&#10;&#9;&#9;  ">European Initiatives</a></li><li><a href="uid196.html&#10;&#9;&#9;  ">International Initiatives</a></li></ul></div>
      <div class="TdmEntry">Dissemination<ul><li><a href="uid213.html&#10;&#9;&#9;  ">Promoting Scientific Activities</a></li><li><a href="uid239.html&#10;&#9;&#9;  ">Teaching - Supervision - Juries</a></li></ul></div>
      <div class="TdmEntry">
        <div>Bibliography</div>
      </div>
      <div class="TdmEntry">
        <ul>
          <li>
            <a id="tdmbibentyear" href="bibliography.html">Publications of the year</a>
          </li>
          <li>
            <a id="tdmbibentfoot" href="bibliography.html#References">References in notes</a>
          </li>
        </ul>
      </div>
    </div>
    <div id="main">
      <div class="mainentete">
        <div id="head_agauche">
          <small><a href="http://www.inria.fr">
	    
	    Inria
	  </a> | <a href="../index.html">
	    
	    Raweb 
	    2016</a> | <a href="http://www.inria.fr/en/teams/galen">Presentation of the Project-Team GALEN</a></small>
        </div>
        <div id="head_adroite">
          <table class="qrcode">
            <tr>
              <td>
                <a href="galen.xml">
                  <img style="align:bottom; border:none" alt="XML" src="../static/img/icons/xml_motif.png"/>
                </a>
              </td>
              <td>
                <a href="galen.pdf">
                  <img style="align:bottom; border:none" alt="PDF" src="IMG/qrcode-galen-pdf.png"/>
                </a>
              </td>
              <td>
                <a href="../galen/galen.epub">
                  <img style="align:bottom; border:none" alt="e-pub" src="IMG/qrcode-galen-epub.png"/>
                </a>
              </td>
            </tr>
            <tr>
              <td/>
              <td>PDF
</td>
              <td>e-Pub
</td>
            </tr>
          </table>
        </div>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid3.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid7.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
      <div id="textepage">
        <!--DEBUT2 du corps du module-->
        <h2>Section: 
      Research Program</h2>
        <h3 class="titre3">Shape, Grouping and Recognition</h3>
        <p>A general framework for the fundamental problems of image segmentation, object recognition and scene analysis is the interpretation of an image in terms of a set of symbols and relations among them. Abstractly stated, image interpretation amounts to mapping an observed image, <span class="math"><math xmlns="http://www.w3.org/1998/Math/MathML"><mi>X</mi></math></span> to a set of symbols <span class="math"><math xmlns="http://www.w3.org/1998/Math/MathML"><mi>Y</mi></math></span>. Of particular interest are the symbols <span class="math"><math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>Y</mi><mo>*</mo></msup></math></span> that <i>optimally explain the underlying image</i>, as measured by a scoring function <span class="math"><math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi></math></span> that aims at distinguishing correct (consistent with human labellings) from incorrect interpretations:</p>
        <div align="center" class="mathdisplay">
          <a name="uid6"/>
          <table width="100%">
            <tr valign="middle">
              <td align="center">
                <math xmlns="http://www.w3.org/1998/Math/MathML">
                  <mrow>
                    <msup>
                      <mi>Y</mi>
                      <mo>*</mo>
                    </msup>
                    <mo>=</mo>
                    <msub>
                      <mi> argmax </mi>
                      <mi>Y</mi>
                    </msub>
                    <mi>s</mi>
                    <mrow>
                      <mo>(</mo>
                      <mi>X</mi>
                      <mo>,</mo>
                      <mi>Y</mi>
                      <mo>)</mo>
                    </mrow>
                  </mrow>
                </math>
              </td>
              <td class="eqno" width="10" align="right">(1)</td>
            </tr>
          </table>
        </div>
        <p class="notaparagraph">Applying this framework requires (a) identifying which symbols and relations to use (b) learning a scoring function <span class="math"><math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi></math></span> from training data and (c) optimizing over <span class="math"><math xmlns="http://www.w3.org/1998/Math/MathML"><mi>Y</mi></math></span> in Eq.<a title="Shape, Grouping and Recognition" href="./uid5.html#uid6">1</a>.</p>
        <p>One of the main themes of our work is the development of methods that jointly address (a,b,c) in a shape-grouping framework in order to reliably extract, describe, model and detect shape information from natural and medical images. A principal motivation for using a shape-based framework is the understanding that shape- and more generally, grouping- based representations can go all the way from image features to objects. Regarding aspect (a), image representation, we cater for the extraction of image features that respect the shape properties of image structures. Such features are typically constructed to be purely geometric (e.g. boundaries, symmetry axes, image segments), or appearance-based, such as image descriptors. The use of machine learning has been shown to facilitate the robust and efficient extraction of such features, while the grouping of local evidence is known to be necessary to disambiguate the potentially noisy local measurements. In our research we have worked on improving feature extraction, proposing novel blends of invariant geometric- and appearance- based features, as well as grouping algorithms that allow for the efficient construction of optimal assemblies of local features.</p>
        <p>Regarding aspect (b) we have worked on learning scoring functions for detection with deformable models that can exploit the developed low-level representations, while also being amenable to efficient optimization. Our works in this direction build on the graph-based framework to construct models that reflect the shape properties of the structure being modeled. We have used discriminative learning to exploit boundary- and symmetry-based representations for the construction of hierarchical models for shape detection, while for medical images we have developed methods for the end-to-end discriminative training of deformable contour models that combine low-level descriptors with contour-based organ boundary representations.</p>
        <p>Regarding aspect (c) we have developed algorithms which implement top-down/bottom-up computation both in deterministic and stochastic optimization. The main idea is that ‘bottom-up’, image-based guidance is necessary for efficient detection, while ‘top-down’, object-based knowledge can disambiguate and help reliably interpret a given image; a combination of both modes of operation is necessary to combine accuracy with efficiency. In particular we have developed novel techniques for object detection that employ combinatorial optimization tools (A* and Branch-and-Bound) to tame the combinatorial complexity, achieving a best-case performance that is logarithmic in the number of pixels.</p>
        <p>In the long run we aim at scaling up shape-based methods to 3D detection and pose estimation and large-scale object detection. One aspect which seems central to this is the development of appropriate mid-level representations. This is a problem that has received increased interest lately in the 2D case and is relatively mature, but in 3D it has been pursued primarily through ad-hoc schemes. We anticipate that questions pertaining to part sharing in 3D will be addressed most successfully by relying on explicit 3D representations. On the one hand depth sensors, such as Microsoft’s Kinect, are now cheap enough to bring surface modeling and matching into the mainstream of computer vision - so these advances may be directly exploitable at test time for detection. On the other hand, even if we do not use depth information at test time, having 3D information can simplify the modeling task during training. In on-going work with collaborators we have started exploring combinations of such aspects, namely (i) the use of surface analysis tools to match surfaces from depth sensors (ii) using branch-and-bound for efficient inference in 3D space and (iii) groupwise-registration to build statistical 3D surface models. In the coming years we intend to pursue a tighter integration of these different directions for scalable 3D object recognition.</p>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid3.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid7.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
    </div>
  </body>
</html>
