<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN" "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8"/>
    <title>Project-Team:WILLOW</title>
    <link rel="stylesheet" href="../static/css/raweb.css" type="text/css"/>
    <meta name="description" content="New Results - 3D object and scene modeling, analysis, and retrieval"/>
    <meta name="dc.title" content="New Results - 3D object and scene modeling, analysis, and retrieval"/>
    <meta name="dc.creator" content="Mathieu Aubry"/>
    <meta name="dc.creator" content="Bryan Russell"/>
    <meta name="dc.creator" content="Josef Sivic"/>
    <meta name="dc.creator" content="Mathieu Aubry"/>
    <meta name="dc.creator" content="Bryan Russell"/>
    <meta name="dc.creator" content="Alyosha Efros"/>
    <meta name="dc.creator" content="Josef Sivic"/>
    <meta name="dc.creator" content="Mathieu Andreux"/>
    <meta name="dc.creator" content="Emanuele Rodola"/>
    <meta name="dc.creator" content="Mathieu Aubry"/>
    <meta name="dc.creator" content="Daniel Cremers"/>
    <meta name="dc.creator" content="Jean Ponce"/>
    <meta name="dc.creator" content="Martial Hebert"/>
    <meta name="dc.creator" content="Jean Ponce"/>
    <meta name="dc.creator" content="Martial Hebert"/>
    <meta name="dc.subject" content=""/>
    <meta name="dc.publisher" content="INRIA"/>
    <meta name="dc.date" content="(SCHEME=ISO8601) 2014-01"/>
    <meta name="dc.type" content="Report"/>
    <meta name="dc.language" content="(SCHEME=ISO639-1) en"/>
    <meta name="projet" content="WILLOW"/>
    <!-- Piwik -->
    <script type="text/javascript" src="/rapportsactivite/piwik.js"></script>
    <noscript><p><img src="//piwik.inria.fr/piwik.php?idsite=49" style="border:0;" alt="" /></p></noscript>
    <!-- End Piwik Code -->
  </head>
  <body>
    <div class="tdmdiv">
      <div class="logo">
        <a href="http://www.inria.fr">
          <img style="align:bottom; border:none" src="../static/img/icons/logo_INRIA-coul.jpg" alt="Inria"/>
        </a>
      </div>
      <div class="TdmEntry">
        <div class="tdmentete">
          <a href="uid0.html">Project-Team Willow</a>
        </div>
        <span>
          <a href="uid1.html">Members</a>
        </span>
      </div>
      <div class="TdmEntry">Overall Objectives<ul><li><a href="./uid3.html">Statement</a></li></ul></div>
      <div class="TdmEntry">Research Program<ul><li><a href="uid5.html&#10;&#9;&#9;  ">3D object and scene modeling, analysis,
and retrieval</a></li><li><a href="uid7.html&#10;&#9;&#9;  ">Category-level object and scene recognition</a></li><li><a href="uid8.html&#10;&#9;&#9;  ">Image restoration, manipulation and enhancement</a></li><li><a href="uid9.html&#10;&#9;&#9;  ">Human activity capture and classification</a></li></ul></div>
      <div class="TdmEntry">Application Domains<ul><li><a href="uid14.html&#10;&#9;&#9;  ">Introduction</a></li><li><a href="uid15.html&#10;&#9;&#9;  ">Quantitative image analysis in science and humanities</a></li><li><a href="uid16.html&#10;&#9;&#9;  ">Video Annotation, Interpretation, and Retrieval</a></li></ul></div>
      <div class="TdmEntry">New Software and Platforms<ul><li><a href="uid18.html&#10;&#9;&#9;  ">SPArse Modeling Software (SPAMS)</a></li><li><a href="uid22.html&#10;&#9;&#9;  ">Efficient video descriptors for action recognition</a></li><li><a href="uid23.html&#10;&#9;&#9;  ">Weakly Supervised Action Labeling in Videos Under Ordering Constraints</a></li><li><a href="uid24.html&#10;&#9;&#9;  ">Visual Place Recognition with Repetitive Structures</a></li><li><a href="uid25.html&#10;&#9;&#9;  ">Seeing 3D chairs: exemplar part-based 2D-3D alignment
using a large dataset of CAD models</a></li><li><a href="uid26.html&#10;&#9;&#9;  ">Painting-to-3D Model Alignment Via Discriminative Visual Elements</a></li><li><a href="uid27.html&#10;&#9;&#9;  ">Painting recognition from wearable cameras</a></li><li><a href="uid28.html&#10;&#9;&#9;  ">Learning and transferring mid-level image representations using convolutional neural networks</a></li></ul></div>
      <div class="TdmEntry">New Results<ul><li><a href="uid30.html&#10;&#9;&#9;  ">Highlights of the Year</a></li><li class="tdmActPage"><a href="uid33.html&#10;&#9;&#9;  ">3D object and scene modeling, analysis, and retrieval</a></li><li><a href="uid41.html&#10;&#9;&#9;  ">Category-level object and scene recognition</a></li><li><a href="uid51.html&#10;&#9;&#9;  ">Image restoration, manipulation and enhancement</a></li><li><a href="uid54.html&#10;&#9;&#9;  ">Human activity capture and classification</a></li></ul></div>
      <div class="TdmEntry">Bilateral Contracts and Grants with Industry<ul><li><a href="uid61.html&#10;&#9;&#9;  ">MSR-Inria joint lab: Image and video mining for science and
humanities (Inria)</a></li><li><a href="uid62.html&#10;&#9;&#9;  ">Google: Learning to annotate videos from movie scripts (Inria)</a></li></ul></div>
      <div class="TdmEntry">Partnerships and Cooperations<ul><li><a href="uid64.html&#10;&#9;&#9;  ">National Initiatives</a></li><li><a href="uid66.html&#10;&#9;&#9;  ">European Initiatives</a></li><li><a href="uid71.html&#10;&#9;&#9;  ">International Initiatives</a></li><li><a href="uid76.html&#10;&#9;&#9;  ">International Research Visitors</a></li></ul></div>
      <div class="TdmEntry">Dissemination<ul><li><a href="uid80.html&#10;&#9;&#9;  ">Promoting Scientific Activities</a></li><li><a href="uid117.html&#10;&#9;&#9;  ">Teaching - Supervision - Juries</a></li><li><a href="uid160.html&#10;&#9;&#9;  ">Invited presentations</a></li></ul></div>
      <div class="TdmEntry">
        <div>Bibliography</div>
      </div>
      <div class="TdmEntry">
        <ul>
          <li>
            <a id="tdmbibentyear" href="bibliography.html">Publications of the year</a>
          </li>
        </ul>
      </div>
    </div>
    <div id="main">
      <div class="mainentete">
        <div id="head_agauche">
          <small><a href="http://www.inria.fr">
	    
	    Inria
	  </a> | <a href="../index.html">
	    
	    Raweb 
	    2014</a> | <a href="http://www.inria.fr/en/teams/willow">Presentation of the Project-Team WILLOW</a> | <a href="http://www.di.ens.fr/willow">WILLOW Web Site
	  </a></small>
        </div>
        <div id="head_adroite">
          <table class="qrcode">
            <tr>
              <td>
                <a href="willow.xml">
                  <img style="align:bottom; border:none" alt="XML" src="../static/img/icons/xml_motif.png"/>
                </a>
              </td>
              <td>
                <a href="willow.pdf">
                  <img style="align:bottom; border:none" alt="PDF" src="IMG/qrcode-willow-pdf.png"/>
                </a>
              </td>
              <td>
                <a href="../willow/willow.epub">
                  <img style="align:bottom; border:none" alt="e-pub" src="IMG/qrcode-willow-epub.png"/>
                </a>
              </td>
            </tr>
            <tr>
              <td/>
              <td>PDF
</td>
              <td>e-Pub
</td>
            </tr>
          </table>
        </div>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid30.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid41.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
      <div id="textepage">
        <!--DEBUT2 du corps du module-->
        <h2>Section: 
      New Results</h2>
        <h3 class="titre3">3D object and scene modeling, analysis, and retrieval</h3>
        <div align="center" style="margin-top:10px">
          <a name="uid34">
            <!--...-->
          </a>
          <table title="" class="objectContainer">
            <caption align="bottom"><strong>Figure
	1. </strong>
          Our system automatically aligns and recovers the viewpoint of paintings, drawings, and historical photographs to a 3D model of an architectural site. 
        </caption>
            <tr align="center">
              <td>
                <table>
                  <tr>
                    <td xmlns="" style="height:3px;" align="center">
                      <img xmlns="http://www.w3.org/1999/xhtml" style="width:427.0pt" alt="IMG/Aubry13.jpg" src="IMG/Aubry13.jpg"/>
                    </td>
                  </tr>
                </table>
              </td>
            </tr>
          </table>
        </div>
        <a name="uid35"/>
        <h4 class="titre4">Painting-to-3D Model Alignment Via Discriminative Visual Elements</h4>
        <p class="participants"><span class="part">Participants</span> :
	Mathieu Aubry, Bryan Russell [Intel Labs] , Josef Sivic.</p>
        <p>In this work we describe a technique that can reliably align arbitrary 2D depictions of an architectural site, including drawings, paintings and historical photographs, with a 3D model of the site. This is a tremendously difficult task as the appearance and scene structure in the 2D depictions can be very different from the appearance and geometry of the 3D model, e.g., due to the specific rendering style, drawing error, age, lighting or change of seasons. In addition, we face a hard search problem: the number of possible alignments of the painting to a large 3D model, such as a partial reconstruction of a city, is huge. To address these issues, we develop a new compact representation of complex 3D scenes. The 3D model of the scene is represented by a small set of discriminative visual elements that are automatically learnt from rendered views. Similar to object detection, the set of visual elements, as well as the weights of individual features for each element, are learnt in a discriminative fashion. We show that the learnt visual elements are reliably matched in 2D depictions of the scene despite large variations in rendering style (e.g. watercolor, sketch, historical photograph) and structural changes (e.g. missing scene parts, large occluders) of the scene. We demonstrate an application of the proposed approach to automatic re-photography to find an approximate viewpoint of historical paintings and photographs with respect to a 3D model of the site. The proposed alignment procedure is validated via a human user study on a new database of paintings and sketches spanning several sites. The results demonstrate that our algorithm produces significantly better alignments than several baseline methods.
This work has been published at ACM Transactions on Graphics 2014 <a href="./bibliography.html#willow-2014-bid5">[3]</a>  and its extension has appeared at RFIA 2014 <a href="./bibliography.html#willow-2014-bid6">[17]</a> .
The problem addressed in this work is illustrated in Figure <a title="3D object and scene modeling, analysis, and retrieval" href="./uid33.html#uid34">1</a>  and example results are shown in Figure <a title="3D object and scene modeling, analysis, and retrieval" href="./uid33.html#uid36">2</a> .</p>
        <div align="center" style="margin-top:10px">
          <a name="uid36">
            <!--...-->
          </a>
          <table title="" class="objectContainer">
            <caption align="bottom"><strong>Figure
	2. </strong>Example alignments of non-photographic depictions to 3D models. Notice that we are able to align depictions rendered in different styles and having a variety of viewpoints with respect to the 3D models.</caption>
            <tr align="center">
              <td>
                <table>
                  <tr>
                    <td xmlns="" style="height:3px;" align="center">
                      <img xmlns="http://www.w3.org/1999/xhtml" style="width:427.0pt" alt="IMG/Aubry13im3.jpg" src="IMG/Aubry13im3.jpg"/>
                    </td>
                  </tr>
                </table>
              </td>
            </tr>
          </table>
        </div>
        <a name="uid37"/>
        <h4 class="titre4">Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models</h4>
        <p class="participants"><span class="part">Participants</span> :
	Mathieu Aubry, Bryan Russell [Intel labs] , Alyosha Efros [UC Berkeley] , Josef Sivic.</p>
        <p>This work poses object category detection in images as a type of 2D-to-3D alignment problem, utilizing the large quantities of 3D CAD models that have been made publicly available online. Using the “chair” class as a running example, we propose an exemplar-based 3D category representation, which can explicitly model chairs of different styles as well as the large variation in viewpoint. We develop an approach to establish part-based correspondences between 3D CAD models and real photographs. This is achieved by (i) representing each 3D model using a set of view-dependent mid-level visual elements learned from synthesized views in a discriminative fashion, (ii) carefully calibrating the individual element detectors on a common dataset of negative images, and (iii) matching visual elements to the test image allowing for small mutual deformations but preserving the viewpoint and style constraints. We demonstrate the ability of our system to align 3D models with 2D objects in the challenging PASCAL VOC images, which depict a wide variety of chairs in complex scenes.
This work has been published at CVPR 2014 <a href="./bibliography.html#willow-2014-bid2">[9]</a> .</p>
        <a name="uid38"/>
        <h4 class="titre4">Anisotropic Laplace-Beltrami Operators for Shape Analysis</h4>
        <p class="participants"><span class="part">Participants</span> :
	Mathieu Andreux [TUM] , Emanuele Rodola [TUM] , Mathieu Aubry, Daniel Cremers [TUM] .</p>
        <p>This work introduces an anisotropic Laplace-Beltrami operator for shape analysis. While keeping useful properties of the standard Laplace-Beltrami operator, it introduces variability in the directions of principal curvature, giving rise to a more intuitive and semantically meaningful diffusion process. Although the benefits of anisotropic diffusion have already been noted in the area of mesh processing (e.g. surface regularization), focusing on the Laplacian itself, rather than on the diffusion process it induces, opens the possibility to effectively replace the omnipresent Laplace-Beltrami operator in many shape analysis methods. After providing a mathematical formulation and analysis of this new operator, we derive a practical implementation on discrete meshes. Further, we demonstrate the effectiveness of our new operator when employed in conjunction with different methods for shape segmentation and matching.
This work has been published at the Sixth Workshop on Non-Rigid Shape Analysis and Deformable Image Alignment (NORDIA) 2014 <a href="./bibliography.html#willow-2014-bid7">[8]</a> .</p>
        <a name="uid39"/>
        <h4 class="titre4">Trinocular Geometry Revisited</h4>
        <p class="participants"><span class="part">Participants</span> :
	Jean Ponce, Martial Hebert [CMU] .</p>
        <p>When do the visual rays associated with triplets of point
correspondences converge, that is, intersect in a common
point? Classical models of trinocular geometry based on
the fundamental matrices and trifocal tensor associated
with the corresponding cameras only provide partial answers
to this fundamental question, in large part because of
underlying, but seldom explicit, general configuration assumptions.
In this project, we use elementary tools from projective
line geometry to provide necessary and sufficient geometric
and analytical conditions for convergence in terms
of transversals to triplets of visual rays, without any such
assumptions. In turn, this yields a novel and simple minimal
parameterization of trinocular geometry for cameras
with non-collinear or collinear pinholes.
This work has been published at CVPR 2014 <a href="./bibliography.html#willow-2014-bid8">[15]</a> .</p>
        <a name="uid40"/>
        <h4 class="titre4">On Image Contours of Projective Shapes</h4>
        <p class="participants"><span class="part">Participants</span> :
	Jean Ponce, Martial Hebert [CMU] .</p>
        <p>This work revisits classical properties of the outlines of solid shapes bounded by smooth surfaces, and shows that they can be established in a purely projective setting, without appealing to Euclidean measurements such as normals or curvatures. In particular, we give new synthetic proofs of Koenderink's famous theorem on convexities and concavities of the image contour, and of the fact that the rim turns in the same direction as the viewpoint in the tangent plane at a convex point, and in the opposite direction at a hyperbolic point. This suggests that projective geometry should not be viewed merely as an analytical device for linearizing calculations (its main role in structure from motion), but as the proper framework for studying the relation between solid shape and its perspective projections. Unlike previous work in this area, the proposed approach does not require an oriented setting, nor does it rely on any choice of coordinate system or analytical considerations.
This work has been published at ECCV 2014 <a href="./bibliography.html#willow-2014-bid9">[14]</a> .</p>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid30.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid41.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
    </div>
  </body>
</html>
