<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN" "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8"/>
    <title>Project-Team:MUTANT</title>
    <link rel="stylesheet" href="../static/css/raweb.css" type="text/css"/>
    <meta name="description" content="Research Program - Real-time Machine Listening"/>
    <meta name="dc.title" content="Research Program - Real-time Machine Listening"/>
    <meta name="dc.subject" content=""/>
    <meta name="dc.publisher" content="INRIA"/>
    <meta name="dc.date" content="(SCHEME=ISO8601) 2014-01"/>
    <meta name="dc.type" content="Report"/>
    <meta name="dc.language" content="(SCHEME=ISO639-1) en"/>
    <meta name="projet" content="MUTANT"/>
    <!-- Piwik -->
    <script type="text/javascript" src="/rapportsactivite/piwik.js"></script>
    <noscript><p><img src="//piwik.inria.fr/piwik.php?idsite=49" style="border:0;" alt="" /></p></noscript>
    <!-- End Piwik Code -->
  </head>
  <body>
    <div class="tdmdiv">
      <div class="logo">
        <a href="http://www.inria.fr">
          <img style="align:bottom; border:none" src="../static/img/icons/logo_INRIA-coul.jpg" alt="Inria"/>
        </a>
      </div>
      <div class="TdmEntry">
        <div class="tdmentete">
          <a href="uid0.html">Project-Team Mutant</a>
        </div>
        <span>
          <a href="uid1.html">Members</a>
        </span>
      </div>
      <div class="TdmEntry">
        <a href="./uid3.html">Overall Objectives</a>
      </div>
      <div class="TdmEntry">Research Program<ul><li class="tdmActPage"><a href="uid8.html&#10;&#9;&#9;  ">Real-time Machine Listening</a></li><li><a href="uid9.html&#10;&#9;&#9;  ">Synchronous and realtime programming for computer music</a></li><li><a href="uid12.html&#10;&#9;&#9;  ">Off-the-shelf Operating Systems for Real-time Audio</a></li></ul></div>
      <div class="TdmEntry">Application Domains<ul><li><a href="uid14.html&#10;&#9;&#9;  ">Authoring and Performing Interactive Music</a></li><li><a href="uid16.html&#10;&#9;&#9;  ">Realtime Music Information Retrieval</a></li><li><a href="uid17.html&#10;&#9;&#9;  ">Automatic Accompaniment/Creative Tools for Entertainment Industry</a></li></ul></div>
      <div class="TdmEntry">New Software and Platforms<ul><li><a href="uid20.html&#10;&#9;&#9;  ">Antescofo</a></li><li><a href="uid22.html&#10;&#9;&#9;  ">Ascograph: Antescofo Visual Editor</a></li><li><a href="uid23.html&#10;&#9;&#9;  ">Antescofo Timed Test Platform</a></li></ul></div>
      <div class="TdmEntry">New Results<ul><li><a href="uid29.html&#10;&#9;&#9;  ">Highlights of the Year</a></li><li><a href="uid30.html&#10;&#9;&#9;  ">Time-Coherency of Bayesian Priors for Sequential Alignment</a></li><li><a href="uid31.html&#10;&#9;&#9;  ">Online Methods for Audio Segmentation and Clustering</a></li><li><a href="uid32.html&#10;&#9;&#9;  ">Model-based Testing an Interactive Music System</a></li><li><a href="uid33.html&#10;&#9;&#9;  ">Antescofo Temporal Pattern</a></li><li><a href="uid34.html&#10;&#9;&#9;  ">OpenMusic reactive Model</a></li><li><a href="uid35.html&#10;&#9;&#9;  ">Representation of Rhythm and Quantization</a></li></ul></div>
      <div class="TdmEntry">Partnerships and Cooperations<ul><li><a href="uid37.html&#10;&#9;&#9;  ">National Initiatives</a></li><li><a href="uid49.html&#10;&#9;&#9;  ">European Initiatives</a></li><li><a href="uid51.html&#10;&#9;&#9;  ">International Initiatives</a></li><li><a href="uid60.html&#10;&#9;&#9;  ">International Research Visitors</a></li></ul></div>
      <div class="TdmEntry">Dissemination<ul><li><a href="uid70.html&#10;&#9;&#9;  ">Promoting Scientific Activities</a></li><li><a href="uid78.html&#10;&#9;&#9;  ">Teaching - Supervision - Juries</a></li><li><a href="uid90.html&#10;&#9;&#9;  ">Popularization</a></li></ul></div>
      <div class="TdmEntry">
        <div>Bibliography</div>
      </div>
      <div class="TdmEntry">
        <ul>
          <li>
            <a id="tdmbibentmajor" href="bibliography.html">Major publications</a>
          </li>
          <li>
            <a id="tdmbibentyear" href="bibliography.html#year">Publications of the year</a>
          </li>
          <li>
            <a id="tdmbibentfoot" href="bibliography.html#References">References in notes</a>
          </li>
        </ul>
      </div>
    </div>
    <div id="main">
      <div class="mainentete">
        <div id="head_agauche">
          <small><a href="http://www.inria.fr">
	    
	    Inria
	  </a> | <a href="../index.html">
	    
	    Raweb 
	    2014</a> | <a href="http://www.inria.fr/en/teams/mutant">Presentation of the Project-Team MUTANT</a> | <a href="http://repmus.ircam.fr/MuTant">MUTANT Web Site
	  </a></small>
        </div>
        <div id="head_adroite">
          <table class="qrcode">
            <tr>
              <td>
                <a href="mutant.xml">
                  <img style="align:bottom; border:none" alt="XML" src="../static/img/icons/xml_motif.png"/>
                </a>
              </td>
              <td>
                <a href="mutant.pdf">
                  <img style="align:bottom; border:none" alt="PDF" src="IMG/qrcode-mutant-pdf.png"/>
                </a>
              </td>
              <td>
                <a href="../mutant/mutant.epub">
                  <img style="align:bottom; border:none" alt="e-pub" src="IMG/qrcode-mutant-epub.png"/>
                </a>
              </td>
            </tr>
            <tr>
              <td/>
              <td>PDF
</td>
              <td>e-Pub
</td>
            </tr>
          </table>
        </div>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid3.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid9.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
      <div id="textepage">
        <!--DEBUT2 du corps du module-->
        <h2>Section: 
      Research Program</h2>
        <h3 class="titre3">Real-time Machine Listening</h3>
        <p>When human listeners are confronted with musical sounds, they rapidly and automatically
find their way in the music.
Even musically untrained listeners have an exceptional ability to make rapid judgments
about music from short examples, such as determining music style, performer, beating,
and specific events such as instruments or pitches.
Making computer systems capable of similar capabilities requires advances in both music cognition,
and analysis and retrieval systems employing signal processing and machine learning.</p>
        <p>In a panel session at the 13th National Conference on Artificial Intelligence in 1996,
Rodney Brooks (noted figure in robotics) remarked that while automatic speech recognition
was a highly researched domain, there had been few works trying to build machines able to
understand “non-speech sound”.
He went further to name this as one of the biggest challenges faced by
Artificial Intelligence <a href="./bibliography.html#mutant-2014-bid5">[41]</a> .
More than 15 years have passed.
Systems now exist that are able to analyze the contents of music and audio signals
and communities such as International Symposium on Music Information Retrieval (MIR)
and Sound and Music Computing (SMC) have formed.
But we still lack reliable Real-Time machine listening systems.</p>
        <p>The first thorough study of machine listening appeared in Eric Scheirer's PhD thesis
at MIT Media Lab in 2001 <a href="./bibliography.html#mutant-2014-bid1">[40]</a>  with a focus on low-level listening
such as pitch and musical tempo, paving the way for a decade of research.
Since the work of Scheirer, the literature has focused on task-dependent methods
for machine listening such as pitch estimation, beat detection, structure discovery and more.
Unfortunately, the majority of existing approaches are designed for information retrieval
on large databases or off-line methods.
Whereas the very act of listening is real-time, very little literature exists for supporting
real-time machine listening.
This argument becomes more clear while looking at the yearly
<a href="http://www.music-ir.org/mirex/wiki/MIREX_HOME">Music Information Retrieval Evaluation eXchange (MIREX)</a> ,
with different retrieval tasks and submitted systems from international institutions,
where almost no emphasis exists on real-time machine listening.
Most MIR contributions focus on off-line approaches to information retrieval
(where the system has access to future data)
with less focus on on-line and realtime approaches to information decoding.</p>
        <p>On another front, most MIR algorithms suffer from modeling of temporal structures
and temporal dynamics specific to music
(where most algorithms have roots in speech or biological sequence without correct
adoption to temporal streams such as music).
Despite tremendous progress using modern signal processing and statistical learning,
there is much to be done to achieve the same level of abstract understanding for example
in text and image analysis on music data.
On another hand, it is important to notice that even untrained listeners are easily able
to capture many aspects of formal and symbolic structures from an audio stream in realtime.
Realtime machine listening is thus still a major challenge for artificial sciences
that should be addressed both on application and theoretical fronts.</p>
        <p>In the MuTant project, we focus on realtime and online methods of music information
retrieval out of audio signals.
One of the primary goals of such systems is to fill in the gap between
<i>signal representation</i> and <i>symbolic information</i>
(such as pitch, tempo, expressivity, etc.)
contained in music signals.
MuTant's current activities focus on two main applications:
<i>score following</i> or realtime audio-to-score alignment <a href="./bibliography.html#mutant-2014-bid6">[2]</a> ,
and realtime transcription of music signals <a href="./bibliography.html#mutant-2014-bid7">[29]</a> 
with impacts both on signal processing using machine learning techniques
and their application in real-world scenarios.</p>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid3.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid9.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
    </div>
  </body>
</html>
