EN FR
EN FR


Bibliography

Major publications by the team in recent years
  • 1F. Bahja, J. Di Martino, E. H. Ibn Elhaj, D. Aboutajdine.

    An overview of the CATE algorithms for real-time pitch determination, in: Signal, Image and Video Processing, 2013. [ DOI : 10.1007/s11760-013-0488-4 ]

    https://hal.inria.fr/hal-00831660
  • 2J. Barker, E. Vincent, N. Ma, H. Christensen, P. Green.

    The PASCAL CHiME Speech Separation and Recognition Challenge, in: Computer Speech and Language, February 2013, vol. 27, no 3, pp. 621-633. [ DOI : 10.1016/j.csl.2012.10.004 ]

    https://hal.inria.fr/hal-00743529
  • 3A. Bonneau, D. Fohr, I. Illina, D. Jouvet, O. Mella, L. Mesbahi, L. Orosanu.

    Gestion d'erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d'une langue seconde, in: Traitement Automatique des Langues, 2013, vol. 53, no 3.

    https://hal.inria.fr/hal-00834278
  • 4D. Jouvet, D. Fohr.

    Combining Forward-based and Backward-based Decoders for Improved Speech Recognition Performance, in: InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Lyon, France, August 2013.

    https://hal.inria.fr/hal-00834282
  • 5A. Ozerov, M. Lagrange, E. Vincent.

    Uncertainty-based learning of acoustic models from noisy data, in: Computer Speech and Language, February 2013, vol. 27, no 3, pp. 874-894. [ DOI : 10.1016/j.csl.2012.07.002 ]

    https://hal.inria.fr/hal-00717992
  • 6A. Ozerov, E. Vincent, F. Bimbot.

    A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, in: IEEE Transactions on Audio, Speech and Language Processing, May 2012, vol. 20, no 4, pp. 1118 - 1133, 16.

    https://hal.archives-ouvertes.fr/hal-00626962
  • 7A. Piquard-Kipffer, L. Sprenger-Charolles.

    Predicting reading level at the end of Grade 2 from skills assessed in kindergarten: contribution of phonemic discrimination (Follow-up of 85 French-speaking children from 4 to 8 years old), in: Topics in Cognitive Psychology, 2013.

    https://hal.inria.fr/hal-00833951
Publications of the year

Doctoral Dissertations and Habilitation Theses

Articles in International Peer-Reviewed Journals

  • 9K. Adiloğlu, E. Vincent.

    Variational Bayesian Inference for Source Separation and Robust Feature Extraction, in: IEEE Transactions on Audio Speech and Language Processing, June 2016. [ DOI : 10.1109/TASLP.2016.2583794 ]

    https://hal.inria.fr/hal-00726146
  • 10M. Aron, M.-O. Berger, E. Kerrien, B. Wrobel-Dautcourt, B. Potard, Y. Laprie.

    Multimodal acquisition of articulatory data: Geometrical and temporal registration, in: Journal of the Acoustical Society of America, 2016, vol. 139, no 2, 13 p. [ DOI : 10.1121/1.4940666 ]

    https://hal.inria.fr/hal-01269578
  • 11J. Barker, R. Marxer, E. Vincent, S. Watanabe.

    The third 'CHIME' speech separation and recognition challenge: Analysis and outcomes, in: Computer Speech and Language, October 2016.

    https://hal.inria.fr/hal-01382108
  • 12F. Bimbot, E. Deruty, G. Sargent, E. Vincent.

    System & Contrast : A Polymorphous Model of the Inner Organization of Structural Segments within Music Pieces, in: Music Perception, 2016, 41 p.

    https://hal.inria.fr/hal-01188244
  • 13M. Cadot, Y. Laprie.

    Extraction d’un modèle articulatoire à partir d’une analyse tri-directionnelle de cinéradiographies d’un locuteur, in: Revue des Nouvelles Technologies de l'Information, 2016, vol. Fouille de Données Complexes, no RNTI-E-31, pp. 73-92.

    https://hal.archives-ouvertes.fr/hal-01346987
  • 14B. Elie, Y. Laprie.

    Extension of the single-matrix formulation of the vocal tract: consideration of bilateral channels and connection of self-oscillating models of the vocal folds with a glottal chink, in: Speech Communication, September 2016, vol. 82, pp. 85-96. [ DOI : 10.1016/j.specom.2016.06.002 ]

    https://hal.archives-ouvertes.fr/hal-01199792
  • 15D. Fitzgerald, A. Liutkus, R. Badeau.

    Projection-based demixing of spatial audio, in: IEEE Transactions on Audio, Speech and Language Processing, May 2016.

    https://hal.inria.fr/hal-01260588
  • 16S. Gannot, E. Vincent, S. Markovich-Golan, A. Ozerov.

    A consolidated perspective on multi-microphone speech enhancement and source separation, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, December 2016.

    https://hal.inria.fr/hal-01414179
  • 17X. Jaureguiberry, E. Vincent, G. Richard.

    Fusion methods for speech enhancement and audio source separation, in: IEEE Transactions on Audio, Speech and Language Processing, April 2016.

    https://hal.archives-ouvertes.fr/hal-01120685
  • 18A. A. Nugraha, A. Liutkus, E. Vincent.

    Multichannel audio source separation with deep neural networks, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, June 2016, vol. 24, no 10, pp. 1652-1664. [ DOI : 10.1109/TASLP.2016.2580946 ]

    https://hal.inria.fr/hal-01163369
  • 19S. Ouni, S. Dahmani.

    Is markerless acquisition of speech production accurate ?, in: Journal of the Acoustical Society of America, May 2016, vol. 139, no 6.

    https://hal.inria.fr/hal-01315579
  • 20G. Sargent, F. Bimbot, E. Vincent.

    Estimating the structural segmentation of popular music pieces under regularity constraints, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017.

    https://hal.inria.fr/hal-01403210
  • 21E. Vincent, S. Watanabe, A. A. Nugraha, J. Barker, R. Marxer.

    An analysis of environment, microphone and data simulation mismatches in robust speech recognition, in: Computer Speech and Language, November 2016.

    https://hal.inria.fr/hal-01399180

Invited Conferences

  • 22E. Vincent.

    Séparation de sources: quand l'acoustique rencontre le machine learning, in: 13e Congrès Français d'Acoustique, Le Mans, France, April 2016.

    https://hal.inria.fr/hal-01398720

International Conferences with Proceedings

  • 23K. Bartkova, D. Jouvet, E. Delais-Roussarie.

    Prosodic Parameters and Prosodic Structures of French Emotional Data, in: Speech Prosody 2016, Boston, United States, Speech Prosody 2016, May 2016.

    https://hal.inria.fr/hal-01293516
  • 24N. Bertin, E. Camberlein, E. Vincent, R. Lebarbenchon, S. Peillon, É. Lamandé, S. Sivasankaran, F. Bimbot, I. Illina, A. Tom, S. Fleury, E. Jamet.

    A French corpus for distant-microphone speech processing in real homes, in: Interspeech 2016, San Francisco, United States, September 2016.

    https://hal.inria.fr/hal-01343060
  • 25M. Cadot, A. Bonneau.

    Du fichier audio à l’intonation en Français :Graphes pour l’apprentissage de 3 classes intonatives, in: Fouille de données complexes (FDC@EGC2016), Reims, France, Proceedings of FDC@EGC2016, January 2016.

    https://hal.archives-ouvertes.fr/hal-01292121
  • 26A. Currey, I. Illina, D. Fohr.

    Dynamic adjustment of language models for automatic speech recognition using word similarity , in: IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, CA, United States, proceeding of IEEE Workshop on Spoken Language Technology, December 2016.

    https://hal.archives-ouvertes.fr/hal-01384365
  • 27K. Déguernel, E. Vincent, G. Assayag.

    Using Multidimensional Sequences For Improvisation In The OMax Paradigm, in: 13th Sound and Music Computing Conference, Hamburg, Germany, August 2016.

    https://hal.inria.fr/hal-01346797
  • 28B. Elie, G. Chardon.

    Robust tonal and noise separation in presence of colored noise, and application to voiced fricatives, in: 22nd International Congress on Acoustics (ICA), Buenos Aires, Argentina, September 2016.

    https://hal.archives-ouvertes.fr/hal-01372313
  • 29B. Elie, Y. Laprie.

    A glottal chink model for the synthesis of voiced fricatives, in: International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, IEEE, March 2016.

    https://hal.archives-ouvertes.fr/hal-01314308
  • 30B. Elie, Y. Laprie.

    Copy synthesis of phrase-level utterances, in: EUSIPCO2016, Budapest, Hungary, August 2016.

    https://hal.archives-ouvertes.fr/hal-01278462
  • 31B. Elie, Y. Laprie.

    Copy synthesis of running speech based on vocal tract imaging and audio recording, in: 22nd International Congress on Acoustics (ICA), Buenos Aires, Argentina, September 2016.

    https://hal.archives-ouvertes.fr/hal-01372310
  • 32B. Elie, Y. Laprie, P.-A. Vuissoz, F. Odille.

    High spatiotemporal cineMRI films using compressed sensing for acquiring articulatory data, in: EUSIPCO2016, Budapest, Hungary, August 2016.

    https://hal.archives-ouvertes.fr/hal-01372320
  • 33B. Elizalde, A. Kumar, A. Shah, R. Badlani, E. Vincent, B. Raj, I. Lane.

    Experiments on the DCASE Challenge 2016: Acoustic scene classification and sound event detection in real life recording, in: DCASE2016 Workshop on Detection and Classification of Acoustic Scenes and Events, Budapest, Hungary, September 2016.

    https://hal.inria.fr/hal-01354007
  • 34D. Fitzgerald, A. Liutkus, R. Badeau.

    PROJET - Spatial Audio Separation Using Projections, in: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, IEEE, 2016.

    https://hal.archives-ouvertes.fr/hal-01248014
  • 35M. Fontaine, C. Vanwynsberghe, A. Liutkus, R. Badeau.

    Sketching for nearfield acoustic imaging of heavy-tailed sources, in: 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), Grenoble, France, Proc. 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), February 2017.

    https://hal.archives-ouvertes.fr/hal-01401988
  • 36S. Ghosh, C. Fauth, A. Sini, Y. Laprie.

    L1-L2 Interference: The case of final devoicing of French voiced fricatives in final position by German learners, in: Interspeech 2016, San Francisco, United States, September 2016, vol. 2016, pp. 3156 - 3160. [ DOI : 10.21437/Interspeech.2016-954 ]

    https://hal.inria.fr/hal-01397176
  • 37S. Leglaive, U. Simsekli, A. Liutkus, R. Badeau, G. Richard.

    Alpha-Stable Multichannel Audio Source Separation, in: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, United States, Proc. 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, March 2017.

    https://hal.archives-ouvertes.fr/hal-01416366
  • 38V. Q. Nguyen, F. Colas, E. Vincent, F. Charpillet.

    Localizing an Intermittent and Moving Sound Source Using a Mobile Robot, in: International Conference on Intelligent Robots and Systems (IROS), Deajeon, South Korea, October 2016.

    https://hal.archives-ouvertes.fr/hal-01354006
  • 39A. A. Nugraha, A. Liutkus, E. Vincent.

    Multichannel music separation with deep neural networks, in: European Signal Processing Conference (EUSIPCO), Budapest, Hungary, Proceedings of the 24th European Signal Processing Conference (EUSIPCO), August 2016, pp. 1748-1752.

    https://hal.inria.fr/hal-01334614
  • 40S. Ouni, V. Colotte, S. Dahmani, S. Azzi.

    Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech, in: Interspeech 2016, San Francisco, United States, ISCA, November 2016, vol. 2016, pp. 580 - 584. [ DOI : 10.21437/Interspeech.2016-730 ]

    https://hal.inria.fr/hal-01398528
  • 41A. Piquard-Kipffer.

    Storytelling with a digital album that use an avatar as narrator, in: XVIèmes rencontres internationales en orthophonie - Orthophonie et technologies innovantes, PARIS, France, XVIèmes rencontres internationales en orthophonie - Orthophonie et technologies innovantes, December 2016.

    https://hal.inria.fr/hal-01403204
  • 42D. Ribas, E. Vincent, J. R. Calvo.

    A study of speech distortion conditions in real scenarios for speech processing applications, in: 2016 IEEE Workshop on Spoken Language Technology, San Diego, United States, December 2016.

    https://hal.inria.fr/hal-01377638
  • 43G. Serrière, C. Cerisara, D. Fohr, O. Mella.

    Weakly-supervised text-to-speech alignment confidence measure, in: International Conference on Computational Linguistics (COLING), Osaka, Japan, Proceedings of the 26th International Conference on Computational Linguistics (COLING), December 2016.

    https://hal.archives-ouvertes.fr/hal-01378355
  • 44I. Sheikh, I. Illina, D. Fohr, G. Linares.

    Document Level Semantic Context for Retrieving OOV Proper Names, in: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), shanghai, China, Proceeding of IEEE ICASSP 2016, IEEE, March 2016, pp. 6050-6054. [ DOI : 10.1109/ICASSP.2016.7472839 ]

    https://hal.archives-ouvertes.fr/hal-01331716
  • 45I. Sheikh, I. Illina, D. Fohr, G. Linares.

    Improved Neural Bag-of-Words Model to Retrieve Out-of-Vocabulary Words in Speech Recognition, in: INTERSPEECH 2016, San Francisco, United States, Proceedings of INTERSPEECH 2016, September 2016, vol. 2016. [ DOI : 10.21437/Interspeech.2016-1219 ]

    https://hal.archives-ouvertes.fr/hal-01384488
  • 46I. Sheikh, I. Illina, D. Fohr, G. Linares.

    Learning Word Importance with the Neural Bag-of-Words Model, in: ACL, Representation Learning for NLP (Repl4NLP) workshop, Berlin, Germany, Proceedings of ACL 2016, August 2016.

    https://hal.archives-ouvertes.fr/hal-01331720
  • 47I. Sheikh, I. Illina, D. Fohr.

    How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News, in: LREC 2016, Portoroz, Slovenia, proceedings of LREC 2016, May 2016.

    https://hal.archives-ouvertes.fr/hal-01331714
  • 48A. J. R. Simpson, G. Roma, E. M. Grais, R. D. Mason, C. Hummersone, A. Liutkus, M. D. Plumbley.

    Evaluation of Audio Source Separation Models Using Hypothesis-Driven Non-Parametric Statistical Methods, in: European Signal Processing Conference, Budapest, Hungary, EURASIP, August 2016.

    https://hal.inria.fr/hal-01410176
  • 49S. Sivasankaran, E. Vincent, I. Illina.

    Discriminative importance weighting of augmented training data for acoustic model training, in: 42th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, United States, March 2017.

    https://hal.inria.fr/hal-01415759
  • 50F.-R. Stöter, A. Liutkus, R. Badeau, B. Edler, P. Magron.

    Common Fate Model for Unison source Separation, in: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, Proceedings of the 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2016.

    https://hal.archives-ouvertes.fr/hal-01248012
  • 51J. Trouvain, A. Bonneau, V. Colotte, C. Fauth, D. Fohr, D. Jouvet, J. Jügler, Y. Laprie, O. Mella, B. Möbius, F. Zimmerer.

    The IFCASL Corpus of French and German Non-native and Native Read Speech, in: LREC'2016, 10th edition of the Language Resources and Evaluation Conference, Portorož, Slovenia, Proceedings LREC'2016, May 2016.

    https://hal.inria.fr/hal-01293935
  • 52F. Zimmerer, A. Bonneau, B. Andreeva.

    Influence of L1 prominence on L2 production: French and German speakers, in: Speech Prosody 2016, Boston, United States, May 2016, vol. 2016, pp. 370 - 374. [ DOI : 10.21437/SpeechProsody.2016-76 ]

    https://hal.inria.fr/hal-01399974

National Conferences with Proceedings

  • 53B. Elie, Y. Laprie, P.-A. Vuissoz.

    Acquisition temps-réel de données articulatoires par IRM : application à la synthèse par copie, in: 13ème Congrès Français d'Acoustique (CFA 2016), Le Mans, France, SFA, April 2016.

    https://hal.archives-ouvertes.fr/hal-01314313

Conferences without Proceedings

  • 54F. Zimmerer, J. Trouvain, A. Bonneau.

    Methods of investigating vowel interferences of French learners of German, in: New Sounds 2016, Aarhus, Denmark, June 2016.

    https://hal.inria.fr/hal-01400005

Scientific Books (or Scientific Book chapters)

  • 55J. Barker, R. Marxer, E. Vincent, S. Watanabe.

    The CHiME challenges: Robust speech recognition in everyday environments, in: New era for robust speech recognition - Exploiting deep learning, Springer, October 2016.

    https://hal.inria.fr/hal-01383263
  • 56M. Cadot.

    Recoder les variables pour obtenir un modèle implicatif optimal, in: L'Analyse Statisqtique Implicative, R. Gras (editor), Cépaduès, December 2016.

    https://hal.archives-ouvertes.fr/hal-01398229

Internal Reports

  • 57P. Magron, R. Badeau, A. Liutkus.

    Generalized Wiener filtering for positive alpha-stable random variables, Télécom ParisTech, June 2016.

    https://hal.archives-ouvertes.fr/hal-01340797
  • 58G. Sargent, F. Bimbot, E. Vincent.

    Supplementary material to the article: Estimating the structural segmentation of popular music pieces under regularity constraints, IRISA-Inria, Campus de Beaulieu, 35042 Rennes cedex ; Inria Nancy, équipe Multispeech, September 2016.

    https://hal.inria.fr/hal-01368683

Scientific Popularization

Patents

  • 61S. Ouni, G. Gris.

    Dispositif de traitement d’image, January 2016, no 15 52058, Le rapport de recherche reconnait la brevetabilité.

    https://hal.inria.fr/hal-01294028

Other Publications

  • 62B. Dumortier, E. Vincent, M. Deaconu, P. Cornu.

    Efficient optimisation of wind power under acoustic constraints, November 2016, working paper or preprint.

    https://hal.inria.fr/hal-01393125
  • 63B. Elie, Y. Laprie.

    Acoustic impact of the glottal chink on the production of fricatives: A numerical study, December 2016, working paper or preprint.

    https://hal.archives-ouvertes.fr/hal-01423206
  • 64A. Piquard-Kipffer, T. Léonova.

    Parcours scolaire de 166 dysphasiques et/ou dyslexiques-dysorthographiques âgés de 6 à 20 ans en situation de handicap : Schooling experiences of 166 dysphasic or dyslexics-dysorthographic children, aged from 6 to 20 in a handicap situation, July 2016, working paper or preprint.

    https://hal.inria.fr/hal-01402986
  • 65A. Piquard-Kipffer, O. Mella, J. Miranda, D. Jouvet, L. Orosanu.

    Terminal portable de communication et affichage de la reconnaissance vocale. Enjeux et rapports à l'écrit. Étude préliminaire auprès d'adultes déficients auditifs, March 2016, 15p. p, In M.Frisch (Eds) Le réseau Idéki : Didactiques, métiers de l'humain et Intelligence collective. Nouveaux espaces et dispositifs en question. Nouveaux horizons en éducation, formation et en recherche. L'harmattan, Collection I.D.

    https://hal.inria.fr/hal-01239910
References in notes
  • 66A. Liutkus, R. Badeau.

    Generalized Wiener filtering with fractional power spectrograms, in: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2015, pp. 266–270.