EN FR
EN FR


Bibliography

Major publications by the team in recent years
  • 1F. Bahja, J. Di Martino, E. H. Ibn Elhaj, D. Aboutajdine.

    An overview of the CATE algorithms for real-time pitch determination, in: Signal, Image and Video Processing, 2013. [ DOI : 10.1007/s11760-013-0488-4 ]

    https://hal.inria.fr/hal-00831660
  • 2J. Barker, E. Vincent, N. Ma, H. Christensen, P. Green.

    The PASCAL CHiME Speech Separation and Recognition Challenge, in: Computer Speech and Language, February 2013, vol. 27, no 3, pp. 621-633. [ DOI : 10.1016/j.csl.2012.10.004 ]

    https://hal.inria.fr/hal-00743529
  • 3A. Bonneau, D. Fohr, I. Illina, D. Jouvet, O. Mella, L. Mesbahi, L. Orosanu.

    Gestion d'erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d'une langue seconde, in: Traitement Automatique des Langues, 2013, vol. 53, no 3.

    https://hal.inria.fr/hal-00834278
  • 4D. Jouvet, D. Fohr.

    Combining Forward-based and Backward-based Decoders for Improved Speech Recognition Performance, in: InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Lyon, France, August 2013.

    https://hal.inria.fr/hal-00834282
  • 5A. Ozerov, M. Lagrange, E. Vincent.

    Uncertainty-based learning of acoustic models from noisy data, in: Computer Speech and Language, February 2013, vol. 27, no 3, pp. 874-894. [ DOI : 10.1016/j.csl.2012.07.002 ]

    https://hal.inria.fr/hal-00717992
  • 6A. Ozerov, E. Vincent, F. Bimbot.

    A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, in: IEEE Transactions on Audio, Speech and Language Processing, May 2012, vol. 20, no 4, pp. 1118 - 1133, 16.

    https://hal.archives-ouvertes.fr/hal-00626962
Publications of the year

Doctoral Dissertations and Habilitation Theses

  • 7A. Gorin.

    Acoustic Model Structuring for Improving Automatic Speech Recognition Performance, University of Lorraine, November 2014.

    https://hal.inria.fr/tel-01102029

Articles in International Peer-Reviewed Journals

  • 8A. Benichoux, L. S. R. Simon, E. Vincent, R. Gribonval.

    Convex regularizations for the simultaneous recording of room impulse responses, in: IEEE Transactions on Signal Processing, January 2014. [ DOI : 10.1109/TSP.2014.2303431 ]

    https://hal.inria.fr/hal-00934941
  • 9C. Fauth, A. Bonneau, O. Mella, V. Colotte, D. Fohr, D. Jouvet, Y. Laprie, J. Trouvain.

    Constitution d'un Corpus de Français Langue Etrangère destiné aux Apprenants Allemands, in: SHS Web of Conferences, July 2014, vol. 8, 14 p. [ DOI : 10.1051/shsconf/20140801186 ]

    https://hal.inria.fr/hal-01080630
  • 10N. Ito, E. Vincent, T. Nakatani, N. Ono, S. Araki, S. Sagayama.

    Blind suppression of nonstationary diffuse noise based on spatial covariance matrix decomposition, in: Journal of Signal Processing Systems, July 2014.

    https://hal.inria.fr/hal-01020255
  • 11Y. Laprie, R. Sock, B. Vaxelaire, B. Elie.

    Comment faire parler les images aux rayons X du conduit vocal ?, in: SHS Web of Conferences, July 2014, vol. 8, 14 p. [ DOI : 10.1051/shsconf/20140801344 ]

    https://hal.inria.fr/hal-01059887
  • 12N. Liu, A. Liutkus, J.-F. Aubry, L. Marsac, M. Tanter, L. Daudet.

    Random Calibration for Accelerating MR-ARFI Guided Ultrasonic Focusing in Transcranial Therapy, in: Physics in Medicine and Biology, January 2015, vol. 60, no 3, 21 p. [ DOI : 10.1088/0031-9155/60/3/1069 ]

    https://hal.inria.fr/hal-01104616
  • 13A. Liutkus, D. Fitzgerald, Z. Rafii, B. Pardo, L. Daudet.

    Kernel Additive Models for Source Separation, in: IEEE Transactions on Signal Processing, June 2014. [ DOI : 10.1109/TSP.2014.2332434 ]

    https://hal.inria.fr/hal-01011044
  • 14A. Liutkus, D. Martina, S. Popoff, G. Chardon, O. Katz, G. Lerosey, S. Gigan, L. Daudet, I. Carron.

    Imaging With Nature: Compressive Imaging Using a Multiply Scattering Medium, in: Scientific Reports, July 2014, vol. 4. [ DOI : 10.1038/srep05552 ]

    https://hal.inria.fr/hal-01025647
  • 15S. Raczynski, E. Vincent.

    Genre-based music language modelling with latent hierarchical Pitman-Yor process allocation, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, January 2014, vol. 22, no 3, pp. 672-681.

    https://hal.inria.fr/hal-00804567
  • 16E. Vincent, N. Bertin, R. Gribonval, F. Bimbot.

    From blind to guided audio source separation: How models and side information can improve the separation of sound, in: IEEE Signal Processing Magazine, May 2014, vol. 31, no 3, pp. 107-115.

    https://hal.inria.fr/hal-00922378

Invited Conferences

  • 17E. Vincent, A. Sini, F. Charpillet.

    Audio source localization by optimal control of a mobile robot, in: 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, April 2015.

    https://hal.inria.fr/hal-01103949

International Conferences with Proceedings

  • 18K. Bartkova, D. Jouvet.

    Links between Manual Punctuation Marks and Automatically Detected Prosodic Structures, in: Speech Prosody 2014, Dublin, Ireland, May 2014.

    https://hal.archives-ouvertes.fr/hal-00998031
  • 19J. Beliao, A. Liutkus.

    OOPS: une approche orientée objet pour l'interrogation et l'analyse linguistique de l'interface prosodie/syntaxe/discours, in: 4e Congrès Mondial de Linguistique Française, Berlin, Germany, July 2014, vol. 8, pp. 2565-2581. [ DOI : 10.1051/shsconf/20140801273 ]

    https://hal.archives-ouvertes.fr/hal-01053422
  • 20F. Bimbot, G. Sargent, E. Deruty, C. Guichaoua, E. Vincent.

    Semiotic Description of Music Structure: an Introduction to the Quaero/Metiss Structural Annotations, in: AES 53rd International Conference on Semantic Audio, London, United Kingdom, January 2014, 12 p, P1-1.

    https://hal.archives-ouvertes.fr/hal-00931859
  • 21B. Dumortier, E. Vincent.

    Blind RT60 estimation robust across room sizes and source distances, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Firenze, Italy, May 2014.

    https://hal.inria.fr/hal-00941061
  • 22B. Elie, Y. Laprie.

    Audiovisual to area and length functions inversion of human tract , in: Eusipco 2014, Lisbonne, Portugal, September 2014.

    https://hal.inria.fr/hal-01096547
  • 23C. Fauth, A. Bonneau.

    L1-L2 interference: the case of devoicing of French voiced obstruents in final position by German learners - Pilot study, in: International Workshop on Multilinguality in Speech Research: Data, Methods and Models, Dagstuhl, Germany, Bernd Möbius et Jürgen Trouvain, Université de la Sarre, Allemagne, April 2014.

    https://hal.inria.fr/hal-01095183
  • 24C. Fauth, A. Bonneau, F. Zimmerer, J. Trouvain, B. Andreeva, V. Colotte, D. Fohr, D. Jouvet, J. Jügler, Y. Laprie, O. Mella, B. Möbius.

    Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process, in: LREC - 9th Language Resources and Evaluation Conference, Reykjavik, Iceland, The European Language Resources Association, May 2014.

    https://hal.inria.fr/hal-00979026
  • 25D. Fitzgerald, A. Liutkus, Z. Rafii, B. Pardo, L. Daudet.

    Harmonic/Percussive Separation Using Kernel Additive Modelling, in: IET Irish Signals & Systems Conference 2014, Limerick, Ireland, June 2014.

    https://hal.inria.fr/hal-01000001
  • 26A. Gorin, D. Jouvet.

    Component Structuring and Trajectory Modeling for Speech Recognition, in: Interspeech, Singapoore, Singapore, September 2014.

    https://hal.inria.fr/hal-01063653
  • 27A. Gorin, D. Jouvet.

    Explicit trajectories and speaker class modeling for child and adult speech recognition, in: XXXème édition des Journées d'Etudes sur la Parole, Le Mans, France, June 2014.

    https://hal.inria.fr/hal-01080343
  • 28A. Gorin, D. Jouvet.

    Structured GMM Based on Unsupervised Clustering for Recognizing Adult and Child Speech, in: SLSP - 2nd International Conference on Statistical Language and Speech Processing, Grenoble, France, October 2014, pp. 108 - 119. [ DOI : 10.1007/978-3-319-11397-5_8 ]

    https://hal.inria.fr/hal-01090472
  • 29A. Gorin, D. Jouvet, E. Vincent, D. Tran.

    Investigating Stranded GMM for Improving Automatic Speech Recognition, in: 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2014), Nancy, France, May 2014.

    https://hal.inria.fr/hal-01003054
  • 30I. Illina, D. Fohr, G. Linares.

    Extension du vocabulaire d’un système de transcription avec de nouveaux noms propres en utilisant un corpus diachronique, in: Journées d'Etude sur la parole, Le Mans, France, June 2014.

    https://hal.inria.fr/hal-01092214
  • 31I. Illina, D. Fohr, G. Linares.

    Proper Name Retrieval from Diachronic Documents for Automatic Speech Transcription using Lexical and Temporal Context, in: Workshop on Speech, Language and Audio in Multimedia, Penang, Malaysia, September 2014.

    https://hal.inria.fr/hal-01092224
  • 32X. Jaureguiberry, E. Vincent, G. Richard.

    Multiple-order non-negative matrix factorization for speech enhancement, in: Interspeech, Singapore, June 2014, 4 p.

    https://hal.archives-ouvertes.fr/hal-01023399
  • 33X. Jaureguiberry, E. Vincent, G. Richard.

    Variational Bayesian model averaging for audio source separation, in: SSP (IEEE Workshop on Statistical Signal Processing), Australia, June 2014, 4 p.

    https://hal.archives-ouvertes.fr/hal-00986909
  • 34D. Jouvet, D. Fohr.

    About Combining Forward and Backward-Based Decoders for Selecting Data for Unsupervised Training of Acoustic Models, in: INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapour, Singapore, September 2014.

    https://hal.inria.fr/hal-01090483
  • 35S. Kırbız, A. Ozerov, A. Liutkus, L. Girin.

    Perceptual coding-based informed source separation, in: 22nd European Signal Processing Conference (EUSIPCO-2014), Lisbonne, Portugal, September 2014.

    https://hal.inria.fr/hal-01016314
  • 36O. Lachhab, J. Di Martino, E. H. Ibn Elhaj, A. Hammouch.

    Improving the recognition of pathological voice using the discriminant HLDA transformation, in: 3rd International IEEE Colloquium on Information Science and Technology, Tetuan-Chefchaouen, Morocco, October 2014.

    https://hal.inria.fr/hal-01093309
  • 37Y. Laprie, M. Aron, M.-O. Berger, B. Wrobel-Dautcourt.

    Studying MRI acquisition protocols of sustained sounds with a multimodal acquisition system, in: 10th International Seminar on Speech Production (ISSP), Köln, Germany, May 2014.

    https://hal.inria.fr/hal-01002121
  • 38Y. Laprie, B. Vaxelaire, M. Cadot.

    Geometric articulatory model adapted to the production of consonants, in: 10th International Seminar on Speech Production (ISSP), Köln, Germany, May 2014.

    https://hal.inria.fr/hal-01002125
  • 39A. Liutkus, R. Badeau.

    Generalized Wiener filtering with fractional power spectrograms, in: 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, IEEE, April 2015.

    https://hal.archives-ouvertes.fr/hal-01110028
  • 40A. Liutkus, D. Fitzgerald, Z. Rafii.

    Scalable audio separation with light kernel additive modelling, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, IEEE, April 2015.

    https://hal.inria.fr/hal-01114890
  • 41A. Liutkus, D. Martina, S. Gigan, L. Daudet.

    Compressed sensing under strong noise. Application to imaging through multiply scattering media, in: European Signal Processing Conference (EUSIPCO), Lisbon, Portugal, September 2014.

    https://hal.inria.fr/hal-01074786
  • 42A. Liutkus, Z. Rafii, B. Pardo, D. Fitzgerald, L. Daudet.

    Kernel Spectrogram models for source separation, in: HSCMA, Nancy, France, May 2014.

    https://hal.inria.fr/hal-00959384
  • 43U. Musti, S. Ouni, Z. Ziheng.

    3D Visual Speech Animation from Image Sequences, in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP), Bangalore, India, ACM, December 2014.

    https://hal.archives-ouvertes.fr/hal-01086073
  • 44L. Orosanu, D. Jouvet.

    Combining words and syllables for speech transcription, in: XXXème édition des Journées d'Etudes sur la Parole, Le Mans, France, June 2014.

    https://hal.inria.fr/hal-01080351
  • 45L. Orosanu, D. Jouvet.

    Hybrid language models for speech transcription, in: INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapour, Singapore, September 2014.

    https://hal.inria.fr/hal-01090478
  • 46N. Souviraà-Labastie, A. Olivero, E. Vincent, F. Bimbot.

    Audio source separation using multiple deformed references, in: Eusipco, Lisboa, Portugal, September 2014.

    https://hal.inria.fr/hal-01017571
  • 47N. Souviraà-Labastie, E. Vincent, F. Bimbot.

    Music separation guided by cover tracks: designing the joint NMF model, in: 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, April 2015.

    https://hal.archives-ouvertes.fr/hal-01108675
  • 48I. Steiner, P. Knopp, S. Musche, A. Schmiedel, A. Braun, S. Ouni.

    Investigating the effects of posture and noise on speech production, in: 10th International Seminar on Speech Production (ISSP), Cologne, Germany, Susanne Fuchs, Martine Grice, Anne Hermes, Leonardo Lancia, Doris Mücke, May 2014.

    https://hal.archives-ouvertes.fr/hal-01086066
  • 49D. Tran, N. Ono, E. Vincent.

    Fast DNN training based on auxiliary function technique, in: 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Queensland, Australia, April 2015.

    https://hal.inria.fr/hal-01107809
  • 50D. Tran, E. Vincent, D. Jouvet.

    Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.

    https://hal.inria.fr/hal-00954654
  • 51D. Tran, E. Vincent, D. Jouvet.

    Fusion of Multiple Uncertainty Estimators and Propagators for Noise Robust ASR, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.

    https://hal.inria.fr/hal-00955185
  • 52D. Tran, E. Vincent, D. Jouvet.

    Discriminative uncertainty estimation for noise robust ASR, in: 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Queensland, Australia, April 2015.

    https://hal.inria.fr/hal-01103969
  • 53E. Vincent, A. Gkiokas, D. Schnitzer, A. Flexer.

    An investigation of likelihood normalization for robust ASR, in: Interspeech, Singapore, Singapore, September 2014.

    https://hal.inria.fr/hal-01006142

National Conferences with Proceedings

  • 54M. Cadot, Y. Laprie.

    Méthodologie 3-way d'extraction d'un modèle articulatoire de la parole à partir des données d'un locuteur, in: Atelier Fouille de Données Complexes des 14èmes Journées Francophones "Extraction et Gestion des Connaissances", Rennes, France, January 2014, pp. 1-12.

    https://hal.archives-ouvertes.fr/hal-00934436
  • 55J. Thiemann, E. Vincent, S. Van De Par.

    Spatial properties of the DEMAND noise recordings, in: 40th Annual German Congress on Acoustics (DAGA 2014), Oldenburg, Germany, March 2014.

    https://hal.inria.fr/hal-00985979

Conferences without Proceedings

  • 56P.-A. Vuissoz, F. Odille, Y. Laprie, E. Vincent, G. Hossu, J. Felblinger.

    Speech Cine SSFP with optical microphone synchronization and motion compensated reconstruction, in: ISMRM Workshop on Motion Correction in MRI, Tromso, Norway, July 2014.

    https://hal.inria.fr/hal-00994526
  • 57P.-A. Vuissoz, F. Odille, E. Vincent, J. Felblinger, Y. Laprie.

    Synchronisation vocale et mouvement compensé en reconstruction pour une ciné IRM de la parole, in: 2e Congrès de la SFRMBM, Grenoble, France, March 2015.

    https://hal.inria.fr/hal-01104230

Scientific Books (or Scientific Book chapters)

  • 58Z. Rafii, A. Liutkus, B. Pardo.

    REPET for Background/Foreground Separation in Audio, in: Blind Source Separation, G. Naik, W. Wang (editors), Springer Berlin Heidelberg, 2014, pp. 395-411. [ DOI : 10.1007/978-3-642-55016-4_14 ]

    https://hal.inria.fr/hal-01025563

Internal Reports

Scientific Popularization

  • 62E. Vincent.

    Les sons à domicile, April 2014, Séminaire SAILOR "Imaginer des nouveaux lieux de vie", Séminaire SAILOR "Imaginer des nouveaux lieux de vie".

    https://hal.inria.fr/hal-00977674

Other Publications

  • 63A. Bonneau.

    Phonetic variation in non-native speech, April 2014, Spring School : "Individual-centered Approaches to Speech Processing".

    https://hal.inria.fr/hal-01095804
  • 64A. Piquard-Kipffer.

    Critères d’évaluation d’un album numérique pour des enfants en difficulté de langage, December 2014, pp. 287-309, In M. Frisch (Eds) Le réseau Idéki : objets de recherche, d’éducation et de formation émergents, problématisés, mis en tension, réélaborés. Préface de Joël Lebeaume. Paris : L’harmattan, Collection I.D, 287-309.

    https://hal.inria.fr/hal-01097278
  • 65Y. Salaün, E. Vincent, N. Bertin, N. Souviraà-Labastie, X. Jaureguiberry, D. T. Tran, F. Bimbot.

    The Flexible Audio Source Separation Toolbox Version 2.0, May 2014, ICASSP.

    https://hal.inria.fr/hal-00957412
  • 66N. Souviraà-Labastie, A. Olivero, E. Vincent, F. Bimbot.

    Multi-channel audio source separation using multiple deformed references, November 2014.

    https://hal.inria.fr/hal-01070298
  • 67D. T. Tran, E. Vincent, D. Jouvet.

    Nonparametric uncertainty estimation and propagation for noise robust ASR, January 2015.

    https://hal.inria.fr/hal-01114329
  • 68E. Vincent.

    Evaluation campaigns and reproducibility, January 2014, Journée GdR ISIS "reproductibilité en traitement du signal et des images".

    https://hal.inria.fr/hal-00927741
References in notes
  • 69F. Bahja.

    Détection du fondamental de la parole en temps réel : application aux voix pathologiques, Université Mohammed V-Agdal UFR Informatique et Télécommunications Laboratoire LRIT Unité associée au CNRST, URAC 29, Faculté des sciences, June 2013.

    https://tel.archives-ouvertes.fr/tel-00927147
  • 70D. Fohr, O. Mella.

    CoALT: A Software for Comparing Automatic Labelling Tools, in: Language Resources and Evaluation LREC 2012, Istanbul, Turkey, May 2012, pp. 325-328.

    https://hal.archives-ouvertes.fr/hal-00761781
  • 71D. Jouvet, D. Fohr.

    Analysis and Combination of Forward and Backward based Decoders for Improved Speech Transcription, in: TSD - 16th International Conference on Text, Speech and Dialogue - 2013, Pilsen, Czech Republic, I. Habernal, V. Matoušek (editors), Lecture Notes in Artificial Intelligence, Springer Verlag, September 2013, vol. 8082, pp. 84-91.

    https://hal.inria.fr/hal-00834296
  • 72S. Ouni, L. Mangeonjean, I. Steiner.

    VisArtico: a visualization tool for articulatory data, in: 13th Annual Conference of the International Speech Communication Association - InterSpeech 2012, Portland, OR, United States, September 2012.

    https://hal.inria.fr/hal-00730733