Bibliography
Major publications by the team in recent years
-
1F. Bahja, J. Di Martino, E. H. Ibn Elhaj, D. Aboutajdine.
An overview of the CATE algorithms for real-time pitch determination, in: Signal, Image and Video Processing, 2013. [ DOI : 10.1007/s11760-013-0488-4 ]
https://hal.inria.fr/hal-00831660 -
2J. Barker, E. Vincent, N. Ma, H. Christensen, P. Green.
The PASCAL CHiME Speech Separation and Recognition Challenge, in: Computer Speech and Language, February 2013, vol. 27, no 3, pp. 621-633. [ DOI : 10.1016/j.csl.2012.10.004 ]
https://hal.inria.fr/hal-00743529 -
3A. Bonneau, D. Fohr, I. Illina, D. Jouvet, O. Mella, L. Mesbahi, L. Orosanu.
Gestion d'erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d'une langue seconde, in: Traitement Automatique des Langues, 2013, vol. 53, no 3.
https://hal.inria.fr/hal-00834278 -
4D. Jouvet, D. Fohr.
Combining Forward-based and Backward-based Decoders for Improved Speech Recognition Performance, in: InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Lyon, France, August 2013.
https://hal.inria.fr/hal-00834282 -
5A. Ozerov, M. Lagrange, E. Vincent.
Uncertainty-based learning of acoustic models from noisy data, in: Computer Speech and Language, February 2013, vol. 27, no 3, pp. 874-894. [ DOI : 10.1016/j.csl.2012.07.002 ]
https://hal.inria.fr/hal-00717992 -
6A. Ozerov, E. Vincent, F. Bimbot.
A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, in: IEEE Transactions on Audio, Speech and Language Processing, May 2012, vol. 20, no 4, pp. 1118 - 1133, 16.
https://hal.archives-ouvertes.fr/hal-00626962 -
7A. Piquard-Kipffer, L. Sprenger-Charolles.
Predicting reading level at the end of Grade 2 from skills assessed in kindergarten: contribution of phonemic discrimination (Follow-up of 85 French-speaking children from 4 to 8 years old), in: Topics in Cognitive Psychology, 2013.
https://hal.inria.fr/hal-00833951
Doctoral Dissertations and Habilitation Theses
-
8I. Sheikh.
Exploiting Semantic and Topic Context to Improve Recognition of Proper Names in Diachronic Audio Documents, Université de Lorraine, November 2016.
https://hal.archives-ouvertes.fr/tel-01400694
Articles in International Peer-Reviewed Journals
-
9K. Adiloğlu, E. Vincent.
Variational Bayesian Inference for Source Separation and Robust Feature Extraction, in: IEEE Transactions on Audio Speech and Language Processing, June 2016. [ DOI : 10.1109/TASLP.2016.2583794 ]
https://hal.inria.fr/hal-00726146 -
10M. Aron, M.-O. Berger, E. Kerrien, B. Wrobel-Dautcourt, B. Potard, Y. Laprie.
Multimodal acquisition of articulatory data: Geometrical and temporal registration, in: Journal of the Acoustical Society of America, 2016, vol. 139, no 2, 13 p. [ DOI : 10.1121/1.4940666 ]
https://hal.inria.fr/hal-01269578 -
11J. Barker, R. Marxer, E. Vincent, S. Watanabe.
The third 'CHIME' speech separation and recognition challenge: Analysis and outcomes, in: Computer Speech and Language, October 2016.
https://hal.inria.fr/hal-01382108 -
12F. Bimbot, E. Deruty, G. Sargent, E. Vincent.
System & Contrast : A Polymorphous Model of the Inner Organization of Structural Segments within Music Pieces, in: Music Perception, 2016, 41 p.
https://hal.inria.fr/hal-01188244 -
13M. Cadot, Y. Laprie.
Extraction d’un modèle articulatoire à partir d’une analyse tri-directionnelle de cinéradiographies d’un locuteur, in: Revue des Nouvelles Technologies de l'Information, 2016, vol. Fouille de Données Complexes, no RNTI-E-31, pp. 73-92.
https://hal.archives-ouvertes.fr/hal-01346987 -
14B. Elie, Y. Laprie.
Extension of the single-matrix formulation of the vocal tract: consideration of bilateral channels and connection of self-oscillating models of the vocal folds with a glottal chink, in: Speech Communication, September 2016, vol. 82, pp. 85-96. [ DOI : 10.1016/j.specom.2016.06.002 ]
https://hal.archives-ouvertes.fr/hal-01199792 -
15D. Fitzgerald, A. Liutkus, R. Badeau.
Projection-based demixing of spatial audio, in: IEEE Transactions on Audio, Speech and Language Processing, May 2016.
https://hal.inria.fr/hal-01260588 -
16S. Gannot, E. Vincent, S. Markovich-Golan, A. Ozerov.
A consolidated perspective on multi-microphone speech enhancement and source separation, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, December 2016.
https://hal.inria.fr/hal-01414179 -
17X. Jaureguiberry, E. Vincent, G. Richard.
Fusion methods for speech enhancement and audio source separation, in: IEEE Transactions on Audio, Speech and Language Processing, April 2016.
https://hal.archives-ouvertes.fr/hal-01120685 -
18A. A. Nugraha, A. Liutkus, E. Vincent.
Multichannel audio source separation with deep neural networks, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, June 2016, vol. 24, no 10, pp. 1652-1664. [ DOI : 10.1109/TASLP.2016.2580946 ]
https://hal.inria.fr/hal-01163369 -
19S. Ouni, S. Dahmani.
Is markerless acquisition of speech production accurate ?, in: Journal of the Acoustical Society of America, May 2016, vol. 139, no 6.
https://hal.inria.fr/hal-01315579 -
20G. Sargent, F. Bimbot, E. Vincent.
Estimating the structural segmentation of popular music pieces under regularity constraints, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017.
https://hal.inria.fr/hal-01403210 -
21E. Vincent, S. Watanabe, A. A. Nugraha, J. Barker, R. Marxer.
An analysis of environment, microphone and data simulation mismatches in robust speech recognition, in: Computer Speech and Language, November 2016.
https://hal.inria.fr/hal-01399180
Invited Conferences
-
22E. Vincent.
Séparation de sources: quand l'acoustique rencontre le machine learning, in: 13e Congrès Français d'Acoustique, Le Mans, France, April 2016.
https://hal.inria.fr/hal-01398720
International Conferences with Proceedings
-
23K. Bartkova, D. Jouvet, E. Delais-Roussarie.
Prosodic Parameters and Prosodic Structures of French Emotional Data, in: Speech Prosody 2016, Boston, United States, Speech Prosody 2016, May 2016.
https://hal.inria.fr/hal-01293516 -
24N. Bertin, E. Camberlein, E. Vincent, R. Lebarbenchon, S. Peillon, É. Lamandé, S. Sivasankaran, F. Bimbot, I. Illina, A. Tom, S. Fleury, E. Jamet.
A French corpus for distant-microphone speech processing in real homes, in: Interspeech 2016, San Francisco, United States, September 2016.
https://hal.inria.fr/hal-01343060 -
25M. Cadot, A. Bonneau.
Du fichier audio à l’intonation en Français :Graphes pour l’apprentissage de 3 classes intonatives, in: Fouille de données complexes (FDC@EGC2016), Reims, France, Proceedings of FDC@EGC2016, January 2016.
https://hal.archives-ouvertes.fr/hal-01292121 -
26A. Currey, I. Illina, D. Fohr.
Dynamic adjustment of language models for automatic speech recognition using word similarity , in: IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, CA, United States, proceeding of IEEE Workshop on Spoken Language Technology, December 2016.
https://hal.archives-ouvertes.fr/hal-01384365 -
27K. Déguernel, E. Vincent, G. Assayag.
Using Multidimensional Sequences For Improvisation In The OMax Paradigm, in: 13th Sound and Music Computing Conference, Hamburg, Germany, August 2016.
https://hal.inria.fr/hal-01346797 -
28B. Elie, G. Chardon.
Robust tonal and noise separation in presence of colored noise, and application to voiced fricatives, in: 22nd International Congress on Acoustics (ICA), Buenos Aires, Argentina, September 2016.
https://hal.archives-ouvertes.fr/hal-01372313 -
29B. Elie, Y. Laprie.
A glottal chink model for the synthesis of voiced fricatives, in: International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, IEEE, March 2016.
https://hal.archives-ouvertes.fr/hal-01314308 -
30B. Elie, Y. Laprie.
Copy synthesis of phrase-level utterances, in: EUSIPCO2016, Budapest, Hungary, August 2016.
https://hal.archives-ouvertes.fr/hal-01278462 -
31B. Elie, Y. Laprie.
Copy synthesis of running speech based on vocal tract imaging and audio recording, in: 22nd International Congress on Acoustics (ICA), Buenos Aires, Argentina, September 2016.
https://hal.archives-ouvertes.fr/hal-01372310 -
32B. Elie, Y. Laprie, P.-A. Vuissoz, F. Odille.
High spatiotemporal cineMRI films using compressed sensing for acquiring articulatory data, in: EUSIPCO2016, Budapest, Hungary, August 2016.
https://hal.archives-ouvertes.fr/hal-01372320 -
33B. Elizalde, A. Kumar, A. Shah, R. Badlani, E. Vincent, B. Raj, I. Lane.
Experiments on the DCASE Challenge 2016: Acoustic scene classification and sound event detection in real life recording, in: DCASE2016 Workshop on Detection and Classification of Acoustic Scenes and Events, Budapest, Hungary, September 2016.
https://hal.inria.fr/hal-01354007 -
34D. Fitzgerald, A. Liutkus, R. Badeau.
PROJET - Spatial Audio Separation Using Projections, in: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, IEEE, 2016.
https://hal.archives-ouvertes.fr/hal-01248014 -
35M. Fontaine, C. Vanwynsberghe, A. Liutkus, R. Badeau.
Sketching for nearfield acoustic imaging of heavy-tailed sources, in: 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), Grenoble, France, Proc. 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), February 2017.
https://hal.archives-ouvertes.fr/hal-01401988 -
36S. Ghosh, C. Fauth, A. Sini, Y. Laprie.
L1-L2 Interference: The case of final devoicing of French voiced fricatives in final position by German learners, in: Interspeech 2016, San Francisco, United States, September 2016, vol. 2016, pp. 3156 - 3160. [ DOI : 10.21437/Interspeech.2016-954 ]
https://hal.inria.fr/hal-01397176 -
37S. Leglaive, U. Simsekli, A. Liutkus, R. Badeau, G. Richard.
Alpha-Stable Multichannel Audio Source Separation, in: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, United States, Proc. 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, March 2017.
https://hal.archives-ouvertes.fr/hal-01416366 -
38V. Q. Nguyen, F. Colas, E. Vincent, F. Charpillet.
Localizing an Intermittent and Moving Sound Source Using a Mobile Robot, in: International Conference on Intelligent Robots and Systems (IROS), Deajeon, South Korea, October 2016.
https://hal.archives-ouvertes.fr/hal-01354006 -
39A. A. Nugraha, A. Liutkus, E. Vincent.
Multichannel music separation with deep neural networks, in: European Signal Processing Conference (EUSIPCO), Budapest, Hungary, Proceedings of the 24th European Signal Processing Conference (EUSIPCO), August 2016, pp. 1748-1752.
https://hal.inria.fr/hal-01334614 -
40S. Ouni, V. Colotte, S. Dahmani, S. Azzi.
Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech, in: Interspeech 2016, San Francisco, United States, ISCA, November 2016, vol. 2016, pp. 580 - 584. [ DOI : 10.21437/Interspeech.2016-730 ]
https://hal.inria.fr/hal-01398528 -
41A. Piquard-Kipffer.
Storytelling with a digital album that use an avatar as narrator, in: XVIèmes rencontres internationales en orthophonie - Orthophonie et technologies innovantes, PARIS, France, XVIèmes rencontres internationales en orthophonie - Orthophonie et technologies innovantes, December 2016.
https://hal.inria.fr/hal-01403204 -
42D. Ribas, E. Vincent, J. R. Calvo.
A study of speech distortion conditions in real scenarios for speech processing applications, in: 2016 IEEE Workshop on Spoken Language Technology, San Diego, United States, December 2016.
https://hal.inria.fr/hal-01377638 -
43G. Serrière, C. Cerisara, D. Fohr, O. Mella.
Weakly-supervised text-to-speech alignment confidence measure, in: International Conference on Computational Linguistics (COLING), Osaka, Japan, Proceedings of the 26th International Conference on Computational Linguistics (COLING), December 2016.
https://hal.archives-ouvertes.fr/hal-01378355 -
44I. Sheikh, I. Illina, D. Fohr, G. Linares.
Document Level Semantic Context for Retrieving OOV Proper Names, in: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), shanghai, China, Proceeding of IEEE ICASSP 2016, IEEE, March 2016, pp. 6050-6054. [ DOI : 10.1109/ICASSP.2016.7472839 ]
https://hal.archives-ouvertes.fr/hal-01331716 -
45I. Sheikh, I. Illina, D. Fohr, G. Linares.
Improved Neural Bag-of-Words Model to Retrieve Out-of-Vocabulary Words in Speech Recognition, in: INTERSPEECH 2016, San Francisco, United States, Proceedings of INTERSPEECH 2016, September 2016, vol. 2016. [ DOI : 10.21437/Interspeech.2016-1219 ]
https://hal.archives-ouvertes.fr/hal-01384488 -
46I. Sheikh, I. Illina, D. Fohr, G. Linares.
Learning Word Importance with the Neural Bag-of-Words Model, in: ACL, Representation Learning for NLP (Repl4NLP) workshop, Berlin, Germany, Proceedings of ACL 2016, August 2016.
https://hal.archives-ouvertes.fr/hal-01331720 -
47I. Sheikh, I. Illina, D. Fohr.
How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News, in: LREC 2016, Portoroz, Slovenia, proceedings of LREC 2016, May 2016.
https://hal.archives-ouvertes.fr/hal-01331714 -
48A. J. R. Simpson, G. Roma, E. M. Grais, R. D. Mason, C. Hummersone, A. Liutkus, M. D. Plumbley.
Evaluation of Audio Source Separation Models Using Hypothesis-Driven Non-Parametric Statistical Methods, in: European Signal Processing Conference, Budapest, Hungary, EURASIP, August 2016.
https://hal.inria.fr/hal-01410176 -
49S. Sivasankaran, E. Vincent, I. Illina.
Discriminative importance weighting of augmented training data for acoustic model training, in: 42th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, United States, March 2017.
https://hal.inria.fr/hal-01415759 -
50F.-R. Stöter, A. Liutkus, R. Badeau, B. Edler, P. Magron.
Common Fate Model for Unison source Separation, in: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, Proceedings of the 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2016.
https://hal.archives-ouvertes.fr/hal-01248012 -
51J. Trouvain, A. Bonneau, V. Colotte, C. Fauth, D. Fohr, D. Jouvet, J. Jügler, Y. Laprie, O. Mella, B. Möbius, F. Zimmerer.
The IFCASL Corpus of French and German Non-native and Native Read Speech, in: LREC'2016, 10th edition of the Language Resources and Evaluation Conference, Portorož, Slovenia, Proceedings LREC'2016, May 2016.
https://hal.inria.fr/hal-01293935 -
52F. Zimmerer, A. Bonneau, B. Andreeva.
Influence of L1 prominence on L2 production: French and German speakers, in: Speech Prosody 2016, Boston, United States, May 2016, vol. 2016, pp. 370 - 374. [ DOI : 10.21437/SpeechProsody.2016-76 ]
https://hal.inria.fr/hal-01399974
National Conferences with Proceedings
-
53B. Elie, Y. Laprie, P.-A. Vuissoz.
Acquisition temps-réel de données articulatoires par IRM : application à la synthèse par copie, in: 13ème Congrès Français d'Acoustique (CFA 2016), Le Mans, France, SFA, April 2016.
https://hal.archives-ouvertes.fr/hal-01314313
Conferences without Proceedings
-
54F. Zimmerer, J. Trouvain, A. Bonneau.
Methods of investigating vowel interferences of French learners of German, in: New Sounds 2016, Aarhus, Denmark, June 2016.
https://hal.inria.fr/hal-01400005
Scientific Books (or Scientific Book chapters)
-
55J. Barker, R. Marxer, E. Vincent, S. Watanabe.
The CHiME challenges: Robust speech recognition in everyday environments, in: New era for robust speech recognition - Exploiting deep learning, Springer, October 2016.
https://hal.inria.fr/hal-01383263 -
56M. Cadot.
Recoder les variables pour obtenir un modèle implicatif optimal, in: L'Analyse Statisqtique Implicative, R. Gras (editor), Cépaduès, December 2016.
https://hal.archives-ouvertes.fr/hal-01398229
Internal Reports
-
57P. Magron, R. Badeau, A. Liutkus.
Generalized Wiener filtering for positive alpha-stable random variables, Télécom ParisTech, June 2016.
https://hal.archives-ouvertes.fr/hal-01340797 -
58G. Sargent, F. Bimbot, E. Vincent.
Supplementary material to the article: Estimating the structural segmentation of popular music pieces under regularity constraints, IRISA-Inria, Campus de Beaulieu, 35042 Rennes cedex ; Inria Nancy, équipe Multispeech, September 2016.
https://hal.inria.fr/hal-01368683
Scientific Popularization
-
59A. Liutkus, E. Vincent.
Démixer la musique, in: Interstices, January 2016.
https://hal.inria.fr/hal-01350450 -
60A. Piquard-Kipffer.
Faire voir une histoire : Louis et son incroyable chien Noisette, in: Les Cahiers Pédagogiques, February 2016, vol. Hors série numérique N°42, 7 p.
https://hal.inria.fr/hal-01191878
Patents
-
61S. Ouni, G. Gris.
Dispositif de traitement d’image, January 2016, no 15 52058, Le rapport de recherche reconnait la brevetabilité.
https://hal.inria.fr/hal-01294028
Other Publications
-
62B. Dumortier, E. Vincent, M. Deaconu, P. Cornu.
Efficient optimisation of wind power under acoustic constraints, November 2016, working paper or preprint.
https://hal.inria.fr/hal-01393125 -
63B. Elie, Y. Laprie.
Acoustic impact of the glottal chink on the production of fricatives: A numerical study, December 2016, working paper or preprint.
https://hal.archives-ouvertes.fr/hal-01423206 -
64A. Piquard-Kipffer, T. Léonova.
Parcours scolaire de 166 dysphasiques et/ou dyslexiques-dysorthographiques âgés de 6 à 20 ans en situation de handicap : Schooling experiences of 166 dysphasic or dyslexics-dysorthographic children, aged from 6 to 20 in a handicap situation, July 2016, working paper or preprint.
https://hal.inria.fr/hal-01402986 -
65A. Piquard-Kipffer, O. Mella, J. Miranda, D. Jouvet, L. Orosanu.
Terminal portable de communication et affichage de la reconnaissance vocale. Enjeux et rapports à l'écrit. Étude préliminaire auprès d'adultes déficients auditifs, March 2016, 15p. p, In M.Frisch (Eds) Le réseau Idéki : Didactiques, métiers de l'humain et Intelligence collective. Nouveaux espaces et dispositifs en question. Nouveaux horizons en éducation, formation et en recherche. L'harmattan, Collection I.D.
https://hal.inria.fr/hal-01239910
-
66A. Liutkus, R. Badeau.
Generalized Wiener filtering with fractional power spectrograms, in: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2015, pp. 266–270.