Bibliography

Major publications by the team in recent years

1F. Bahja, J. Di Martino, E. H. Ibn Elhaj, D. Aboutajdine.

An overview of the CATE algorithms for real-time pitch determination, in: Signal, Image and Video Processing, 2013. [ DOI : 10.1007/s11760-013-0488-4 ]

https://hal.inria.fr/hal-00831660
2J. Barker, E. Vincent, N. Ma, H. Christensen, P. Green.

The PASCAL CHiME Speech Separation and Recognition Challenge, in: Computer Speech and Language, February 2013, vol. 27, n^o 3, pp. 621-633. [ DOI : 10.1016/j.csl.2012.10.004 ]

https://hal.inria.fr/hal-00743529
3A. Bonneau, D. Fohr, I. Illina, D. Jouvet, O. Mella, L. Mesbahi, L. Orosanu.

Gestion d'erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d'une langue seconde, in: Traitement Automatique des Langues, 2013, vol. 53, n^o 3.

https://hal.inria.fr/hal-00834278
4D. Jouvet, D. Fohr.

Combining Forward-based and Backward-based Decoders for Improved Speech Recognition Performance, in: InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Lyon, France, August 2013.

https://hal.inria.fr/hal-00834282
5A. Ozerov, M. Lagrange, E. Vincent.

Uncertainty-based learning of acoustic models from noisy data, in: Computer Speech and Language, February 2013, vol. 27, n^o 3, pp. 874-894. [ DOI : 10.1016/j.csl.2012.07.002 ]

https://hal.inria.fr/hal-00717992
6A. Ozerov, E. Vincent, F. Bimbot.

A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, in: IEEE Transactions on Audio, Speech and Language Processing, May 2012, vol. 20, n^o 4, pp. 1118 - 1133, 16.

https://hal.archives-ouvertes.fr/hal-00626962
7A. Piquard-Kipffer, L. Sprenger-Charolles.

Predicting reading level at the end of Grade 2 from skills assessed in kindergarten: contribution of phonemic discrimination (Follow-up of 85 French-speaking children from 4 to 8 years old), in: Topics in Cognitive Psychology, 2013.

https://hal.inria.fr/hal-00833951

Publications of the year

Doctoral Dissertations and Habilitation Theses

8I. Sheikh.

Exploiting Semantic and Topic Context to Improve Recognition of Proper Names in Diachronic Audio Documents, Université de Lorraine, November 2016.

https://hal.archives-ouvertes.fr/tel-01400694

Articles in International Peer-Reviewed Journals

9K. Adiloğlu, E. Vincent.

Variational Bayesian Inference for Source Separation and Robust Feature Extraction, in: IEEE Transactions on Audio Speech and Language Processing, June 2016. [ DOI : 10.1109/TASLP.2016.2583794 ]

https://hal.inria.fr/hal-00726146
10M. Aron, M.-O. Berger, E. Kerrien, B. Wrobel-Dautcourt, B. Potard, Y. Laprie.

Multimodal acquisition of articulatory data: Geometrical and temporal registration, in: Journal of the Acoustical Society of America, 2016, vol. 139, n^o 2, 13 p. [ DOI : 10.1121/1.4940666 ]

https://hal.inria.fr/hal-01269578
11J. Barker, R. Marxer, E. Vincent, S. Watanabe.

The third 'CHIME' speech separation and recognition challenge: Analysis and outcomes, in: Computer Speech and Language, October 2016.

https://hal.inria.fr/hal-01382108
12F. Bimbot, E. Deruty, G. Sargent, E. Vincent.

System & Contrast : A Polymorphous Model of the Inner Organization of Structural Segments within Music Pieces, in: Music Perception, 2016, 41 p.

https://hal.inria.fr/hal-01188244
13M. Cadot, Y. Laprie.

Extraction d’un modèle articulatoire à partir d’une analyse tri-directionnelle de cinéradiographies d’un locuteur, in: Revue des Nouvelles Technologies de l'Information, 2016, vol. Fouille de Données Complexes, n^o RNTI-E-31, pp. 73-92.

https://hal.archives-ouvertes.fr/hal-01346987
14B. Elie, Y. Laprie.

Extension of the single-matrix formulation of the vocal tract: consideration of bilateral channels and connection of self-oscillating models of the vocal folds with a glottal chink, in: Speech Communication, September 2016, vol. 82, pp. 85-96. [ DOI : 10.1016/j.specom.2016.06.002 ]

https://hal.archives-ouvertes.fr/hal-01199792
15D. Fitzgerald, A. Liutkus, R. Badeau.

Projection-based demixing of spatial audio, in: IEEE Transactions on Audio, Speech and Language Processing, May 2016.

https://hal.inria.fr/hal-01260588
16S. Gannot, E. Vincent, S. Markovich-Golan, A. Ozerov.

A consolidated perspective on multi-microphone speech enhancement and source separation, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, December 2016.

https://hal.inria.fr/hal-01414179
17X. Jaureguiberry, E. Vincent, G. Richard.

Fusion methods for speech enhancement and audio source separation, in: IEEE Transactions on Audio, Speech and Language Processing, April 2016.

https://hal.archives-ouvertes.fr/hal-01120685
18A. A. Nugraha, A. Liutkus, E. Vincent.

Multichannel audio source separation with deep neural networks, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, June 2016, vol. 24, n^o 10, pp. 1652-1664. [ DOI : 10.1109/TASLP.2016.2580946 ]

https://hal.inria.fr/hal-01163369
19S. Ouni, S. Dahmani.

Is markerless acquisition of speech production accurate ?, in: Journal of the Acoustical Society of America, May 2016, vol. 139, n^o 6.

https://hal.inria.fr/hal-01315579
20G. Sargent, F. Bimbot, E. Vincent.

Estimating the structural segmentation of popular music pieces under regularity constraints, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017.

https://hal.inria.fr/hal-01403210
21E. Vincent, S. Watanabe, A. A. Nugraha, J. Barker, R. Marxer.

An analysis of environment, microphone and data simulation mismatches in robust speech recognition, in: Computer Speech and Language, November 2016.

https://hal.inria.fr/hal-01399180

Invited Conferences

22E. Vincent.

Séparation de sources: quand l'acoustique rencontre le machine learning, in: 13e Congrès Français d'Acoustique, Le Mans, France, April 2016.

https://hal.inria.fr/hal-01398720

International Conferences with Proceedings

23K. Bartkova, D. Jouvet, E. Delais-Roussarie.

Prosodic Parameters and Prosodic Structures of French Emotional Data, in: Speech Prosody 2016, Boston, United States, Speech Prosody 2016, May 2016.

https://hal.inria.fr/hal-01293516
24N. Bertin, E. Camberlein, E. Vincent, R. Lebarbenchon, S. Peillon, É. Lamandé, S. Sivasankaran, F. Bimbot, I. Illina, A. Tom, S. Fleury, E. Jamet.

A French corpus for distant-microphone speech processing in real homes, in: Interspeech 2016, San Francisco, United States, September 2016.

https://hal.inria.fr/hal-01343060
25M. Cadot, A. Bonneau.

Du fichier audio à l’intonation en Français :Graphes pour l’apprentissage de 3 classes intonatives, in: Fouille de données complexes (FDC@EGC2016), Reims, France, Proceedings of FDC@EGC2016, January 2016.

https://hal.archives-ouvertes.fr/hal-01292121
26A. Currey, I. Illina, D. Fohr.

Dynamic adjustment of language models for automatic speech recognition using word similarity , in: IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, CA, United States, proceeding of IEEE Workshop on Spoken Language Technology, December 2016.

https://hal.archives-ouvertes.fr/hal-01384365
27K. Déguernel, E. Vincent, G. Assayag.

Using Multidimensional Sequences For Improvisation In The OMax Paradigm, in: 13th Sound and Music Computing Conference, Hamburg, Germany, August 2016.

https://hal.inria.fr/hal-01346797
28B. Elie, G. Chardon.

Robust tonal and noise separation in presence of colored noise, and application to voiced fricatives, in: 22nd International Congress on Acoustics (ICA), Buenos Aires, Argentina, September 2016.

https://hal.archives-ouvertes.fr/hal-01372313
29B. Elie, Y. Laprie.

A glottal chink model for the synthesis of voiced fricatives, in: International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, IEEE, March 2016.

https://hal.archives-ouvertes.fr/hal-01314308
30B. Elie, Y. Laprie.

Copy synthesis of phrase-level utterances, in: EUSIPCO2016, Budapest, Hungary, August 2016.

https://hal.archives-ouvertes.fr/hal-01278462
31B. Elie, Y. Laprie.

Copy synthesis of running speech based on vocal tract imaging and audio recording, in: 22nd International Congress on Acoustics (ICA), Buenos Aires, Argentina, September 2016.

https://hal.archives-ouvertes.fr/hal-01372310
32B. Elie, Y. Laprie, P.-A. Vuissoz, F. Odille.

High spatiotemporal cineMRI films using compressed sensing for acquiring articulatory data, in: EUSIPCO2016, Budapest, Hungary, August 2016.

https://hal.archives-ouvertes.fr/hal-01372320
33B. Elizalde, A. Kumar, A. Shah, R. Badlani, E. Vincent, B. Raj, I. Lane.

Experiments on the DCASE Challenge 2016: Acoustic scene classification and sound event detection in real life recording, in: DCASE2016 Workshop on Detection and Classification of Acoustic Scenes and Events, Budapest, Hungary, September 2016.

https://hal.inria.fr/hal-01354007
34D. Fitzgerald, A. Liutkus, R. Badeau.

PROJET - Spatial Audio Separation Using Projections, in: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, IEEE, 2016.

https://hal.archives-ouvertes.fr/hal-01248014
35M. Fontaine, C. Vanwynsberghe, A. Liutkus, R. Badeau.

Sketching for nearfield acoustic imaging of heavy-tailed sources, in: 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), Grenoble, France, Proc. 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), February 2017.

https://hal.archives-ouvertes.fr/hal-01401988
36S. Ghosh, C. Fauth, A. Sini, Y. Laprie.

L1-L2 Interference: The case of final devoicing of French voiced fricatives in final position by German learners, in: Interspeech 2016, San Francisco, United States, September 2016, vol. 2016, pp. 3156 - 3160. [ DOI : 10.21437/Interspeech.2016-954 ]

https://hal.inria.fr/hal-01397176
37S. Leglaive, U. Simsekli, A. Liutkus, R. Badeau, G. Richard.

Alpha-Stable Multichannel Audio Source Separation, in: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, United States, Proc. 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, March 2017.

https://hal.archives-ouvertes.fr/hal-01416366
38V. Q. Nguyen, F. Colas, E. Vincent, F. Charpillet.

Localizing an Intermittent and Moving Sound Source Using a Mobile Robot, in: International Conference on Intelligent Robots and Systems (IROS), Deajeon, South Korea, October 2016.

https://hal.archives-ouvertes.fr/hal-01354006
39A. A. Nugraha, A. Liutkus, E. Vincent.

Multichannel music separation with deep neural networks, in: European Signal Processing Conference (EUSIPCO), Budapest, Hungary, Proceedings of the 24th European Signal Processing Conference (EUSIPCO), August 2016, pp. 1748-1752.

https://hal.inria.fr/hal-01334614
40S. Ouni, V. Colotte, S. Dahmani, S. Azzi.

Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech, in: Interspeech 2016, San Francisco, United States, ISCA, November 2016, vol. 2016, pp. 580 - 584. [ DOI : 10.21437/Interspeech.2016-730 ]

https://hal.inria.fr/hal-01398528
41A. Piquard-Kipffer.

Storytelling with a digital album that use an avatar as narrator, in: XVIèmes rencontres internationales en orthophonie - Orthophonie et technologies innovantes, PARIS, France, XVIèmes rencontres internationales en orthophonie - Orthophonie et technologies innovantes, December 2016.

https://hal.inria.fr/hal-01403204
42D. Ribas, E. Vincent, J. R. Calvo.

A study of speech distortion conditions in real scenarios for speech processing applications, in: 2016 IEEE Workshop on Spoken Language Technology, San Diego, United States, December 2016.

https://hal.inria.fr/hal-01377638
43G. Serrière, C. Cerisara, D. Fohr, O. Mella.

Weakly-supervised text-to-speech alignment confidence measure, in: International Conference on Computational Linguistics (COLING), Osaka, Japan, Proceedings of the 26th International Conference on Computational Linguistics (COLING), December 2016.

https://hal.archives-ouvertes.fr/hal-01378355
44I. Sheikh, I. Illina, D. Fohr, G. Linares.

Document Level Semantic Context for Retrieving OOV Proper Names, in: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), shanghai, China, Proceeding of IEEE ICASSP 2016, IEEE, March 2016, pp. 6050-6054. [ DOI : 10.1109/ICASSP.2016.7472839 ]

https://hal.archives-ouvertes.fr/hal-01331716
45I. Sheikh, I. Illina, D. Fohr, G. Linares.

Improved Neural Bag-of-Words Model to Retrieve Out-of-Vocabulary Words in Speech Recognition, in: INTERSPEECH 2016, San Francisco, United States, Proceedings of INTERSPEECH 2016, September 2016, vol. 2016. [ DOI : 10.21437/Interspeech.2016-1219 ]

https://hal.archives-ouvertes.fr/hal-01384488
46I. Sheikh, I. Illina, D. Fohr, G. Linares.

Learning Word Importance with the Neural Bag-of-Words Model, in: ACL, Representation Learning for NLP (Repl4NLP) workshop, Berlin, Germany, Proceedings of ACL 2016, August 2016.

https://hal.archives-ouvertes.fr/hal-01331720
47I. Sheikh, I. Illina, D. Fohr.

How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News, in: LREC 2016, Portoroz, Slovenia, proceedings of LREC 2016, May 2016.

https://hal.archives-ouvertes.fr/hal-01331714
48A. J. R. Simpson, G. Roma, E. M. Grais, R. D. Mason, C. Hummersone, A. Liutkus, M. D. Plumbley.

Evaluation of Audio Source Separation Models Using Hypothesis-Driven Non-Parametric Statistical Methods, in: European Signal Processing Conference, Budapest, Hungary, EURASIP, August 2016.

https://hal.inria.fr/hal-01410176
49S. Sivasankaran, E. Vincent, I. Illina.

Discriminative importance weighting of augmented training data for acoustic model training, in: 42th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, United States, March 2017.

https://hal.inria.fr/hal-01415759
50F.-R. Stöter, A. Liutkus, R. Badeau, B. Edler, P. Magron.

Common Fate Model for Unison source Separation, in: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, Proceedings of the 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2016.

https://hal.archives-ouvertes.fr/hal-01248012
51J. Trouvain, A. Bonneau, V. Colotte, C. Fauth, D. Fohr, D. Jouvet, J. Jügler, Y. Laprie, O. Mella, B. Möbius, F. Zimmerer.

The IFCASL Corpus of French and German Non-native and Native Read Speech, in: LREC'2016, 10th edition of the Language Resources and Evaluation Conference, Portorož, Slovenia, Proceedings LREC'2016, May 2016.

https://hal.inria.fr/hal-01293935
52F. Zimmerer, A. Bonneau, B. Andreeva.

Influence of L1 prominence on L2 production: French and German speakers, in: Speech Prosody 2016, Boston, United States, May 2016, vol. 2016, pp. 370 - 374. [ DOI : 10.21437/SpeechProsody.2016-76 ]

https://hal.inria.fr/hal-01399974

National Conferences with Proceedings

53B. Elie, Y. Laprie, P.-A. Vuissoz.

Acquisition temps-réel de données articulatoires par IRM : application à la synthèse par copie, in: 13ème Congrès Français d'Acoustique (CFA 2016), Le Mans, France, SFA, April 2016.

https://hal.archives-ouvertes.fr/hal-01314313

Conferences without Proceedings

54F. Zimmerer, J. Trouvain, A. Bonneau.

Methods of investigating vowel interferences of French learners of German, in: New Sounds 2016, Aarhus, Denmark, June 2016.

https://hal.inria.fr/hal-01400005

Scientific Books (or Scientific Book chapters)

55J. Barker, R. Marxer, E. Vincent, S. Watanabe.

The CHiME challenges: Robust speech recognition in everyday environments, in: New era for robust speech recognition - Exploiting deep learning, Springer, October 2016.

https://hal.inria.fr/hal-01383263
56M. Cadot.

Recoder les variables pour obtenir un modèle implicatif optimal, in: L'Analyse Statisqtique Implicative, R. Gras (editor), Cépaduès, December 2016.

https://hal.archives-ouvertes.fr/hal-01398229

Internal Reports

57P. Magron, R. Badeau, A. Liutkus.

Generalized Wiener filtering for positive alpha-stable random variables, Télécom ParisTech, June 2016.

https://hal.archives-ouvertes.fr/hal-01340797
58G. Sargent, F. Bimbot, E. Vincent.

Supplementary material to the article: Estimating the structural segmentation of popular music pieces under regularity constraints, IRISA-Inria, Campus de Beaulieu, 35042 Rennes cedex ; Inria Nancy, équipe Multispeech, September 2016.

https://hal.inria.fr/hal-01368683

Scientific Popularization

59A. Liutkus, E. Vincent.

Démixer la musique, in: Interstices, January 2016.

https://hal.inria.fr/hal-01350450
60A. Piquard-Kipffer.

Faire voir une histoire : Louis et son incroyable chien Noisette, in: Les Cahiers Pédagogiques, February 2016, vol. Hors série numérique N°42, 7 p.

https://hal.inria.fr/hal-01191878

Patents

61S. Ouni, G. Gris.

Dispositif de traitement d’image, January 2016, n^o 15 52058, Le rapport de recherche reconnait la brevetabilité.

https://hal.inria.fr/hal-01294028

Other Publications

62B. Dumortier, E. Vincent, M. Deaconu, P. Cornu.

Efficient optimisation of wind power under acoustic constraints, November 2016, working paper or preprint.

https://hal.inria.fr/hal-01393125
63B. Elie, Y. Laprie.

Acoustic impact of the glottal chink on the production of fricatives: A numerical study, December 2016, working paper or preprint.

https://hal.archives-ouvertes.fr/hal-01423206
64A. Piquard-Kipffer, T. Léonova.

Parcours scolaire de 166 dysphasiques et/ou dyslexiques-dysorthographiques âgés de 6 à 20 ans en situation de handicap : Schooling experiences of 166 dysphasic or dyslexics-dysorthographic children, aged from 6 to 20 in a handicap situation, July 2016, working paper or preprint.

https://hal.inria.fr/hal-01402986
65A. Piquard-Kipffer, O. Mella, J. Miranda, D. Jouvet, L. Orosanu.

Terminal portable de communication et affichage de la reconnaissance vocale. Enjeux et rapports à l'écrit. Étude préliminaire auprès d'adultes déficients auditifs, March 2016, 15p. p, In M.Frisch (Eds) Le réseau Idéki : Didactiques, métiers de l'humain et Intelligence collective. Nouveaux espaces et dispositifs en question. Nouveaux horizons en éducation, formation et en recherche. L'harmattan, Collection I.D.

https://hal.inria.fr/hal-01239910

References in notes

66A. Liutkus, R. Badeau.

Generalized Wiener filtering with fractional power spectrograms, in: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2015, pp. 266–270.

Previous |

Home