EN FR
EN FR


Bibliography

Major publications by the team in recent years
  • 1X. Alameda-Pineda, R. Horaud.

    A Geometric Approach to Sound Source Localization from Time-Delay Estimates, in: IEEE Transactions on Audio, Speech and Language Processing, June 2014, vol. 22, no 6, pp. 1082-1095. [ DOI : 10.1109/TASLP.2014.2317989 ]

    https://hal.inria.fr/hal-00975293
  • 2X. Alameda-Pineda, R. Horaud.

    Vision-Guided Robot Hearing, in: International Journal of Robotics Research, April 2015, vol. 34, no 4-5, pp. 437-456. [ DOI : 10.1177/0278364914548050 ]

    https://hal.inria.fr/hal-00990766
  • 3N. Andreff, B. Espiau, R. Horaud.

    Visual Servoing from Lines, in: International Journal of Robotics Research, 2002, vol. 21, no 8, pp. 679–700.

    http://hal.inria.fr/hal-00520167
  • 4S. Ba, X. Alameda-Pineda, A. Xompero, R. Horaud.

    An On-line Variational Bayesian Model for Multi-Person Tracking from Cluttered Scenes, in: Computer Vision and Image Understanding, December 2016, vol. 153, pp. 64–76. [ DOI : 10.1016/j.cviu.2016.07.006 ]

    https://hal.inria.fr/hal-01349763
  • 5Y. Ban, X. Alameda-Pineda, F. Badeig, S. Ba, R. Horaud.

    Tracking a Varying Number of People with a Visually-Controlled Robotic Head, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Vancouver, Canada, September 2017.

    https://hal.inria.fr/hal-01542987
  • 6F. Cuzzolin, D. Mateus, R. Horaud.

    Robust Temporally Coherent Laplacian Protrusion Segmentation of 3D Articulated Bodies, in: International Journal of Computer Vision, March 2015, vol. 112, no 1, pp. 43-70. [ DOI : 10.1007/s11263-014-0754-0 ]

    https://hal.archives-ouvertes.fr/hal-01053737
  • 7A. Deleforge, F. Forbes, R. Horaud.

    Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds, in: International Journal of Neural Systems, February 2015, vol. 25, no 1, 21 p. [ DOI : 10.1142/S0129065714400036 ]

    https://hal.inria.fr/hal-00960796
  • 8A. Deleforge, F. Forbes, R. Horaud.

    High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables, in: Statistics and Computing, September 2015, vol. 25, no 5, pp. 893-911. [ DOI : 10.1007/s11222-014-9461-5 ]

    https://hal.inria.fr/hal-00863468
  • 9A. Deleforge, R. Horaud, Y. Y. Schechner, L. Girin.

    Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression, in: IEEE Transactions on Audio, Speech and Language Processing, April 2015, vol. 23, no 4, pp. 718-731. [ DOI : 10.1109/TASLP.2015.2405475 ]

    https://hal.inria.fr/hal-01112834
  • 10V. Drouard, R. Horaud, A. Deleforge, S. Ba, G. Evangelidis.

    Robust Head-Pose Estimation Based on Partially-Latent Mixture of Linear Regressions, in: IEEE Transactions on Image Processing, March 2017, vol. 26, no 3, pp. 1428 - 1440. [ DOI : 10.1109/TIP.2017.2654165 ]

    https://hal.inria.fr/hal-01413406
  • 11G. Evangelidis, M. Hansard, R. Horaud.

    Fusion of Range and Stereo Data for High-Resolution Scene-Modeling, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, November 2015, vol. 37, no 11, pp. 2178 - 2192. [ DOI : 10.1109/TPAMI.2015.2400465 ]

    https://hal.archives-ouvertes.fr/hal-01110031
  • 12I. D. Gebru, X. Alameda-Pineda, F. Forbes, R. Horaud.

    EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, December 2016, vol. 38, no 12, pp. 2402 - 2415. [ DOI : 10.1109/TPAMI.2016.2522425 ]

    https://hal.inria.fr/hal-01261374
  • 13M. Hansard, G. Evangelidis, Q. Pelorson, R. Horaud.

    Cross-Calibration of Time-of-flight and Colour Cameras, in: Computer Vision and Image Understanding, April 2015, vol. 134, pp. 105-115. [ DOI : 10.1016/j.cviu.2014.09.001 ]

    https://hal.inria.fr/hal-01059891
  • 14M. Hansard, R. Horaud, M. Amat, G. Evangelidis.

    Automatic Detection of Calibration Grids in Time-of-Flight Images, in: Computer Vision and Image Understanding, April 2014, vol. 121, pp. 108-118. [ DOI : 10.1016/j.cviu.2014.01.007 ]

    https://hal.inria.fr/hal-00936333
  • 15M. Hansard, R. Horaud.

    Cyclopean geometry of binocular vision, in: Journal of the Optical Society of America A, September 2008, vol. 25, no 9, pp. 2357-2369. [ DOI : 10.1364/JOSAA.25.002357 ]

    http://hal.inria.fr/inria-00435548
  • 16M. Hansard, R. Horaud.

    Cyclorotation Models for Eyes and Cameras, in: IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, March 2010, vol. 40, no 1, pp. 151-161. [ DOI : 10.1109/TSMCB.2009.2024211 ]

    http://hal.inria.fr/inria-00435549
  • 17M. Hansard, R. Horaud.

    A Differential Model of the Complex Cell, in: Neural Computation, September 2011, vol. 23, no 9, pp. 2324-2357. [ DOI : 10.1162/NECO_a_00163 ]

    http://hal.inria.fr/inria-00590266
  • 18M. Hansard, S. Lee, O. Choi, R. Horaud.

    Time of Flight Cameras: Principles, Methods, and Applications, Springer Briefs in Computer Science, Springer, October 2012, 95 p.

    http://hal.inria.fr/hal-00725654
  • 19R. Horaud, G. Csurka, D. Demirdjian.

    Stereo Calibration from Rigid Motions, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, December 2000, vol. 22, no 12, pp. 1446–1452. [ DOI : 10.1109/34.895977 ]

    http://hal.inria.fr/inria-00590127
  • 20R. Horaud, F. Forbes, M. Yguel, G. Dewaele, J. Zhang.

    Rigid and Articulated Point Registration with Expectation Conditional Maximization, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, March 2011, vol. 33, no 3, pp. 587-602. [ DOI : 10.1109/TPAMI.2010.94 ]

    http://hal.inria.fr/inria-00590265
  • 21R. Horaud, M. Niskanen, G. Dewaele, E. Boyer.

    Human Motion Tracking by Registering an Articulated Surface to 3-D Points and Normals, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, January 2009, vol. 31, no 1, pp. 158-163. [ DOI : 10.1109/TPAMI.2008.108 ]

    http://hal.inria.fr/inria-00446898
  • 22V. Khalidov, F. Forbes, R. Horaud.

    Conjugate Mixture Models for Clustering Multimodal Data, in: Neural Computation, February 2011, vol. 23, no 2, pp. 517-557. [ DOI : 10.1162/NECO_a_00074 ]

    http://hal.inria.fr/inria-00590267
  • 23D. Knossow, R. Ronfard, R. Horaud.

    Human Motion Tracking with a Kinematic Parameterization of Extremal Contours, in: International Journal of Computer Vision, September 2008, vol. 79, no 3, pp. 247-269. [ DOI : 10.1007/s11263-007-0116-2 ]

    http://hal.inria.fr/inria-00590247
  • 24D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, S. Gannot, R. Horaud.

    A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, August 2016, vol. 24, no 8, pp. 1408-1423. [ DOI : 10.1109/TASLP.2016.2554286 ]

    https://hal.inria.fr/hal-01301762
  • 25X. Li, L. Girin, F. Badeig, R. Horaud.

    Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function, in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Daejeon, South Korea, IEEE, October 2016, pp. 2819-2826. [ DOI : 10.1109/IROS.2016.7759437 ]

    https://hal.inria.fr/hal-01349771
  • 26X. Li, L. Girin, R. Horaud, S. Gannot.

    Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, November 2016, vol. 24, no 11, pp. 2171 - 2186. [ DOI : 10.1109/TASLP.2016.2598319 ]

    https://hal.inria.fr/hal-01349691
  • 27X. Li, L. Girin, R. Horaud, S. Gannot.

    Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, October 2017, vol. 25, no 10, pp. 1997 - 2012, 16 pages, 4 figures, 4 tables. [ DOI : 10.1109/TASLP.2017.2740001 ]

    https://hal.inria.fr/hal-01413417
  • 28M. Sapienza, M. Hansard, R. Horaud.

    Real-time Visuomotor Update of an Active Binocular Head, in: Autonomous Robots, January 2013, vol. 34, no 1, pp. 33-45. [ DOI : 10.1007/s10514-012-9311-2 ]

    http://hal.inria.fr/hal-00768615
  • 29A. Zaharescu, E. Boyer, R. Horaud.

    Topology-Adaptive Mesh Deformation for Surface Evolution, Morphing, and Multi-View Reconstruction, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, April 2011, vol. 33, no 4, pp. 823-837. [ DOI : 10.1109/TPAMI.2010.116 ]

    http://hal.inria.fr/inria-00590271
  • 30A. Zaharescu, E. Boyer, R. Horaud.

    Keypoints and Local Descriptors of Scalar Functions on 2D Manifolds, in: International Journal of Computer Vision, October 2012, vol. 100, no 1, pp. 78-98. [ DOI : 10.1007/s11263-012-0528-5 ]

    http://hal.inria.fr/hal-00699620
  • 31A. Zaharescu, R. Horaud.

    Robust Factorization Methods Using A Gaussian/Uniform Mixture Model, in: International Journal of Computer Vision, March 2009, vol. 81, no 3, pp. 240-258. [ DOI : 10.1007/s11263-008-0169-x ]

    http://hal.inria.fr/inria-00446987
Publications of the year

Doctoral Dissertations and Habilitation Theses

Articles in International Peer-Reviewed Journals

  • 35G. Evangelidis, R. Horaud.

    Joint Alignment of Multiple Point Sets with Batch and Incremental Expectation-Maximization, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, June 2018, vol. 40, no 6, pp. 1397 - 1410, https://arxiv.org/abs/1609.01466. [ DOI : 10.1109/TPAMI.2017.2717829 ]

    https://hal.inria.fr/hal-01413414
  • 36I. Gebru, S. Ba, X. Li, R. Horaud.

    Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, July 2018, vol. 40, no 5, pp. 1086 - 1099, https://arxiv.org/abs/1603.09725. [ DOI : 10.1109/TPAMI.2017.2648793 ]

    https://hal.inria.fr/hal-01413403
  • 37S. Lathuilière, B. Massé, P. Mesejo, R. Horaud.

    Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction, in: Pattern Recognition Letters, May 2018, https://arxiv.org/abs/1711.06834. [ DOI : 10.1016/j.patrec.2018.05.023 ]

    https://hal.inria.fr/hal-01643775
  • 38X. Li, S. Gannot, L. Girin, R. Horaud.

    Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, May 2018, vol. 26, no 10, pp. 1755-1768, https://arxiv.org/abs/1711.07911. [ DOI : 10.1109/TASLP.2018.2839362 ]

    https://hal.inria.fr/hal-01645749
  • 39X. Li, L. Girin, S. Gannot, R. Horaud.

    Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, January 2019, https://arxiv.org/abs/1711.07911. [ DOI : 10.1109/TASLP.2019.2892412 ]

    https://hal.inria.fr/hal-01799809
  • 40X. Li, L. Girin, R. Horaud.

    Expectation-Maximization for Speech Source Separation using Convolutive Transfer Function, in: CAAI Transactions on Intelligent Technologies, January 2019. [ DOI : 10.1049/trit.2018.1061 ]

    https://hal.inria.fr/hal-01982250
  • 41R. T. Marriott, A. Pashevich, R. Horaud.

    Plane-extraction from depth-data using a Gaussian mixture regression model, in: Pattern Recognition Letters, July 2018, vol. 110, pp. 44-50, https://arxiv.org/abs/1710.01925 - 2 figures, 1 table. [ DOI : 10.1016/j.patrec.2018.03.024 ]

    https://hal.inria.fr/hal-01663984
  • 42B. Massé, S. Ba, R. Horaud.

    Tracking Gaze and Visual Focus of Attention of People Involved in Social Interaction, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, November 2018, vol. 40, no 11, pp. 2711 - 2724, https://arxiv.org/abs/1703.04727. [ DOI : 10.1109/TPAMI.2017.2782819 ]

    https://hal.inria.fr/hal-01511414
  • 43D. Xu, X. Alameda-Pineda, J. Song, E. Ricci, N. Sebe.

    Cross-Paced Representation Learning with Partial Curricula for Sketch-based Image Retrieval, in: IEEE Transactions on Image Processing, September 2018, vol. 27, no 9, pp. 4410-4421. [ DOI : 10.1109/TIP.2018.2837381 ]

    https://hal.inria.fr/hal-01803694

International Conferences with Proceedings

  • 44Y. Ban, X. Li, X. Alameda-Pineda, L. Girin, R. Horaud.

    Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking, in: ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, Alberta, Canada, IEEE, April 2018, pp. 6553-6557. [ DOI : 10.1109/ICASSP.2018.8462100 ]

    https://hal.inria.fr/hal-01718114
  • 45S. Lathuilière, B. Massé, P. Mesejo, R. Horaud.

    Deep Reinforcement Learning for Audio-Visual Gaze Control, in: IROS 2018 - IEEE/RSJ International Conference on Intelligent Robots and Systems, Madrid, Spain, October 2018, pp. 1-8. [ DOI : 10.1109/IROS.2018.8594327 ]

    https://hal.inria.fr/hal-01851738
  • 46S. Lathuilière, P. Mesejo, X. Alameda-Pineda, R. Horaud.

    DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model, in: ECCV 2018 - European Conference on Computer Vision, Munich, Germany, September 2018, pp. 1-16.

    https://hal.inria.fr/hal-01851511
  • 47S. Leglaive, L. Girin, R. Horaud.

    A variance modeling framework based on variational autoencoders for speech enhancement, in: MSLP 2018 - IEEE International Workshop on Machine Learning for Signal Processing, Aalborg, Denmark, IEEE, September 2018, pp. 1-6. [ DOI : 10.1109/MLSP.2018.8516711 ]

    https://hal.inria.fr/hal-01832826
  • 48X. Li, Y. Ban, L. Girin, X. Alameda-Pineda, R. Horaud.

    A Cascaded Multiple-Speaker Localization and Tracking System, in: Proceedings of the LOCATA Challenge Workshop - a satellite event of IWAENC 2018, Tokyo, Japan, September 2018, pp. 1-5.

    https://hal.inria.fr/hal-01957137
  • 49X. Li, S. Gannot, L. Girin, R. Horaud.

    Multisource MINT Using the Convolutive Transfer Function, in: ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, Alberta, Canada, IEEE, April 2018, pp. 756-760. [ DOI : 10.1109/ICASSP.2018.8462607 ]

    https://hal.inria.fr/hal-01718106
  • 50X. Li, B. Mourgue, L. Girin, S. Gannot, R. Horaud.

    Online Localization of Multiple Moving Speakers in Reverberant Environments, in: 10th IEEE Workshop on Sensor Array and Multichannel Signal Processing (SAM 2018), Sheffield, United Kingdom, IEEE, July 2018, pp. 405-409. [ DOI : 10.1109/SAM.2018.8448423 ]

    https://hal.inria.fr/hal-01795462
  • 51A. Siarohin, E. Sangineto, S. Lathuilière, N. Sebe.

    Deformable GANs for Pose-based Human Image Generation, in: IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, United States, June 2018, pp. 3408-3416, https://arxiv.org/abs/1801.00055.

    https://hal.archives-ouvertes.fr/hal-01761539
  • 52W. Wang, X. Alameda-Pineda, D. Xu, P. Fua, E. Ricci, N. Sebe.

    Every Smile is Unique: Landmark-Guided Diverse Smile Generation, in: IEEE Conference on Computer Vision and Pattern Recognition, Salk Lake City, United States, June 2018, pp. 7083-7092, https://arxiv.org/abs/1802.01873.

    https://hal.inria.fr/hal-01759335

Scientific Books (or Scientific Book chapters)

  • 53X. Alameda-Pineda, E. Ricci, N. Sebe.

    Multimodal behavior analysis in the wild: Advances and challenges, Academic Press (Elsevier), December 2018.

    https://hal.inria.fr/hal-01858395
  • 54L. Girin, S. Gannot, X. Li.

    Audio source separation into the wild, in: Multimodal Behavior Analysis in the Wild, Computer Vision and Pattern Recognition, Academic Press (Elsevier), November 2018, pp. 53-78. [ DOI : 10.1016/B978-0-12-814601-9.00022-5 ]

    https://hal.inria.fr/hal-01943375

Other Publications