MULTISPEECH - 2018 - Rapport annuel d'activité

MULTISPEECH

MULTISPEECH - 2018

Project-Team Multispeech

Team, Visitors, External Collaborators

Overall Objectives

Research Program

Application Domains

Highlights of the Year

New Software and Platforms

New Results

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Previous |

Home | Next next

Section: New Software and Platforms

dnnsep

Multichannel audio source separation with deep neural networks

Keywords: Audio - Source Separation - Deep learning

Scientific Description: dnnsep is the only source separation software relying on multichannel Wiener filtering based on deep learning. Deep neural networks are used to initialize and reestimate the power spectrum of the sources at every iteration of an expectation-maximization (EM) algorithm. This results in state-of-the-art separation quality for both speech and music.

Functional Description: Combines deep neural networks and multichannel signal processing for speech enhancement and separation of musical recordings.

Release Functional Description: This version derives from version 1.0 (not 1.9). Differences concerns the use of a bidirectional long short-term memory (BLSTM) neural network, smoothing of the multichannel Wiener filter (MWF) over time and frequency, usage of the principal component of the MWF filter, adding a new generalized eigenvector beamformer with blind analytical normalization (GEVB) filter, and normalizing the training and test signals.

Participants: Aditya Nugraha, Emmanuel Vincent and Antoine Liutkus
Contact: Emmanuel Vincent

Previous |

Home | Next next