MULTISPEECH - 2017 - Annual activity report

MULTISPEECH

MULTISPEECH - 2017

Project-Team Multispeech

Personnel

Overall Objectives

Research Program

Application Domains

Highlights of the Year

New Software and Platforms

New Results

Bilateral Contracts and Grants with Industry

Bilateral Contracts with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Previous |

Home | Next next

Section: New Software and Platforms

KATS

Kaldi-based Automatic Transcription System

Keyword: Speech recognition

Functional Description: KATS is a multipass system for transcribing audio data, and in particular radio or TV shows in French, English or Arabic. It is based on the Kaldi speech recognition tools. It relies on Deep Neural Network (DNN) modeling for speech detection and acoustic modeling of the phones (speech sounds). Higher order statistical language models and recurrent neural network language models can be used for improving performance through rescoring of multiple hypotheses.

News Of The Year: Better acoustic models have been developed for French, English and Arabic languages. An NN-based speech detection module has been included, as well as rescoring with RNN language models.

Contact: Dominique Fohr

Previous |

Home | Next next