EN FR
EN FR


Section: New Software and Platforms

KATS

Kaldi-based Automatic Transcription System

Keyword: Speech recognition

Functional Description

KATS is a multipass system for transcribing audio data, and in particular radio or TV shows. The audio stream is first split into homogeneous segments that are decoded using the most adequate acoustic model with a large vocabulary continuous speech recognition engine. In this new software, the recognition engine is based on the Kaldi toolkit, and uses Deep Neural Network - DNN - based acoustic models. An extra processing pass is run in order to rescore the n-best hypotheses with a higher order language model.