Techreport,

A New Approach to Continuous Speech Recognition Using LSTM Recurrent Neural Networks

, , and .
IDSIA-14-03. IDSIA, www.idsia.ch/\-techrep.html, (May 2003)

Abstract

This paper presents an algorithm for continuous speech recognition built from two Long Short-Term Memory (LSTM) recurrent neural networks. A first LSTM network performs frame-level phone probability estimation. A second network maps these phone predictions onto words. In contrast to HMMs, this allows greater exploitation of long-timescale correlations. Simulation results are presented for a hand-segmented subset of the "Numbers-95" database. These results include isolated phone prediction, continuous frame-level phone prediction and continuous word prediction. We conclude that despite its early stage of development, our new model is already competitive with existing approaches on certain aspects of speech recognition and promising on others, warranting further research.

Tags

Users

  • @tb2332

Comments and Reviews