Artikel,

Environment and Sensor Robustness in Automatic Speech Recognition

.
International Journal of Innovative Science and Modern Engineering (IJISME), 1 (2): 31-37 (Januar 2013)

Zusammenfassung

Most of the presently available speech recognition systems work efficiently only in some ideal conditions. This is due to the fact that these systems are based on some assumptions related to the operating conditions. The system works efficiently if the actual working environment is identical with the environment for which the system is built. Performance of the speech recognition system considerably degrades if mismatch between the training and the testing environment occurs. In the present study, mismatch due to sensor variability and environment has been considered and Cepstral Mean Normalization (CMN) and Spectral subtraction methods have been investigated as front-end methods for the reduction of noise. A Hidden Markov Model (HMM) based speech recognition system has been built with Mel-Frequency Cepstral Coefficient (MFCC) as feature vector. It has been observed that there is a 15% enhancement of system performance in channel and environment mismatched condition compared to baseline performance when CMN and spectral subtraction methods have been applied for noise reduction.

Tags

Nutzer

  • @ijisme_beiesp

Kommentare und Rezensionenanzeigen / verbergen

  • @ijisme_beiesp
    vor 3 Jahren (zuletzt bearbeitetvor 3 Jahren)
    good
Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.