Article,

Effect of high-frequency spectral components in computer recognition of dysarthric speech based on a Mel-cepstral stochastic model.

P. Polur, and G. Miller.
J Rehabil Res Dev, 42 (3): 363--371 (2005)

Abstract

Computer speech recognition of individuals with dysarthria, such as cerebral palsy patients, requires a robust technique that can handle conditions of very high variability and limited training data. In this study, a hidden Markov model (HMM) was constructed and conditions investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system intended to act as an assistive/control tool. In particular, we investigated the effect of high-frequency spectral components on the recognition rate of the system to determine if they contributed useful additional information to the system. A small-size vocabulary spoken by three cerebral palsy subjects was chosen. Mel-frequency cepstral coefficients extracted with the use of 15 ms frames served as training input to an ergodic HMM setup. Subsequent results demonstrated that no significant useful information was available to the system for enhancing its ability to discriminate dysarthric speech above 5.5 kHz in the current set of dysarthric data. The level of variability in input dysarthric speech patterns limits the reliability of the system. However, its application as a rehabilitation/control tool to assist dysarthric motor-impaired individuals such as cerebral palsy subjects holds sufficient promise.

BibTeX key: Polur2005a
entry type: article
year: 2005
journal: J Rehabil Res Dev
number: 3
pages: 363--371
volume: 42
timestamp: 2007.06.22
username: ar0berts
pmid: 16187248
groups: public

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{Polur2005a, abstract = {Computer speech recognition of individuals with dysarthria, such as cerebral palsy patients, requires a robust technique that can handle conditions of very high variability and limited training data. In this study, a hidden Markov model (HMM) was constructed and conditions investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system intended to act as an assistive/control tool. In particular, we investigated the effect of high-frequency spectral components on the recognition rate of the system to determine if they contributed useful additional information to the system. A small-size vocabulary spoken by three cerebral palsy subjects was chosen. Mel-frequency cepstral coefficients extracted with the use of 15 ms frames served as training input to an ergodic HMM setup. Subsequent results demonstrated that no significant useful information was available to the system for enhancing its ability to discriminate dysarthric speech above 5.5 kHz in the current set of dysarthric data. The level of variability in input dysarthric speech patterns limits the reliability of the system. However, its application as a rehabilitation/control tool to assist dysarthric motor-impaired individuals such as cerebral palsy subjects holds sufficient promise.}, added-at = {2014-07-19T21:03:42.000+0200}, author = {Polur, Prasad D and Miller, Gerald E}, biburl = {https://www.bibsonomy.org/bibtex/248951c67d7f575b56fb57d8ce30702e1/ar0berts}, groups = {public}, interhash = {0bc032c28d37c74f3df8025407845bc8}, intrahash = {48951c67d7f575b56fb57d8ce30702e1}, journal = {J Rehabil Res Dev}, keywords = {Artificial Intelligence; Cerebral Palsy; Communication Aids for Disabled; Dysarthria; Humans; Male; Markov Chains; Models, Theoretical; Signal Processing, Computer-Assisted; Speech Acoustics; Intelligibility; Production Measurement; Recognition Software}, number = 3, pages = {363--371}, pmid = {16187248}, timestamp = {2014-07-19T21:03:42.000+0200}, title = {Effect of high-frequency spectral components in computer recognition of dysarthric speech based on a Mel-cepstral stochastic model.}, username = {ar0berts}, volume = 42, year = 2005 }

BibSonomy

Effect of high-frequency spectral components in computer recognition of dysarthric speech based on a Mel-cepstral stochastic model.

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on