copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

PHONOLOGICAL FEATURE BASED VARIABLE FRAME RATE SCHEME FOR IMPROVED SPEECH RECOGNITION

A. Sangwan, and J. Hansen. Automatic Speech Recognition and Understanding (ASRU), page 582-586. (December 2007)

Abstract

In this paper, we propose a new scheme for variable frame rate (VFR) feature processing based on high level segmentation (HLS) of speech into broad phone classes. Traditional fixed-rate processing is not capable of accurately reflecting the dynamics of continuous speech. On the other hand, the proposed VFR scheme adapts the temporal representation of the speech signal by tying the framing strategy with the detected phone class sequence. The phone classes are detected and segmented by using appropriately trained phonological features (PFs). In this manner, the proposed scheme is capable of tracking the evolution of speech due to the underlying phonetic content, and exploiting the non-uniform information flow-rate of speech by using a variable framing strategy. The new VFR scheme is applied to automatic speech recognition of TIMIT and NTIMIT corpora, where it is compared to a traditional fixed window-size/frame-rate scheme. Our experiments yield encouraging results with relative reductions of 24% and 8% in WER (word error rate) for TIMIT and NTIMIT tasks, respectively.

Links and resources

BibTeX key: phonovfr
entry type: inproceedings
booktitle: Automatic Speech Recognition and Understanding (ASRU)
year: 2007
month: December
pages: 582-586
url: http://sites.google.com/site/publicationsabhijeetsangwan/Home/publication-pdfs/ASRU_Final_Submission.pdf?attredirects=0

@abhijeet.sangwan's tags highlighted

Cite this publication

search on

Meta data

Last update 16 years ago
Created 16 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

PHONOLOGICAL FEATURE BASED VARIABLE FRAME RATE SCHEME FOR IMPROVED SPEECH RECOGNITION

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML PHONOLOGICAL FEATURE BASED VARIABLE FRAME RATE SCHEME FOR IMPROVED SPEECH RECOGNITION

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

PHONOLOGICAL FEATURE BASED VARIABLE FRAME RATE SCHEME FOR IMPROVED SPEECH RECOGNITION

Comments and Reviews
(0)