Article,

Comparison of Different Modeling Techniques for Robust Prototype Matching of Speech Pitch-Contours

, , and .
Kognitive Systeme, (2016)

Abstract

The goal of this investigation is to enable an automatic classification for the discourse particle (DP) “hm”. As stated in the introduction, in human communication the interlocutors are able to differentiate the functional meaning of DPs only by their intonation. The intonational curve can be represented using the pitch-values extracted from the raw speech material. This estimation of the pitch is a separate field of research and has three major difficulties, due to continuous speech and the short duration of the DPs: 1) the beginning and end of the DP can interfere with other spoken content, 2) gaps within the acoustical course, and 3) wrongly estimated pitch- values. To process these errors and reduce the complexity of the following analysis algorithm a pre-processing of the pitch-contour is necessary. The pre-processing of the pitch-contour will reduce the number of pitch-values. Being one of the smallest utterance units, the number of measuring points for “hm” is already small which leads to the main issue of this paper: The development of a method which ensures a reliable and robust classification of the DP “hm”. Therefore, three different methods will be presented and compared to evaluate the best one. Furthermore, to ensure the reliability of the algorithm not only for human-computer interaction (HCI), and to gain more training data, tests were carried out on two datasets containing HCI and human-human interaction (HHI).

Tags

Users

  • @isiegert

Comments and Reviews