Abstract
The goal of this investigation is to enable an automatic classification for the discourse particle
(DP) “hm”. As stated in the introduction, in human communication the interlocutors are able
to differentiate the functional meaning of DPs only by their intonation. The intonational curve
can be represented using the pitch-values extracted from the raw speech material. This estimation
of the pitch is a separate field of research and has three major difficulties, due to continuous
speech and the short duration of the DPs: 1) the beginning and end of the DP can interfere
with other spoken content, 2) gaps within the acoustical course, and 3) wrongly estimated pitch-
values. To process these errors and reduce the complexity of the following analysis algorithm a
pre-processing of the pitch-contour is necessary. The pre-processing of the pitch-contour will
reduce the number of pitch-values. Being one of the smallest utterance units, the number of
measuring points for “hm” is already small which leads to the main issue of this paper: The
development of a method which ensures a reliable and robust classification of the DP “hm”.
Therefore, three different methods will be presented and compared to evaluate the best one.
Furthermore, to ensure the reliability of the algorithm not only for human-computer interaction
(HCI), and to gain more training data, tests were carried out on two datasets containing HCI
and human-human interaction (HHI).
Users
Please
log in to take part in the discussion (add own reviews or comments).