Article,

Speech Centric Multimodal Interfaces for Mobile Communications Systems

K. Kvale, N. Warakagoda, and J. Knudsen.
Telektronikk, (2003)

Abstract

Natural conversational man-machine interfaces should support multimodal interaction. Generally, a multimodal system combines natural input modes such as speech, touch, manual gestures, gaze, head and body movements, and searches for the meaning of these combined inputs. The response can be presented by a multimedia system. The main focus of this paper is the technical aspects of implementing multimodal interfaces for mobile terminals. With the limited size and processing power of these terminals, we have restricted the functionality to speech centric multimodal interfaces with two input modes: speech (audio) and touch, and two output modes: audio and vision. That is, the input combines automatic speech recognition and a pen to click areas on the touch-screen, or pushing buttons on the small terminal. The output is either speech (synthetic or pre-recorded) or text and graphics.

BibTeX key: KvaleWarakagodaKnudsen03telenor
entry type: article
year: 2003
journal: Telektronikk
pages: 104-117
volume: 2
file: Telenor:2000-04/KvaleWarakagodaKnudsen03telenor.pdf:PDF
issn: 0085-7130
groups: public
intrahash: b9f8cdd27e908771bb8a97901d482ad6
timestamp: 2009.08.03
username: flint63
Document: http://www.telenor.com/telektronikk/volumes/pdf/2.2003/Side_104-117.pdf

BibSonomy

Speech Centric Multimodal Interfaces for Mobile Communications Systems

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on