@flint63

What's in a Gaze? The Role of Eye-Gaze in Reference Resolution in Multimodal Conversational Interfaces

, and . Proceedings of IUI '08: 13th International Conference on Intelligent User Interfaces, Gran Canaria, Spain, page 20-29. (2008)
DOI: 10.1145/1378773.1378777

Abstract

Multimodal conversational interfaces allow users to carry a dialog with a graphical display using speech to accomplish a particular task. Motivated by previous psycholinguistic findings, we examine how eye-gaze contributes to reference resolution in such a setting. Specifically, we present an integrated probabilistic framework that combines speech and eye-gaze for reference resolution. We further examine the relationship between eye-gaze and increased domain modeling with corresponding linguistic processing. Our empirical results show that the incorporation of eye-gaze significantly improves reference resolution performance. This improvement is most dramatic when a simple domain model is used. Our results also show that minimal domain modeling combined with eye-gaze significantly outperforms complex domain modeling without eye-gaze, which indicates that eye-gaze can be used to potentially compensate a lack of domain modeling for reference resolution.

Links and resources

Tags

community

  • @flint63
  • @dblp
@flint63's tags highlighted