sign in · help · news · about · deen

BibSonomy ::  publication ::

The blue social bookmark and publication sharing system.
entry of diego_ma:    
(0)
This publication has not been reviewed yet.
rating distribution
average user rating
?
The average rating is computed over all reviews. However, some of them may be invisible to you due to the visibility setting chosen by the reviewers.
(0.0 of 5.0 based on 0 reviews)

Question Answering on a Case Insensitive Corpus

by: Wei Li, Rohini Srihari, Cheng Niu, and Xiaoge Li
In: Proc. ACL 2003 Workshop on Multilingual Summarization and Question Answering (2003) , p. 84-93.
Citation format (all formats):

Abstract

Most question answering QA systems rely on both keyword index and Named Entity NE tagging. The corpus from which the QA systems attempt to retrieve answers is usually mixed case text. However, there are numerous corpora that consist of case insensitive documents, e.g. speech recognition results. This paper presents a successful approach to QA on a case insensitive corpus, whereby a preprocessing module is designed to restore the case-sensitive form. The document pool with the restored case then feeds the QA system, which remains unchanged. The case restoration preprocessing is implemented as a Hidden Markov Model trained on a large raw corpus of case sensitive documents. It is demonstrated that this approach leads to very limited degradation in QA benchmarking 2.8%, mainly due to the limited degradation in the underlying information extraction support.

BibTeX record

Endnote record

a gripper