Inproceedings,

RiQuA: A Corpus of Rich Quotation Annotation for English Literary Text

, and .
Proceedings of the 12th Language Resources and Evaluation Conference, page 835--841. Marseille, France, European Language Resources Association, (May 2020)

Abstract

We introduce RiQuA (RIch QUotation Annotations), a corpus that provides quotations, including their interpersonal structure (speakers and addressees) for English literary text. The corpus comprises 11 works of 19th-century literature that were manually doubly annotated for direct and indirect quotations. For each quotation, its span, speaker, addressee, and cue are identified (if present). This provides a rich view of dialogue structures not available from other available corpora. We detail the process of creating this dataset, discuss the annotation guidelines, and analyze the resulting corpus in terms of inter-annotator agreement and its properties. RiQuA, along with its annotations guidelines and associated scripts, are publicly available for use, modification, and experimentation.

Tags

Users

  • @albinzehe

Comments and Reviews