@jaeschke

A Novel Approach for Identification and Linking of Short Quotations in Scholarly Texts and Literary Works

, and . Journal of Computational Literary Studies, (2023)
DOI: doi.org/10.48694/jcls.3590

Abstract

We present two approaches for the identification and linking of short quotations between scholarly works and literary works: ProQuo, a specialized pipeline, and ProQuoLM, a more general language model based approach. Our evaluation shows that both approaches outperform a strong baseline and the overall performance is on the same level. We compare the performance of ProQuoLM on texts with and without (page) reference information and find that reference information is not used. Based on our findings, we propose the following steps for future improvements: further analysis of the influence of a bigger context window for better handling of long distance references and the introduction of positional information of the literary work so that reference information can be utilized by ProQuoLM.

Links and resources

Tags