Inproceedings,

A Novel Approach for Identification and Linking of Short Quotations in Scholarly Texts and Literary Works

, and .
Proceedings of the 2nd Annual Conference of Computational Literary Studies, (2023)

Abstract

We present two approaches for the identification and linking of short quotations between scholarly works and literary works: ProQuo, a specialized pipeline, and ProQuoLM, a more general language model based approach. Our evaluation shows that both approaches outperform a strong baseline and the overall performance is on the same level. We compare the performance of ProQuoLM on texts with and without (page) reference information and find that reference information is not used. Based on our findings, we propose the following steps for future improvements: further analysis of the influence of a bigger context window for better handling of long distance references and the introduction of positional information of the literary work so that reference information can be utilized by ProQuoLM.

Tags

Users

  • @jaeschke

Comments and Reviews