A dependency parser analyzes the grammatical structure of a sentence, establishing relationships between "head" words and words which modify those heads.
This paper presents a flexible framework for generating very short abstractive summaries. The key idea is to use a word graph data structure referred to as the Opinosis-Graph to represent the text to be summarized. Then, we repeatedly find paths through this graph to produce concise summaries. We consider Opinosis a "shallow" abstractive summarizer as it uses the original text itself to generate summaries. This is unlike a true abstractive summarizer that would need a deeper level of natural language understanding.
While the evaluation is on an opinion dataset, the approach itself is general in that, it can be applied to any corpus containing high amounts of redundancies, for example, Twitter comments or user comments on blog/news articles. A very similar work to ours (published at the same time and at the same conference) is the following:
Multi-sentence compression: Finding shortest paths in word graphs
Proceedings of the 23rd International Conference on Computaional Linguistics (COLING 10). Beijing, China, August 23-27, 2010. Katja Filippova
Katja's work was evaluated on a news dataset (google news) for both English and Spanish while ours was evaluated on user reviews from various sources (English only). She studies the informativeness and grammaticality of sentences and in a similar way we evaluate these aspects by studying how close the Opinosis summaries are compared to the human composed summaries in terms of information overlap and readability (using a human assessor).
T. Gao, X. Yao, и D. Chen. (2021)cite arxiv:2104.08821Comment: Accepted to EMNLP 2021. The code and pre-trained models are available at https://github.com/princeton-nlp/simcse.
Q. Le, и T. Mikolov. Proceedings of the 31st International Conference on Machine Learning, том 32 из Proceedings of Machine Learning Research, стр. 1188--1196. Bejing, China, PMLR, (июня 2014)
M. Cohen, D. Massaro, и R. Clark. Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI), стр. 499-504. Pittsburgh, PA, USA, (октября 2002)
J. Reynar, и A. Ratnaparkhi. Proceedings of the fifth conference on Applied natural language processing, стр. 16--19. Stroudsburg, PA, USA, Association for Computational Linguistics, (1997)
R. McDonald, и G. Satta. Proceedings of the 10th International Conference on Parsing Technologies, стр. 121--132. Stroudsburg, PA, USA, Association for Computational Linguistics, (2007)
K. Ganesan, C. Zhai, и J. Han. Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010), стр. 340--348. Beijing, China, Coling 2010 Organizing Committee, (августа 2010)
S. Lim, и S. Cho. Modeling Decisions for Artificial Intelligence, Second
International Conference, MDAI 2005, Proceedings, том 3558 из Lecture Notes in Computer Science, стр. 305--315. Tsukuba, Japan, Springer, (июля 2005)
J. Clarke, и M. Lapata. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), (2007)
D. Reidsma. Proceedings of the 10th International Conference on Conceptual Structures (ICCS 2002), том 2393 из Lecture Notes in Computer Science, стр. 151-165. Springer, (2002)