Inproceedings,

Introducing a structure into a set of similar concepts

, and .
Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznań, Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu, (2015)

Abstract

In the paper, we examine the idea of supporting domain ontology creation by an automatic clustering of selected terms identified using a terminology extraction method. We discuss the problem of introducing a structure into a set of similar concepts. We extract terminology from economic articles in Polish Wikipedia, then we select several sets of similar concepts present in the top 5,500 extracted terms. We describe two methods for automatic clustering of such groups of phrases on the basis of their distributional properties, i.e. the quantitative characteristics of the contexts of their occurrences in texts and test them on two sets of data.

Tags

Users

  • @lepsky

Comments and Reviews