Introducing a structure into a set of similar concepts
A. Mykowiecka, and M. Marciniak. Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznań, Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu, (2015)
Abstract
In the paper, we examine the idea of supporting domain ontology creation by an automatic clustering of selected terms identified using a terminology extraction method. We discuss the problem of introducing a structure into a set of similar concepts. We extract terminology
from economic articles in Polish Wikipedia, then we select several sets of similar concepts present in the top 5,500 extracted terms. We describe two methods for automatic clustering of such groups of phrases on the basis of their distributional properties, i.e. the quantitative
characteristics of the contexts of their occurrences in texts and test them on two sets of data.
%0 Conference Paper
%1 mykowiecka_introducing_2015
%A Mykowiecka, Agnieszka
%A Marciniak, Małgorzata
%B Human Language Technologies as a Challenge for Computer Science and Linguistics
%C Poznań
%D 2015
%E Vetulani, Zygmunt
%E Mariani, Joseph
%I Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu
%K clustering
%T Introducing a structure into a set of similar concepts
%U http://ltc.amu.edu.pl/book/
%X In the paper, we examine the idea of supporting domain ontology creation by an automatic clustering of selected terms identified using a terminology extraction method. We discuss the problem of introducing a structure into a set of similar concepts. We extract terminology
from economic articles in Polish Wikipedia, then we select several sets of similar concepts present in the top 5,500 extracted terms. We describe two methods for automatic clustering of such groups of phrases on the basis of their distributional properties, i.e. the quantitative
characteristics of the contexts of their occurrences in texts and test them on two sets of data.
%@ 978-83-932640-8-7
@inproceedings{mykowiecka_introducing_2015,
abstract = {In the paper, we examine the idea of supporting domain ontology creation by an automatic clustering of selected terms identified using a terminology extraction method. We discuss the problem of introducing a structure into a set of similar concepts. We extract terminology
from economic articles in Polish Wikipedia, then we select several sets of similar concepts present in the top 5,500 extracted terms. We describe two methods for automatic clustering of such groups of phrases on the basis of their distributional properties, i.e. the quantitative
characteristics of the contexts of their occurrences in texts and test them on two sets of data.},
added-at = {2018-11-04T16:54:56.000+0100},
address = {Poznań},
author = {Mykowiecka, Agnieszka and Marciniak, Małgorzata},
biburl = {https://www.bibsonomy.org/bibtex/296d5f6c50beb489761d7d58bcf4d208d/lepsky},
booktitle = {Human {Language} {Technologies} as a {Challenge} for {Computer} {Science} and {Linguistics}},
editor = {Vetulani, Zygmunt and Mariani, Joseph},
interhash = {868853c861099f97719b06c9f9aaad67},
intrahash = {96d5f6c50beb489761d7d58bcf4d208d},
isbn = {978-83-932640-8-7},
keywords = {clustering},
publisher = {Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu},
timestamp = {2018-11-04T16:54:56.000+0100},
title = {Introducing a structure into a set of similar concepts},
url = {http://ltc.amu.edu.pl/book/},
year = 2015
}