E. Alfonseca, S. Bilac, and S. Pharies. Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING-08), (2008)
Abstract
Splitting compound words has proved to be useful in areas such as Machine Translation, Speech Recognition or Information Retrieval (IR). In the case of IR systems, they usually have to cope with noisy data, as user queries are usually written quickly and submitted without review. This work attempts at improving the current approaches for German decompounding when applied to query keywords. The results show an increase of more than 10% in accuracy compared to other state-of-the-art methods.
%0 Conference Paper
%1 Alfonseca:EtAl:08a
%A Alfonseca, Enrique
%A Bilac, Slaven
%A Pharies, Stefan
%B Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING-08)
%D 2008
%K 2008 cicling compounds german ir splitting
%T German Decompounding in a Difficult Corpus
%U http://www.springerlink.com/content/tw815263702576ww/fulltext.pdf
%X Splitting compound words has proved to be useful in areas such as Machine Translation, Speech Recognition or Information Retrieval (IR). In the case of IR systems, they usually have to cope with noisy data, as user queries are usually written quickly and submitted without review. This work attempts at improving the current approaches for German decompounding when applied to query keywords. The results show an increase of more than 10% in accuracy compared to other state-of-the-art methods.
@inproceedings{Alfonseca:EtAl:08a,
abstract = {Splitting compound words has proved to be useful in areas such as Machine Translation, Speech Recognition or Information Retrieval (IR). In the case of IR systems, they usually have to cope with noisy data, as user queries are usually written quickly and submitted without review. This work attempts at improving the current approaches for German decompounding when applied to query keywords. The results show an increase of more than 10% in accuracy compared to other state-of-the-art methods.},
added-at = {2008-09-15T20:18:34.000+0200},
author = {Alfonseca, Enrique and Bilac, Slaven and Pharies, Stefan},
biburl = {https://www.bibsonomy.org/bibtex/25d7ae643b66e0be21f289d4c514950e0/seandalai},
booktitle = {Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING-08)},
interhash = {299a3d9fa484ee9b58d4657c6d6211fb},
intrahash = {5d7ae643b66e0be21f289d4c514950e0},
keywords = {2008 cicling compounds german ir splitting},
timestamp = {2008-09-15T20:18:34.000+0200},
title = {German Decompounding in a Difficult Corpus},
url = {http://www.springerlink.com/content/tw815263702576ww/fulltext.pdf},
year = 2008
}