@seandalai

German Decompounding in a Difficult Corpus

, , and . Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING-08), (2008)

Abstract

Splitting compound words has proved to be useful in areas such as Machine Translation, Speech Recognition or Information Retrieval (IR). In the case of IR systems, they usually have to cope with noisy data, as user queries are usually written quickly and submitted without review. This work attempts at improving the current approaches for German decompounding when applied to query keywords. The results show an increase of more than 10% in accuracy compared to other state-of-the-art methods.

Links and resources

Tags

community

  • @dblp
  • @seandalai
@seandalai's tags highlighted