Finding important information in unstructured text
From Language and Information Technologies
Jump to: navigation, search
A vast majority of the information we deal with in everyday life consists of raw, unstructured text, where the most important facts or concepts are not always readily available, but hidden in the myriad of details that accompany them. To handle and digest the sheer amount of information we are exposed to in this information age, more sophisticated procedures are required to unveil the important parts of a text, and to allow us to process more information in less time. The goal of this project is to develop robust and accurate techniques to automatically extract important information from unstructured text, in the form of keyphrases (keyphrase extraction) or entire sentences (extractive summarization).
Funded by Google
[edit]
A. Dickinger, A. Scharl, H. Stern, A. Weichselbraun, and K. Wöber. Information and Communication Technologies in Tourism 2008, Proceedings
of the International Conference in Innsbruck, Austria, 2008, Springer, Vienna-New York, (2008)
M. Hearst. Proceedings of the 37th annual meeting of the Association for Computational
Linguistics on Computational Linguistics, page 3--10. Morristown, NJ, USA, Association for Computational Linguistics, (1999)
Y. Chan, and D. Roth. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, page 551--560. Stroudsburg, PA, USA, Association for Computational Linguistics, (2011)
H. Lang, G. Wohlgenannt, and A. Weichselbraun. International Conference on Information Resources Management (Conf-IRM), Vienna, Austria, AIS, (2012)Forthcoming (accepted 6 February 2012).