Incollection,

Linguistics to Structure Unstructured Information

, , and .
Towards the Internet of Services: The THESEUS Research Program, Springer, Berlin, (2014)
DOI: 10.1007/978-3-319-06755-1_29

Abstract

The extraction of semantics of unstructured documents requires the recognition and classification of textual patterns, their variability, and their inter-relationships, i.e., the analysis of the linguistic structure of documents. Being the integral part of a larger real-life application, this linguistic analysis process must be robust, fast and adaptable. This creates a big challenge for the development of the necessary linguistic base components. In this drill-down, we present several dimensions of this challenge and show how they have been successfully tackled in Ordo.

Tags

Users

  • @flint63

Comments and Reviews