Abstract
The extraction of semantics of unstructured documents requires the recognition and classification of textual patterns, their variability, and their inter-relationships, i.e., the analysis of the linguistic structure of documents. Being the integral part of a larger real-life application, this linguistic analysis process must be robust, fast and adaptable. This creates a big challenge for the development of the necessary linguistic base components. In this drill-down, we present several dimensions of this challenge and show how they have been successfully tackled in Ordo.
Users
Please
log in to take part in the discussion (add own reviews or comments).