Abstract
Documents can be assigned keywords by frequency analysis of the terms found in the document text, which arguably is the primary
source of knowledge about the document itself. By including a hierarchi- cally organised domain specific thesaurus as a secondknowledge source the quality of such keywords was improved considerably, as measured by match to previously manually assignedkeywords. In the presented ex- periment, the combination of the evidence from frequency analysis and the hierarchically organisedthesaurus was done using inductive logic programming.
Users
Please
log in to take part in the discussion (add own reviews or comments).