,

Algorithms That Learn To Extract Information BBN: Description Of The Sift System As Used For MUC-7

, , , , , , , и .
MUC-7, (1998)

Аннотация

For MUC-7, BBN has for the first time fielded a fully-trained system for NE, TE, and TR; results are all the output of statistical language models trained on annotated data, rather than programs executing handwritten rules. Such trained systems have some significant advantages: . They can be easily ported to new domains by simply annotating data with semantic answers. . The complex interactions that make rule-based systems difficult to develop and maintain can here be learned automatically from the training data. We believe that the results in this evaluation are evidence that such trained systems, even at their current level of development, can perform roughly on a par with rules hand-tailored by experts. Since MUC-3, BBN has been steadily increasing the proportion of the information extraction process that is statistically trained. Already in MET-1, our name-finding results were the output of a fully statistical, HMM-based model, and that statistical Identifinder^TM model was als...

тэги

Пользователи данного ресурса

  • @kabloom

Комментарии и рецензии