copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MeSH Up: effective MeSH text classification for improved document retrieval

D. "Trieschnigg, P. "Peznik, V. "Lee, F. "De Jong, W. "Kraaij, and D. "Rebholz-Schuhmann. Bioinformatics, 25 (11): 1412-1418 (April 2009)

Abstract

Motivation: Controlled vocabularies such as the Medical Subject Headings (MeSH) thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing biomedical information by reducing the ambiguity inherent to free-text data. Different methods of automating the assignment of MeSH concepts have been proposed to replace manual annotation, but they are either limited to a small subset of MeSH or have only been compared with a limited number of other systems. Results: We compare the performance of six MeSH classification systems MetaMap, EAGL, a language and a vector space modelbased approach, a K-Nearest Neighbor (KNN) approach and MTI in terms of reproducing and complementing manual MeSH annotations. A KNN system clearly outperforms the other published approaches and scales well with large amounts of text using the full MeSH thesaurus. Our measurements demonstrate to what extent manual MeSH annotations can be reproduced and how they can be complemented by automatic annotations. We also show that a statistically significant improvement can be obtained in information retrieval (IR) when the text of a user’s query is automatically annotated with MeSH concepts, compared to using the original textual query alone. Conclusions: The annotation of biomedical texts using controlled vocabularies such as MeSH can be automated to improve textonly IR. Furthermore, the automatic MeSH annotation system we propose is highly scalable and it generates improvements in IR comparable with those observed for manual annotations.

Cite this publication

@article{trieschnigg2009effective, abstract = {Motivation: Controlled vocabularies such as the Medical Subject Headings (MeSH) thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing biomedical information by reducing the ambiguity inherent to free-text data. Different methods of automating the assignment of MeSH concepts have been proposed to replace manual annotation, but they are either limited to a small subset of MeSH or have only been compared with a limited number of other systems. Results: We compare the performance of six MeSH classification systems [MetaMap, EAGL, a language and a vector space modelbased approach, a K-Nearest Neighbor (KNN) approach and MTI] in terms of reproducing and complementing manual MeSH annotations. A KNN system clearly outperforms the other published approaches and scales well with large amounts of text using the full MeSH thesaurus. Our measurements demonstrate to what extent manual MeSH annotations can be reproduced and how they can be complemented by automatic annotations. We also show that a statistically significant improvement can be obtained in information retrieval (IR) when the text of a user’s query is automatically annotated with MeSH concepts, compared to using the original textual query alone. Conclusions: The annotation of biomedical texts using controlled vocabularies such as MeSH can be automated to improve textonly IR. Furthermore, the automatic MeSH annotation system we propose is highly scalable and it generates improvements in IR comparable with those observed for manual annotations.}, added-at = {2021-04-06T09:54:15.000+0200}, author = {"Trieschnigg, Dolf" and "Peznik, Piotr" and "Lee, Vivian" and "De Jong, Franciska" and "Kraaij, Wessel" and "Rebholz-Schuhmann, Dietrich"}, biburl = {https://www.bibsonomy.org/bibtex/2ff323a515c1192bf846894b230c29c62/valerijajurkas}, interhash = {064ee763578f6a258372dc0039d8a439}, intrahash = {ff323a515c1192bf846894b230c29c62}, journal = {Bioinformatics}, keywords = {MeSH classification document retrieval}, language = {English}, month = {April}, number = 11, pages = {1412-1418}, timestamp = {2021-04-06T10:58:12.000+0200}, title = {MeSH Up: effective MeSH text classification for improved document retrieval}, volume = 25, year = 2009 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MeSH Up: effective MeSH text classification for improved document retrieval

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML MeSH Up: effective MeSH text classification for improved document retrieval

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MeSH Up: effective MeSH text classification for improved document retrieval

Comments and Reviews
(0)