Article,

Biomedical Literature Mining: Challenges and Solutions in the ‘Omics’ Era

D. Chaussabel.
Am J Pharmacogenomics, 4 (6): 383-393 (2004)

Abstract

It is now obvious that the rate-limiting step in high throughput experimentation is neither data acquisition nor analysis, but rather our ability to interpret data on a genome-wide scale. Indeed, the explosion of data sampling capacity combined with increasing publication rates greatly impairs our ability to find meaning in vast collections of data. In order to support data interpretation, bioinformatic tools are needed to identify critical information contained in large bodies of literature. However, extracting knowledge embedded in free text is an arduous task, compounded in the biomedical field by an inconsistent gene nomenclature, domain-specific language and restricted access to full text articles. This paper presents a selection of currently available biomedical literature mining software. These tools rely on statistic and, more recently, semantic analyses (Natural Language Processing) to automatically extract information from the literature. In addition, a literature mining strategy has been developed to explore patterns of term occurrences in abstracts. This method automatically identifies relevant keywords in collections of abstracts, and uses a pattern discovery algorithm to generate a visual interface for exploring functional associations among genes. Term occurrence heatmaps can also be combined with gene expression profiles to provide valuable functional annotations. Furthermore, as demonstrated with tumor cell line literature profiling results, this approach can be applied to a variety of themes beyond genomic data analysis. Altogether, these examples illustrate how literature analysis can be employed to support knowledge discovery in biomedical research.

BibTeX key: review.bioTextMining.2004
entry type: article
year: 2004
journal: Am J Pharmacogenomics
number: 6
pages: 383-393
volume: 4
Document: http://www.provalisresearch.com/Documents/LiteratureAnalysis.pdf

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 review.bioTextMining.2004 %A Chaussabel, Damien %D 2004 %J Am J Pharmacogenomics %K CAT CAT-REV-text-mining biomedical literature mining review %N 6 %P 383-393 %T Biomedical Literature Mining: Challenges and Solutions in the ‘Omics’ Era %U http://www.provalisresearch.com/Documents/LiteratureAnalysis.pdf %V 4 %X It is now obvious that the rate-limiting step in high throughput experimentation is neither data acquisition nor analysis, but rather our ability to interpret data on a genome-wide scale. Indeed, the explosion of data sampling capacity combined with increasing publication rates greatly impairs our ability to find meaning in vast collections of data. In order to support data interpretation, bioinformatic tools are needed to identify critical information contained in large bodies of literature. However, extracting knowledge embedded in free text is an arduous task, compounded in the biomedical field by an inconsistent gene nomenclature, domain-specific language and restricted access to full text articles. This paper presents a selection of currently available biomedical literature mining software. These tools rely on statistic and, more recently, semantic analyses (Natural Language Processing) to automatically extract information from the literature. In addition, a literature mining strategy has been developed to explore patterns of term occurrences in abstracts. This method automatically identifies relevant keywords in collections of abstracts, and uses a pattern discovery algorithm to generate a visual interface for exploring functional associations among genes. Term occurrence heatmaps can also be combined with gene expression profiles to provide valuable functional annotations. Furthermore, as demonstrated with tumor cell line literature profiling results, this approach can be applied to a variety of themes beyond genomic data analysis. Altogether, these examples illustrate how literature analysis can be employed to support knowledge discovery in biomedical research.

@article{review.bioTextMining.2004, abstract = {It is now obvious that the rate-limiting step in high throughput experimentation is neither data acquisition nor analysis, but rather our ability to interpret data on a genome-wide scale. Indeed, the explosion of data sampling capacity combined with increasing publication rates greatly impairs our ability to find meaning in vast collections of data. In order to support data interpretation, bioinformatic tools are needed to identify critical information contained in large bodies of literature. However, extracting knowledge embedded in free text is an arduous task, compounded in the biomedical field by an inconsistent gene nomenclature, domain-specific language and restricted access to full text articles. This paper presents a selection of currently available biomedical literature mining software. These tools rely on statistic and, more recently, semantic analyses (Natural Language Processing) to automatically extract information from the literature. In addition, a literature mining strategy has been developed to explore patterns of term occurrences in abstracts. This method automatically identifies relevant keywords in collections of abstracts, and uses a pattern discovery algorithm to generate a visual interface for exploring functional associations among genes. Term occurrence heatmaps can also be combined with gene expression profiles to provide valuable functional annotations. Furthermore, as demonstrated with tumor cell line literature profiling results, this approach can be applied to a variety of themes beyond genomic data analysis. Altogether, these examples illustrate how literature analysis can be employed to support knowledge discovery in biomedical research.}, added-at = {2010-06-16T03:37:20.000+0200}, author = {Chaussabel, Damien}, biburl = {https://www.bibsonomy.org/bibtex/2bbaa2e32974dcbf2cc841679811f98c4/huiyangsfsu}, interhash = {ce021618722c47692bb5731e32a48c9b}, intrahash = {bbaa2e32974dcbf2cc841679811f98c4}, journal = {Am J Pharmacogenomics}, keywords = {CAT CAT-REV-text-mining biomedical literature mining review}, number = 6, pages = {383-393}, timestamp = {2010-11-12T02:10:51.000+0100}, title = {Biomedical Literature Mining: Challenges and Solutions in the ‘Omics’ Era}, url = {http://www.provalisresearch.com/Documents/LiteratureAnalysis.pdf}, volume = 4, year = 2004 }

BibSonomy

Biomedical Literature Mining: Challenges and Solutions in the ‘Omics’ Era

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on