Inproceedings,

Opinion Mining for Biomedical Text Data: Feature Space Design and Feature Selection

R. Swaminathan, A. Sharma, and H. Yang.
the Nineth International Workshop on Data Mining in Bioinformatics (BIOKDD 2010), (July 2010)

Full text

Abstract

Unstructured text (e.g., journal articles) remains as the primary means for publishing biomedical research results. To extract and integrate knowledge from such data, text mining has been routinely applied. One important task is extracting relationships between bio-entities such as foods and diseases. Most existing studies however stop short of further analyzing the extracted relationships such as the polarity and the level of certainty at which the authors reported on a given relationship. The latter is termed as the relationship strength and marked at three levels— weak, medium and strong. We have previously reported a preliminary study on this issue 22, and here we detail our studies on constructing a novel feature space towards effectively predicting the polarity and strength of a relationship. Unlike previous work, four types of polarity instead of three are considered, namely, positive, negative, neutral and no- relationship. Another contribution is that in addition to the commonly accepted lexicon-based features, we have identified a set of novel features that capture both the semantic and structural aspects of a relationship. Our intensive evaluations demonstrate that combining these new features with the lexicon-based ones can achieve the best accuracy for polarity prediction (~0.91). This however is not the case for strength prediction, where lexicon- based features alone are sufficient (~0.96).

BibTeX key: rajesh.opinion.biokdd.2010
entry type: inproceedings
booktitle: the Nineth International Workshop on Data Mining in Bioinformatics (BIOKDD 2010)
year: 2010
month: July
Document: http://cs.sfsu.edu/~huiyang/publications.htm

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{rajesh.opinion.biokdd.2010, abstract = {Unstructured text (e.g., journal articles) remains as the primary means for publishing biomedical research results. To extract and integrate knowledge from such data, text mining has been routinely applied. One important task is extracting relationships between bio-entities such as foods and diseases. Most existing studies however stop short of further analyzing the extracted relationships such as the polarity and the level of certainty at which the authors reported on a given relationship. The latter is termed as the relationship strength and marked at three levels— weak, medium and strong. We have previously reported a preliminary study on this issue [22], and here we detail our studies on constructing a novel feature space towards effectively predicting the polarity and strength of a relationship. Unlike previous work, four types of polarity instead of three are considered, namely, positive, negative, neutral and no- relationship. Another contribution is that in addition to the commonly accepted lexicon-based features, we have identified a set of novel features that capture both the semantic and structural aspects of a relationship. Our intensive evaluations demonstrate that combining these new features with the lexicon-based ones can achieve the best accuracy for polarity prediction (~0.91). This however is not the case for strength prediction, where lexicon- based features alone are sufficient (~0.96).}, added-at = {2011-02-17T04:11:20.000+0100}, author = {Swaminathan, Rajesh and Sharma, Abhishek and Yang, Hui}, biburl = {https://www.bibsonomy.org/bibtex/25ae21641bcf5f454a9043c4692587fbd/huiyangsfsu}, booktitle = {the Nineth International Workshop on Data Mining in Bioinformatics (BIOKDD 2010)}, interhash = {bf495e1a01e76663d59dbae3acf4f024}, intrahash = {5ae21641bcf5f454a9043c4692587fbd}, keywords = {CAT CAT-OPINION-bio bk-ngx feature mining opinion selection}, month = {July}, timestamp = {2011-02-17T04:11:20.000+0100}, title = {Opinion Mining for Biomedical Text Data: Feature Space Design and Feature Selection}, url = {http://cs.sfsu.edu/~huiyang/publications.htm}, year = 2010 }

BibSonomy

Opinion Mining for Biomedical Text Data: Feature Space Design and Feature Selection

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on