copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Group testing for pathway analysis improves comparability of different microarray datasets.

T. Manoli, N. Gretz, H. Gröne, M. Kenzelmann, R. Eils, and B. Brors. Bioinformatics, 22 (20): 2500--2506 (October 2006)
DOI: 10.1093/bioinformatics/btl424

Abstract

The wide use of DNA microarrays for the investigation of the cell transcriptome triggered the invention of numerous methods for the processing of microarray data and lead to a growing number of microarray studies that examine the same biological conditions. However, comparisons made on the level of gene lists obtained by different statistical methods or from different datasets hardly converge. We aimed at examining such discrepancies on the level of apparently affected biologically related groups of genes, e.g. metabolic or signalling pathways. This can be achieved by group testing procedures, e.g. over-representation analysis, functional class scoring (FCS), or global tests.Three public prostate cancer datasets obtained with the same microarray platform (HGU95A/HGU95Av2) were analyzed. Each dataset was subjected to normalization by either variance stabilizing normalization (vsn) or mixed model normalization (MMN). Then, statistical analysis of microarrays was applied to the vsn-normalized data and mixed model analysis to the data normalized by MMN. For multiple testing adjustment the false discovery rate was calculated and the threshold was set to 0.05. Gene lists from the same method applied to different datasets showed overlaps between 42 and 52\%, while lists from different methods applied to the same dataset had between 63 and 85\% of genes in common. A number of six gene lists obtained by the two statistical methods applied to the three datasets was then subjected to group testing by Fisher's exact test. Group testing by GSEA and global test was applied to the three datasets, as well. Fisher's exact test followed by global test showed more consistent results with respect to the concordance between analyses on gene lists obtained by different methods and different datasets than the GSEA. However, all group testing methods identified pathways that had already been described to be involved in the pathogenesis of prostate cancer. Moreover, pathways recurrently identified in these analyses are more likely to be reliable than those from a single analysis on a single dataset.

Links and resources

BibTeX key: Manoli2006
entry type: article
year: 2006
month: Oct
institution: Theoretical Bioinformatics, German Cancer Reseach Center, 69120 Heidelberg, Germany.
journal: Bioinformatics
number: 20
pages: 2500--2506
volume: 22
medline-pst: ppublish
pii: btl424
pmid: 16895928
owner: bbrors
__markedentry: bbrors:6
language: eng
DOI: 10.1093/bioinformatics/btl424
url: http://dx.doi.org/10.1093/bioinformatics/btl424

Cite this publication

%0 Journal Article %1 Manoli2006 %A Manoli, Theodora %A Gretz, Norbert %A Gröne, Hermann-Josef %A Kenzelmann, Marc %A Eils, Roland %A Brors, Benedikt %D 2006 %J Bioinformatics %K Analysis, Array Biological, Data Databases, Expression Factual; Gene Humans; Interpretation, Markers, Neoplasm Neoplasms, Oligonucleotide Profiling, Proteins, Reproducibility Results; Sensitivity Sequence Signal Specificity; Statistical; Transduction; Tumor analysis analysis; and metabolism; methods; of %N 20 %P 2500--2506 %R 10.1093/bioinformatics/btl424 %T Group testing for pathway analysis improves comparability of different microarray datasets. %U http://dx.doi.org/10.1093/bioinformatics/btl424 %V 22 %X The wide use of DNA microarrays for the investigation of the cell transcriptome triggered the invention of numerous methods for the processing of microarray data and lead to a growing number of microarray studies that examine the same biological conditions. However, comparisons made on the level of gene lists obtained by different statistical methods or from different datasets hardly converge. We aimed at examining such discrepancies on the level of apparently affected biologically related groups of genes, e.g. metabolic or signalling pathways. This can be achieved by group testing procedures, e.g. over-representation analysis, functional class scoring (FCS), or global tests.Three public prostate cancer datasets obtained with the same microarray platform (HGU95A/HGU95Av2) were analyzed. Each dataset was subjected to normalization by either variance stabilizing normalization (vsn) or mixed model normalization (MMN). Then, statistical analysis of microarrays was applied to the vsn-normalized data and mixed model analysis to the data normalized by MMN. For multiple testing adjustment the false discovery rate was calculated and the threshold was set to 0.05. Gene lists from the same method applied to different datasets showed overlaps between 42 and 52\%, while lists from different methods applied to the same dataset had between 63 and 85\% of genes in common. A number of six gene lists obtained by the two statistical methods applied to the three datasets was then subjected to group testing by Fisher's exact test. Group testing by GSEA and global test was applied to the three datasets, as well. Fisher's exact test followed by global test showed more consistent results with respect to the concordance between analyses on gene lists obtained by different methods and different datasets than the GSEA. However, all group testing methods identified pathways that had already been described to be involved in the pathogenesis of prostate cancer. Moreover, pathways recurrently identified in these analyses are more likely to be reliable than those from a single analysis on a single dataset.

@article{Manoli2006, __markedentry = {[bbrors:6]}, abstract = {The wide use of DNA microarrays for the investigation of the cell transcriptome triggered the invention of numerous methods for the processing of microarray data and lead to a growing number of microarray studies that examine the same biological conditions. However, comparisons made on the level of gene lists obtained by different statistical methods or from different datasets hardly converge. We aimed at examining such discrepancies on the level of apparently affected biologically related groups of genes, e.g. metabolic or signalling pathways. This can be achieved by group testing procedures, e.g. over-representation analysis, functional class scoring (FCS), or global tests.Three public prostate cancer datasets obtained with the same microarray platform (HGU95A/HGU95Av2) were analyzed. Each dataset was subjected to normalization by either variance stabilizing normalization (vsn) or mixed model normalization (MMN). Then, statistical analysis of microarrays was applied to the vsn-normalized data and mixed model analysis to the data normalized by MMN. For multiple testing adjustment the false discovery rate was calculated and the threshold was set to 0.05. Gene lists from the same method applied to different datasets showed overlaps between 42 and 52\%, while lists from different methods applied to the same dataset had between 63 and 85\% of genes in common. A number of six gene lists obtained by the two statistical methods applied to the three datasets was then subjected to group testing by Fisher's exact test. Group testing by GSEA and global test was applied to the three datasets, as well. Fisher's exact test followed by global test showed more consistent results with respect to the concordance between analyses on gene lists obtained by different methods and different datasets than the GSEA. However, all group testing methods identified pathways that had already been described to be involved in the pathogenesis of prostate cancer. Moreover, pathways recurrently identified in these analyses are more likely to be reliable than those from a single analysis on a single dataset.}, added-at = {2015-04-09T12:36:21.000+0200}, author = {Manoli, Theodora and Gretz, Norbert and Gr{\"{o}}ne, Hermann-Josef and Kenzelmann, Marc and Eils, Roland and Brors, Benedikt}, biburl = {https://www.bibsonomy.org/bibtex/28c460f5acb8884960e581f12b7635689/bbrors}, doi = {10.1093/bioinformatics/btl424}, institution = {Theoretical Bioinformatics, German Cancer Reseach Center, 69120 Heidelberg, Germany.}, interhash = {ee2cb3336d2060f70402f8ba113e4baf}, intrahash = {8c460f5acb8884960e581f12b7635689}, journal = {Bioinformatics}, keywords = {Analysis, Array Biological, Data Databases, Expression Factual; Gene Humans; Interpretation, Markers, Neoplasm Neoplasms, Oligonucleotide Profiling, Proteins, Reproducibility Results; Sensitivity Sequence Signal Specificity; Statistical; Transduction; Tumor analysis analysis; and metabolism; methods; of}, language = {eng}, medline-pst = {ppublish}, month = Oct, number = 20, owner = {bbrors}, pages = {2500--2506}, pii = {btl424}, pmid = {16895928}, timestamp = {2015-04-09T12:36:21.000+0200}, title = {Group testing for pathway analysis improves comparability of different microarray datasets.}, url = {http://dx.doi.org/10.1093/bioinformatics/btl424}, volume = 22, year = 2006 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Group testing for pathway analysis improves comparability of different microarray datasets.

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Group testing for pathway analysis improves comparability of different microarray datasets.

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Group testing for pathway analysis improves comparability of different microarray datasets.

Comments and Reviews
(0)