copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A fully Bayesian model to cluster gene-expression profiles

C. Vogl, F. Sanchez-Cabo, G. Stocker, S. Hubbard, O. Wolkenhauer, and Z. Trajanoski. Bioinformatics (Oxford, England), (September 2005)PMID: 16204092.
DOI: 10.1093/bioinformatics/bti1122

Abstract

MOTIVATION: With cDNA or oligonucleotide chips, gene-expression levels of essentially all genes in a genome can be simultaneously monitored over a time-course or under different experimental conditions. After proper normalization of the data, genes are often classified into co-expressed classes (clusters) to identify subgroups of genes that share common regulatory elements, a common function or a common cellular origin. With most methods, e.g. k-means, the number of clusters needs to be specified in advance; results depend strongly on this choice. Even with likelihood-based methods, estimation of this number is difficult. Furthermore, missing values often cause problems and lead to the loss of data. RESULTS: We propose a fully probabilistic Bayesian model to cluster gene-expression profiles. The number of classes does not need to be specified in advance; instead it is adjusted dynamically using a Reversible Jump Markov Chain Monte Carlo sampler. Imputation of missing values is integrated into the model. With simulations, we determined the speed of convergence of the sampler as well as the accuracy of the inferred variables. Results were compared with the widely used k-means algorithm. With our method, biologically related co-expressed genes could be identified in a yeast transcriptome dataset, even when some values were missing. AVAILABILITY: The code is available at http://genome.tugraz.at/BayesianClustering/

Cite this publication

%0 Journal Article %1 vogl_fully_2005 %A Vogl, C %A Sanchez-Cabo, F %A Stocker, G %A Hubbard, S %A Wolkenhauer, O %A Trajanoski, Z %D 2005 %J Bioinformatics (Oxford, England) %K Algorithms, Analysis, Array Artificial Automated Bayes Cluster Computer Expression Family, Gene Genetic, Intelligence, Models, Multigene Oligonucleotide Pattern Profiling, Recognition, Rostock SBI Sequence Simulation, Theorem, %P ii130--136 %R 10.1093/bioinformatics/bti1122 %T A fully Bayesian model to cluster gene-expression profiles %U http://www.ncbi.nlm.nih.gov/pubmed/16204092 %V 21 Suppl 2 %X MOTIVATION: With cDNA or oligonucleotide chips, gene-expression levels of essentially all genes in a genome can be simultaneously monitored over a time-course or under different experimental conditions. After proper normalization of the data, genes are often classified into co-expressed classes (clusters) to identify subgroups of genes that share common regulatory elements, a common function or a common cellular origin. With most methods, e.g. k-means, the number of clusters needs to be specified in advance; results depend strongly on this choice. Even with likelihood-based methods, estimation of this number is difficult. Furthermore, missing values often cause problems and lead to the loss of data. RESULTS: We propose a fully probabilistic Bayesian model to cluster gene-expression profiles. The number of classes does not need to be specified in advance; instead it is adjusted dynamically using a Reversible Jump Markov Chain Monte Carlo sampler. Imputation of missing values is integrated into the model. With simulations, we determined the speed of convergence of the sampler as well as the accuracy of the inferred variables. Results were compared with the widely used k-means algorithm. With our method, biologically related co-expressed genes could be identified in a yeast transcriptome dataset, even when some values were missing. AVAILABILITY: The code is available at http://genome.tugraz.at/BayesianClustering/

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A fully Bayesian model to cluster gene-expression profiles

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML A fully Bayesian model to cluster gene-expression profiles

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A fully Bayesian model to cluster gene-expression profiles

Comments and Reviews
(0)