Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Gibbons, F. D.
Right arrow Articles by Roth, F. P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gibbons, F. D.
Right arrow Articles by Roth, F. P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 10, 1574-1581, October 2002

METHODS
Judging the Quality of Gene Expression-Based Clustering Methods Using Gene Annotation

Francis D. Gibbons, and Frederick P. Roth1

Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA

We compare several commonly used expression-based gene clustering algorithms using a figure of merit based on the mutual information between cluster membership and known gene attributes. By studying various publicly available expression data sets we conclude that enrichment of clusters for biological function is, in general, highest at rather low cluster numbers. As a measure of dissimilarity between the expression patterns of two genes, no method outperforms Euclidean distance for ratio-based measurements, or Pearson distance for non-ratio-based measurements at the optimal choice of cluster number. We show the self-organized-map approach to be best for both measurement types at higher numbers of clusters. Clusters of genes derived from single- and average-linkage hierarchical clustering tend to produce worse-than-random results.

[The algorithm described is available at http://llama.med.harvard.edu, under Software.]


1 Corresponding author.


12:1574-1581 ©2002 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/02 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
A. Bhattacharya and R. K. De
Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles
Bioinformatics, June 1, 2008; 24(11): 1359 - 1366.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. H. Singh, D. M. Wolf, P. Wang, and A. P. Arkin
Modularity of stress response evolution
PNAS, May 27, 2008; 105(21): 7500 - 7505.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
J. C. Trinidad, A. Thalhammer, C. G. Specht, A. J. Lynn, P. R. Baker, R. Schoepfer, and A. L. Burlingame
Quantitative Analysis of Synaptic Phosphorylation and Protein Expression
Mol. Cell. Proteomics, April 1, 2008; 7(4): 684 - 696.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Joshi, Y. Van de Peer, and T. Michoel
Analysis of a Gibbs sampler method for model-based clustering of gene expression data
Bioinformatics, January 15, 2008; 24(2): 176 - 183.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Dotan-Cohen, A. A. Melkman, and S. Kasif
Hierarchical tree snipping: clustering guided by prior knowledge
Bioinformatics, December 15, 2007; 23(24): 3335 - 3342.
[Abstract] [Full Text] [PDF]


Home page
Ann. Surg. Oncol.Home page
C. Duong, D. M. Greenawalt, A. Kowalczyk, M. L. Ciavarella, G. Raskutti, W. K. Murray, W. A. Phillips, and R. J. S. Thomas
Pretreatment Gene Expression Profiles Can Be Used to Predict Response to Neoadjuvant Chemoradiotherapy in Esophageal Cancer.
Ann. Surg. Oncol., December 1, 2007; 14(12): 3602 - 3609.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Shi, M. Klustein, I. Simon, T. Mitchell, and Z. Bar-Joseph
Continuous hidden process model for time series expression experiments
Bioinformatics, July 1, 2007; 23(13): i459 - i467.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Li, Y. Sun, and M. Zhan
The discovery of transcriptional modules by a two-stage matrix decomposition approach
Bioinformatics, February 15, 2007; 23(4): 473 - 479.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D.-W. Kim, K.-Y. Lee, K. H. Lee, and D. Lee
Towards clustering of incomplete microarray data without the use of imputation
Bioinformatics, January 1, 2007; 23(1): 107 - 113.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. K. Ng, G. J. McLachlan, K. Wang, L. Ben-Tovim Jones, and S.-W. Ng
A Mixture model with random-effects components for clustering correlated gene-expression profiles
Bioinformatics, July 15, 2006; 22(14): 1745 - 1752.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Tuikkala, L. Elo, O. S. Nevalainen, and T. Aittokallio
Improving missing value estimation in microarray data with gene ontology
Bioinformatics, March 1, 2006; 22(5): 566 - 572.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Grotkjaer, O. Winther, B. Regenberg, J. Nielsen, and L. K. Hansen
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm
Bioinformatics, January 1, 2006; 22(1): 58 - 67.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Persson, H. Wei, J. Milne, G. P. Page, and C. R. Somerville
Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets
PNAS, June 14, 2005; 102(24): 8633 - 8638.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D.-W. Kim, K. H. Lee, and D. Lee
Detecting clusters of different geometrical shapes in microarray gene expression data
Bioinformatics, May 1, 2005; 21(9): 1927 - 1934.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F. R. Pinto, L. A. Cowart, Y. A. Hannun, B. Rohrer, and J. S. Almeida
Local correlation of expression profiles with gene annotations--proof of concept for a general conciliatory method
Bioinformatics, April 1, 2005; 21(7): 1037 - 1045.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. Arnau, S. Mars, and I. Marín
Iterative Cluster Analysis of Protein Interaction Data
Bioinformatics, February 1, 2005; 21(3): 364 - 378.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. H. Bo, B. Dysvik, and I. Jonassen
LSimpute: accurate estimation of missing values in microarray data with least squares methods
Nucleic Acids Res., February 20, 2004; 32(3): e34 - e34.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.