Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Genome Res. 13:1706-1718, 2003
©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kim, P. M.
Right arrow Articles by Tidor, B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kim, P. M.
Right arrow Articles by Tidor, B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Methods

Subsystem Identification Through Dimensionality Reduction of Large-Scale Gene Expression Data

Philip M. Kim1 and Bruce Tidor2,3,4

1 Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA 2 Biological Engineering Division, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA 3 Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA

The availability of parallel, high-throughput biological experiments that simultaneously monitor thousands of cellular observables provides an opportunity for investigating cellular behavior in a highly quantitative manner at multiple levels of resolution. One challenge to more fully exploit new experimental advances is the need to develop algorithms to provide an analysis at each of the relevant levels of detail. Here, the data analysis method non-negative matrix factorization (NMF) has been applied to the analysis of gene array experiments. Whereas current algorithms identify relationships on the basis of large-scale similarity between expression patterns, NMF is a recently developed machine learning technique capable of recognizing similarity between subportions of the data corresponding to localized features in expression space. A large data set consisting of 300 genome-wide expression measurements of yeast was used as sample data to illustrate the performance of the new approach. Local features detected are shown to map well to functional cellular subsystems. Functional relationships predicted by the new analysis are compared with those predicted using standard approaches; validation using bioinformatic databases suggests predictions using the new approach may be up to twice as accurate as some conventional approaches.


Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.903503.

4 Corresponding author.
E-MAIL tidor{at}mit.edu; FAX (617)252-1816.

[Supplemental material is available online at www.genome.org.]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
E. Mejia-Roa, P. Carmona-Saez, R. Nogales, C. Vicente, M. Vazquez, X. Y. Yang, C. Garcia, F. Tirado, and A. Pascual-Montano
bioNMF: a web-based tool for nonnegative matrix factorization in biology
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W523 - W528.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Bhattacharya and R. K. De
Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles
Bioinformatics, June 1, 2008; 24(11): 1359 - 1366.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
J. H. Graber, J. Salisbury, L. N. Hutchins, and T. Blumenthal
C. elegans sequences that control trans-splicing and operon pre-mRNA processing
RNA, September 1, 2007; 13(9): 1409 - 1426.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Kim and H. Park
Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis
Bioinformatics, June 15, 2007; 23(12): 1495 - 1502.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Tamayo, D. Scanfeld, B. L. Ebert, M. A. Gillette, C. W. M. Roberts, and J. P. Mesirov
Metagene projection for cross-platform, cross-species characterization of global transcriptional states
PNAS, April 3, 2007; 104(14): 5959 - 5964.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Li, Y. Sun, and M. Zhan
The discovery of transcriptional modules by a two-stage matrix decomposition approach
Bioinformatics, February 15, 2007; 23(4): 473 - 479.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
P. Fogel, S. S. Young, D. M. Hawkins, and N. Ledirac
Inferential, robust non-negative matrix factorization analysis of microarray data
Bioinformatics, January 1, 2007; 23(1): 44 - 49.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J.-P. Brunet, P. Tamayo, T. R. Golub, and J. P. Mesirov
Metagenes and molecular pattern discovery using matrix factorization
PNAS, March 23, 2004; 101(12): 4164 - 4169.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2003 by Cold Spring Harbor Laboratory Press.