Genome Research cityscape

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Published online before print August 9, 2006, 10.1101/gr.5144106
Genome Res. 16:1126-1135, 2006
©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00
OPEN ACCESS ARTICLE
This Article
OPEN ACCESS ARTICLE
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow All Versions of this Article:
gr.5144106v1
16/9/1126    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Seringhaus, M.
Right arrow Articles by Gerstein, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Seringhaus, M.
Right arrow Articles by Gerstein, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Methods

Predicting essential genes in fungal genomes

Michael Seringhaus1,6, Alberto Paccanaro2,6, Anthony Borneman3, Michael Snyder1,3 and Mark Gerstein1,4,5,7

1Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA; 2Department of Computer Science, Royal Holloway University of London, Egham, TW20 0EX, United Kingdom; 3Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA; 4Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA; 5Department of Computer Science, Yale University, New Haven, Connecticut 06520, USA

Essential genes are required for an organism's viability, and the ability to identify these genes in pathogens is crucial to directed drug development. Predicting essential genes through computational methods is appealing because it circumvents expensive and difficult experimental screens. Most such prediction is based on homology mapping to experimentally verified essential genes in model organisms. We present here a different approach, one that relies exclusively on sequence features of a gene to estimate essentiality and offers a promising way to identify essential genes in unstudied or uncultured organisms. We identified 14 characteristic sequence features potentially associated with essentiality, such as localization signals, codon adaptation, GC content, and overall hydrophobicity. Using the well-characterized baker's yeast Saccharomyces cerevisiae, we employed a simple Bayesian framework to measure the correlation of each of these features with essentiality. We then employed the 14 features to learn the parameters of a machine learning classifier capable of predicting essential genes. We trained our classifier on known essential genes in S. cerevisiae and applied it to the closely related and relatively unstudied yeast Saccharomyces mikatae. We assessed predictive success in two ways: First, we compared all of our predictions with those generated by homology mapping between these two species. Second, we verified a subset of our predictions with eight in vivo knockouts in S. mikatae, and we present here the first experimentally confirmed essential genes in this species.


6 These authors contributed equally to this work.

7 Corresponding author.

E-mail mark.gerstein{at}yale.edu; fax (360) 838-7861.

[Supplemental material is available online at www.genome.org. and http://www.gersteinlab.org/proj/predess/.]

Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5144106. Freely available online through the Genome Research Open Access option.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?





Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2006 by Cold Spring Harbor Laboratory Press.