Genome Research Attend a BioResearch Product Faire

Home Help [Feedback] [For Subscribers] [Archive] [Search] --
 QUICK SEARCH:   [advanced]


     


Published online before print October 12, 2004
Genome Research, DOI: 10.1101/gr.2816704
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
gr.2816704v1
14/11/2330    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Tenney, A. E.
Right arrow Articles by Brent, M. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tenney, A. E.
Right arrow Articles by Brent, M. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Genome Research Vol 0, Issue 2004, gr.2816704, 2004
© 2004 Cold Spring Harbor Laboratory Press

Methods

Gene prediction and verification in a compact genome with numerous small introns

Aaron E. Tenney1, Randall H. Brown1, Charles Vaske1,4, Jennifer K. Lodge2, Tamara L. Doering3 and Michael R. Brent1,5

1 Laboratory for Computational Genomics and Department of Computer Science, Washington University, St. Louis, Missouri 63130, USA , 2 Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine, St. Louis, Missouri 63104, USA , 3 Department of Molecular Microbiology, Washington University Medical School, St. Louis, Missouri 63110-1093, USA

The genomes of clusters of related eukaryotes are now being sequenced at an increasing rate, creating a need for accurate, low-cost annotation of exon-intron structures. In this paper, we demonstrate that reverse transcription-polymerase chain reaction (RT-PCR) and direct sequencing based on predicted gene structures satisfy this need, at least for single-celled eukaryotes. The TWINSCAN gene prediction algorithm was adapted for the fungal pathogen Cryptococcus neoformans by using a precise model of intron lengths in combination with ungapped alignments between the genome sequences of the two closely related Cryptococcus varieties. This approach resulted in ~60% of known genes being predicted exactly right at every coding base and splice site. When previously unannotated TWINSCAN predictions were tested by RT-PCR and direct sequencing, 75% of targets spanning two predicted introns were amplified and produced high-quality sequence. When targets spanning the complete predicted open reading frame were tested, 72% of them amplified and produced high-quality sequence. We conclude that sequencing a small number of expressed sequence tags (ESTs) to provide training data, running TWINSCAN on an entire genome, and then performing RT-PCR and direct sequencing on all of its predictions would be a cost-effective method for obtaining an experimentally verified genome annotation.


[Supplemental material is available online at www.genome.org. All sequences, predictions, primers, traces, accession numbers, and links to software are available at http://genes.cse.wustl.edu/tenney-04-crypto-data/].

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2816704. Article published online before print in October 2004.

4 Present address: Dept. of Biomolecular Engineering, Univ. of California-Santa Cruz, Santa Cruz, California 95064, USA.

5 Corresponding author.
E-mail brent{at}cse.wustl.edu; fax (314) 935-7302.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Genome Res.Home page
D. DeCaprio, J. P. Vinson, M. D. Pearson, P. Montgomery, M. Doherty, and J. E. Galagan
Conrad: Gene prediction using conditional random fields
Genome Res., September 1, 2007; 17(9): 1389 - 1398.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
A. Yoneda and T. L. Doering
A Eukaryotic Capsular Polysaccharide Is Synthesized Intracellularly and Secreted via Exocytosis
Mol. Biol. Cell, December 1, 2006; 17(12): 5131 - 5140.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. E. Galagan, M. R. Henn, L.-J. Ma, C. A. Cuomo, and B. Birren
Genomics of the fungal kingdom: Insights into eukaryotic biology
Genome Res., December 1, 2005; 15(12): 1620 - 1631.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
C. Wei, P. Lamesch, M. Arumugam, J. Rosenberg, P. Hu, M. Vidal, and M. R. Brent
Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions
Genome Res., April 1, 2005; 15(4): 577 - 582.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Yandell, A. M. Bailey, S. Misra, S. Shu, C. Wiel, M. Evans-Holm, S. E. Celniker, and G. M. Rubin
A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome
PNAS, February 1, 2005; 102(5): 1566 - 1571.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] --
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2004 by Cold Spring Harbor Laboratory Press.