Genome Research cityscape

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Flicek, P.
Right arrow Articles by Brent, M. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Flicek, P.
Right arrow Articles by Brent, M. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Vol 13, Issue 1, 46-54, January 2003

LETTER

Leveraging the Mouse Genome for Gene Prediction in Human: From Whole-Genome Shotgun Reads to a Global Synteny Map

Paul Flicek1,2, Evan Keibler1, Ping Hu1, Ian Korf1,3 and Michael R. Brent1,4

1Department of Computer Science and Engineering and2 Department of Biomedical Engineering, Washington University, St. Louis, Missouri 63130, USA

The availability of draft sequences for both the mouse and human genomes makes it possible, for the first time, to annotate whole mammalian genomes using comparative methods. TWINSCAN is a gene-prediction system that combines the methods of single-genome predictors like GENSCAN with information derived from genome comparison, thereby improving accuracy. Because TWINSCAN uses genomic sequence only, it is less biased toward highly and/or ubiquitously expressed genes than GENEWISE, GENOMESCAN, and other methods based on evidence derived from transcripts. We show that TWINSCAN improves gene prediction in human using intermediate products from various stages of the sequencing and analysis of the mouse genome, from low-redundancy, whole-genome shotgun reads to the draft assembly and the synteny map. TWINSCAN improves on the prior state of the art even when alignments from only 1X coverage of the mouse genome are available. Gene prediction accuracy improves steadily from 1X through 3X, more slowly from 3X to 4X, and relatively little thereafter. The assembly and the synteny map greatly speed the computations, however. Our human annotation using the mouse assembly is conservative, predicting only 25,622 genes, and appears to be one of the best de novo annotations of the human genome to date.


3 Present address: The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.

4 Corresponding author.

E-MAIL brent{at}cse.wustl.edu; FAX (314) 935-7302.

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.830003.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Genome Res.Home page
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al.
Targeted discovery of novel human exons by comparative genomics
Genome Res., December 1, 2007; 17(12): 1763 - 1773.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. Keibler, M. Arumugam, and M. R. Brent
The Treeterbi and Parallel Treeterbi algorithms: efficient, optimal decoding for ordinary, generalized and pair HMMs
Bioinformatics, March 1, 2007; 23(5): 545 - 554.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Agrawal and G. D. Stormo
Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans
Bioinformatics, May 15, 2006; 22(10): 1239 - 1244.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. J. van Baren and M. R. Brent
Iterative gene prediction and pseudogene removal improves genome annotation.
Genome Res., May 1, 2006; 16(5): 678 - 685.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. S. Hinrichs, D. Karolchik, R. Baertsch, G. P. Barber, G. Bejerano, H. Clawson, M. Diekhans, T. S. Furey, R. A. Harte, F. Hsu, et al.
The UCSC Genome Browser Database: update 2006
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D590 - D598.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. R. Brent
Genome annotation past, present, and future: How to define an ORF at each locus
Genome Res., December 1, 2005; 15(12): 1777 - 1786.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. E. Allen and S. L. Salzberg
JIGSAW: integration of multiple sources of evidence for gene prediction
Bioinformatics, September 15, 2005; 21(18): 3596 - 3603.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. Ayele, B. J. Haas, N. Kumar, H. Wu, Y. Xiao, S. Van Aken, T. R. Utterback, J. R. Wortman, O. R. White, and C. D. Town
Whole genome shotgun sequencing of Brassica oleracea and its application to gene discovery and annotation in Arabidopsis
Genome Res., April 1, 2005; 15(4): 487 - 495.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
C. Wei, P. Lamesch, M. Arumugam, J. Rosenberg, P. Hu, M. Vidal, and M. R. Brent
Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions
Genome Res., April 1, 2005; 15(4): 577 - 582.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Yandell, A. M. Bailey, S. Misra, S. Shu, C. Wiel, M. Evans-Holm, S. E. Celniker, and G. M. Rubin
A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome
PNAS, February 1, 2005; 102(5): 1566 - 1571.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. E. Tenney, R. H. Brown, C. Vaske, J. K. Lodge, T. L. Doering, and M. R. Brent
Gene prediction and verification in a compact genome with numerous small introns
Genome Res., November 1, 2004; 14(11): 2330 - 2335.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. Birney, M. Clamp, and R. Durbin
GeneWise and Genomewise
Genome Res., May 1, 2004; 14(5): 988 - 995.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. Q. Wu, D. Shteynberg, M. Arumugam, R. A. Gibbs, and M. R. Brent
Identification of Rat Genes by TWINSCAN Gene Prediction, RT-PCR, and Direct Sequencing
Genome Res., April 1, 2004; 14(4): 665 - 671.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. E. Allen, M. Pertea, and S. L. Salzberg
Computational Gene Prediction Using Multiple Sources of Evidence
Genome Res., January 1, 2004; 14(1): 142 - 148.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. H. Margulies, M. Blanchette, NISC Comparative Sequencing Program, D. Haussler, and E. D. Green
Identification and Characterization of Multi-Species Conserved Sequences
Genome Res., December 1, 2003; 13(12): 2507 - 2518.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. Guigo, E. T. Dermitzakis, P. Agarwal, C. P. Ponting, G. Parra, A. Reymond, J. F. Abril, E. Keibler, R. Lyle, C. Ucla, et al.
Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes
PNAS, February 4, 2003; 100(3): 1140 - 1145.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.