Genome Research songbird

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Reese, M. G.
Right arrow Articles by Lewis, S. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Reese, M. G.
Right arrow Articles by Lewis, S. E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 10, Issue 4, 483-501, April 2000

LETTER
Genome Annotation Assessment in Drosophila melanogaster

Martin G. Reese,1,4 George Hartzell,1 Nomi L. Harris,1 Uwe Ohler,1,2 Josep F. Abril,3 and Suzanna E. Lewis1

1 Berkeley Drosophila Genome Project, Department of Molecular and Cell Biology, University of California, Berkeley, California 94720-3200 USA; 2 Chair for Pattern Recognition, University of Erlangen-Nuremberg, D-91058 Erlangen, Germany;3 Institut Municipal d'Investigació Médica---Universitat Pompeu Fabra, Department of Medical Informatics (IMIM---UPF), 08003 Barcelona, Spain

Computational methods for automated genome annotation are critical to our community's ability to make full use of the large volume of genomic sequence being generated and released. To explore the accuracy of these automated feature prediction tools in the genomes of higher organisms, we evaluated their performance on a large, well-characterized sequence contig from the Adh region of Drosophila melanogaster. This experiment, known as the Genome Annotation Assessment Project (GASP), was launched in May 1999. Twelve groups, applying state-of-the-art tools, contributed predictions for features including gene structure, protein homologies, promoter sites, and repeat elements. We evaluated these predictions using two standards, one based on previously unreleased high-quality full-length cDNA sequences and a second based on the set of annotations generated as part of an in-depth study of the region by a group of Drosophila experts. Although these standard sets only approximate the unknown distribution of features in this region, we believe that when taken in context the results of an evaluation based on them are meaningful. The results were presented as a tutorial at the conference on Intelligent Systems in Molecular Biology (ISMB-99) in August 1999. Over 95% of the coding nucleotides in the region were correctly identified by the majority of the gene finders, and the correct intron/exon structures were predicted for >40% of the genes. Homology-based annotation techniques recognized and associated functions with almost half of the genes in the region; the remainder were only identified by the ab initio techniques. This experiment also presents the first assessment of promoter prediction techniques for a significant number of genes in a large contiguous region. We discovered that the promoter predictors' high false-positive rates make their predictions difficult to use. Integrating gene finding and cDNA/EST alignments with promoter predictions decreases the number of false-positive classifications but discovers less than one-third of the promoters in the region. We believe that by establishing standards for evaluating genomic annotations and by assessing the performance of existing automated genome annotation tools, this experiment establishes a baseline that contributes to the value of ongoing large-scale annotation projects and should guide further research in genome informatics.


4 Corresponding author.


10:483-501 ©2000 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/00 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
J. Cell Biol.Home page
J. Zou, M. A. Hallen, C. D. Yankel, and S. A. Endow
A microtubule-destabilizing kinesin motor regulates spindle length and anchoring in oocytes
J. Cell Biol., February 6, 2008; 180(3): 459 - 466.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
C. M. Bergman and H. Quesneville
Discovering and detecting transposable elements in genome sequences
Brief Bioinform, November 1, 2007; 8(6): 382 - 392.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
U. Ohler
Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction
Nucleic Acids Res., November 6, 2006; 34(20): 5943 - 5950.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
I. Friedberg
Automated protein function prediction--the genomic challenge
Brief Bioinform, September 1, 2006; 7(3): 225 - 242.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
S. Bandyopadhyay, R. Sharan, and T. Ideker
Systematic identification of functional orthologs based on protein network comparison
Genome Res., March 1, 2006; 16(3): 428 - 435.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
L. Yu, P. M. Haverty, J. Mariani, Y. Wang, H.-Y. Shen, M. A. Schwarzschild, Z. Weng, and J.-F. Chen
Genetic and pharmacological inactivation of adenosine A2A receptor reveals an Egr-2-mediated transcriptional regulatory network in the mouse striatum
Physiol Genomics, September 21, 2005; 23(1): 89 - 102.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
B. C. Meyers, S. S. Tej, T. H. Vu, C. D. Haudenschild, V. Agrawal, S. B. Edberg, H. Ghazal, and S. Decola
The Use of MPSS for Whole-Genome Transcriptional Analysis in Arabidopsis
Genome Res., August 1, 2004; 14(8): 1641 - 1653.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. V. Sun, L. Chen, F. Greil, N. Negre, T.-R. Li, G. Cavalli, H. Zhao, B. van Steensel, and K. P. White
Protein-DNA interaction mapping using genomic tiling path microarrays in Drosophila
PNAS, August 5, 2003; 100(16): 9428 - 9433.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer
Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes
Plant Physiology, July 1, 2003; 132(3): 1162 - 1176.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Sorek and H. M. Safer
A novel algorithm for computational identification of contaminated EST libraries
Nucleic Acids Res., February 1, 2003; 31(3): 1067 - 1074.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. Parra, P. Agarwal, J. F. Abril, T. Wiehe, J. W. Fickett, and R. Guigo
Comparative Gene Prediction in Human and Mouse
Genome Res., January 1, 2003; 13(1): 108 - 117.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
L. D. Stein, C. Mungall, S. Shu, M. Caudy, M. Mangone, A. Day, E. Nickerson, J. E. Stajich, T. W. Harris, A. Arva, et al.
The Generic Genome Browser: A Building Block for a Model Organism System Database
Genome Res., October 1, 2002; 12(10): 1599 - 1610.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
M. S. Halfon and A. M. Michelson
Exploring genetic regulatory networks in metazoan development: methods and models
Physiol Genomics, September 3, 2002; 10(3): 131 - 143.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
D. S. Hekmat-Scafe, C. R. Scafe, A. J. McKinney, and M. A. Tanouye
Genome-Wide Analysis of the Odorant-Binding Protein Gene Family in Drosophila melanogaster
Genome Res., September 1, 2002; 12(9): 1357 - 1369.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
K. L. Howe, T. Chothia, and R. Durbin
GAZE: A Generic Framework for the Integration of Gene-Prediction Data by Dynamic Programming
Genome Res., September 1, 2002; 12(9): 1418 - 1427.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. Stapleton, G. Liao, P. Brokstein, L. Hong, P. Carninci, T. Shiraki, Y. Hayashizaki, M. Champe, J. Pacleb, K. Wan, et al.
The Drosophila Gene Collection: Identification of Putative Full-Length cDNAs for 70% of D. melanogaster Genes
Genome Res., August 1, 2002; 12(8): 1294 - 1300.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
X. Morin, R. Daneman, M. Zavortink, and W. Chia
A protein trap strategy to detect GFP-tagged proteins expressed from their endogenous loci in Drosophila
PNAS, December 6, 2001; (2001) 261408198.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. J. Schmid and C. F. Aquadro
The Evolutionary Analysis of ""Orphans"" From the Drosophila Genome Identifies Rapidly Diverging and Incorrectly Annotated Genes
Genetics, October 1, 2001; 159(2): 589 - 598.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
R. S. Hewes and P. H. Taghert
Neuropeptides and Neuropeptide Receptors in the Drosophila melanogaster Genome
Genome Res., June 1, 2001; 11(6): 1126 - 1142.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
Z. Kan, E. C. Rouchka, W. R. Gish, and D. J. States
Gene Structure Prediction and Alternative Splicing Analysis Using Genomically Aligned ESTs
Genome Res., May 1, 2001; 11(5): 889 - 900.
[Abstract] [Full Text]


Home page
Nucleic Acids ResHome page
C. Gemund, C. Ramu, B. Altenberg-Greulich, and T. J. Gibson
Gene2EST: a BLAST2 server for searching expressed sequence tag (EST) databases with eukaryotic gene-sized queries
Nucleic Acids Res., March 15, 2001; 29(6): 1272 - 1277.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. Scherf, A. Klingenhoff, K. Frech, K. Quandt, R. Schneider, K. Grote, M. Frisch, V. Gailus-Durner, A. Seidel, R. Brack-Werner, et al.
First Pass Annotation of Promoters on Human Chromosome 22
Genome Res., March 1, 2001; 11(3): 333 - 340.
[Abstract] [Full Text]


Home page
Genome Res.Home page
J. Andrews, G. G. Bouffard, C. Cheadle, J. Lü, K. G. Becker, and B. Oliver
Gene Discovery Using Computational and Microarray Analysis of Transcription in the Drosophila melanogaster Testis
Genome Res., December 1, 2000; 10(12): 2030 - 2043.
[Abstract] [Full Text]


Home page
Genome Res.Home page
G. K.-S. Wong, D. A. Passey, Y.-z. Huang, Z. Yang, and J. Yu
Is "Junk" DNA Mostly Intron DNA?
Genome Res., November 1, 2000; 10(11): 1672 - 1678.
[Abstract] [Full Text]


Home page
Genome Res.Home page
R. Guigó, P. Agarwal, J. F. Abril, M. Burset, and J. W. Fickett
An Assessment of Gene Prediction Accuracy in Large DNA Sequences
Genome Res., October 1, 2000; 10(10): 1631 - 1642.
[Abstract] [Full Text]


Home page
J. Cell Biol.Home page
M. E. Fortini, M. P. Skupski, M. S. Boguski, and I. K. Hariharan
A Survey of Human Disease Gene Counterparts in the Drosophila Genome
J. Cell Biol., July 24, 2000; 150(2): F23 - F30.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Biol.Home page
D. K. Morrison, M. S. Murakami, and V. Cleghon
Protein Kinases and Phosphatases in the Drosophila Genome
J. Cell Biol., July 24, 2000; 150(2): F57 - F62.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
X. Morin, R. Daneman, M. Zavortink, and W. Chia
A protein trap strategy to detect GFP-tagged proteins expressed from their endogenous loci in Drosophila
PNAS, December 18, 2001; 98(26): 15050 - 15055.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
S. Bandyopadhyay, R. Sharan, and T. Ideker
Systematic identification of functional orthologs based on protein network comparison
Genome Res., March 1, 2006; 16(3): 428 - 435.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.