Vol. 10, Issue 4, 502-510, April 2000
METHODS
MAGPIE/EGRET Annotation of the 2.9-Mb Drosophila melanogaster Adh Region
Terry
Gaasterland,1,4
Alexander
Sczyrba,1
Elizabeth
Thomas,1,2
Gulriz
Aytekin-Kurban,1
Paul
Gordon,3 and
Christoph W.
Sensen3
1 The Rockefeller University, Laboratory of Computational
Genomics, New York, New York 10021 USA; 2 Cold Spring Harbor
Laboratory, Cold Spring Harbor, New York 11724 USA;
3 Institute for Marine Biosciences, Halifax, Nova Scotia, Canada
Our challenge in annotating the 2.91-Mb Adh region of the
Drosophila melanogaster genome was to identify genetic and
genomic features automatically, completely, and precisely within a
6-week period. To do so, we augmented the MAGPIE microbial
genome annotation system to handle eukaryotic genomic sequence data. The new configuration required the integration of eukaryotic
gene-finding tools and DNA repeat tools into the automatic data
collection module. It also required us to define in MAGPIE
new strategies to combine data about eukaryotic exon predictions with
functional data to refine the exon predictions. At the heart of the
resulting new eukaryotic genome annotation system is a reverse
comparison of public protein and complementary DNA sequences against
the input genome to identify missing exons and to refine exon
boundaries. The software modules that add eukaryotic genome annotation
capability to MAGPIE are available as EGRET
(Eukaryotic Genome Rapid
Evaluation Tool).
4
Corresponding author.
10:502-510 ©2000 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/00 $5.00