|
Vol. 11, Issue 5, 710-730, May 2001
LETTER
From First Base: The Sequence of the Tip of the X Chromosome of Drosophila melanogaster, a Comparison of Two Sequencing Strategies
Panayiotis V.
Benos,1,15
Melanie K.
Gatt,2,11
Lee
Murphy,3
David
Harris,3
Bart
Barrell,3
Concepcion
Ferraz,4
Sophie
Vidal,4
Christine
Brun,4
Jacques
Demaille,4
Edouard
Cadieu,5
Stephane
Dreano,5
Stéphanie
Gloux,5
Valerie
Lelaure,5
Stephanie
Mottier,5
Francis
Galibert,5
Dana
Borkova,6
Belen
Miñana,6
Fotis C.
Kafatos,6
Slava
Bolshakov,6,7
Inga
Sidén-Kiamos,7
George
Papagiannakis,7
Lefteris
Spanos,7
Christos
Louis,7,8
Encarnación
Madueño,9
Beatriz
de Pablos,9
Juan
Modolell,9
Annette
Peter,10
Petra
Schöttler,10
Meike
Werner,10
Fotini
Mourkioti,10
Nicole
Beinert,10
Gordon
Dowe,10
Ulrich
Schäfer,10
Herbert
Jäckle,10
Alain
Bucheton,4
Debbie
Callister,11
Lorna
Campbell,11
Nadine S.
Henderson,11
Paul J.
McMillan,11
Cathy
Salles,11
Evelyn
Tait,11
Phillipe
Valenti,11
Robert D.C.
Saunders,11,12
Alain
Billaud,13
Lior
Pachter,14
David M.
Glover,2,11 and
Michael
Ashburner1,2,16
1 EMBL Outstation, The European Bioinformatics Institute,
Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK;
2 Department of Genetics, University of Cambridge, Cambridge,
CB2 3EH, UK; 3 Sanger Centre, Wellcome Trust Genome Campus,
Hinxton, Cambridge, CB10 1SA, UK; 4 Montpellier University
Medical School, IGH-Institut de Génétique Humaine-CNRS,
34396 Montpellier Cedex 5, France; 5 UPR 41, CNRS,
Recombinaisons Génétiques, Faculte de Medecine, 35043 Rennes Cedex, France; 6 European Molecular Biology Laboratory
(EMBL), D-69117 Heidelberg, Germany; 7 Institute of Molecular
Biology and Biotechnology, FORTH, GR-71110 Heraklion, Greece;
8 Department of Biology, University of Crete, 71409 Heraklion,
Crete, Greece; 9 Centro de Biología Molecular Severo
Ochoa, CSIC and Universidad Autónoma de Madrid, 28049 Madrid,
Spain; 10 Max-Planck-Institut für biophysikalische
Chemie, Department of Molecular Developmental Biology, D-37070
Göttingen, Germany; 11 Department of Anatomy and
Physiology, CRC Cell Cycle Genetics Group, University of Dundee,
Dundee, DD1 4HN, UK; 12 Department of Biological Sciences, The
Open University, Milton Keynes, MK7 6AA, UK; 13 Fondation Jean
Dausset-CEPH (Centre d'Etude du Polymorphisme Humain), 75010 Paris,
France; 14 Department of Mathematics, University of California
at Berkeley, California 94720-3840, USA
We present the sequence of a contiguous 2.63 Mb of
DNA extending from the tip of the X chromosome of
Drosophila melanogaster. Within this sequence, we predict 277 protein coding genes, of which 94 had been sequenced already in the
course of studying the biology of their gene products, and examples of
12 different transposable elements. We show that an interval between
bands 3A2 and 3C2, believed in the 1970s to show a correlation between the number of bands on the polytene chromosomes and the 20 genes identified by conventional genetics, is predicted to contain 45 genes
from its DNA sequence. We have determined the insertion sites of
P-elements from 111 mutant lines, about half of which are in a
position likely to affect the expression of novel predicted genes, thus
representing a resource for subsequent functional genomic analysis. We
compare the European Drosophila Genome Project sequence with
the corresponding part of the independently assembled and annotated
Joint Sequence determined through "shotgun" sequencing. Discounting
differences in the distribution of known transposable elements between
the strains sequenced in the two projects, we detected three major
sequence differences, two of which are probably explained by errors in
assembly; the origin of the third major difference is unclear. In
addition there are eight sequence gaps within the Joint Sequence. At
least six of these eight gaps are likely to be sites of transposable
elements; the other two are complex. Of the 275 genes in common to both
projects, 60% are identical within 1% of their predicted amino-acid
sequence and 31% show minor differences such as in choice of
translation initiation or termination codons; the remaining 9% show
major differences in interpretation.
[All of the sequences analyzed in this paper have been deposited in
the EMBL-Bank database under the following accession nos.: AL009146,
AL009147, AL009171, AL009188-AL009196, AL021067, AL021086,
AL021106-AL021108, AL021726, AL021728, AL022017, AL022018, AL022139,
AL023873, AL023874, AL023893, AL024453, AL024455-AL024457, AL024485,
AL030993, AL030994, AL031024-AL031028, AL031128, AL031173, AL031366,
AL031367, AL031581-AL031583, AL031640, AL031765, AL031883, AL031884,
AL034388, AL034544, AL035104, AL035105, AL035207, AL035245, AL035331,
AL035632, AL049535, AL050231, AL050232, AL109630, AL121804, AL121806,
AL132651, AL132792, AL132797, AL133503-AL133506, AL138678, AL138971, AL138972, and Z98269. A single file (FASTA format) of the
2.6-Mb contig is available from
ftp://ftp.ebi.ac.uk/pub/databases/edgp/contigs/contig_1.fa.]
15
Present address: Department of Genetics, School of
Medicine, Washington University, 4566 Scott Avenue,St. Louis, MO 63110 USA.
16
Corresponding author.
11:710-730 ©2001 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/01 $5.00

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
G. Bosco, P. Campbell, J. T. Leiva-Neto, and T. A. Markow
Analysis of Drosophila Species Genome Size and Satellite DNA Content Reveals Significant Differences Among Strains as Well as Between Species
Genetics,
November 1, 2007;
177(3):
1277 - 1290.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Ashburner and C. M. Bergman
Drosophila melanogaster: A case study of a model genomic sequence and its consequences
Genome Res.,
December 1, 2005;
15(12):
1661 - 1667.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. P. Abad, B. de Pablos, K. Osoegawa, P. J. de Jong, A. Martin-Gallardo, and A. Villasante
Genomic Analysis of Drosophila melanogaster Telomeres: Full-length Copies of HeT-A and TART Elements at Telomeres
Mol. Biol. Evol.,
September 1, 2004;
21(9):
1613 - 1619.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
O. Jaillon, C. Dossat, R. Eckenberg, K. Eiglmeier, B. Segurens, J.-M. Aury, C. W. Roth, C. Scarpelli, P. T. Brey, J. Weissenbach, et al.
Assessing the Drosophila melanogaster and Anopheles gambiae Genome Annotations Using Genome-Wide Sequence Comparisons
Genome Res.,
July 1, 2003;
13(7):
1595 - 1599.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Bray, I. Dubchak, and L. Pachter
AVID: A Global Alignment Program
Genome Res.,
January 1, 2003;
13(1):
97 - 102.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Bartolome, X. Maside, and B. Charlesworth
On the Abundance and Distribution of Transposable Elements in the Genome of Drosophila melanogaster
Mol. Biol. Evol.,
June 1, 2002;
19(6):
926 - 937.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. N. Bolshakov, P. Topalis, C. Blass, E. Kokoza, A. della Torre, F. C. Kafatos, and C. Louis
A Comparative Genomic Analysis of Two Distant Diptera, the Fruit Fly, Drosophila melanogaster, and the Malaria Mosquito, Anopheles gambiae
Genome Res.,
January 1, 2002;
12(1):
57 - 66.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|