Published online before print
October 12, 2004, 10.1101/gr.2648404
Genome Res. 14:2235-2244, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Letter
An intermediate grade of finished genomic sequence suitable for comparative analyses
Robert W. Blakesley1,2,3,
Nancy F. Hansen1,3,
James C. Mullikin1,2,3,
Pamela J. Thomas1,
Jennifer C. McDowell1,
Baishali Maskeri1,
Alice C. Young1,
Beatrice Benjamin1,
Shelise Y. Brooks1,
Bradley I. Coleman1,
Jyoti Gupta1,
Shi-Ling Ho1,
Eric M. Karlins1,
Quino L. Maduro1,
Sirintorn Stantripop1,
Cyrus Tsurgeon1,
Jennifer L. Vogt1,
Michelle A. Walker1,
Catherine A. Masiello1,
Xiaobin Guan1,
NISC Comparative Sequencing Program1,2,
Gerard G. Bouffard1,2 and
Eric D. Green1,2,4
1 NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
2 Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
Although the cost of generating draft-quality genomic sequence continues to decline, refining that sequence by the process of "sequence finishing" remains expensive. Near-perfect finished sequence is an appropriate goal for the human genome and a small set of reference genomes; however, such a high-quality product cannot be cost-justified for large numbers of additional genomes, at least for the foreseeable future. Here we describe the generation and quality of an intermediate grade of finished genomic sequence (termed comparative-grade finished sequence), which is tailored for use in multispecies sequence comparisons. Our analyses indicate that this sequence is very high quality (with the residual gaps and errors mostly falling within repetitive elements) and reflects 99% of the total sequence. Importantly, comparative-grade sequence finishing requires 40-fold less reagents and 10-fold less personnel effort compared to the generation of near-perfect finished sequence, such as that produced for the human genome. Although applied here to finishing sequence derived from individual bacterial artificial chromosome (BAC) clones, one could envision establishing routines for refining sequences emanating from whole-genome shotgun sequencing projects to a similar quality level. Our experience to date demonstrates that comparative-grade sequence finishing represents a practical and affordable option for sequence refinement en route to comparative analyses.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2648404. Article published online before print in October 2004.
3 These authors contributed equally to this work.
4 Corresponding author. E-mail egreen{at}nhgri.nih.gov; fax (301) 402-2040.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
K. C. Worley, G. M. Weinstock, and R. A. Gibbs
Rats in the genomic era
Physiol Genomics,
February 19, 2008;
32(3):
273 - 282.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Caceres, National Institutes of Health Intramural Sequencin, R. T. Sullivan, and J. W. Thomas
A recurrent inversion on the eutherian X chromosome
PNAS,
November 20, 2007;
104(47):
18571 - 18576.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. H. Margulies, G. M. Cooper, G. Asimenos, D. J. Thomas, C. N. Dewey, A. Siepel, E. Birney, D. Keefe, A. S. Schwartz, M. Hou, et al.
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome
Genome Res.,
June 1, 2007;
17(6):
760 - 774.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Hurle, W. Swanson, NISC Comparative Sequencing Program, and E. D. Green
Comparative sequence analyses reveal rapid and divergent evolutionary changes of the WFDC locus in the primate lineage
Genome Res.,
March 1, 2007;
17(3):
276 - 286.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. E. Stajich and H. Lapp
Open source tools and toolkits for bioinformatics: significance, and where are we?
Brief Bioinform,
September 1, 2006;
7(3):
287 - 296.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Caceres, NISC Comparative Sequencing Program, and J. W. Thomas
The Gene of Retroviral Origin Syncytin 1 is Specific to Hominoids and is Inactive in Old World Monkeys
J. Hered.,
March 1, 2006;
97(2):
100 - 106.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Fadiel, I. Anidi, and K. D. Eichenbaum
Farm animal genomics and informatics: an update
Nucleic Acids Res.,
November 7, 2005;
33(19):
6308 - 6318.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. M. Cooper, E. A. Stone, G. Asimenos, NISC Comparative Sequencing Program, E. D. Green, S. Batzoglou, and A. Sidow
Distribution and intensity of constraint in mammalian genomic sequence
Genome Res.,
July 1, 2005;
15(7):
901 - 913.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. H. Margulies, J. P. Vinson, NISC Comparative Sequencing Program, W. Miller, D. B. Jaffe, K. Lindblad-Toh, J. L. Chang, E. D. Green, E. S. Lander, J. C. Mullikin, et al.
An initial strategy for the systematic identification of functional elements in the human genome by low-redundancy comparative sequencing
PNAS,
March 29, 2005;
102(13):
4795 - 4800.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
The ENCODE Project Consortium
The ENCODE (ENCyclopedia Of DNA Elements) Project
Science,
October 22, 2004;
306(5696):
636 - 640.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|