Genome Research scroll

Home Help [Feedback] [For Subscribers] [Archive] [Search] --
 QUICK SEARCH:   [advanced]


     


Published online before print March 13, 2008
Genome Research, DOI: 10.1101/gr.7337908
This Article
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow All Versions of this Article:
gr.7337908v1
18/5/810    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Google Scholar
Right arrow Articles by Butler, J.
Right arrow Articles by Jaffe, D. B.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Butler, J.
Right arrow Articles by Jaffe, D. B.
Related Content
Right arrowRelated Articles
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Resource

ALLPATHS: De novo assembly of whole-genome shotgun microreads

Jonathan Butler1, Iain MacCallum1, Michael Kleber1,6, Ilya A. Shlyakhter1, Matthew K. Belmonte1,2, Eric S. Lander1,3,4,5, Chad Nusbaum1, and David B. Jaffe1,7

1 Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, USA; 2 Department of Human Development, Cornell University, Ithaca, New York 14853, USA; 3 Whitehead Institute for Biomedical Research, Cambridge, Massachusetts 02139, USA; 4 Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; 5 Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115, USA

New DNA sequencing technologies deliver data at dramatically lower costs but demand new analytical methods to take full advantage of the very short reads that they produce. We provide an initial, theoretical solution to the challenge of de novo assembly from whole-genome shotgun "microreads." For 11 genomes of sizes up to 39 Mb, we generated high-quality assemblies from 80x coverage by paired 30-base simulated reads modeled after real Illumina-Solexa reads. The bacterial genomes of Campylobacter jejuni and Escherichia coli assemble optimally, yielding single perfect contigs, and larger genomes yield assemblies that are highly connected and accurate. Assemblies are presented in a graph form that retains intrinsic ambiguities such as those arising from polymorphism, thereby providing information that has been absent from previous genome assemblies. For both C. jejuni and E. coli, this assembly graph is a single edge encompassing the entire genome. Larger genomes produce more complicated graphs, but the vast majority of the bases in their assemblies are present in long edges that are nearly always perfect. We describe a general method for genome assembly that can be applied to all types of DNA sequence data, not only short read data, but also conventional sequence reads.


6 Present addresses:

Google, Inc., Cambridge, MA 02142, USA

7 Corresponding author.

E-mail jaffe{at}broad.mit.edu; fax (617) 258-0901.

[Supplemental material is available online at www.genome.org.]

Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.7337908


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?

Related Articles

De novo bacterial genome sequencing: Millions of very short reads assembled on a desktop computer
David Hernandez, Patrice François, Laurent Farinelli, Magne Østerås, and Jacques Schrenzel
Genome Res. 2008 18: 802-809. [Abstract] [Full Text] [PDF]

Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
Daniel R. Zerbino and Ewan Birney
Genome Res. 2008 18: 821-829. [Abstract] [Full Text] [PDF]

SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing
Juliane C. Dohm, Claudio Lottaz, Tatiana Borodina, and Heinz Himmelbauer
Genome Res. 2007 17: 1697-1706. [Abstract] [Full Text] [PDF]






Home Help [Feedback] [For Subscribers] [Archive] [Search] --
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2008 by Cold Spring Harbor Laboratory Press.