Genome Research cityscape

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Genome Res. 13:2164-2170, 2003
©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Huang, X.
Right arrow Articles by Hillier, L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Huang, X.
Right arrow Articles by Hillier, L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Methods

PCAP: A Whole-Genome Assembly Program

Xiaoqiu Huang1,4, Jianmin Wang1, Srinivas Aluru2, Shiaw-Pyng Yang3 and LaDeana Hillier3

1 Department of Computer Science Iowa State University, Ames, Iowa 50011-1040, USA 2 Department of Electrical and Computer Engineering, Iowa State University, Ames, Iowa 50011-1040, USA 3 Genome Sequencing Center, Washington University Medical School, St. Louis, Missouri 63108, USA

We describe a whole-genome assembly program named PCAP for processing tens of millions of reads. The PCAP program has several features to address efficiency and accuracy issues in assembly. Multiple processors are used to perform most time-consuming computations in assembly. A more sensitive method is used to avoid missing overlaps caused by sequencing errors. Repetitive regions of reads are detected on the basis of many overlaps with other reads, instead of many shorter word matches with other reads. Contaminated end regions of reads are identified and removed. Generation of a consensus sequence for a contig is based on an alignment of reads in the contig, in which both base quality values and coverage information are used to determine every consensus base. The PCAP program was tested on a mouse whole-genome data set of 30 million reads and a human Chromosome 20 data set of 1.7 million reads. The program is freely available for academic use.


[The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: the Mouse Genome Sequencing Consortium 2002; J. Mullikin. The assembled mouse sequences are available at our Web site, http://seq.cs.iastate.edu.]

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1390403.

4 Corresponding author.
E-MAIL xqhuang{at}cs.iastate.edu; FAX (515) 294-0258.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Genome Res.Home page
D. R. Zerbino and E. Birney
Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
Genome Res., May 1, 2008; 18(5): 821 - 829.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Denisov, B. Walenz, A. L. Halpern, J. Miller, N. Axelrod, S. Levy, and G. Sutton
Consensus generation and variant detection by Celera Assembler
Bioinformatics, April 15, 2008; 24(8): 1035 - 1040.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. J. Chaisson and P. A. Pevzner
Short read fragment assembly of bacterial genomes
Genome Res., February 1, 2008; 18(2): 324 - 330.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. V. Zimin, D. R. Smith, G. Sutton, and J. A. Yorke
Assembly reconciliation
Bioinformatics, January 1, 2008; 24(1): 42 - 45.
[Abstract] [Full Text] [PDF]


Home page
DNA ResHome page
K. Kurokawa, T. Itoh, T. Kuwahara, K. Oshima, H. Toh, A. Toyoda, H. Takami, H. Morita, V. K. Sharma, T. P. Srivastava, et al.
Comparative Metagenomics Revealed Commonly Enriched Gene Sets in Human Gut Microbiomes
DNA Res, October 16, 2007; (2007) dsm018v2.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
B. S. Samuel, E. E. Hansen, J. K. Manchester, P. M. Coutinho, B. Henrissat, R. Fulton, P. Latreille, K. Kim, R. K. Wilson, and J. I. Gordon
Genomic and metabolic adaptations of Methanobrevibacter smithii to the human gut
PNAS, June 19, 2007; 104(25): 10643 - 10648.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
R. L. Warren, D. Varabei, D. Platt, X. Huang, D. Messina, S.-P. Yang, J. W. Kronstad, M. Krzywinski, W. C. Warren, J. W. Wallis, et al.
Physical map-assisted whole-genome shotgun sequence assemblies.
Genome Res., June 1, 2006; 16(6): 768 - 775.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
X. Huang, S.-P. Yang, A. T. Chinwalla, L. W. Hillier, P. Minx, E. R. Mardis, and R. K. Wilson
Application of a superword array in genome assembly
Nucleic Acids Res., January 5, 2006; 34(1): 201 - 205.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. L. Salzberg and J. A. Yorke
Beware of mis-assembled genomes
Bioinformatics, December 15, 2005; 21(24): 4320 - 4321.
[Full Text] [PDF]


Home page
Plant Physiol.Home page
Q. Dong, C. J. Lawrence, S. D. Schlueter, M. D. Wilkerson, S. Kurtz, C. Lushbough, and V. Brendel
Comparative Plant Genomics Resources at PlantGDB
Plant Physiology, October 1, 2005; 139(2): 610 - 618.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. P. Vinson, D. B. Jaffe, K. O'Neill, E. K. Karlsson, N. Stange-Thomann, S. Anderson, J. P. Mesirov, N. Satoh, Y. Satou, C. Nusbaum, et al.
Assembly of polymorphic genomes: Algorithms and application to Ciona savignyi
Genome Res., August 1, 2005; 15(8): 1127 - 1135.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Xu and J. I. Gordon
MapLinker: a software tool that aids physical map-linked whole genome shotgun assembly
Bioinformatics, April 1, 2005; 21(7): 1265 - 1266.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
V. Magrini, W. C. Warren, J. Wallis, W. E. Goldman, J. Xu, E. R. Mardis, and J. D. McPherson
Fosmid-Based Physical Mapping of the Histoplasma capsulatum Genome
Genome Res., August 1, 2004; 14(8): 1603 - 1609.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
P. Havlak, R. Chen, K. J. Durbin, A. Egan, Y. Ren, X.-Z. Song, G. M. Weinstock, and R. A. Gibbs
The Atlas Genome Assembly System
Genome Res., April 1, 2004; 14(4): 721 - 732.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2003 by Cold Spring Harbor Laboratory Press.