Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wang, J.
Right arrow Articles by Li, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wang, J.
Right arrow Articles by Li, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 5, 824-831, May 2002

METHODS
RePS: A Sequence Assembler That Masks Exact Repeats Identified from the Shotgun Data

Jun Wang,1,2,3,5 Gane Ka-Shu Wong,1,2,4,5 Peixiang Ni,2 Yujun Han,2 Xiangang Huang,2 Jianguo Zhang,2 Chen Ye,2 Yong Zhang,2,3 Jianfei Hu,2,3 Kunlin Zhang,2,3 Xin Xu,1 Lijuan Cong,1 Hong Lu,1 Xide Ren,1 Xiaoyu Ren,1 Jun He,1 Lin Tao,1,2 Douglas A. Passey,4 Jian Wang,1,2 Huanming Yang,1,2 Jun Yu,1,2,4 and Songgang Li2,3

1 Hangzhou Genomics Institute, Institute of Bioinformatics of Zhejiang University, Key Laboratory of Bioinformatics of Zhejiang Province, Hangzhou 310007, China; 2 Beijing Genomic Institute, Center of Genomics and Bioinformatics, Chinese Academy of Sciences, Beijing, 101300, China; 3 College of Life Sciences, Peking University, Beijing, 100871, China; 4 University of Washington Genome Center, Department of Medicine, Fluke Hall, M/C 352145, Seattle, Washington 98195, USA

We describe a sequence assembler, RePS (repeat-masked Phrap with scaffolding), that explicitly identifies exact 20mer repeats from the shotgun data and removes them prior to the assembly. The established software Phrap is used to compute meaningful error probabilities for each base. Clone-end-pairing information is used to construct scaffolds that order and orient the contigs. We show with real data for human and rice that reasonable assemblies are possible even at coverages of only 4× to 6×, despite having up to 42.2% in exact repeats.

[The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: P. Green and A.F. Smit.]


5 Corresponding authors.


12:824-831 ©2002 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/02 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
G. Denisov, B. Walenz, A. L. Halpern, J. Miller, N. Axelrod, S. Levy, and G. Sutton
Consensus generation and variant detection by Celera Assembler
Bioinformatics, April 15, 2008; 24(8): 1035 - 1040.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Morgulis, E. M. Gertz, A. A. Schaffer, and R. Agarwala
WindowMasker: window-based masker for sequenced genomes
Bioinformatics, January 15, 2006; 22(2): 134 - 141.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Wang, Q. Xia, X. He, M. Dai, J. Ruan, J. Chen, G. Yu, H. Yuan, Y. Hu, R. Li, et al.
SilkDB: a knowledgebase for silkworm biology and genomics
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D399 - D402.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
P. A. Pevzner, H. Tang, and G. Tesler
De Novo Repeat Classification and Fragment Assembly
Genome Res., September 1, 2004; 14(9): 1786 - 1796.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
J. Tang, H. Xia, M. Cao, X. Zhang, W. Zeng, S. Hu, W. Tong, J. Wang, J. Wang, J. Yu, et al.
A Comparison of Rice Chloroplast Genomes
Plant Physiology, May 1, 2004; 135(1): 412 - 420.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
X. Huang, J. Wang, S. Aluru, S.-P. Yang, and L. Hillier
PCAP: A Whole-Genome Assembly Program
Genome Res., September 1, 2003; 13(9): 2164 - 2170.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
X. Li and M. S. Waterman
Estimating the Repeat Structure and Length of DNA Sequences Using {ell}-Tuples
Genome Res., August 1, 2003; 13(8): 1916 - 1922.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. C. Mullikin and Z. Ning
The Phusion Assembler
Genome Res., January 1, 2003; 13(1): 81 - 90.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.