Genome Research Econo tag

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Genome Res. 13:1916-1922, 2003
©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Li, X.
Right arrow Articles by Waterman, M. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Li, X.
Right arrow Articles by Waterman, M. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Methods

Estimating the Repeat Structure and Length of DNA Sequences Using {ell}-Tuples

Xiaoman Li1,3,4 and Michael S. Waterman1,2

1 Department of Mathematics, University of Southern California, Los Angeles, California 90089, USA 2 Celera Genomics, Rockville, Maryland 20850, USA

In shotgun sequencing projects, the genome or BAC length is not always known. We approach estimating genome length by first estimating the repeat structure of the genome or BAC, sometimes of interest in its own right, on the basis of a set of random reads from a genome project. Moreover, we can find the consensus for repeat families before assembly. Our methods are based on the {ell}-tuple content of the reads.


Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1251803.

3 Present address: Department of Statistics, Harvard University, Cambridge, MA 02138, USA.

4 Corresponding author. E-MAIL xiaomanl{at}yahoo.com; FAX (617) 496-8057.

[Supplemental material available online at www.genome.org.]

5 The left end of all reads consist of a homogeneous Poisson process with parameter c/L (Lander et al. 1988).

6 GroupNum is the maximal number of groups we used.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
Y. Zhang and M. S. Waterman
An Eulerian path approach to local multiple alignment for DNA sequences
PNAS, February 1, 2005; 102(5): 1285 - 1290.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
B. Raphael, D. Zhi, H. Tang, and P. Pevzner
A novel method for multiple alignment of sequences with repeated and shuffled elements
Genome Res., November 1, 2004; 14(11): 2336 - 2346.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
P. A. Pevzner, H. Tang, and G. Tesler
De Novo Repeat Classification and Fragment Assembly
Genome Res., September 1, 2004; 14(9): 1786 - 1796.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
R. DeMarco, A. T. Kowaltowski, A. A. Machado, M. B. Soares, C. Gargioni, T. Kawano, V. Rodrigues, A. M. B. N. Madeira, R. A. Wilson, C. F. M. Menck, et al.
Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with High Transcriptional Activities from the Human Parasite Schistosoma mansoni
J. Virol., March 15, 2004; 78(6): 2967 - 2978.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2003 by Cold Spring Harbor Laboratory Press.