Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Published online before print July 19, 2002, 10.1101/gr.88502
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
GR-885Rv1
12/8/1269    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Bao, Z.
Right arrow Articles by Eddy, S. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bao, Z.
Right arrow Articles by Eddy, S. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 8, 1269-1276, August 2002

METHODS
Automated De Novo Identification of Repeat Sequence Families in Sequenced Genomes

Zhirong Bao, and Sean R. Eddy1

Howard Hughes Medical Institute and Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA

Repetitive sequences make up a major part of eukaryotic genomes. We have developed an approach for the de novo identification and classification of repeat sequence families that is based on extensions to the usual approach of single linkage clustering of local pairwise alignments between genomic sequences. Our extensions use multiple alignment information to define the boundaries of individual copies of the repeats and to distinguish homologous but distinct repeat element families. When tested on the human genome, our approach was able to properly identify and group known transposable elements. The program, RECON, should be useful for first-pass automatic classification of repeats in newly sequenced genomes.

[The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: R. Klein.]


1 Corresponding author.


12:1269-1276 ©2002 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/02 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
DNA ResHome page
S. Sato, Y. Nakamura, T. Kaneko, E. Asamizu, T. Kato, M. Nakao, S. Sasamoto, A. Watanabe, A. Ono, K. Kawashima, et al.
Genome Structure of the Legume, Lotus japonicus
DNA Res, May 28, 2008; (2008) dsn008v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Saha, S. Bridges, Z. V. Magbanua, and D. G. Peterson
Empirical comparison of ab initio repeat finding programs
Nucleic Acids Res., April 1, 2008; 36(7): 2284 - 2294.
[Abstract] [Full Text] [PDF]


Home page
DNA ResHome page
T. Kaneko, N. Nakajima, S. Okamoto, I. Suzuki, Y. Tanabe, M. Tamaoki, Y. Nakamura, F. Kasai, A. Watanabe, K. Kawashima, et al.
Complete Genomic Structure of the Bloom-forming Toxic Cyanobacterium Microcystis aeruginosa NIES-843
DNA Res, January 11, 2008; (2008) dsm026v1.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
B. A. Kronmiller and R. P. Wise
TEnest: Automated Chronological Annotation and Visualization of Nested Plant Transposable Elements
Plant Physiology, January 1, 2008; 146(1): 45 - 59.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. A. Campbell, W. Zhu, N. Jiang, H. Lin, S. Ouyang, K. L. Childs, B. J. Haas, J. P. Hamilton, and C. R. Buell
Identification and Characterization of Lineage-Specific Genes within the Poaceae
Plant Physiology, December 1, 2007; 145(4): 1311 - 1322.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
C. M. Bergman and H. Quesneville
Discovering and detecting transposable elements in genome sequences
Brief Bioinform, November 1, 2007; 8(6): 382 - 392.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
V. L. Jensen, P. S. Albert, and D. L. Riddle
Caenorhabditis elegans SDF-9 Enhances Insulin/Insulin-Like Signaling Through Interaction With DAF-2
Genetics, September 1, 2007; 177(1): 661 - 666.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Hou, P. Berman, C.-H. Hsu, and R. S. Harris
HomologMiner: looking for homologous genomic groups in whole genomes
Bioinformatics, April 15, 2007; 23(8): 917 - 925.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. S. Small, M. Brudno, M. M. Hill, and A. Sidow
Extreme genomic variation in a natural population
PNAS, March 27, 2007; 104(13): 5698 - 5703.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Dieterich, W. Roeseler, P. Sobetzko, and R. J. Sommer
Pristionchus.org: a genome-centric database of the nematode satellite species Pristionchus pacificus
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D498 - D502.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Achaz, F. Boyer, E. P. C. Rocha, A. Viari, and E. Coissac
Repseek, a tool to retrieve approximate repeats from large DNA sequences
Bioinformatics, January 1, 2007; 23(1): 119 - 121.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
S. M. Johnson, F. J. Tan, H. L. McCullough, D. P. Riordan, and A. Z. Fire
Flexibility and constraint in the nucleosome core landscape of Caenorhabditis elegans chromatin
Genome Res., December 1, 2006; 16(12): 1505 - 1516.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. Holligan, X. Zhang, N. Jiang, E. J. Pritham, and S. R. Wessler
The Transposable Element Landscape of the Model Legume Lotus japonicus
Genetics, December 1, 2006; 174(4): 2215 - 2228.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Tempel, M. Giraud, D. Lavenier, I.-C. Lerman, A.-S. Valin, I. Couee, A. E. Amrani, and J. Nicolas
Domain organization within repeated DNA sequences: application to the study of a family of transposable elements
Bioinformatics, August 15, 2006; 22(16): 1948 - 1954.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Toth, G. Deak, E. Barta, and G. B. Kiss
PLOTREP: a web tool for defragmentation and visual analysis of dispersed genomic repeats.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W708 - W713.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. T. Webster, E. Axelsson, and H. Ellegren
Strong Regional Biases in Nucleotide Substitution in the Chicken Genome
Mol. Biol. Evol., June 1, 2006; 23(6): 1203 - 1216.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
P. Bertone, V. Trifonov, J. S. Rozowsky, F. Schubert, O. Emanuelsson, J. Karro, M.-Y. Kao, M. Snyder, and M. Gerstein
Design optimization methods for genomic DNA tiling arrays
Genome Res., February 1, 2006; 16(2): 271 - 281.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Morgulis, E. M. Gertz, A. A. Schaffer, and R. Agarwala
WindowMasker: window-based masker for sequenced genomes
Bioinformatics, January 15, 2006; 22(2): 134 - 141.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. P. Chan, G. Pertea, F. Cheung, D. Lee, L. Zheng, C. Whitelaw, A. C. Pontaroli, P. SanMiguel, Y. Yuan, J. Bennetzen, et al.
The TIGR Maize Database
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D771 - D776.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Schneeberger, K. Malde, E. Coward, and I. Jonassen
Masking repeats while clustering ESTs
Nucleic Acids Res., April 14, 2005; 33(7): 2176 - 2180.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
E. H. Margulies, NISC Comparative Sequencing Program, V. V. B. Maduro, P. J. Thomas, J. P. Tomkins, C. T. Amemiya, M. Luo, and E. D. Green
Comparative sequencing provides insights about the structure and conservation of marsupial and monotreme genomes
PNAS, March 1, 2005; 102(9): 3354 - 3359.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Campagna, C. Romualdi, N. Vitulo, M. Del Favero, M. Lexa, N. Cannata, and G. Valle
RAP: a new computer program for de novo identification of repeated sequences in whole genomes
Bioinformatics, March 1, 2005; 21(5): 582 - 588.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
F. A. Feltus, J. Wan, S. R. Schulze, J. C. Estill, N. Jiang, and A. H. Paterson
An SNP Resource for Rice Genetics and Breeding Based on Subspecies Indica and Japonica Genome Alignments
Genome Res., September 1, 2004; 14(9): 1812 - 1819.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Biedler and Z. Tu
Non-LTR Retrotransposons in the African Malaria Mosquito, Anopheles gambiae: Unprecedented Diversity and Evidence of Recent Activity
Mol. Biol. Evol., November 1, 2003; 20(11): 1811 - 1825.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. L. Chen and L. Shapiro
Identification of Long Intergenic Repeat Sequences Associated with DNA Methylation Sites in Caulobacter crescentus and Other {alpha}-Proteobacteria
J. Bacteriol., August 15, 2003; 185(16): 4997 - 5002.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. Feschotte, L. Swamy, and S. R. Wessler
Genome-Wide Analysis of mariner-Like Transposable Elements in Rice Reveals Complex Relationships With Stowaway Miniature Inverted Repeat Transposable Elements (MITEs)
Genetics, February 1, 2003; 163(2): 747 - 758.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.