Genome Research songbird

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Published online before print June 13, 2007, 10.1101/gr.5894107
Genome Res. 17:1101-1110, 2007
©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
OPEN ACCESS ARTICLE
This Article
OPEN ACCESS ARTICLE
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow All Versions of this Article:
gr.5894107v1
17/7/1101    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kim, J. H.
Right arrow Articles by Li, L. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kim, J. H.
Right arrow Articles by Li, L. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Methods

Diploid genome reconstruction of Ciona intestinalis and comparative analysis with Ciona savignyi

Jong Hyun Kim1,4, Michael S. Waterman2,3, and Lei M. Li2,3

1 Department of Computer Science, Yonsei University, Seoul, 120-749, Republic of Korea; 2 Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, USA; 3 Department of Mathematics, University of Southern California, Los Angeles, California 90089, USA

One of the main goals in genome sequencing projects is to determine a haploid consensus sequence even when clone libraries are constructed from homologous chromosomes. However, it has been noticed that haplotypes can be inferred from genome assemblies by investigating phase conservation in sequenced reads. In this study, we seek to infer haplotypes, a diploid consensus sequence, from the genome assembly of an organism, Ciona intestinalis. The Ciona intestinalis genome is an ideal resource from which haplotypes can be inferred because of the high polymorphism rate (1.2%). The haplotype estimation scheme consists of polymorphism detection and phase estimation. The core step of our method is a Gibbs sampling procedure. The mate-pair information from two-end sequenced clone inserts is exploited to provide long-range continuity. We estimate the polymorphism rate of Ciona intestinalis to be 1.2% and 1.5%, according to two different polymorphism counting schemes. The distribution of heterozygosity number is well fit by a compound Poisson distribution. The N50 length of haplotype segments is 37.9 kb in our assembly, while the N50 scaffold length of the Ciona intestinalis assembly is 190 kb. We also infer diploid gene sequences from haplotype segments. According to our reconstruction, 85.4% of predicted gene sequences are continuously covered by single haplotype segments. Our results indicate 97% accuracy in haplotype estimation, based on a simulated data set. We conduct a comparative analysis with Ciona savignyi, and discover interesting patterns of conserved DNA elements in chordates.


4 Corresponding author.

E-mail jonghkim{at}usc.edu; fax (213) 740-2437.

[Supplemental material is available at www.genome.org. The diploid genome sequence of Ciona intestinalis is downloadable from http://www-rcf.usc.edu/~lilei/diploid.html. The software to reconstruct haplotypes, called hapBuild, is available on request.]

Article published online before print. Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.5894107


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
V. Bansal and V. Bafna
HapCUT: an efficient and accurate algorithm for the haplotype assembly problem
Bioinformatics, August 15, 2008; 24(16): i153 - i159.
[Abstract] [PDF]


Home page
Genome ResHome page
V. Bansal, A. L. Halpern, N. Axelrod, and V. Bafna
An MCMC algorithm for haplotype assembly from whole-genome sequence data
Genome Res., August 1, 2008; 18(8): 1336 - 1346.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
Y. Harada, Y. Takagaki, M. Sunagawa, T. Saito, L. Yamada, H. Taniguchi, E. Shoguchi, and H. Sawada
Mechanism of Self-Sterility in a Hermaphroditic Chordate
Science, April 25, 2008; 320(5875): 548 - 550.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Denisov, B. Walenz, A. L. Halpern, J. Miller, N. Axelrod, S. Levy, and G. Sutton
Consensus generation and variant detection by Celera Assembler
Bioinformatics, April 15, 2008; 24(8): 1035 - 1040.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2007 by Cold Spring Harbor Laboratory Press.