|
|
|
|
Published online before print
December 21, 2006, 10.1101/gr.6052507 Genome Res. 17:249-257, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
Resource Molecular refinement of gibbon genome rearrangements1 Department of Genetics and Microbiology, University of Bari, 70126 Bari, Italy; 2 Washington University Genome Sequencing Center, Washington University School of Medicine, St. Louis, Missouri 63108, USA; 3 Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA; 4 Gibbon Conservation Center, Santa Clarita, California 91380, USA; 5 Howard Hughes Medical Institute, Seattle, Washington 98195, USA
The gibbon karyotype is known to be extensively rearranged when compared to the human and to the ancestral primate karyotype. By combining a bioinformatics (paired-end sequence analysis) approach and a molecular cytogenetics approach, we have refined the synteny block arrangement of the white-cheeked gibbon (Nomascus leucogenys, NLE) with respect to the human genome. We provide the first detailed clone framework map of the gibbon genome and refine the location of 86 evolutionary breakpoints to <1 Mb resolution. An additional 12 breakpoints, mapping primarily to centromeric and telomeric regions, were mapped to 5 Mb resolution. Our combined FISH and BES analysis indicates that we have effectively subcloned 49 of these breakpoints within NLE gibbon BAC clones, mapped to a median resolution of 79.7 kb. Interestingly, many of the intervals associated with translocations were gene-rich, including some genes associated with normal skeletal development. Comparisons of NLE breakpoints with those of other gibbon species reveal variability in the position, suggesting that chromosomal rearrangement has been a longstanding property of this particular ape lineage. Our data emphasize the synergistic effect of combining computational genomics and cytogenetics and provide a framework for ultimate sequence and assembly of the gibbon genome.
Hominidae (humans and great apes) and, to a lesser extent, Old World monkeys, possess karyotypes closely resembling the hypothetical hominoid ancestor. Most evolutionary chromosomal rearrangements between ape lineages involve pericentric (including the centromere) or paracentric (not including the centromere) inversions (Yunis and Prakash 1982
Gibbons, then, provide a unique perspective of a highly rearranged ape genome with two major advantages: (1) neutrally evolving DNA shows a relatively short genetic distance (<0.05 substitutions/site) to the high-quality human reference sequence; and (2) the gibbon represents a phylogenetic link between the great apes and the Old World monkeys, providing a unique perspective of evolutionary change between 15 and 20 million years of species separation (Goodman 1999
Two complementary approaches were undertaken to resolve the organization of the gibbon genome with respect to the human genome. First, we developed a FISH-based framework to define the major synteny relationships between all gibbon and human chromosomes, using data provided by interspecies chromosome painting analysis as a starting point. We selected 500 index human BAC clones at an average density of 1 clone every 6 Mb across the human sequence map and hybridized each against metaphase chromosome preparations of the white-cheeked gibbon. Reiterative FISH experiments were then performed to further refine the interval, using 450 additional BAC clones; in many cases identifying a large insert BAC clone that spanned a breakpoint. The analysis established a total of 121 homologous syntenic blocks between the two species, defining the approximate locations of 107 breakpoints. These included 98 breaks mapping within interstitial euchromatin and the delineation of nine human telomeres that are not telomeric in gibbon.
As a second approach, we constructed a comparative clone map between the gibbon and human genomes by mapping the end sequences of an NLE gibbon large-insert BAC library against the human genome reference sequence assembly (Fujiyama et al. 2002
Putative rearrangements were computationally inferred when two or more independent discordant BAC clones supported the same type of rearrangement at a genomic position. We identified a total of 201 putative insertion/deletion (<1 Mb in length), 126 inversion, and 124 interchromosomal rearrangement breakpoints (Supplemental Table 1). We focused on validation of the largest (>1 Mb) inversion and of interchromosomal rearrangement events (breakpoints = 107) using a reciprocal FISH assay (Nickerson and Nelson 1998
Using this approach, we confirmed experimentally and computationally 50 evolutionary chromosomal breakpoints corresponding to eight inversions and 17 translocations between the human and NLE gibbon genomes (Table 1). In addition to these 50 breakpoints, we identified eight computational breakpoints where only one of the two ends of the rearrangement could be confirmed experimentally. These frequently mapped to regions enriched for highly repetitive regions of the genome where neither FISH nor computational methods could accurately refine the breakpoint regions. We note that 25% (26/103) of our computationally predicted locations showed multisite locations enriched in subtelomeric and pericentromeric regions, indicative of lineage-specific duplication differences between the two genomes.
We constructed a mySQL database of gibbon concordant, discordant and interchromosomal BES and incorporated the results as customized tracks on the human genome assembly (http://humanparalogy.gs.washington.edu). This allowed experimentally validated regions to be curated in more detail (Fig. 3). For example, a translocation breakpoint between the human and gibbon genomes should be conspicuous by the absence of concordant gibbon BES across an interval in addition to interchromosomal pairs abutting the actual breakpoint. Based on the extent of concordant and discordant BAC clones near each breakpoint, we further refined the breakpoint locations for 49 of the 50 regions. We determined that the median interval distance was 79.7 kb with some breakpoints refined to a distance of 1314 kb. At this level of resolution, we found no definitive evidence of a gene disruption, although many of the breakpoints will require additional resolution by DNA sequencing.
Almost half of the breakpoints were not confirmed by computational placement of gibbon BACs against the human genome. Indeed, in 18 cases, the break mapped to a gibbon centromere, while nine homologous syntenic block breaks corresponded to the position of a new telomere. In this respect it is interesting that 15 out of 26 gibbon centromeres join homologous syntenic blocks that are noncontiguous in humans. A detailed synteny map that summarizes the refinement of the breakpoints from the perspective of human and NLE gibbon chromosome organization is provided (Supplemental Tables 2 and 3, respectively; the organization of each NLE chromosome is also displayed at http://www.biologia.uniba.it/gibbon). Figures 4 and 5 provide a graphical summary of the synteny block organization in gibbon and human genomes, respectively. In both figures the homologous syntenic block numbering was derived from the hypothesized ancestral Hominoidea chromosomal arrangement reported by Muller et al. (2003)
We provide the first detailed clone framework map of the gibbon genome and refine the location of 86 evolutionary breakpoints to <1 Mb resolution. An additional 12 breakpoints, mapping primarily to centromeric and telomeric regions, were mapped to 5 Mb resolution. Our combined FISH and BES analysis indicates that we have effectively subcloned 49 of these breakpoints within NLE gibbon BAC clones mapped to a median resolution of 79.7 kb. Interestingly, several of the intervals were gene-rich (Table 1), containing conserved genes such as matrilin-2 (an extracellular cartilage matrix protein), guanine-nucleotide binding protein, gamma 5 (membrane-associated G protein), phospholipase scramblase, lunatic fringe (LFNG; a developmental regulator of Notch in the presomitic mesoderm), and ectonucleotide pyrophosphatase/phosphodiesterase genes 4 and 5. Two of the breakpoint intervals mapped to KRAB C2H2 Zinc finger gene family clusters on chromosome 19. While it is unclear whether any of the breakpoints disrupts a gene, it is intriguing that several of the genes mapping to breakpoint intervals are associated with skeletal development, ossification, and cartilage maturation. Mis-sense mutations of the lunatic fringe gene, for example, are associated with spondylocostal dystosis, which includes vertebral congenital abnormalities of the spine and markedly long, slender fingers (Sparrow et al. 2006
The computational paired end-sequence and cytogenetics-based approaches were highly complementary. Nearly half of the breakpoints could not be recognized using computational methods alone. Most of these regions corresponded to highly repetitive regions of the genome including regions enriched for complex segmental duplications. In these regions, end sequences cannot be mapped unambiguously, and therefore rearrangements are underrepresented. In regions of lower complexity, the paired-end-sequences strategy provided exquisite resolution, allowing more subtle rearrangements to be identified and refining breakpoint intervals. Combined, the two approaches were mutually informative and emphasize the value of cytogenetics-based experimental validation accompanying computational genomics-based approaches for characterizing and verifying the organization of primate genomes. In addition, our analysis revealed four previously unpublished rearrangements. These subtle chromosomal changes involved subtelomeric regions of the genome and are consistent with their proclivity to undergo reciprocal translocations (Flint and Knight 2003
All the 107 synteny breaks we detected could be grouped into two categories: those that actually occurred in NLE or that NLE inherited from its gibbon ancestors (84), and those (23) that occurred in the Hominidae lineages leading to humans. Only human chromosomes 15 (NLE6), 18 (NLE4), 21 (NLE25), and X constitute single, uninterrupted chromosomal segments in NLE. All chromosomes, with the exception of 18 and 21, showed internal rearrangements with respect to the Hominoidea ancestor (Murphy et al. 2001
Comparisons of structural variation between human and various primates predict a logarithmic increase in the number of rearrangements as resolution increases (Chimpanzee Sequencing and Analysis Consortium 2005 Whole-genome shotgun sequencing of the entire NLE genome could be used to further elucidate the molecular bases for chromosomal rearrangementsparticularly the frequency of smaller inversions and intrachromosomal events. Information obtained from such studies could provide valuable insight into the mechanism underlying both germline and somatic chromosomal instability associated with human disease and evolution. In addition, targeted sequencing of the breakpoint intervals will allow the impact of these events in terms of gene and gene structure to be understood in the context of the evolution of the apes. The clone framework and our detailed analysis of homologous synteny breakpoints provide the infrastructure for the sequence and assembly of the gibbon genome.
BES analysis BAC end sequences were generated from the gibbon BAC library, CHORI-271. The BAC library was constructed from lymphocyte blood material obtained from a female Northern white-cheeked gibbon (Nomascus leucogenys leucogenys) kindly provided by Alan Mootnick, Director of Gibbon Conservation, Santa Clarita Zoo, California. To eliminate potential mismapped rearrangements, we required >90% identity, >400 bp in length, and at least 150 bp of unique sequence (as defined by RepeatMasker). All clones were mapped to the human genome, and discordant sites were classified as those that exceeded 3 STD of the mean insert size (<76.5 kb or >277 kb). In principle, this allowed us to detect rearrangements >100 kb in size. Additional algorithmic details may be found in the Supplemental Methods.
FISH analysis
NLE-synteny block definition was obtained by FISH analysis using a set of >950 human BACs, one every
We are grateful to the Washington University Genome Sequencing Center Production Group for access to BAC end sequences through the NIH trace repository and to A.R.M. for providing access to gibbon tissue used in this study. MIUR (Ministero Italiano della Universita e della Ricerca) and the European Commission (INPRIMAT, QLRI-CT-2002-01325) are gratefully acknowledged for financial support. This work was supported, in part, by an NIH grant HG002385 to E.E.E. E.E.E. is an investigator of the Howard Hughes Medical Institute.
6 These authors contributed equally to this work. E-mail rocchi{at}biologia.uniba.it; fax 39-080-544-3386.
E-mail eee{at}gs.washington.edu; fax (206) 221-5795. [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6052507
Chimpanzee Sequencing Analysis Consortium 2005. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437: 6987.[CrossRef][Medline] Couturier, J. and Lernould, J.M. 1991. Karyotypic study of four gibbon forms provisionally considered as subspecies of Hylobates (Nomascus) concolor (Primates, Hylobatidae). Folia Primatol. (Basel) 56: 95104. Ferguson-Smith, M.A., Yang, F., Rens, W., and OBrien, P.C. 2005. The impact of chromosome sorting and painting on the comparative analysis of primate genomes. Cytogenet. Genome Res. 108: 112121.[CrossRef][Medline] Flint, J. and Knight, S. 2003. The use of telomere probes to investigate submicroscopic rearrangements associated with mental retardation. Curr. Opin. Genet. Dev. 13: 310316.[CrossRef][Medline] Froenicke, L. 2005. Origins of primate chromosomesAs delineated by Zoo-FISH and alignments of human and mouse draft genome sequences. Cytogenet. Genome Res. 108: 122138.[CrossRef][Medline] Fujiyama, A., Watanabe, H., Toyoda, A., Taylor, T.D., Itoh, T., Tsai, S.F., Park, H.S., Yaspo, M.L., Lehrach, H., and Chen, Z., et al. 2002. Construction and analysis of a humanchimpanzee comparative clone map. Science 295: 131134. Goodman, M. 1999. The genomic record of humankinds evolutionary roots. Am. J. Hum. Genet. 64: 3139.[CrossRef][Medline] Jauch, A., Wienberg, J., Stanyon, R., Arnold, N., Tofanelli, S., Ishida, T., and Cremer, T. 1992. Reconstruction of genomic rearrangements in great apes and gibbons by chromosome painting. Proc. Natl. Acad. Sci. 89: 86118615. Koehler, U., Arnold, N., Wienberg, J., Tofanelli, S., and Stanyon, R. 1995a. Genomic reorganization and disrupted chromosomal synteny in the siamang (Hylobates syndactylus) revealed by fluorescence in situ hybridization. Am. J. Phys. Anthropol. 97: 3747.[CrossRef][Medline] Koehler, U., Bigoni, F., Wienberg, J., and Stanyon, R. 1995b. Genomic reorganization in the concolor gibbon (Hylobates concolor) revealed by chromosome painting. Genomics 30: 287292.[CrossRef][Medline] Linardopoulou, E.V., Williams, E.M., Fan, Y., Friedman, C., Young, J.M., and Trask, B.J. 2005. Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication. Nature 437: 94100.[CrossRef][Medline] Muller, S. and Wienberg, J. 2001. "Bar-coding" primate chromosomes: Molecular cytogenetic screening for the ancestral hominoid karyotype. Hum. Genet. 109: 8594.[CrossRef][Medline] Muller, S., Neusser, M., and Wienberg, J. 2002. Towards unlimited colors for fluorescence in-situ hybridization (FISH). Chromosome Res. 10: 223232.[CrossRef][Medline] Muller, S., Hollatz, M., and Wienberg, J. 2003. Chromosomal phylogeny and evolution of gibbons (Hylobatidae). Hum. Genet. 113: 493501.[CrossRef][Medline] Murphy, W.J., Stanyon, R., and OBrien, S.J. 2001. Evolution of mammalian genome organization inferred from comparative gene mapping. Genome Biol. 2: reviews0005. Newman, T.L., Tuzun, E., Morrison, V.A., Hayden, K.E., Ventura, M., McGrath, S.D., Rocchi, M., and Eichler, E.E. 2005. A genome-wide survey of structural variation between human and chimpanzee. Genome Res. 15: 13441356. Nickerson, E. and Nelson, D.L. 1998. Molecular definition of pericentric inversion breakpoints occurring during the evolution of humans and chimpanzees. Genomics 50: 368372.[CrossRef][Medline] Nie, W., Rens, W., Wang, J., and Yang, F. 2001. Conserved chromosome segments in Hylobates hoolock revealed by human and H. leucogenys paint probes. Cytogenet. Cell Genet. 92: 248253.[CrossRef][Medline] Sparrow, D.B., Chapman, G., Wouters, M.A., Whittock, N.V., Ellard, S., Fatkin, D., Turnpenny, P.D., Kusumi, K., Sillence, D., and Dunwoodie, S.L. 2006. Mutation of the LUNATIC FRINGE gene in humans causes spondylocostal dysostosis with a severe vertebral phenotype. Am. J. Hum. Genet. 78: 2837.[CrossRef][Medline] Tuzun, E., Sharp, A.J., Bailey, J.A., Kaul, R., Morrison, V.A., Pertz, L.M., Haugen, E., Hayden, H., Albertson, D., and Pinkel, D., et al. 2005. Fine-scale structural variation of the human genome. Nat. Genet. 37: 727732.[CrossRef][Medline] Ventura, M., Boniotto, M., Cardone, M.F., Fulizio, L., Archidiacono, N., Rocchi, M., and Crovella, S. 2001. Characterization of a highly repeated DNA sequence family in five species of the genus Eulemur. Gene 275: 305310.[CrossRef][Medline] Ventura, M., Mudge, J.M., Palumbo, V., Burn, S., Blennow, E., Pierluigi, M., Giorda, R., Zuffardi, O., Archidiacono, N., and Jackson, M.S., et al. 2003. Neocentromeres in 15q24-26 map to duplicons which flanked an ancestral centromere in 15q25. Genome Res. 13: 20592068. Wienberg, J. 2005. Fluorescence in situ hybridization to chromosomes as a tool to understand human and primate genome evolution. Cytogenet. Genome Res. 108: 139160.[CrossRef][Medline] Yunis, J.J. and Prakash, O. 1982. The origin of man: A chromosomal pictorial legacy. Science 215: 15251530.
Received October 19, 2006; accepted in revised format November 13, 2006. This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||