|
|
|
|
Published online before print
August 18, 2005, 10.1101/gr.4134305 Genome Res. 15:1307-1314, 2005 ©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00 OPEN ACCESS ARTICLE
Resources The zebrafish gene map defines ancestral vertebrate chromosomes1 Department of Developmental Biology, Stanford University School of Medicine, Stanford, California 94305-5329, USA 2 Institute of Neuroscience, University of Oregon, Eugene, Oregon 97403-1254, USA
Genetic screens in zebrafish (Danio rerio) have identified mutations that define the roles of hundreds of essential vertebrate genes. Genetic maps can link mutant phenotype with gene sequence by providing candidate genes for mutations and polymorphic genetic markers useful in positional cloning projects. Here we report a zebrafish genetic map comprising 4073 polymorphic markers, with more than twice the number of coding sequences localized in previously reported zebrafish genetic maps. We use this map in comparative studies to identify numerous regions of synteny conserved among the genomes of zebrafish, Tetraodon, and human. In addition, we use our map to analyze gene duplication in the zebrafish and Tetraodon genomes. Current evidence suggests that a whole-genome duplication occurred in the teleost lineage after it split from the tetrapod lineage, and that only a subset of the duplicates have been retained in modern teleost genomes. It has been proposed that differential retention of duplicate genes may have facilitated the isolation of nascent species formed during the vast radiation of teleosts. We find that different duplicated genes have been retained in zebrafish and Tetraodon, although similar numbers of duplicates remain in both genomes. Finally, we use comparative mapping data to address the proposal that the common ancestor of vertebrates had a genome consisting of 12 chromosomes. In a three-way comparison between the genomes of zebrafish, Tetraodon, and human, our analysis delineates the gene content for 11 of these 12 proposed ancestral chromosomes.
Genome sequencing projects have revealed a nearly complete picture of the 20,000-30,000 genes that constitute the genomes of human and other vertebrates (Lander et al. 2001
Sequence analysis can identify orthologous genes in different species, and gene maps therefore allow comparisons among the genomes of different species. Conserved synteny (the presence of two or more orthologous gene pairs on a single chromosome in each of two different species) defines regions of ancestral chromosomes that have been maintained through evolution. By allowing the transfer of map information between species, comparative maps increase the pool of mapped genes that can be considered as candidates for mutations. Previous comparative maps (Gates et al. 1999
Comparative mapping can also illuminate the history of chromosome evolution. Previous analyses comparing zebrafish genetic maps with the genomes of other vertebrate species have suggested that a whole-genome duplication occurred in the teleost lineage, after its divergence from the tetrapod lineage (Amores et al. 1998 Here we report an extensive gene-based meiotic map for zebrafish, with more than double the number of genes and ESTs present on previously reported genetic maps. This map will facilitate the molecular identification of mutations, and will therefore constitute a valuable resource in elucidating gene function. Moreover, because it contains a large data set that has been extensively checked for errors, the map will provide an important framework to facilitate the assembly of the zebrafish genome sequence. We use this map in comparisons between the genomes of human and Tetraodon to define numerous regions of conserved synteny, to compare the outcome of gene duplication events in zebrafish and Tetraodon, and to present a comprehensive assessment of the possible composition of the ancestral vertebrate karyotype.
Genetic mapping of zebrafish genes and ESTs By using a previously described homozygous diploid mapping panel (Kelly et al. 2000 We worked to minimize the number of cases in which multiple ESTs representing the same gene were present on the map. First, we assigned 1149 ESTs to putative full-length cDNAs from the zebrafish gene collection (http://zgc.nci.nih.gov), increasing the total number of full-length cDNAs on the map to 1648. Because sequences in this collection are thought to represent full-length mRNAs, assigning matches between ESTs and these sequences decreases the possibility that two ESTs representing non-overlapping portions of the same gene are both present on the map. Second, duplicate entries for the same gene were removed when ESTs matching the same human ortholog (see below) were localized to the same position on the map. Although some of these cases may represent independent genes formed by tandem duplications, removing one member of this type of duplicate pair reduces the possibility that two non-overlapping ESTs from the same gene are retained. Third, 3208 mapped genes and ESTs were assigned to different UniGene clusters (http://www.ncbi.nlm.nih.gov/UniGene, build 83), and these are likely to represent different genes. Eight pairs of genes were assigned to the same UniGene cluster but mapped to different locations; these map positions were confirmed in duplicate genotyping analyses. Genes in the same UniGene cluster found on different chromosomes are likely to be errors in cluster assembly. Finally, the 193 genes and ESTs without UniGene assignments were analyzed via two-way BLAST comparisons with the entire set of mapped genes and ESTs, and no significant overlap was detected. We took numerous steps to maximize the accuracy of map position assignments. First, we identified possible problematic markers by inspecting the data for markers that exhibited high numbers of genotyping failures, that caused the introduction of double recombinants into the map, or that were localized to positions discrepant with those of other zebrafish maps. The positions of markers in these categories were confirmed in additional genotyping analyses. In addition, 91 SSCP markers showing none of these potential problems were selected for random sampling of map accuracy. By using newly generated RFLPs for these markers, 90 of 91 original positions were confirmed. Furthermore, we took cases of different ESTs matching the same UniGene cluster or full-length sequence that mapped to the same region as confirmation of the map position. In total, 1522 of the 3417 genes and ESTs on the map have been confirmed in duplicate genotyping analyses.
Zebrafish-human comparison
Zebrafish-Tetraodon comparison By using the data obtained from the Tetraodon nigroviridis (Tetraodon) sequencing project (Jaillon et al. 2004 2 = 1226, P < 0.001); for example, no boxes containing nine or more gene pairs would be expected in a random distribution, whereas 36 of these boxes were observed in the analysis, and 20 boxes had >17 orthologous gene pairs (Fig. 2B). In many cases, the high degree of conserved synteny between individual Tetraodon chromosomes and zebrafish linkage groups suggested a 1:1 correspondence of chromosomes in the two species, for example Tetraodon chromosome (Tni) 9/LG 23, Tni 18/LG 1, Tni 15/LG 2, Tni 3/LG 3, and Tni 8/LG 16. Supplemental Table 3 contains the complete set of putative zebrafish-Tetraodon orthologous gene pairs, along with chromosome locations for each gene.
Jaillon et al. (2004 Because of the high degree of conserved synteny exhibited between the majority of zebrafish and Tetraodon chromosomes, we examined whether gene order was also conserved in regions of conserved synteny. Figure 3 shows a comparison of gene order in two regions of synteny conserved in zebrafish and Tetraodon. In both cases shown, orthologous gene pairs were distributed along the zebrafish and Tetraodon chromosomes, suggesting that a large group of genes were syntenic in their last common ancestor. Comparison of gene order along individual chromosomes indicated that numerous inversions involving large regions altered gene order. Some smaller clusters of genes have been conserved in both species through evolution, but there is also evidence of some rearrangement within these small groups of genes.
Duplicate genes in zebrafish and Tetraodon
To compare complements of duplicated genes in different teleost species, we searched for duplicate gene pairs in the Tetraodon genome sequence. We identified 3327 pairs of duplicate genes in Tetraodon using criteria identical to those applied for zebrafish (Fig. 4B). Map locations of both duplicate genes were available for only 1543 gene pairs (Supplemental Table 5). Again, the distribution of clusters was highly nonrandom ( 2 = 987, P < 0.001). The locations of significant duplicate clustersfor example, Tni 2/Tni 3, Tni 10/Tni 14, and Tni 9/Tni 11largely agree with those reported by Jaillon et al. (2004 5500 of 27,918 = 20%) were predicted to be incompletely assembled fragments (Jaillon et al. 2004
By analyzing extant duplicated genes in each species, rates of retention of duplicated genes through evolution can be estimated. For example, assuming that 80% of the 3327 Tetraodon duplicate gene pairs identified in our analysis are actually gene duplicates rather than misannotated gene fragments, there are
In addition to estimating the overall frequency of retention after gene duplication, comparisons of retained duplicated genes can determine whether the same duplicates are retained in different species through evolution. For example, our analysis suggests that 60 of 126 (48%) zebrafish duplicate gene pairs are also duplicated in Tetraodon, whereas 66 appear to be represented by a single gene in Tetraodon. This proportion is similar to that observed in a previous comparison of duplicate genes between zebrafish and Takifugu rubripes (Taylor et al. 2003
Genome-wide comparisons and the ancestral karyotype Several previous analyses have employed comparisons between fish and human genomes, as well as between the genomes of different fish species, to attempt a reconstruction of the ancestral vertebrate karyotype. By analyzing the map positions of zebrafish-human orthologous gene pairs, Postlethwait et al. (2000 20-30 chromosomes in Eutherian mammals via chromosome fissions, and to the current number of 25 chromosomes in most teleosts via chromosome duplications. A comparison of syntenic relationships among the genomes of human, zebrafish, and medaka supported this prediction of 12 ancestral chromosomes (Naruse et al. 2004
These analyses differed both in approach and in the scope of the data sets employed. In the zebrafish-medaka study, a relatively small number of mapped genes in these two fish species was analyzed in three-way comparisons with orthologous human genes to reconstruct the putative ancestral karyotype (Naruse et al. 2004 By analyzing the complete genome sequence of Tetraodon along with the most extensive zebrafish map yet produced, we undertook an independent assessment of the predicted ancestral vertebrate karyotype. The goal of this analysis was to compare gene maps of extant tetrapods (i.e., human) and teleosts to identify conserved syntenies, which represent groups of genes that were present on the same chromosome in the last common ancestor and have remained together through the vertebrate radiation. Since ancestral groups of syntenic genes may have been fragmented in one teleost lineage, the likelihood of identifying conserved syntenies is increased by comparing duplicate chromosomal segments within a species and by comparing orthologous chromosomal segments between the two species. Thus a comparative analysis using two teleost species provides a more complete picture of the ancestral gene map than does a comparison of one teleost and one tetrapod species. We used the data in the Oxford grids comparing orthologous chromosome segments between species (Fig. 2), along with paralogous chromosome segments within species (Fig. 4), to reconstruct putative relationships among extant chromosomes within and between the genomes of zebrafish and Tetraodon (Fig. 5). Chromosomes in Figure 5 are color-coded to denote the position in the human genome of fish-human orthologous gene pairs found in our analysis, and genes are rearranged according to human chromosome number, rather than map order, to enhance the clarity of the comparisons. In many cases, the pattern of human orthology clearly supports the Oxford grid data suggesting a 1:1 relationship for many chromosomes between zebrafish and Tetraodon (e.g., Tni 10/LG 17, Tni 14/LG 20, Tni 8/LG 16, Tni 21/LG 19) and between putative duplicate chromosomes within a single fish species (e.g., Tni 10/14, Tni 8/21, LG 17/20, LG 16/19).
Gene content of ancestral chromosomes can be inferred from these relationships. For example, a single ancestral chromosome (J) was likely duplicated in the ancestor of both fish species and gave rise to Tni 10/14 and LG 17/20. In the lineage leading to human, this ancestral chromosome likely fragmented to form large portions of chromosomes 6 and 14, and smaller portions of other chromosomes. Because of interchromosomal rearrangements through evolution, however, not all relationships are as clear. For example, the pattern of chromosomal relationships within and between species suggests that two ancestral chromosomes (E and F) were both duplicated in the teleost lineage, and in Tetraodon the duplicate copies of Tni 5 and Tni 19 fused to form Tni 13, as was proposed in the analysis of the complete Tetraodon genome sequence (Jaillon et al. 2004
Our analysis of map locations of paralogous genes between zebrafish and Tetraodon chromosomes, combined with between-species comparisons of map locations of orthologous genes (Fig. 5), largely agreed with the ancestral karyotype reconstruction proposed by each of the previous studies (Jaillon et al. 2004
Conclusions
The analysis presented here represents the most extensive comparison to date between the genomes of zebrafish and other vertebrate species. The identification of regions of conserved synteny between the zebrafish genome and the extensively annotated genomes of other vertebrate species such as human and Tetraodon will facilitate cloning of mutations based on comparative approaches. Moreover, these comparative studies can highlight processes underlying vertebrate chromosome evolution. Our identification of numerous duplicated chromosome segments involving nearly all zebrafish chromosomes has confirmed and extended data from previous studies (Amores et al. 1998
Linkage analysis Primer design, PCR assays, and linkage analysis were performed as previously described (Kelly et al. 2000
Sequence comparisons
In some cases, phylogenetic trees were constructed to confirm or resolve orthologous relationships derived from reciprocal BLAST searches giving several matches with nearly equivalent scores (delta, msx, nodal, tenm, noggin), or when the reciprocal BLAST analysis suggested an orthology relationship that differed from published data (bmpr1) (Taylor et al. 2003 Duplicate genes in Tetraodon were identified by comparison to human, as described above. The data from the Tetraodon-Tetraodon duplicate grid (Fig. 4B) were examined to determine which chromosome pairs shared the highest number of duplicates (as indicated in Fig. 5). The data from the zebrafish-Tetraodon comparison (Fig. 2B) were used to identify orthologous chromosomes in these species. Duplicate chromosomes in zebrafish (Fig. 4A) were identified as described for Tetraodon. Collectively, these data define pairings of chromosomes both within and between species, and these pairings represent extant chromosomes likely derived from the same ancestral chromosome.
We thank members of our laboratories for helpful discussions, Tom Conlin for expert help with bioinformatics, Greg Cooper and Jon Binkley for assistance with phylogenetic analyses, and Alex Schier for critical comments on the manuscript. I.G.W. was supported by a predoctoral fellowship from the Howard Hughes Medical Institute. This work was supported by NIH grants R01RR10715 (J.H.P.), HD22486 (J.H.P.), RR12349 (W.S.T.), and HG02568 (W.S.T.).
3 Corresponding author. E-mail talbot{at}cmgm.stanford.edu; fax (650) 725-7739. [Supplemental material is available online at www.genome.org.] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.4134305. Article published online before print in August 2005. Freely available online through the Genome Research Immediate Open Access option.
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.L. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402.
Amores, A., Force, A., Yan, Y.L., Joly, L., Amemiya, C., Fritz, A., Ho, R.K., Langeland, J., Prince, V., Wang, Y.L., et al. 1998. Zebrafish hox clusters and vertebrate genome evolution. Science 282: 1711-1714.
Aparicio, S., Chapman, J., Stupka, E., Putnam, N., Chia, J.M., Dehal, P., Christoffels, A., Rash, S., Hoon, S., Smit, A., et al. 2002. Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes. Science 297: 1301-1310.
Barbazuk, W.B., Korf, I., Kadavi, C., Heyen, J., Tate, S., Wun, E., Bedell, J.A., McPherson, J.D., and Johnson, S.L. 2000. The syntenic relationship of the zebrafish and human genomes. Genome Res. 10: 1351-1358. Birely, J., Schneider, V.A., Santana, E., Dosch, R., Wagner, D.S., Mullins, M.C., and Granato, M. 2005. Genetic screens for genes controlling motor nerve-muscle development and interactions. Dev. Biol. 280: 162-176.[CrossRef][Medline] Donovan, A., Brownlie, A., Zhou, Y., Shepard, J., Pratt, S.J., Moynihan, J., Paw, B.H., Drejer, A., Barut, B., Zapata, A., et al. 2000. Positional cloning of zebrafish ferroportin1 identifies a conserved vertebrate iron exporter. Nature 403: 776-781.[CrossRef][Medline] Driever, W., Solnica-Krezel, L., Schier, A.F., Neuhauss, S.C., Malicki, J., Stemple, D.L., Stainier, D.Y., Zwartkruis, F., Abdelilah, S., Rangini, Z., et al. 1996. A genetic screen for mutations affecting embryogenesis in zebrafish. Development 123: 37-46.[Abstract] Edwards, J.H. 1991. The Oxford Grid. Ann. Hum. Genet. 55: 17-31.[Medline]
Force, A., Lynch, M., Pickett, F.B., Amores, A., Yan, Y.L., and Postlethwait, J. 1999. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151: 1531-1545.
Gates, M.A., Kim, L., Egan, E.S., Cardozo, T., Sirotkin, H.I., Dougan, S.T., Lashkari, D., Abagyan, R., Schier, A.F., and Talbot, W.S. 1999. A genetic linkage map for zebrafish: Comparative analysis and localization of genes and expressed sequences. Genome Res. 9: 334-347. Gibbs, R.A., Weinstock, G.M., Metzker, M.L., Muzny, D.M., Sodergren, E.J., Scherer, S., Scott, G., Steffen, D., Worley, K.C., Burch, P.E., et al. 2004. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature 428: 493-521.[CrossRef][Medline] Haffter, P., Granato, M., Brand, M., Mullins, M.C., Hammerschmidt, M., Kane, D.A., Odenthal, J., van Eeden, F.J., Jiang, Y.J., Heisenberg, C.P., et al. 1996. The identification of genes with unique and essential functions in the development of the zebrafish, Danio rerio. Development 123: 1-36.[Abstract] Jaillon, O., Aury, J.M., Brunet, F., Petit, J.L., Stange-Thomann, N., Mauceli, E., Bouneau, L., Fischer, C., Ozouf-Costaz, C., Bernot, A., et al. 2004. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature 431: 946-957.[CrossRef][Medline]
Karlstrom, R.O., Talbot, W.S., and Schier, A.F. 1999. Comparative synteny cloning of zebrafish you-too: Mutations in the Hedgehog target gli2 affect ventral forebrain patterning. Genes & Dev. 13: 388-393.
Kelly, P.D., Chu, F., Woods, I.G., Ngo-Hazelett, P., Cardozo, T., Huang, H., Kimm, F., Liao, L., Yan, Y.L., Zhou, Y., et al. 2000. Genetic linkage mapping of zebrafish genes and ESTs. Genome Res. 10: 558-567.
Kirkness, E.F., Bafna, V., Halpern, A.L., Levy, S., Remington, K., Rusch, D.B., Delcher, A.L., Pop, M., Wang, W., Fraser, C.M., et al. 2003. The dog genome: Survey sequencing and comparative analysis. Science 301: 1898-1903. Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., et al. 2001. Initial sequencing and analysis of the human genome. Nature 409: 860-921.[CrossRef][Medline]
Lynch, M. and Conery, J.S. 2000. The evolutionary fate and consequences of duplicate genes. Science 290: 1151-1155.
Lynch, M. and Force, A. 2000. The probability of duplicate gene preservation by subfunctionalization. Genetics 154: 459-473. Lyons, D.A., Pogoda, H.M., Voas, M.G., Woods, I.G., Diamond, B., Nix, R., Arana, A., Jacobs, J., and Talbot, W.S. 2005. erbb3 and erbb2 are essential for schwann cell migration and myelination in zebrafish. Curr. Biol. 15: 513-524.[CrossRef][Medline] Manly, K.F. 1993. A Macintosh program for storage and analysis of experimental genetic mapping data. Mamm. Genome 4: 303-313.[CrossRef][Medline] Meyer, A. and Schartl, M. 1999. Gene and genome duplications in vertebrates: The one-to-four (-to-eight in fish) rule and the evolution of novel gene functions. Curr. Opin. Cell Biol. 11: 699-704.[CrossRef][Medline] Miller, C.T., Schilling, T.F., Lee, K., Parker, J., and Kimmel, C.B. 2000. Sucker encodes a zebrafish endothelin-1 required for ventral pharyngeal arch development. Development 127: 3815-3828.[Abstract]
Naruse, K., Fukamachi, S., Mitani, H., Kondo, M., Matsuoka, T., Kondo, S., Hanamura, N., Morita, Y., Hasegawa, K., Nishigaki, R., et al. 2000. A detailed linkage map of medaka, Oryzias latipes: Comparative genomics and genome evolution. Genetics 154: 1773-1784.
Naruse, K., Tanaka, M., Mita, K., Shima, A., Postlethwait, J., and Mitani, H. 2004. A medaka gene map: The trace of ancestral vertebrate proto-chromosomes revealed by comparative gene mapping. Genome Res. 14: 820-828. Postlethwait, J.H., Yan, Y.L., Gates, M.A., Horne, S., Amores, A., Brownlie, A., Donovan, A., Egan, E.S., Force, A., Gong, Z., et al. 1998. Vertebrate genome evolution and the zebrafish gene map. Nat. Genet. 18: 345-349.[CrossRef][Medline] Postlethwait, J.H., Amores, A., Force, A., and Yan, Y.L. 1999. The zebrafish genome. Methods Cell Biol. 60: 149-163.[Medline]
Postlethwait, J.H., Woods, I.G., Ngo-Hazelett, P., Yan, Y.L., Kelly, P.D., Chu, F., Huang, H., Hill-Force, A., and Talbot, W.S. 2000. Zebrafish comparative genomics and the origins of vertebrate chromosomes. Genome Res. 10: 1890-1902. Postlethwait, J.H., Amores, A., Yan, Y.L., and Austin, C.A. 2002. Duplication of a portion of human chromosome 20q containing topoisomerase (top1) and snail genes provides evidence on genome expansion and the radiation of teleost fish. In Aquatic genomics (eds. N. Schimizu, et al.), pp 20-31. Springer, Tokyo. Postlethwait, J., Amores, A., Cresko, W., Singer, A., and Yan, Y.L. 2004. Subfunction partitioning, the teleost radiation and the annotation of the human genome. Trends Genet. 20: 481-490.[CrossRef][Medline] Shimoda, N., Knapik, E.W., Ziniti, J., Sim, C., Yamada, E., Kaplan, S., Jackson, D., de Sauvage, F., Jacob, H., and Fishman, M.C. 1999. Zebrafish genetic map with 2000 microsatellite markers. Genomics 58: 219-232.[CrossRef][Medline]
Taylor, J.S., Braasch, I., Frickey, T., Meyer, A., and Van de Peer, Y. 2003. Genome duplication: A trait shared by 22,000 species of ray-finned fish. Genome Res. 13: 382-390. Varga, Z.M., Amores, A., Lewis, K.E., Yan, Y.L., Postlethwait, J.H., Eisen, J.S., and Westerfield, M. 2001. Zebrafish smoothened functions in ventral neural tube specification and axon tract formation. Development 128: 3497-3509.
Venter, J.C., Adams, M.D., Myers, E.W., Li, P.W., Mural, R.J., Sutton, G.G., Smith, H.O., Yandell, M., Evans, C.A., Holt, R.A., et al. 2001. The sequence of the human genome. Science 291: 1304-1351. Waterston, R.H., Lindblad-Toh, K., Birney, E., Rogers, J., Abril, J.F., Agarwal, P., Agarwala, R., Ainscough, R., Alexandersson, M., An, P., et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420: 520-562.[CrossRef][Medline]
Woods, I.G., Kelly, P.D., Chu, F., Ngo-Hazelett, P., Yan, Y.L., Huang, H., Postlethwait, J.H., and Talbot, W.S. 2000. A comparative map of the zebrafish genome. Genome Res. 10: 1903-1914.
http://zgc.nci.nih.gov; zebrafish gene collection. http://www.ncbi.nlm.nih.gov/UniGene, build 83; UniGene clusters. http://trace.ensembl.org; zebrafish whole-genome shotgun sequencing project. http://zebrafish.stanford.edu; the complete genotype data set. http://www.ensembl.org; Ensembl genome browser. http://probcons.stanford.edu; probcons. http://mendel.stanford.edu:16080/SidowLab/; Protein Alignment Editor. http://www.cs.huji.ac.il/~nir/SEMPHY/; semphy. http://zfin.org/zf_info/nomen.html; zebrafish nomenclature guidelines. http://www.ensembl.org/Tetraodon_nigroviridis/; map positions for Tetraodon.
Received May 14, 2005; accepted in revised format June 27, 2005. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||