|
|
|
|
Genome Res. 14:567-573, 2004 ©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00 Letter Insertions and Deletions Are Male Biased Too: A Whole-Genome Analysis in Rodents1 Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA 2 Departments of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA 3 Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA 4 Department of Health Evaluation Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA 5 Department of the Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
It is presently accepted that, in mammals, due to the greater number of cell divisions in the male germline than in the female germline, nucleotide substitutions occur more frequently in males. The data on mutation bias in insertions and deletions (indels) are contradictory, with some studies indicating no sex bias and others indicating either female or male bias. The sequenced rat and mouse genomes provide a unique opportunity to investigate a potential sex bias for different types of mutations. Indeed, mutation rates can be accurately estimated from a large number of orthologous loci in organisms similar in generation time and in the number of germline cell divisions. Here we compare the mutation rates between chromosome X and autosomes for likely neutral sites in eutherian ancestral interspersed repetitive elements present at orthologous locations in the rat and mouse genomes. We find that small indels are male biased: The male-to-female mutation rate ratio ( ) for indels in rodents is 2. Similarly, our whole-genome analysis in rodents indicates an approximately twofold excess of nucleotide substitutions originating in males over that in females. This is the same as the male-to-female ratio of the number of germline cell divisions in rat and mouse. Thus, this is consistent with nucleotide substitutions and small indels occurring primarily during DNA replication.
In mammals the male germline undergoes more cell divisions (and more DNA replications) than does the female germline. If germline mutations are replication-driven, then (1) mutations should originate more frequently in males than in females, and (2) the male-to-female mutation rate ratio ( ) should be equal to the male-to-female ratio of the number of germline cell divisions (c). If is smaller than c, then the role of replication-independent factors (e.g., environmental damaging agents such as oxygen radicals) is significant in generating mutations. Thus, knowing the precise value of is critical for assessing whether germline mutations are caused by errors in DNA replication.
Two lines of evidence indicate that nucleotide substitutions are more frequent in males than in females. First, de novo point mutations leading to human genetic diseases are predominantly of paternal origin (for review, see Crow 2000
Despite the growing support of the male-mutation bias hypothesis in mammals, the exact value of
In rodents, c is approximately two (Chang et al. 1994
A potential sex bias in mutations other than nucleotide substitutions has not been investigated extensively. Do insertions and deletions (indels) occur predominantly in males compared with females? The published data addressing this question are contradictory. First, small indels causing some human genetic diseases originate with the same frequency in males and females (e.g., in tuberous sclerosis; Roberts et al. 2002
The availability of the rat and mouse genomic sequences provides a unique opportunity to test the hypothesis of male-mutation bias and to estimate the male-to-female mutation rate ratio in rodents. First, the reproductive physiology of both rat and mouse has been well studied, and the two organisms have a similar ratio of cell divisions between male and female germlines by the time of reproduction (Chang et al. 1994
In this study we estimate the male-to-female mutation rate ratio for indels and nucleotide substitutions in rodents. As female rat and female mouse have been selected for the Rat Genome Project (Rat Genome Sequencing Project Consortium 2004
We focus on transposable elements fixed in the common ancestor of primates and rodents. It has been suggested that the ARs represent a suitable model of neutral evolution (Waterston et al. 2002
ARs Used to Model Neutral Evolution Below, we examine two sets of the ARs at orthologous locations in the rat and mouse genomes. The first set contains all ARs detected in the four-way (humanratmouse-repeat consensus) alignments (a total of 5.5 Mb on chromosome X and 100.4 Mb on autosomes). It includes 3 families of SINEs, 64 families of LINEs, 156 families of LTRs, and 100 families of DNA elements. Because of a high mutation rate in rodents, a significant portion of the ARs of more ancient origin is usually undetected in their genomes (Waterston et al. 2002
Focusing only on the portion of the genome occupied by the AR set described above may introduce some biases (perhaps, this part of the rodent genome is particularly conserved as it is alignable with the human genome). Thus, we also perform our calculations on the set of all sites alignable between rat and mouse; there are a total of 1691.0 Mb on autosomes, and 78.3 Mb on X were examined.
Indels
The higher prevalence of indels on autosomes compared with X is observed for both insertions and deletions (Table 1). Interestingly, the male-to-female ratio is higher for deletions (
The frequency of indels is significantly lower on chromosome X than on any rat autosome (Table 2; P < 0.001,
When all sites aligned between rat and mouse are analyzed, we once again observe a lower indel rate on X than on autosomes. The resulting X/A = 0.894, leading to = 1.930, comparable to obtained from the analysis of the ARs.
Nucleotide Substitutions
When the analysis is restricted to ARs active shortly before the rodentprimate speciation, the results are comparable (Table 3). Namely, the average substitution rate is significantly lower on chromosome X than on the autosomes (P < 0.001, unpaired t-test). The resulting ratio of the substitution rates between chromosome X and autosomes is 0.895, which leads to = 1.914 (the bootstrap-based 95% CI is 1.651 to 2.271). Thus, a higher proportion of older ARs does not introduce a bias in the estimates for nucleotide substitutions, similar to what we observed for indels.
It is known that substitution rates correlate with fluctuations in GC content (see Castresana 2002 Similar to the indel rate, the mouserat substitution rate is significantly lower on X than on any autosome (P < 0.001, unpaired t-test; Table 4). The significance remains after applying the Bonferroni correction for multiple comparisons.
A genome-wide comparison of substitution rates for all sites confirmed our findings from the ARs. Indeed, there is a lower substitution rate on chromosome X than on autosomes when all sites aligned between rat and mouse are compared. The average nucleotide substitution rate (weighted by the length of individual alignments) is 0.152 per site for chromosome X and 0.167 per site for autosomes. Hence, the X/autosome substitution rate ratio is 0.908, so = 1.760.
In this study, we conducted a whole-genome comparison of mutation rates between chromosome X and autosomes in rat and mouse and came to the following conclusions. First, there is strong evidence of male mutation bias for indel mutations in rodents. To our knowledge, this is the first report of the sex-specific bias for indels in mammals. Second, we confirm male mutation bias for nucleotide substitutions in rodents. Remarkably, for both types of mutations, the male-to-female mutation rate ratio is approximately two, similar to the male-to-female ratio in the number of germline cell divisions. This strongly supports the replication-driven origin for both indels and nucleotide substitutions. One should be aware that an estimate of c 2 in rodents is an approximation (Chang et al. 1994
Previous studies indicated that indel mutations might occur during meiosis (Ketterling et al. 1994
The recombination origin of indels was hypothesized because of maternal mutation bias for large indels (usually deletions) causing some human genetic diseases (e.g., in neurofibromatosis type 1 and hemophilia B; Lazaro et al. 1996
Recently, it has been shown that there is a substantial variation in mutation rates within chromosomes, and point mutation rates, insertion rates of repetitive elements, and recombination rates covary locally (Lercher et al. 2001
Here, we use orthologous ARs as a data source for modeling neutral evolution. A potential problem associated with using ARs is in gene conversion. Roy et al. (2000
Our results based on a comparison of rates between chromosome X and autosomes await confirmation from the comparisons of mutation rates between chromosome Y and autosomes as well as between the two sex chromosomes. These latter studies will become possible once the complete sequences of the mouse and rat Y chromosome are available. Differing male-to-female mutation rate ratios among the three types of comparisons would indicate that other factors, in addition to the number of DNA replications, determine differences in mutation rates between sex chromosomes and between sex chromosomes and autosomes. One of such factors could be a specifically reduced mutation rate of the X chromosome (McVean and Hurst 1997
Conclusions
Alignments The mutation rates between orthologous rat and mouse ARs were estimated from four-way multiple alignments. The April 2003 human genome assembly (hg15), the February 2003 mouse genome assembly (mm3), and the June 2003 rat genome assembly (rn3) were aligned along with the consensus sequences for the ARs, as described in Blanchette et al. 2004
The mutation rates at all sites were studied from the whole-genome alignments of rat and mouse. Namely, the February 2003 mouse genome assembly (mm3) and the June 2003 rat genome assembly (rn3) were aligned with BLASTZ (Schwartz et al. 2003
Estimating the Male-to-Female Mutation Rate Ratio
Indels
The significance in the difference of indel frequencies between chromosome X and autosomes was tested by a
To obtain confidence interval for the estimates of
The indel frequency spectrum was compared between chromosome X and autosomes by the Kolmogorov-Smirnov test on the two distributions of indels (by length) for chromosome X and pooled for all autosomes. To test a hypothesis of a more pronounced male bias for 1-bp indels, we performed a
Nucleotide Substitutions
An unpaired t-test was used to compare the mean substitution rates between chromosome X and autosomes. Later, the same test was used to compare the mean substitution rates between chromosome X and each autosome, applying the Bonferroni correction for multiple tests. A weighted estimate of the variance for chromosome X and autosomes was calculated as suggested by Erlandsson et al. (2000
To assess the effects of GC content on nucleotide substitution rate, we proceeded as in Hardison et al. (2003 Statistical tests and the quadratic regression were performed with the MINITAB software package.
We thank Webb Miller and Ross Hardison for comments improving the manuscript and for helpful discussions. Thanks to Hans Ellegren for suggestions on the manuscript. The sequence alignments used in this project were generated on a 1024-node Pentium III cluster at the University of California at Santa Cruz (UCSC). We acknowledge David Haussler, Krishna Roskin, and the UCSC Genome Browser staff for generating these alignments. Scott Schwartz helped with managing the alignments at Penn State. We are thankful to Paula Goetting-Minesky and Laura Elnitski for critical readings of the manuscript. We are grateful to Michael Jensen-Seaman for sharing his results before publication. This study was supported by the startup funds available to K.D.M. from the Eberly College of Sciences at Penn State University. The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1971104.
6 Corresponding author.
Blanchette, M., Kent, W.J., Riemer, C., Elnitski, L., Smit, A.F.A., Roskin, K.M., Baertsch, R., Rosenbloom, K., Clawson, H., Green, E.D., et al. 2004. Aligning muliple genomic sequences with the threaded blockset aligner. Genome Res. (this issue). Bohossian, H.B., Skaletsky, H., and Page, D.C. 2000. Unexpectedly similar rates of nucleotide substitution found in male and female hominids. Nature 406: 622-625.[CrossRef][Medline]
Castresana, J. 2002. Genes on human chromosome 19 show extreme divergence from the mouse orthologs and a high GC content. Nucleic Acids Res. 30: 1751-1756. Chang, B.H.J. and Li, W.-H. 1995. Estimating the intensity of male-driven evolution in rodents by using X-linked and Y-linked Ube 1 genes and pseudogenes. J. Mol. Evol. 40: 70-77.[CrossRef][Medline]
Chang, B.H.J., Shimmin, L.C., Shyue, S.-K., Hewett-Emmett, D., and Li, W.-H. 1994. Weak male-driven evolution in rodents. Proc. Natl. Acad. Sci. 91: 827-831. Chang, B.-H., Hewett-Emmett, D., and Li, W.-H. 1996. Male-to-female ratios of mutation rate in higher primates estimated from intron sequences. Zool. Stud. 35: 36-48. Chiaromonte, F., Yap, V.B., and Miller, W. 2002. Scoring pairwise genomic sequence alignments. Pac. Symp. Biocomput. 115-126. Chuzhanova, N.A., Annassis, E.J., Ball, E.V., Krawczak, M., and Cooper, D.N. 2002. Meta-analysis of indels causing human genetic disease: Mechanisms of mutagenesis and the role of local DNA sequence complexity. Hum. Mutat. 21: 28-44. Crow, J.F. 2000. The origins, patterns and implications of human spontaneous mutation. Nat. Rev. Genet. 1: 40-47.[Medline] Dermitzakis, E.T., Reymond, A., Lyle, R., Scamuffa, N., Ucla, C., Deutsch, S., Stevenson, B.J., Flegel, V., Bucher, P., Jongeneel, C.V., et al. 2002. Numerous potentially functional but non-genic conserved sequences on human chromosome 21. Nature 420: 578-582.[CrossRef][Medline] Drouin, G., Prat, F., Ell, M., and Clarke, G.D. 1999. Detecting and characterizing gene conversions between multigene family members. Mol. Biol. Evol. 16: 1369-1390.[Abstract] Ebersberger, I., Metzler, D., Schawarts, C., and Pääbo, S. 2002. Genomewide comparison of DNA sequences between humans and chimpanzees. Am. J. Hum. Genet. 70: 1490-1497.[CrossRef][Medline]
Erlandsson, R., Wilson, J.F., and Pääbo, S. 2000. Sex chromosomal transposable element accumulation and male-driven substitutional evolution in humans. Mol. Biol. Evol. 17: 804-812.
Hardison, R.C., Roskin, K., Yang, S., Diekhans, M., Kent, W.J., Weber, R., Elnitski, L., Li, J., O'Connor, M., Kolbe, D., et al. 2003. Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. Genome Res. 13: 13-26. Hurst, L.D. and Ellegren, H. 1998. Sex biases in the mutation rate. Trends Genet. 14: 446-452.[CrossRef][Medline] Hurst, L.D. and Williams, E.J.B. 2000. Covariation of GC content and the silent site substitution rate in rodents: Implications for methodology and for the evolution of isochores. Gene 261: 107-114.[CrossRef][Medline] Jensen-Seaman, M.I., Furey, T.S., Payseur, B.A., Lu, Y., Roskin, K.M., Chen, C-F., Thomas, M.A., Haussler, D., and Jacob, H.J. 2004. Comparative recombination rates in rat, mouse, and human. Genome Res. (this issue). Ketterling, R.P., Vielhaber, E.L., Lind, T.J., Thorland, E.C., and Sommer, S.S. 1994. The rates and patterns of deletions in the human factor IX gene. Am. J. Hum. Genet. 54: 201-213.[Medline] Kluwe, L., Mautner, V., Parry, D.M., Jacoby, L.B., Baser, M., Gusella, J., Davis, K., Stavrou, D., and MacCollin, M. 2000. The parental origin of new mutations in neurofibromatosis 2. Neurogenetics 3: 17-24.[CrossRef][Medline]
Kolb, A. 2003. The first intron of the murine Kong, A., Gudbjartsson, D.F., Sainz, J., Jonsdottir, G.M., Gudjonsson, S.A., Richardsson, B., Sigurdardottir, S., Barnard, J., Hallbeck, B., Masson, G., et al. 2002. A high-resolution recombination map of the human genome. Nat. Genet. 31: 241-247.[CrossRef][Medline] Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., et al. 2001. Initial sequencing and analysis of the human genome. Nature 409: 860-921.[CrossRef][Medline] Lazaro, C., Gaona, A., Ainsworth, P., Tenconi, R., Vidaud, D., Kruyer, H., Ars, E., Volpini, V., and Estivill, X. 1996. Sex differences in mutational rate and mutational mechanism in the NF1 gene in neurofibromatosis type 1 patients. Hum. Genet. 98: 696-699.[CrossRef][Medline]
Lercher, M.J., Williams, E.J.B., and Hurst, L.D. 2001. Local similarity in evolutionary rates extends over whole chromosomes in humanrodent and mouserat comparisons: Implications for understanding the mechanistic basis of the male mutation bias. Mol. Biol. Evol. 18: 2032-2039. Li, W.-H., Yi, S., and Makova, K.D. 2002. Male-driven evolution. Curr. Opin. Genet. Dev. 12: 650-656.[CrossRef][Medline] Makova, K.D. and Li, W.-H. 2002. Strong male-driven evolution of DNA sequences in humans and apes. Nature 416: 624-626.[CrossRef][Medline] Malcom, C.M., Wyckoff, G.J., and Lahn, B.T. 2004. Genic mutation rates in mammals: Local similarity, chromosomal heterogeneity, and X-versus-autosome disparity. Mol. Biol. Evol. 20: 1633-1641. McVean, G.T. and Hurst, L.D. 1997. Evidence for a selectively favourable reduction in the mutation rate of the X chromosome. Nature 386: 388-392.[CrossRef][Medline] Miyata, T., Hayashida, H., Kuma, K., Mitsuyasu, K., and Yasunaga, T. 1987. Male-driven evolution: A model and nucleotide sequence analysis. Cold Spring Harb. Symp. Quant. Biol. LII: 863-867. Modrich, P. and Lahue, R. 1996. Mismatch repair in replication fidelity, genetic recombination and cancer biology. Annu. Rev. Biochem. 65: 101-133.[CrossRef][Medline] Rat Genome Sequencing Project Consortium. 2004. Genome sequence of the Brown Norway Rat yields insights into mammalian evolution. Nature (in press). Roberts, P.S., Chung, J., Jozwiak, S., Dabora, S., Franz, D.N., Thiele, E.A., and Kwiatkowski, D. 2002. SNP identification, haplotype analysis, and parental origin of mutations in TSC2. Hum. Genet. 111: 96-101.[CrossRef][Medline]
Roy, A.M., Carroll, M.L., Nguyen, S.N., Salem, A.-H., Oldridge, M., Wilkie, A.O.M., Batzer, M., and Deininger, P.L. 2000. Potential gene conversion and source genes for recently integrated Alu elements. Genome Res. 10: 1485-1495. Roy-Engel, A.M., Carroll, M.L., El-Sawy, M., Salem, A.-H., Garber, R.K., Nguyen, S.V., Deininger, P.L., and Batzer, M.A. 2002. Non-traditional Alu evolution and primate genomic diversity. J. Mol. Biol. 316: 1033-1040.[CrossRef][Medline]
Salem, A.-H., Kilroy, G.E., Watkins, W.S., Jorde, L.B., and Batzer, M.A. 2003. Recently integrated Alu elements and human genomic diversity. Mol. Biol. Evol. 20: 1349-1361. Schwartz, S., Kent, W.J., Smit, A., Zhang, Z., Baertsch, R., Hardison, R.C., Haussler, D., and Miller, W. 2003. Humanmouse alignments with BLASTZ. Genome Res. 13: 103-107. Shimmin, L.C., Chang, B.H., and Li, W.-H. 1993. Male-driven evolution of DNA sequences. Nature 362: 745-747.[CrossRef][Medline]
Smith, N.G.C. and Hurst, L.D. 1999. The causes of synonymous rate variation in the rodent genome: Can substitution rates be used to estimate the sex bias in mutation rate? Genetics 152: 661-673. Sommer, S.S., Scaringe, W.A., and Hill, K.A. 2001. Human germline mutation in the factor IX gene. Mutat. Res. 487: 1-17.[Medline] Stenson, P.D., Ball, E.V., Mort, M., Phillips, A.D., Shiel, J.A., Thomas, N.S., Abeysinghe, S., Krawczak, M., and Cooper, D.N. 2003. Human Gene Mutation Database (HGMD): 2003 update. Hum. Mutat. 21: 577-581.[CrossRef][Medline]
Sundstrom, H., Webster, M.T., and Ellegren, H. 2003. Is the rate of insertion and deletion mutation male biased?: Molecular evolutionary analysis of avian and primate sex chromosome sequences. Genetics 164: 259-268. Trappe, R., Laccone, F., Cobilanschi, J., Meins, M., Huppke, P., Hanefeld, F., and Engel, W. 2001. MECP2 mutations in sporadic cases of Rett syndrome are almost exclusively of paternal origin. Am. J. Hum. Genet. 68: 1093-1101.[CrossRef][Medline] Waterston, R.H., Lindblad-Toh, K., Birney, E., Rogers, J., Abril, J.F., Agarwal, P., Agarwala, R., Ainscough, R., Alexandersson, M., An, P., et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420: 520-562.[CrossRef][Medline] Whelan, S., Lio, P., and Goldman, N. 2001. Molecular phylogenetics: State-of-the-art methods for looking into the past. TIG 17: 262-272. Wolfe, K.H. and Sharp, P.M. 1993. Mammalian gene evolution: Nucleotide sequence divergence between mouse and rat. J. Mol. Evol. 37: 441-456.[Medline]
Yang, Z. 1997. PAML: A program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13: 555-556.
http://ftp.genome.washington.edu/RM/RepeatMasker.html; RepeatMasker.
Received September 12, 2003;
accepted in revised format December 9, 2003.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||