|
|
|
|
Published online before print
January 2, 2007, 10.1101/gr.5457707 Genome Res. 17:184-191, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
Letter Highly efficient concerted evolution in the ribosomal DNA repeats: Total rDNA repeat variation revealed by whole-genome shotgun sequence data1 National Institute for Basic Biology, Okazaki, 444-8585 Japan; 2 SOKENDAI, Okazaki, 444-8585 Japan; 3 Department of Biology, Duke University, Durham, North Carolina 27708, USA
Repeat families within genomes are often maintained with similar sequences. Traditionally, this has been explained by concerted evolution, where repeats in an array evolve "in concert" with the same sequence via continual turnover of repeats by recombination. Another form of evolution, birth-and-death evolution, can also explain this pattern, although in this case selection is the critical force maintaining the repeats. The level of intragenomic variation is the key difference between these two forms of evolution. The prohibitive size and repetitive nature of large repeat arrays have made determination of the absolute level of intragenomic repeat variability difficult, thus there is little evidence to support concerted evolution over birth-and-death evolution for many large repeat arrays. Here we use whole-genome shotgun sequence data from the genome projects of five fungal species to reveal absolute levels of sequence variation within the ribosomal RNA gene repeats (rDNA). The level of sequence variation is remarkably low. Furthermore, the polymorphisms that are detected are not functionally constrained and seem to exist beneath the level of selection. These results suggest the rDNA is evolving via concerted evolution. Comparisons with a repeat array undergoing birth-and-death evolution provide a clear contrast in the level of repeat array variation between these two forms of evolution, confirming that the rDNA indeed does evolve via concerted evolution. These low levels of intra-genomic variation are consistent with a model of concerted evolution in which homogenization is very rapid and efficiently maintains highly similar repeat arrays.
Repetitive elements are an abundant feature of genomes (Britten and Kohne 1968
The major problems in detecting variation within large tandem-repeat arrays are the prohibitive size and high levels of similarity of these arrays. Therefore, they represent uncharted territories in the genome, and are usually assumed to be comprised of repeats with identical sequences (e.g., Goffeau et al. 1996
The ribosomal RNA repeats (rDNA) are an extensively studied repetitive gene family. Each repeat unit contains three ribosomal RNA (rRNA) genes, the large subunit (LSU), small subunit (SSU), and 5.8S rRNA genes, as well as two transcribed spacers (the ITS1 and ITS2) and a large intergenic spacer (IGS) (Long and Dawid 1980
Fungi are an ideal group of eukaryotes to study at a genomic level due to their relative small genome sizes. We chose five fungi encompassing a diverse range of species for which WGSS data were available. These species were chosen as all have a simple rDNA organization, i.e., a single rDNA array. The first four are members of the phylum Ascomycota. Saccharomyces cerevisiae is a budding yeast from the family Saccharomycetaceae, and is a widely used model organism for biological research (the other model yeast, Schizosaccharomyces pombe was not included as it contains two rDNA arrays). The haploid genome sequence was completed by the Broad Institute (http://www.broad.mit.edu/annotation/genome/saccharomyces_cerevisiae/Home.html). Saccharomyces paradoxus is a close relative of S. cerevisiae. The diploid genome sequence was completed by the Broad Institute as part of a comparative study of Saccharomycetaceae genomes (Kellis et al. 2003 In this study we identified all of the rDNA reads from WGSS projects of these five fungal species and assembled these reads and scanned them for polymorphisms to identify the total level of variation in the rDNA. The results give us the first complete picture of the level of variation within a large, tandemly arrayed repeat family, and this variation is very low. We discuss the implications of these results for the evolution of the rDNA repeats.
The rDNA reads from the five genome projects were each assembled into a rDNA unit alignment. The high similarity of the repeats in the rDNA means there is no way to determine which repeat in the array any given sequence read comes from (e.g., whether a given read comes from repeat #1 or repeat #100). Instead, the sequence data collapse down to a single rDNA unit alignment whose sequence coverage is a product of the genome coverage level and rDNA copy number. We performed automated searches, followed by manual corrections on these alignments to identify polymorphisms (see Methods for details). The polymorphisms that were identified were put into two classes, high-confidence and low-confidence polymorphisms, and results for both are presented. The low-confidence polymorphisms occur in areas of low-sequence quality, and thus, we believe that most of them are unlikely to be real, but we have included them for completeness. We found no evidence for any rearranged rDNA units, such as those found recently in humans (Caburet et al. 2005
Low-sequence variation within the rDNA repeats
To make meaningful interpretations of these polymorphism levels, it is necessary to relate the polymorphism level to the copy number of the rDNA (as more copies are expected to harbor a greater absolute polymorphism level). Therefore, we determined rDNA copy number for these species. A. gossypii was previously reported to contain 50 rDNA copies (Wendland et al. 1999 150 copies (Kobayashi et al. 1998 850 kb; A. nidulans = 360 kb; and C. neoformans = 440 kb. From these values, we calculated rDNA copy number: S. paradoxus contains 90 rDNA copies (per haploid genome); A. nidulans 45 copies; and C. neoformans 55 copies. Two sharp bands are present in S. paradoxus, consistent with slight allelic variation in rDNA copy number in this diploid. The rDNA band is much more diffuse in A. nidulans, consistent with more copy-number variation. This is expected, as this fungus is filamentous; therefore, the time back to the last single nucleus is expected to be large, resulting in greater variation as a result of the continuous copy-number changes that occur during growth (Cowen et al. 2000
To put these levels of variation in perspective, we compared the level of variation found here in the rDNA with the level of variation present in a repeat family undergoing birth-and-death evolution. We used the nucleotide diversity ( ) statistic (Nei and Li 1979 within each array was calculated for each species for both the rDNA and poly-u repeats (Table 1). The contrast in variation between the rDNA and poly-u repeats is stark, with the poly-u repeats showing three to four orders of magnitude more diversity than the rDNA repeats. Many of the rDNA polymorphisms we observe are present on only a single sequence read, and these are probably sequence errors (see Discussion). If so, they artificially inflate the level of , and therefore, we have also calculated for the rDNA polymorphisms using only those polymorphisms present in more than one read (Table 1). To rule out drastic differences in evolutionary rates being responsible for these results (i.e., to rule out the possibility that the lower rate of intra-genomic variation in the rDNA is the result of sequence change intolerance), we measured the inter-specific divergences for these two loci. These were calculated by comparing the consensus sequences between all five species separately for both loci. The levels of divergence are very similar (average pairwise nucleotide similarity between the five species is 79% for the polyubiquitin gene repeat vs. 76% for the rDNA repeat). Therefore, the overall level of sequence constraint in these two loci is very similar, but the intra-genomic variation is markedly different, demonstrating that the within-array evolutionary forces of these two loci differ greatly.
Polymorphisms exist beneath the radar of selection We next looked to see how the polymorphisms we detected are distributed with respect to functional constraint. If selection acts directly on rDNA mutations, we would expect to preferentially find polymorphisms in rDNA residues that show low levels of constraint. Visual inspection of the positions of the polymorphisms (Fig. 2) reveals no obvious pattern of localization. Importantly, there is no bias of polymorphisms toward the IGS regions, which are the least selectively constrained regions of the rDNA. However, it is still possible that the polymorphisms found in the rRNA genes mostly fall on residues with low selective constraint. To test this, we used the LSU and SSU variability maps from the European Ribosomal RNA Database (Van de Peer et al. 1997
Mutational spectrum of the rDNA polymorphisms If the polymorphisms we observe are appearing beneath the level of selection, they may give us insight into the spectrum of mutations offered by the mutation process. Therefore, we looked at the types of mutations that have arisen. Polymorphisms were divided into five classes of mutation (transitions, transversions, insertions, deletions, and complex mutations), and the proportions of these for each species (except A. gossypii and S. cerevisiae, as there are too few polymorphisms) are presented (for high-confidence polymorphisms) (Fig. 4). Each species has its own unique profile of mutation, although the polymorphism levels in S. paradoxus and A. nidulans are too low for strong conclusions. However, we find, as expected, that transitions are the most common form of substitution in all species. The most striking pattern is that of C. neoformans, which shows a very strong deletion bias (62% of all polymorphisms are deletions). If deletion/insertion bias predicts the direction of genome size evolution (Mira et al. 2001
C. neoformans is also characterized by several highly complex mutations. These mutations typically involve replacement of one large sequence tract (1019 bp) with a similar-sized tract of unrelated sequence (1028 bp). We speculated that these mutations were the result of sequence substitution mutations (Yoshiyama et al. 2001
This study has given us the first quantitative picture of the level of polymorphism present within rDNA arrays. Combining the levels of polymorphism with rDNA copy number data, it is clear the level of variation across the rDNA arrays in all five fungal species is extremely low. Indeed, in each species many repeats must be identical in sequence across the entire unit ( 7.59 kb of sequence), as the total number of polymorphisms is less than the rDNA copy number in every species. We also show that this level of variation is orders of magnitude lower than that of a repeat family undergoing birth-and-death evolution. Furthermore, there is no bias of the few observed polymorphisms to areas of low selective constraint, and polymorphisms present in a few copies seem to exist beneath the level of selection. Together these results demonstrate that the rDNA is evolving via concerted evolution, rather than birth-and-death evolution, and suggest that homogenization is highly efficient at maintaining the rDNA with near-identical repeats. At first glance it may seem paradoxical that variation within the rDNA is very low when some regions of the rDNA (notably the IGS) evolve very rapidly. However, these features are easily reconciled under a model of rapid homogenization (Fig. 5). In this model there are three phases of homogenization in a hypothetical repeat array. In the first phase (mutation), mutations can occur stochastically anywhere within the repeat unit. The rDNA is highly redundant, so no selective pressure acts on these "unique" mutations and they can persist for some time. Indeed, most polymorphisms we observe seem to fall into this class: low-frequency polymorphisms that are located randomly throughout the repeat unit, irrespective of the level of constraint. In the second phase (transition), continual repeat turnover by homogenization (unequal recombination) results in mutated repeats being either deleted or duplicated (again stochastically). Deletion obviously removes that mutation from the array (homogenizes the array). Duplication starts the mutated repeat on a process where it may increase in copy number through successive duplications. This is where natural selection comes in. A deleterious mutation will only be able to increase in copy number up to a certain threshold, above which the mutation will compromise fitness. Therefore, only mutations tolerated by natural selection can increase to high copy numbers in the array. Interestingly, the three high copy-number polymorphisms found in this study (two from S. cerevisiae and one from S. paradoxus) (Fig. 2) are present in the IGS and ITS, the regions of the rDNA with the lowest selective constraint, fully consistent with them being neutral variants that are being spread by homogenization. All other polymorphisms are present at low copy number, and therefore are likely to have arisen recently or are unable to spread to high copy number because of functional constraint. The final phase is fixation, where a "tolerated" mutant repeat completely replaces the previous repeats. Thus a new, variant sequence becomes homogenized in the array.
A rapid homogenization model of concerted evolution is consistent with the results from numerous previous studies (Liao et al. 1997 One unusual feature of our data, as noted in the Results section, is that the majority of polymorphisms are found in only one read. Given that these genomes were sequenced to multiple levels of coverage, we expect "true" polymorphisms to be present in multiple reads at a frequency similar to the coverage level (i.e., for a threefold coverage level, we expect to find single-copy polymorphisms in approximately three reads). We suspect that these are the result of unknown sequencing errors. The possibility that these polymorphisms result from heavily methylated, inactive rDNA copies that are biased against in the genomic libraries, and which contain many mutations, was ruled out by digestion of S. paradoxus, A. nidulans, and C. neoformans genomic DNA with methylation sensitive/insensitive isoschizomers. No obvious rDNA methylation was revealed in any of these species (results not shown). Another possibility is that these polymorphisms are real and are the result of cell-to-cell variation due to ongoing mutation in the rDNA during cell growth, resulting in polymorphisms between cells in a colony. In either case, only counting polymorphisms present on more than one read will be a more realistic measure of total array variation.
No fixed (allelic) differences were found between the homologous arrays in S. paradoxus, even though fixed differences are expected in diploid organisms with low rates of sexual recombination (K. Klein, pers. comm.). We did observe one high-frequency IGS polymorphism in this species, but the polymorphism is present equally on both homologous rDNA arrays (see Supplemental Fig. 1). Although this is consistent with the idea of the cohesive spread of variants through populations by concerted evolution (Ohta and Dover 1984
Some differences between the five species were observed, particularly the high proportion of polymorphisms found on multiple reads in S. paradoxus. However, if single-read polymorphisms are excluded, the results for all five species are similar. All five species have a single rDNA array, and it will be interesting to see what pattern emerges from species with more complicated rDNA organizations. Also, sex in these species is thought to be rare or absent, and they all have a history of vegetative lab cultivation, features that will limit variation (K. Klein, pers. comm.). Finally, variation in the structural arrangement of repeat units has been detected within some species (Caburet et al. 2005
In summary, this study shows that the level of intragenomic variation in the rDNA arrays of these five fungal species is extremely low. This low level of variation provides a clear distinction between repeats evolving by concerted evolution and those evolving by birth-and-death evolution, and suggests that concerted evolution is very dynamic and efficient. The WGSS data methodology used here can be applied to other repeat arrays as well as to the rDNA of other species. Furthermore, the contrast between the variation observed in repeats undergoing concerted evolution versus repeats undergoing birth-and-death evolution provides a framework for determining the evolutionary behavior of other repeat families. These results are reassuring for researchers using the rDNA for phylogenetic purposes, as there is no evidence for major cryptic variation within the repeats, although the situation may be different in species with more complicated rDNA arrangements (e.g., multiple loci), and different life-history traits (e.g., higher rates of sexual recombination). Extending these analyses to more and varied species will help clarify this. Our results also provide critical data for the formulation of theoretical models of concerted evolution, as now both the rate of rDNA recombination (e.g., Szostak and Wu 1980
Identifying polymorphisms rDNA sequence reads were either obtained directly from the genome sequencing center (A. gossypii and C. neoformans) or (for S. paradoxus and A. nidulans) identified as follows. First, complete rDNA unit sequences were constructed by taking a portion of the rDNA (from GenBank), BLASTing (Altschul et al. 1997 3.3-fold coverage of the rDNA; S. cerevisiae, 10,764 reads for 5.4-fold coverage; S. paradoxus, 5571 reads for 1.7-fold coverage (diploid coverage level); A. nidulans, 4996 reads for 7.7-fold coverage; and C. neoformans, 4206 reads for 4.7-fold coverage. rDNA coverage level was calculated using total rDNA array length (from rDNA copy-number results) and the total rDNA read length (obtained by multiplying rDNA read number by average read length). Average read length was obtained either directly from the genome sequencing center or calculated from total read number, sequenced genome length, and coverage level. This was 500 bp for A. gossypii, S. cerevisiae, and S. paradoxus, and 534 bp for A. nidulans. The value for C. neoformans is not known, but we used 500 bp as an estimate. rDNA coverage level calculated in this way was consistently lower than overall genome coverage level, but this is not surprising as regions of unusual chromatin structure such as centromeres and telomeres are usually underrepresented in genomic libraries (e.g., Mais et al. 2005rDNA reads from each genome were assembled into a single unit with a high density of sequence reads using the Phred/Phrap/Consed software cluster (http://www.phrap.org/phredphrapconsed.html). Due to the large amount of data, this often required assemblies of subsets of the data. Any minor contigs formed were manually checked, and any reads that were real rDNA reads were subsequently incorporated into the main contig. Polymorphisms were then automatically identified using Consed, and every polymorphic chromatogram was checked by eye. From this, "true" polymorphisms were identified and put subjectively into two classes: high-confidence, and low-confidence polymorphisms. Reads containing polymorphisms are presented in Supplemental Table 1.
Molecular biology techniques
Determination of rDNA copy number
Nucleotide diversity
Particular thanks are owed to Fred Dietrich (Duke Univ. Medical Center) for the inspiration for this project, and for providing access to the A. gossypii genome sequence data. We also thank Eula Fung (Stanford Genome Tech. Center) for providing the C. neoformans rDNA reads and for helpful discussions; Alexey Egorov (NCBI Trace Archive) for retrieving the rDNA reads for A. nidulans and S. paradoxus; and Tim James and Rytas Vilgalys (Duke Univ.), and Takashi Horiuchi (Natl. Inst. Basic Biol.) for helpful discussions. Thanks are also due to K. Klein (Minnesota State Univ.) for helpful comments on the manuscript, and the communication of unpublished results. This work was supported by grants from the Clark Fellowship for Molecular Evolution and Comparative Genomics and the Japan Society for the Promotion of Science to ARDG, grants 17080010, 17370065, and 18207013 from the Ministry of Education, Science and Culture, Japan, and by a Human Frontier Science Program grant to T.K.
4 Present address: Division of Cytogenetics, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan.
E-mail gausten{at}lab.nig.ac.jp; fax 81-055-981-6881. [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5457707
Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 33893402. Andersson, D.I., Slechta, E.S., and Roth, J.R. 1998. Evidence that gene amplification underlies adaptive mutability of the bacterial lac operon. Science 282: 11331135. Averbeck, K.T. and Eickbush, T.H. 2005. Monitoring the mode and tempo of concerted evolution in the Drosophila melanogaster rDNA locus. Genetics 171: 18371846. Ben Ali, A., Wuyts, J., De Wachter, R., Meyer, A., and de Van Peer, Y. 1999. Construction of a variability map for eukaryotic large subunit ribosomal RNA. Nucleic Acids Res. 27: 28252831. Berbee, M.L. and Taylor, J.W. 2001. Fungal molecular evolution: Gene trees and geologic time. In The Mycota (eds. D.J. McLaughlin, E. McLaughlin, and P.A. Lemke,), pp. 229246. Springer, Berlin. Birren, B., Green, E.D., Klapholz, S., Myers, R.M., and Roskams, J. 1997. Analyzing DNA. In Genome analysis: A laboratory manual. (eds. E.D. Green, B. Birren, S. Klapholz, R.M. Myers, and P. Hieter,), pp. 125127. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. Britten, R.J. and Kohne, D.E. 1968. Repeated sequences in DNA. Science 161: 529540. Brown, D.D., Wensink, P.C., and Jordan, E. 1972. A comparison of the ribosomal DNAs of Xenopus laevis and Xenopus mulleri: The evolution of tandem genes. J. Mol. Biol. 63: 5773.[CrossRef][Medline] Caburet, S., Conti, C., Schurra, C., Lebofsky, R., Edelstein, S.J., and Bensimon, A. 2005. Human ribosomal RNA gene arrays display a broad range of palindromic structures. Genome Res. 15: 10791085. Carbone, I. and Kohn, L.M. 2001. A microbial population-species interface: Nested cladistic and coalescent inference with multilocus data. Mol. Ecol. 10: 947964.[CrossRef][Medline] Copenhaver, G.P. and Pikaard, C.S. 1996. Two-dimensional RFLP analyses reveal megabase-sized clusters of rRNA gene variants in Arabidopsis thaliana, suggesting local spreading of variants as the mode for gene homogenization during concerted evolution. Plant J. 9: 273282.[CrossRef][Medline] Cowen, L.E., Sanglard, D., Calabrese, D., Sirjusingh, C., Anderson, J.B., and Kohn, L.M. 2000. Evolution of drug resistance in experimental populations of Candida albicans. J. Bacteriol. 182: 15151522. Dietrich, F.S., Voegeli, S., Brachat, S., Lerch, A., Gates, K., Steiner, S., Mohr, C., Pöhlmann, R., Luedi, P., and Choi, S., et al. 2004. The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome. Science 304: 304307. Dover, G.A. 1982. Molecular drive: A cohesive mode of species evolution. Nature 299: 111117.[CrossRef][Medline] Gangloff, S., Zou, H., and Rothstein, R. 1996. Gene conversion plays the major role in controlling the stability of large tandem repeats in yeast. EMBO J. 15: 17151725.[Medline] Ganley, A.R.D. and Scott, B. 1998. Extraordinary ribosomal spacer length heterogeneity in a Neotyphodium endophyte hybrid: Implications for concerted evolution. Genetics 150: 16251637. Ganley, A.R.D. and Scott, B. 2002. Concerted evolution in the ribosomal RNA genes of an Epichloë endophyte hybrid: Comparison between tandemly-arranged rDNA and dispersed 5S rrn genes. Fungal Genet. Biol. 35: 3951.[CrossRef][Medline] Ganley, A.R.D., Hayashi, K., Horiuchi, T., and Kobayashi, T. 2005. Identifying gene-independent noncoding functional elements in the yeast ribosomal DNA by phylogenetic footprinting. Proc. Natl. Acad. Sci. 102: 1178711792. Goffeau, A., Barrell, B.G., Bussey, H., Davis, R.W., Dujon, B., Feldmann, H., Galibert, F., Hoheisel, J.D., Jacq, C., and Johnston, M., et al. 1996. Life with 6000 genes. Science 274: 546567. Gonzalez, I.L. and Sylvester, J.E. 2001. Human rDNA: Evolutionary patterns within the genes and tandem arrays derived from multiple chromosomes. Genomics 73: 255263.[CrossRef][Medline] Hadjiolov, A.A. 1984. The nucleolus and ribosome biogenesis. In Cell biology monographs (eds. M. Alfert, et al.). Vol. 12. Springer-Verlag Wien, New York. Hoffman, C.S. and Winston, F. 1987. A ten-minute preparation from yeast efficiently releases autonomous plasmids for transformation of Escherichia coli. Gene 57: 267272.[CrossRef][Medline] Hollocher, H. and Templeton, A.R. 1994. The molecular through ecological genetics of abnormal abdomen in Drosophila mercatorum. VI. The non-neutrality of the Y chromosome rDNA polymorphism. Genetics 136: 13731384.[Abstract] James, T.J., Moncalvo, J.-M., Li, S., and Vilgalys, R. 2001. Polymorphism at the ribosomal DNA spacers and its relation to breeding structure of the widespread mushroom Schizophyllum commune. Genetics 157: 149161. Johnson, L.J., Koufopanou, V., Goddard, M.R., Hetherington, R., Schäfer, S.M., and Burt, A. 2004. Population genetics of the wild yeast Saccharomyces paradoxus. Genetics 166: 4352. Kellis, M., Patterson, N., Endrizzi, M., Birren, B., and Lander, E.S. 2003. Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature 423: 241254.[CrossRef][Medline] Kobayashi, T., Heck, D.J., Nomura, M., and Horiuchi, T. 1998. Expansion and contraction of ribosomal DNA repeats in Saccharomyces cerevisiae: Requirement of replication fork blocking (Fob1) protein and the role of RNA polymerase I. Genes & Dev. 12: 38213830. Kovarik, A., Pires, J.C., Leitch, A.R., Lim, K.Y., Sherwood, A.M., Matyasek, R., Rocca, J., Soltis, D.E., and Soltis, P.S. 2005. Rapid concerted evolution of nuclear ribosomal DNA in two Tragopogon allopolyploids of recent and recurrent origin. Genetics 169: 931944. Liao, D., Pavelitz, T., Kidd, J.R., Kidd, K.K., and Weiner, A.M. 1997. Concerted evolution of the tandemly repeated genes encoding human U2 snRNA (the RNU2 locus) involves rapid intrachromosomal homogenization and rare interchromosomal gene conversion. EMBO J. 16: 588598.[CrossRef][Medline] Loftus, B.J., Fung, E., Roncaglia, P., Rowley, D., Amedeo, P., Bruno, D., Vamathevan, J., Miranda, M., Anderson, I.J., and Fraser, J.A., et al. 2005. The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science 307: 13211324. Long, E.O. and Dawid, I.B. 1980. Repeated genes in eukaryotes. Annu. Rev. Biochem. 49: 727764.[CrossRef][Medline] Mais, C., Wright, J.E., Prieto, J.-L., Raggett, S.L., and McStay, B. 2005. UBF-binding site arrays form pseudo-NORs and sequester the RNA polymerase I transcription machinery. Genes & Dev. 19: 5064. Mira, A., Ochman, H., and Moran, N.A. 2001. Deletional bias and the evolution of bacterial genomes. Trends Genet. 17: 589596.[CrossRef][Medline] Nei, M. and Li, W.-H. 1979. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. 76: 52695273. Nei, M., Gu, X., and Sitnikova, T. 1997. Evolution by the birth-and-death process in multigene families of the vertebrate immune system. Proc. Natl. Acad. Sci. 94: 77997806. Nei, M., Rogozin, I.B., and Piontkivska, H. 2000. Purifying selection and birth-and-death evolution in the ubiquitin gene family. Proc. Natl. Acad. Sci. 97: 1086610871. ODonnell, K. and Cigelnik, E. 1997. Two divergent intragenomic rDNA ITS2 types within a monophyletic lineage of the fungus Fusarium are nonorthologous. Mol. Phylogenet. Evol. 7: 103116.[CrossRef][Medline] Ohta, T. and Dover, G.A. 1984. The cohesive population genetics of molecular drive. Genetics 108: 501521. Pérez-González, C.E. and Eickbush, T.H. 2002. Rates of R1 and R2 retrotransposition and elimination from the rDNA locus of Drosophila melanogaster. Genetics 162: 799811. Prokopowich, C.D., Gregory, T.R., and Crease, T.J. 2003. The correlation between rDNA copy number and genome size in eukaryotes. Genome 46: 4850.[Medline] Ritossa, F.M., Atwood, K.C., and Spiegelman, S. 1966. A molecular explanation of the bobbed mutants of Drosophila as partial deficiencies of "ribosomal DNA.". Genetics 54: 819834. Rooney, A.P. and Ward, T.J. 2005. Evolution of a large ribosomal RNA multigene family in filamentous fungi: Birth and death of a concerted evolution paradigm. Proc. Natl. Acad. Sci. 102: 50845089. Rozas, J., Sánchez-DelBarrio, J.C., Messeguer, X., and Rozas, R. 2003. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19: 24962497. Russell, P.J. and Rodland, K.D. 1986. Magnification of rRNA gene number in a Neurospora crassa strain with a partial deletion of the nucleolus organizer. Chromosoma 93: 337340.[CrossRef][Medline] Sambrook, J. and Russell, D.W. 2001. Molecular cloning: A laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. 3rd edition. Scott, B.R. and Kafer, E. 1982. Aspergillus nidulans: An organism for detecting a range of genetic damage. In Chemical mutagens (eds. F.J. deSerres, and A. Hollaender,). Vol. 7, pp. 447479. Plenum, New York. Shapiro, J.A. and von Sternberg, R. 2005. Why repetitive DNA is essential to genome function. Biol. Rev. Camb. Philos. Soc. 80: 227250.[Medline] Skalicka, K., Lim, K.Y., Matyásek, R., Koukalová, B., Leitch, A.R., and Kovarìk, A. 2003. Rapid evolution of parental rDNA in a synthetic tobacco allotetraploid line. Am. J. Bot. 90: 988996. Smith, G.P. 1973. Unequal crossover and the evolution of multigene families. Cold Spring Harb. Symp. Quant. Biol. 38: 507513. Smith, C.L., Klco, S.R., and Cantor, C.R. 1988. Pulsed field gel electrophoresis and the technology of large DNA molecules. In Genome analysis: A practical approach (eds. K.E. Davies, et al.), pp. 4172. IRL Press, Oxford. Szostak, J.W. and Wu, R. 1980. Unequal crossing over in the ribosomal DNA of Saccharomyces cerevisiae. Nature 284: 426430.[CrossRef][Medline] Takeuchi, Y., Horiuchi, T., and Kobayashi, T. 2003. Transcription-dependent recombination and the role of fork collision in yeast rDNA. Genes & Dev. 17: 14971506. de Van Peer, Y., Chapelle, S., and De Wachter, R. 1996. A quantitative map of nucleotide substitution rates in bacterial rRNA. Nucleic Acids Res. 24: 33813391. de Van Peer, Y., Jansen, J., De Rijk, P., and De Wachter, R. 1997. Database on the structure of small ribosomal subunit RNA. Nucleic Acids Res. 25: 111116. Wendland, J., Pöhlmann, R., Dietrich, F., Steiner, S., Mohr, C., and Philippsen, P. 1999. Compact organization of rRNA genes in the filamentous fungus Ashbya gossypii. Curr. Genet. 35: 618625.[CrossRef][Medline] Westenberger, S.J., Sturm, N.R., and Campbell, D.A. 2006. Trypanosoma cruzi 5S rRNA arrays define five groups and indicate geographic origins of an ancestor of the heterozygous hybrids. Int. J. Parasitol. 36: 337346.[CrossRef][Medline] Williams, S.M., Robbins, L.G., Cluster, P.D., Allard, R.W., and Strobeck, C. 1990. Superstructure of the Drosophila ribosomal gene family. Proc. Natl. Acad. Sci. 87: 31563160. Yelton, M.M., Hamer, J.E., and Timberlake, W.E. 1984. Transformation of Aspergillus nidulans by using a trpC plasmid. Proc. Natl. Acad. Sci. 81: 14701474. Yoshiyama, K., Higuchi, K., Matsumura, H., and Maki, H. 2001. Directionality of DNA replication fork movement strongly affects the generation of spontaneous mutations in Escherichia coli. J. Mol. Biol. 307: 11951206.[CrossRef][Medline] Zimmer, E.A., Martin, S.L., Beverley, S.M., Kan, Y.W., and Wilson, A.C. 1980. Rapid duplication and loss of genes coding for the
Received May 1, 2006; accepted in revised format November 16, 2006. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||