|
|
|
|
Vol. 10, Issue 1, 72-80, January 2000
LETTER
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Microsatellite enrichment is an excess of repetitive sequences characteristic to all studied eukaryotes. It is thought to result from the accumulated effects of replication slippage mutations. Enrichment is commonly measured as the ratio of the observed frequency of microsatellites to the frequency expected to result from random association of nucleotides. We have compared enrichment of specific types of microsatellites in coding sequences with those in noncoding sequences across seven eukaryotic clades. The results reveal consistent differences between coding and noncoding regions, in terms of both the quantity of repetitive DNA and the types present. In noncoding regions, all types of microsatellite (mono-, di-, tri-, tetra-, penta-, and hexanucleotide repeats) are found in excess, and in all cases, these excesses scale in a similar exponential fashion with the length of the microsatellite. This suggests that all types of noncoding repeats are subject to similar mutational and selective processes. Coding repeats, however, appear to be under much stronger and more specific constraints. Tri- and hexanucleotide repeats are found in consistent and significant excess over a wide range of lengths in both coding and noncoding sequences, but other repeat types are much less frequent in coding regions than in noncoding regions. These findings suggest that the differences between coding and noncoding microsatellite frequencies arise from specific selection against frameshift mutations in coding regions resulting from length changes in nontriplet repeats. Furthermore, the excesses of tri- and hexanucleotide coding repeats appear to be controlled primarily by mutation pressure.
| |
INTRODUCTION |
|---|
|
|
|---|
Microsatellites are repetitive DNAs consisting of
directly repeated short motifs of 1-6 nucleotides. These repeats are
subject to a high rate of single-motif insertion and deletion
mutations, through the process of replication slippage (Levinson and
Gutman 1987
; Tautz and Schlotterer 1994
). Over time, the cumulative
effect of these mutations generates highly significant excesses of
repetitive tracts in the genomes of eukaryotes (Hancock 1995
; Cox and
Mirkin 1997
). Because of their high rate of mutation, eukaryotic
microsatellites have become a primary source of polymorphic markers for
studies of population genetics and linkage analysis (Slatkin 1995
).
Microsatellites, and their associated mutational processes, have also
been implicated in a variety of human genetic maladies, including
myotonic dystrophy and Fragile X syndrome (Caskey et al. 1992
).
A recent analysis of microsatellite distributions in whole genomic DNA
of Saccharomyces cerevisiae has shown that enrichment of all
types of microsatellites with motifs of from 1 to 5 nucleotides is
primarily dependent on the number of base pairs contained in a repeat
sequence, as opposed to the length or number of individual motifs
(Pupko and Graur 1999
). We observed similar patterns of enrichment in
the noncoding sequences of all eukaryotic clades analyzed in this paper
(for primates and plants, this relationship can be seen in Fig.
1). This suggests that microsatellites of a variety
of motif types undergo similar rates and types of length mutation (or
the fixation thereof) in both genomic DNA considered as a whole and in
noncoding regions analyzed separately. This relationship between
microsatellite length and genomic enrichment has been suggested to
derive from the fact that replication slippage is dependent on
secondary structure formation, which is itself dependent on sequence
length (Cox and Mirkin 1997
).
|
It has been noted for S. cerevisiae that noncoding regions
have more long mono- and dinucleotide repeats (more than eight motifs
in length) than coding regions, whereas trinucleotide repeat abundances
are similar in coding and noncoding regions (Field and Wills 1998
).
Others have also recognized general differences in repeat frequencies
between coding and noncoding regions (Tautz et al. 1986
; Hancock 1996
),
and it has been suggested that these differences arise from increased
selection in coding sequences. Because microsatellites are
characterized by length mutations, we hypothesized that limitation of
microsatellite expansion occurs in coding regions as a direct result of
selection against frameshift mutations. To address this hypothesis, we
have examined the differences between coding and noncoding
microsatellite enrichment for all microsatellite motif lengths, in
several widely diverged eukaryotic clades.
The hypothesis predicts that microsatellites with a motif length not divisible by three (that is, mono-, di-, tetra-, and pentanucleotide repeats) would be enriched less in coding regions than in noncoding regions. Length mutations in these nontriplet microsatellite types would lead to frameshift mutations when they occurred in coding DNA, and selection against these mutations would reduce their chances of fixation. Enrichment, being dependent on the rate of fixation, would likewise be reduced in coding regions. In contrast, microsatellites with a repeat length divisible by three (tri- and hexanucleotide repeats) should not generate frameshift mutations. Fixation of mutants (and further expansion) of these microsatellites should not be prevented by selection against frameshift events.
To test these predictions, we compared the excess of observed over expected frequencies of microsatellites in coding and noncoding regions. This comparison was carried out with DNA libraries representing a variety of widely diverged eukaryotes, including S. cerevisiae, Caenorhabditis elegans, Schizosaccharomyces pombe, Mus, Drosophila, plants, and primates. Observed and expected frequencies of microsatellites in each library were computed independently for each motif size (1-6 bp) and all possible lengths from 1 to 30 bp. Expected frequencies of repeats were primarily derived from nucleotide frequencies, but we controlled for di- and trinucleotide (codon) biases by recalculating statistics for di- and trinucleotide repeats on the basis of motif frequencies. Exponential regression analysis was used to define the relationship between microsatellite excess and length, and nonparametric statistics were used to determine the overall significance of the effect of functional status (coding vs. noncoding) on the excess of each type of repeat.
| |
RESULTS |
|---|
|
|
|---|
Table 1 shows the raw data from the primate
libraries. Raw data for other organisms is not shown, but demonstrated
similar patterns. Figure 1 summarizes the data for triplet and
nontriplet repeats with motifs of 1-6 nucleotides in primates and
plants, depicting the proportional excess of microsatellite DNA as a
function of length for both coding and noncoding sequences. In Figure
1, gaps in the distribution are treated as missing data.
|
All nontriplet microsatellite types (mono-, di-, tetra-, and pentanucleotide repeats) were found at least once in the noncoding primate sequence library for all possible lengths up to 30 bp. The same was true for plants, with the exception of the longest di- and tetranucleotides (30 and 28 bp, respectively). In contrast, most types of nontriplets were not consistently represented at all lengths in the coding libraries (e.g., we found no primate coding mononucleotide runs with lengths of 17-20 bp, nor are there any of 23-30 bp). It is clear from the data shown in Table 1 that nontriplet microsatellites are far less common and less consistently represented across the range of lengths in coding regions than in noncoding regions.
There are no gaps in the length distributions of either coding or noncoding triplet repeats from primates and plants, and there appears to be very little difference in the length distributions of these repeats between coding and noncoding regions. Similar patterns for both triplet and nontriplet repeats were obtained for the other organisms.
For each organism and each type of repeat, the slope (growth
coefficient) of the exponential regression of excess as a function of
length is shown in Tables 2 and 3. These slopes represent the regression lines fitted to the data in the
form shown in Figure 1. To avoid the use of zero values in exponential
regressions, these analyses were carried out by treating gaps in the
distribution (zero frequencies) as missing data. This treatment of gaps
should be conservative with respect to our hypothesis, because zero
frequencies occur primarily in coding nontriplet microsatellites,
and if they could be included, they would lower the value of the
growth coefficient. The fit of the equation was excellent in all cases.
The mean fit and its standard deviation for all 84 exponential
regressions was R = 0.9621 ± 0.09518.
|
|
|
|
The regression slope for each type of nontriplet microsatellite, averaged across all organisms, was lower in coding regions than in noncoding regions. This relationship is independently significant for all polynucleotide nontriplet repeats (di-, tetra-, and pentanucleotides) and strongly significant for nontriplet repeats as a whole, although not for mononucleotide repeats considered independently. Wilcoxon P values for each type of microsatellite, as well as for sets of all triplet and nontriplet microsatellites, are shown in Tables 2 and 3.
The regression slopes of triplet microsatellites, in contrast, were not significantly different between coding and noncoding regions. No differences were found across the range of organisms for tri- or hexanucleotide repeats considered independently or for both types of triplet repeat considered together.
In Table 4, slope data are shown for analyses carried
out on di- and trinucleotide microsatellite frequencies with the motif frequencies instead of the nucleotide frequencies to generate expected
values. This data, and the associated Wilcoxon P value statistics, demonstrate that dinucleotide enrichment is still significantly decreased in coding regions compared with noncoding regions when dinucleotide biases are controlled for, and trinucleotide enrichment still shows no significant difference between coding and
noncoding regions when trinucleotide biases are controlled for.
|
Table 5 shows growth coefficient data for analyses
carried out with only in-frame trinucleotide frequencies to generate
observed and expected frequencies. This procedure, which controls for
the effects of amino acid and codon biases, still fails to reveal any
significant difference between coding and noncoding trinucleotide microsatellite frequencies.
|
In Figure 1, it can be seen that the excesses of primate noncoding mono- and pentanucleotide repeats appear to be somewhat depressed relative to noncoding di-, tri, tetra, and hexanucleotide repeats at lengths above 20 bp. This relationship appeared consistently for mononucleotide repeats across all libraries analyzed here, but is not apparent for pentanucleotides in several libraries.
The only observed deficiencies of repetitive DNA occurred among primates, Mus, and Drosophila, and involved mononucleotide repeats between the lengths of 5 and 15 bp. An example of this can be seen in the Primate Nontriplet graph in Figure 1.
It can also be seen that noncoding dinucleotide repeat excesses
consistently give greater growth coefficients than other types of
noncoding repeats. This effect is significant for all comparisons of
dinucleotides to other repeat types on the basis of two-tailed Wilcoxon-paired sample tests (P
0.02 in all cases).
| |
DISCUSSION |
|---|
|
|
|---|
The data support the hypothesis that selection against frameshift mutations limits expansion of nontriplet microsatellites in coding regions. The rate of generation of excess microsatellites, as represented by the slope of the exponential regression line fit to proportional microsatellite excess as a function of length, is significantly lower for nontriplet microsatellites in coding regions than it is in noncoding regions. The most probable explanation for this difference is that coding nontriplet microsatellites generate negatively selected frameshifts when they undergo length mutations involving single motifs, and that this type of mutation is the most common variety contributing to changes in microsatellite length, and therefore to enrichment.
In contrast, triplet microsatellites do not generate frameshifts through single-motif length mutations. The results show that the rate of generation of excess triplet microsatellites is not significantly affected by coding status. This suggests that both coding and noncoding triplet microsatellites are subject to similar rates of repeat expansion. More specifically, expansion of triplet microsatellites is not subject to differential selective pressures in coding and noncoding regions, whereas nontriplet microsatellites are subject to greater purifying selection in coding regions. An important inference from this observation is that the excess of triplet microsatellites in both coding and noncoding regions is primarily the result of mutation pressure.
Among noncoding repeats, dinucleotides showed significantly higher growth coefficients than other types of microsatellites. This finding appears primarily to be the result of pervasive noncoding-region dinucleotide biases, which increase the likelihood of dinucleotide runs above the frequencies expected on the basis of nucleotide frequencies alone.
Among primates, Mus, and Drosophila, a deficiency of mid-length (5-15 bp) coding mononucleotide runs was seen. This effect cannot be explained by selection against the products of replication slippage, which in theory should act only to prevent enrichment. Such deficiencies might represent selection against the targets rather than the products of replication slippage, or selection against mononucleotide runs resulting from pressures unrelated to length mutability. Alternatively, these mononucleotide repeats could be subject to different patterns of mutation than other repeats.
| |
METHODS |
|---|
|
|
|---|
Library Construction
We obtained sequence data directly from nonredundant and
whole-genome collections in GenBank (release 105.0, February 15, 1998).
Sequences were collected for each library (see Table
6) by downloading files that contained, in the
associated annotation, the name of the appropriate clade and functional
status indicator (coding or noncoding). Library construction was
accomplished with AWK programs (The AWK Programming Language; A.V. Aho,
P.J. Weinberger, B.W. Kernighan, Prentice Hall, Englewood Cliffs, NJ)
developed by J. Bytof. For each library, we randomly collected up to
10,000 sequences. Library sizes and other parameters are shown in Table 6. In some cases, sequence availability limited library size. In cases
for which specific-species data was limited, we combined species to
form higher-level libraries (primate and plant libraries). Because we
are not comparing groups to one another, but rather using groups as
independent samples to address a phylogeny-independent hypothesis, any
evolutionarily defensible grouping strategy would suffice, so long as
none of the groups overlap. The types of sequence used to construct the
noncoding libraries also varied by nature of their availability. For
S. cerevisiae, which has very few introns, the noncoding
library consisted primarily of intergenic sequence, whereas for the
other clades it consisted entirely of intronic sequences. Previous work
has shown that both varieties of noncoding DNA tend to be more
repetitive than nonspecific coding DNA in eukaryotic genomes (Hancock
1995
), and they are equivalent in terms of our hypothesis.
|
The assignment of coding or noncoding status relies on the accuracy of the annotation in GenBank. This annotation is partially based on predictive statistics, as opposed to known functionality. Any errors in assignment, however, should be conservative with regard to the hypothesis because such errors would tend to reduce the statistical significance of observed differences between coding and noncoding regions.
It has been shown that individual regions of high repetitivity are not
conserved, even in otherwise highly conserved coding regions, at
evolutionary distances as short as those between chickens and rats
(Tautz et al. 1986
). In this work the two most closely related clades
are Mus and primates, and we did not find significant evidence
of microsatellite conservation between these even more closely related
groups. We therefore consider that the frequencies of repetitive DNA in
the widely diverged clades that are compared in this work are
effectively independent.
Generation of Observed and Expected Values
The libraries were surveyed to determine the observed and expected
numbers of all mono- through hexanucleotide repeats. Expected numbers
of these repeats were calculated according to the method of deWachter
(1981)
.
To control for local variation in base content, sequences were analyzed individually to obtain observed and expected repeat frequencies. The values obtained for individual sequences were summed for each library.
Observed numbers were generated as follows: A repeat motif M is defined as a specific sequence of bases. If a repeat motif appears in a sequence, but the bases following it do not consist of the same motif, the repeat has length one. If the sequence of bases that follows M is the same as that found in the motif, but the repeat ends at this point, the repeat has length two, and so on. There is redundancy in this measure; for example, if a triplet repeat CATCATCATCAT occurs in the sequence, the algorithm will also count repeats ATCATCATC and TCATCATCA. However, this does not effect the direct comparison of observed and expected numbers of repeats, as the same redundancy applies to both observed and expected figures.
We define the length of a repeat as the total number of nucleotides in
the repeat sequence. Recent work has shown that the excess of observed
over expected frequencies of repeats is a function of the absolute
length of the repeat (in nucleotides) rather than the number of times
the motif is repeated (Pupko and Graur 1999
). In the present analysis,
both observed and expected numbers of all repeats up to a length of 30 were determined. That is, we examined mononucleotide runs of up to 30 repeats, dinucleotide runs of up to 15 repeats, trinucleotide runs of
up to 10 repeats, and so on.
A correction factor was applied to the data to prevent autocorrelation between repeat types. An (AA)3 repeat, for example, could also be counted as an (A)6 repeat. Therefore, runs in which the first base of the repeat motif is the same as the second were excluded from the dinucleotide totals. Corrections to remove similar ambiguities were applied to tri- through hexanucleotide repeats.
To obtain the expected numbers, all possible motifs of 1 to 6 bases
were examined in turn. For each DNA segment made up of N
bases, the expected numbers of a repeat of motif M and repeat number t is given by
|
(1) |
|
(2) |
As with the observed numbers of repeats, a correction factor was applied to the data to prevent autocorrelation between repeat types. For example, probabilities for dinucleotide repeats that were also mononucleotide repeats were counted only in mononucleotide totals, not dinucleotide totals.
Dinucleotide biases can generate apparent microsatellite excesses that
disappear when dinucleotide rather than mononucleotide frequencies are
used to generate the expected frequencies, and this false-positive
effect primarily influences apparent excesses of dinucleotide repeats
(Hancock and Armstrong 1994
). The frequency of trinucleotide repeats in
coding regions is likely to be influenced by a similar effect, due to
amino acid and codon biases that generate nonrandom trinucleotide
frequencies with respect to the underlying nucleotide frequencies.
To control for apparent excesses of di- and trinucleotide repeats that result from biased di- and trinucleotide frequencies, as opposed to replication slippage, similar calculations to those described above were performed to generate repeat frequency expectations on the basis of motif usage instead of nucleotide usage. Motif frequencies were based on the averaged frequencies of all possible motifs (16 in the case of dinucleotides, 64 in the case of tris) in each of the possible frames for that motif (two for dinucleotides and three for tris). Observed values were then compared with these adjusted expected values (see Table 4).
To control specifically for codon biases in the reading frame, a third set of calculations were carried out for trinucleotide repeats, with in-frame codon frequencies to generate f(M) for coding trinucleotides, and one-frame-only trinucleotide frequencies (arbitrarily starting at the first base) for noncoding trinucleotides. In this case, observed microsatellites were limited to those occurring in frame (codon repeats), but otherwise the analysis was performed in an identical manner to that described above with nucleotide frequencies (see Table 5).
The above protocols for calculation of both observed and expected frequencies of repeats were combined in a TrueBasic for the Macintosh program available from C. Wills (cwills{at}ucsd.edu).
Analysis of Repeat Frequencies
The proportional excess of a particular type of microsatellite,
generated by dividing the observed number of repeats of a particular
length by the number expected to occur by chance, increases exponentially with length in eukaryotic genomes (Cox and Mirkin 1997
;
Pupko and Graur 1999
). This relationship is predicted under a Markov
chain model invoking the combined effects of base-pair mutation and
replication slippage (Kruglyak et al. 1998
). We fit our data to an
exponential model:
|
(3) |
i is the expected value of the
ratio of the observed over expected microsatellite frequencies, for
repeats of length i, and Xi is the growth
coefficient. The slope of this relationship on a semilog plot, defined
as the exponent (Xi) of the exponential regression
of excess against length, is used here as a proxy for the rate of
fixation of mutations. A description of the logic behind this statistic
is as follows:
Let us call microsatellites that are generated by recursive slippage
processes (as opposed to those expected to arise by chance association
of nucleotides) "excess microsatellites". Microsatellite mutations
have been shown to be primarily insertions and deletions of single
motifs in both empirical studies (Henderson and Petes 1992
) and
distributional analyses (Bell and Jurka 1997
). Therefore, excess
microsatellites consisting of a particular number of repeats (X) will generally arise through either (1) an insertion event affecting a microsatellite of X
1 repeats or (2) a
deletion event affecting a microsatellite of X + 1 repeats.
In addition, the original microsatellite in case 2 must have, at one
time, arisen from the growth of a microsatellite of X repeats
and is therefore replacing itself in this length category. That is, the first time it reached length X, it did so by mechanism 1. Hence, excess microsatellites are assumed to have arisen at some point in the past from a microsatellite of the next shortest category. The
rate of fixation of mutations determines how many microsatellites of
length X will evolve to length X + 1, and so on.
The general exponential nature of the relationship between
microsatellite excess and length is primarily driven by the exponential decrease in expected values with increasing length in nucleotides (Cox
and Mirkin 1997
) (e.g., given unbiased nucleotide frequencies, any
sequence is expected to occur by chance at a rate of 0.25 raised to the
power of its length in nucleotides). For any organism, the expected
value for a particular length (in base pairs) is the same for
microsatellites of all motif sizes. We controlled for this similarity
by representing all excess values as a function of length, in the form
of an exponential regression equation of excess against length.
Differences between the slopes (growth coefficients) of these
regression lines are assumed to represent differences in the rate at
which microsatellites with different motif lengths evolve (through
mutation and fixation) to generate excess microsatellites.
If comparable mutation pressures are affecting microsatellite expansion, then growth coefficients should be the same in coding and noncoding regions, at least for a particular type of microsatellite in a specific organism or clade. However, coding microsatellites with a motif length not divisible by three are expected to generate frameshifts, and length variants of these microsatellites should be subject to purifying selection. Hence, the model predicts that the rate of fixation of mutations in nontriplet microsatellites should be higher in noncoding regions than in coding regions, and therefore the related growth coefficients should be significantly higher for noncoding, as opposed to coding, sequences. In contrast, for both coding and noncoding triplet microsatellites, the growth coefficients should be the same.
To test this hypothesis, we used the two-tailed Wilcoxon paired-sample
test (Zar 1984
) (H0: Coding slope = Noncoding
slope) to analyze the effect of functional status (coding vs.
noncoding) on the growth coefficients of triplet and nontriplet
microsatellites. A one-tailed test might have been appropriate for the
nontriplet microsatellites, because the hypothesis predicts a specific
directional effect of coding status, but would not have been
appropriate for triplets, for which the hypothesis predicts no
difference between coding and noncoding regions. Two-tailed tests were
used throughout to maintain comparability of statistical parameters. We
also analyzed each type of microsatellite independently for effect of
functional status on microsatellite enrichment.
| |
ACKNOWLEDGMENTS |
|---|
D.M. was supported by a fellowship from the NASA NSCORT program. This work was personally supported by C.W. We thank Christian Hansen for help with statistical analyses.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Corresponding author.
E-MAIL cwills{at}ucsd.edu; FAX (619) 534-7108.
| |
REFERENCES |
|---|
|
|
|---|
Received September 7, 1999; accepted in revised form November 16, 1999.
This article has been cited by other articles:
![]() |
N. Karaiskou, L. Buggiotti, E. Leder, and C. R. Primmer High Degree of Transferability of 86 Newly Developed Zebra Finch EST-Linked Microsatellite Markers in 8 Bird Species J. Hered., June 26, 2008; (2008) esn052v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. S. Kim, S. T. Ratcliffe, B. W. French, L. Liu, and T. W. Sappington Utility of EST-Derived SSRs as Population Genetics Markers in a Beetle J. Hered., March 1, 2008; 99(2): 112 - 124. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Yatabe, N. C. Kane, C. Scotti-Saintagne, and L. H. Rieseberg Rampant Gene Exchange Across a Strong Reproductive Barrier Between the Annual Sunflowers, Helianthus annuus and H. petiolaris Genetics, April 1, 2007; 175(4): 1883 - 1893. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sampaio, L. Gusmao, A. Correia, C. Alves, A. G. Rodrigues, C. Pina-Vaz, A. Amorim, and C. Pais New Microsatellite Multiplex PCR for Candida albicans Strain Typing Reveals Microevolutionary Changes J. Clin. Microbiol., August 1, 2005; 43(8): 3869 - 3876. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Karaoglu, C. M. Y. Lee, and W. Meyer Survey of Simple Sequence Repeats in Completed Fungal Genomes Mol. Biol. Evol., March 1, 2005; 22(3): 639 - 649. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Perez, F. Cruz, and P. Presa Distribution Properties of Polymononucleotide Repeats in Molluscan Genomes J. Hered., January 1, 2005; 96(1): 40 - 51. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Almeida and C. Penha-Goncalves Long Perfect Dinucleotide Repeats Are Typical of Vertebrates, Show Motif Preferences and Size Convergence Mol. Biol. Evol., July 1, 2004; 21(7): 1226 - 1233. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-C. Li, A. B. Korol, T. Fahima, and E. Nevo Microsatellites Within Genes: Structure, Function, and Evolution Mol. Biol. Evol., June 1, 2004; 21(6): 991 - 1007. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sampaio, L. Gusmao, C. Alves, C. Pina-Vaz, A. Amorim, and C. Pais Highly Polymorphic Microsatellite for Identification of Candida albicans Strains J. Clin. Microbiol., February 1, 2003; 41(2): 552 - 557. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Borstnik and D. Pumpernik Tandem Repeats in Protein Coding Regions of Primate Genes Genome Res., June 1, 2002; 12(6): 909 - 915. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Metzgar, L. Liu, C. Hansen, K. Dybvig, and C. Wills Domain-Level Differences in Microsatellite Distribution and Content Result from Different Relative Rates of Insertion and Deletion Mutations Genome Res., March 1, 2002; 12(3): 408 - 413. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. K. Chang, D. Metzgar, C. Wills, and C. R. Boland Microsatellites in the Eukaryotic DNA Mismatch Repair Genes as Modulators of Evolutionary Mutation Rate Genome Res., July 1, 2001; 11(7): 1145 - 1146. [Full Text] [PDF] |
||||
![]() |
F. W. VAN LEEUWEN, E. M. HOL, R. W. H. HERMANUSSEN, M. A. F. SONNEMANS, E. MORAAL, D. F. FISCHER, D. A. P. EVANS, K.-F. CHOOI, J. P. H. BURBACH, and D. MURPHY Molecular misreading in non-neuronal cells FASEB J, August 1, 2000; 14(11): 1595 - 1602. [Abstract] [Full Text] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||