|
|
|
|
Vol. 9, Issue 9, 878-887, September 1999
RESOURCE
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
To facilitate whole-genome scan experiments, we selected a panel of 128 microsatellite markers on the basis of spacing and polymorphism in the strains DBA/2, BALB/c, AKR, C57BL/6, C57BL/10, A/J, C3H, 129/J, SJL/J, JF1, and PWB. Many of the primer pairs were redesigned for better performance. The last four strains were not characterized previously using these markers. JF1 and PWB are particularly interesting for intersubspecific crosses offering high polymorphism. We provide allele size data for the markers on these strains and add them to the emerging radiation hybrid framework map, which is not continuous except for chromosome 17 and 13. Information on the interrelationships of strains is useful both because of the importance of polymorphism in designing crosses and the background in assessing phenotypes. Microsatellites offer a widely dispersed, selectively neutral set of characters that lends itself conceptually to parsimony methods of analysis. The microsatellite allele size data were recoded as binary discrete characters in such a way that adjacent sizes differ by one step. Trees were generated using a Wagner parsimony method. As expected, the non-Mus domesticus strains, PWB (musculus) and JF1 (molossinus), are excluded from the domesticus strains. Among the domesticus strains, C57BL/6 and C57BL/10 (derived from the same founding pair) form a strongly supported group, as do C3H, A/J, and BALB/c (derived from the Bagg albino stock). No unique branching order for SJL/J, AKR, and DBA/2 is strongly supported, which may reflect a complicated history. Strain 129/J is clearly placed as the most deeply diverged of the domesticus strains represented.
| |
INTRODUCTION |
|---|
|
|
|---|
To map quantitative trait loci it is necessary to
test the genotypes of cross progeny with a panel of
polymorphic markers spaced at 10- to 20-cM intervals. For example, with
52 backcross animals, linkage across 15 cM can be detected at 95%
confidence. Therefore, in principle, a set of markers no more than 15 cM from chromosome ends and 30 cM from each other would suffice (Silver 1995
). Although there are now many microsatellite markers available (primarily MIT markers from Dietrich et al. 1996
) and the average density is about four markers/cM, it is still often difficult to find
sufficient markers with differing alleles for a particular pair of
inbred strains. This is especially true if these are both derived from
Mus musculus domesticus. In addition, although the quality of
published microsatellite markers is good, most of them are
mass-produced and have not been extensively characterized, and some
primers are not robust. For the Dietrich et al. (1996)
markers, allele
size data are available for 12 strains, including a Mus
spretus and a Mus musculus castaneus strain. However, for any given pair of strains it is a substantial job to identify a
suitable marker panel and verify that these are detectably polymorphic.
We have selected a set of highly polymorphic markers for the Mus
musculus strains DBA/2, BALB/c, AKR, C57BL/6, A/J, and C3H (typed
by Dietrich et al. 1996
), redesigned many of the primers, and tested
them on these strains, as well as C57BL/10, SJL/J, 129/J, JF1, and PWB.
PWB is an inbred strain derived from a wild-derived Czech Mus
musculus musculus isolate (Forejt 1991
). JF1 is a recently established inbred strain of Japanese Mus musculus molossinus origin (a natural hybrid of M. m. musculus and M. m.
castaneus; Bonhomme and Guénet 1996
), which should also
differ at many loci from the standard strains (Koide et al. 1998
). The
remaining strains are classic inbred strains that are quite closely
related (Silver 1995
). 129/J in particular is a widely used strain for
which allele size data are not provided in the MIT data set. Data for
209 simple sequence-length polymorphisms (SSLPs) on strain 129 were
published recently (Matin et al. 1998
), only 2 of which are in common
with the work reported here.
It would be useful to have a clear picture of the degrees of
relatedness of mouse strains to each other. The breeding histories of
the classical inbred strains are to some extent known (Festing 1996
),
but there are little data on the outbred progenitor populations. The
relationships of the inbred strains of mice were investigated using
parsimony by Atchley and Fitch (1991)
, employing classical genotypes at
144 loci over the whole genome. At each locus, possession by two
strains of the same-sized allele indicates probable similarity by
descent of that sequence. Using many loci spread over the genome, it
should be possible to reconstruct a phylogeny of the genome as a whole,
at least to the extent that the history of these strains can be
represented by a tree (i.e., no recent interbreeding, which would give
a net). Atchley and Fitch (1991)
obtained a tree that reflected the
groups of strains known to be closely related: the C57 and C58 strains,
and the Bagg albino descendants BALB/c, A, and C3H. A somewhat deeper
clade united many of the others, including SJL, DBA, and AKR. The main
findings were reproduced by a subset of the data identified as
"protein loci" but not by smaller subsets of "immune" and
"viral." One discrepancy between the overall and protein trees is
the position of strain 129, which is a specific relative of the C57 and
C58 group in the former and of the SJL/DBA/AKR group in the latter.
Radiation hybrid (RH) panels are an attractive tool for mapping
arbitrary sequences (such as ESTs) without the need for polymorphism and high resolution, and have been important in making a very gene-rich
human map (Deloukas et al. 1998
). The T31 panel has been characterized
sufficiently to indicate that it will be a useful mapping tool. Typings
of 472 markers on the panel are available from the public database
(http://www.jax.org/resources/documents/cmdata/rhmap/rhdata.html). This
database includes the initial characterization (McCarthy et al. 1997
)
and detailed maps of chromosome 17 (Schalkwyk et al. 1998
) and
chromosome 13 (Elliott et al. 1999
). However, these data do not provide
a framework suitably dense to allow unknown markers to be mapped with
certainty. We have typed the T31 RH panel (McCarthy et al. 1997
) with
the same panel of markers and thus contributed to an emerging map
framework for this panel.
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
The panel of microsatellite markers was selected using published
genetic positions and allele sizes (Breen et al. 1994
; Dietrich et al.
1996
) with two objectives: maximizing polymorphism in our chosen list
of strains, and approximating a 10-cM spacing. This resulted in a
preliminary list of 148 markers, 114 of which were selected. The
remaining markers did not produce the expected PCR performance,
polymorphism, or position. Another 14 markers were selected to patch
the resulting gaps. Primer sequences were inspected for potential
problems and redesigned where possible to eliminate cases where the
3' end contained one or more CA dinucleotide sequences, which
experience indicates leads to stuttering artifacts. Other primer
sequences were redesigned to adjust the sizes of the products to
facilitate multiplexing of the markers within chromosomes.
Mouse Strain Panel Data
The 11 strains tested here (allele size data presented in Table
1) give rise to 55 pairs of strains that could be considered for a cross. For different
pairs, the present work provides between 31 and 120 polymorphic markers
(Table 2)
in many cases, a nearly complete, tested panel of
markers. The allele size data, as well as lists of
polymorphic markers for each possible pair, are available at
http://www.mpimg-berlin-dahlem.mpg.de/~rodent/bin/polymarkerleo.cgi. These markers were also tested on M. spretus strain SMZ. The
animal tested was heterozygous at 32 of the loci (data not shown).
|
|
Allele sizes for seven of the strains were also estimated by Dietrich
et al. (1996)
, using conventional sequencing acrylamide gels on which
32P kinase-labeled PCR products were separated. This method,
using size standards run separately, is likely to be less accurate for absolute sizes than the fluorescent labeling and internal standard method used here. Nonetheless, relative fragment sizes should be
robust. Taking this into account, there is agreement in allele sizes.
There is perfect agreement for only 4 markers, and in 28 other cases
the relative sizes of the alleles agree, but the absolute sizes differ.
In 16 cases, one allele size disagrees, and in 13 cases two allele
sizes disagree. These categories are indicated in the last column of
Table 1 and are adjusted in those cases where redesigned primers change
the expected product size. In the cases of C57BL/6, C57BL/10, AKR, and
A/J, some fraction of the differences may be attributed to mutations occurring
since the divergence of the different sublines used in the two studies.
The number of polymorphic markers follows the known relatedness of the
strains in question (Table 2). The highest numbers come from the pairs
involving PWB and JF1, reflecting the divisions between the three
subspecies musculus (PWB), molossinus (JF1), and
domesticus (the remaining strains). At the opposite extreme are the highly similar strains C57BL/6 and C57BL/10 (both derived from
the same founding pair) and the group BALB/c, C3H, A/J, which are all
derived from the Bagg Albino stock (Festing 1996
).
Microsatellite Tree
Because inbred mouse strains are derived from a common heritage in the M. musculus complex species and have not been allowed to interbreed, it is plausible to treat them as if they were species and it is interesting to determine whether their history can be reconstructed. One important issue for which this information gives some insight is the effect of genetic background in assessing phenotypes. Microsatellites are a dispersed set of selectively neutral, highly variable samples from across the genome. Although collected for a different purpose they offer an outstanding source of information about strain histories. Given sufficient data, the histories of specific portions of the genome could probably be reconstructed, which would not necessarily be the same as the whole-genome average derived here.
Microsatellite data lend itself conceptually to parsimony analysis
because alleles differ by discrete steps, and in those cases where it
has been observed directly, they mutate predominantly by gain or loss
of one repeat unit (Weber and Wong 1993
; Amos et al. 1996
; Primmer et
al. 1998
). Compared to methods based on a distance matrix, parsimony
allows more information to be considered in the analysis (specific
mutation steps rather than simple proportion of alleles of the same size).
To do the parsimony analysis with PHYLIP 3.5c (Felsenstein 1989
) which
accepts only binary discrete characters, the microsatellite allele size
data were recoded as n
1 binary discrete characters, where n is the number of alleles (ranging from 2 to 9), in
such a way that adjacent sizes differ by one step. This gives an order to the mutational changes between alleles but does not presuppose that
a particular allele is ancestral. Thus, for example, if a primer pair
gives products sized 102, 104, and 108 bp on our panel of strains, the
alleles would be coded using two bits: 00, 01, and 11, respectively.
This yields a vector (binary string) of length 520 for each strain.
Wagner parsimony analysis was done with the MIX program of PHYLIP
(Felsenstein 1989
). We tested the robustness of the results by running
the analysis on 100 different subsamples of the data (jackknife
replicates generated with seqboot), and the consensus tree is shown in
Figure 1. The Wagner parsimony method does not assume
that the ancestral state is known or that mutation in one direction is
more probable than the other. It does, however, assume that species and
characters both evolve independently, which perhaps cannot be said of
multiple characters (bits) derived from a single microsatellite.
Nonetheless, the tree derived is largely congruent with expectations
from the known history of the strains and with the analysis of Atchley
and Fitch (1991)
of 144 single locus genotypes. Most of the branches
are highly robust as indicated by jackknife resampling, showing that the observed branching order is not the result of a subset of the
recoded data. As expected, the non-domesticus strains, PWB (musculus) and JF1 (molossinus), are distant from
each other and from the remaining domesticus strains. Among
the domesticus strains, C57BL/6 and C57BL/10 (derived from the
same founding pair) form a strongly supported group, as do C3H, A/J,
and BALB/c. No unique branching order for SJL/J, AKR, and DBA/2 is
supported strongly, which may reflect a more complicated history.
Strain 129/J is clearly placed as the most deeply diverged of the
domesticus strains represented. In fact these results provide
no evidence that 129/J belongs to the M.m. domesticus group at
all, as the tree is unrooted. This nominates 129/J as a good partner for any
of the other domesticus strains in terms of polymorphism rate.
|
RH Data
In Table 3, RH linkage, lod score, and distance estimates are
presented for each marker relative to the nearest markers in the public
RH data, along with the positions of the markers on the genetic map.
The high resolution of RH mapping is a consequence of
the high break frequency and also means that linkage can be clearly
demonstrated only over short distances. Although lod scores are
calculated for RH mapping in a way similar to that used for genetic
mapping, the data obtained for chromosomes 17 and 13 suggests that high
lod scores for linkage are also obtained between markers on different
chromosomes. Based on this information, a practical lower linkage lod
score limit for placement of unknown markers would be
10. A
framework having each marker linked to the next with a lod score of 10 would thus be desirable and would require a total of
1000 mapped
markers. Here we contribute 128 additional markers to this emerging
framework and subdivide many of the larger intervals in the McCarthy et
al. (1997)
data. If independent chromosome assignment is available, the
current framework can be used readily to assign markers to the
intervals already defined, which correspond to <1 cM for chromosomes
17 and 13 and ~10 cM for the rest of the genome.
|
| |
METHODS |
|---|
|
|
|---|
Primers
Oligonucleotides were from TIB MOLBIOL (Berlin, Germany). The reverse primer of each pair was 5' labeled with the fluorescent dye indicated in Table 1. Many of the primers were redesigned (72 loci marked with an asterisk in the first column of Table 1; primer sequences available http://www.mpimg-berlin-dahlem.mpg.de/~schalkwy/primer.html). The most common change was the deletion of one or more nucleotides from the 3' end to avoid overlap with the microsatellite sequence and attendant stuttering artifacts.
DNAs
Mouse T31 RH panel DNAs were from Research Genetics (Huntsville, AL). C3H, SJL/J, and 129/J (stock no. 000690) DNAs were from Jackson Laboratories. PWB and C57BL/10 DNAs were a gift from J. Forejt (Czech Academy of Sciences, Prague). JF1 DNA was a gift from M. Hrabé de Angelis (GSF, Munich, Germany). C57BL/6JOlaHsd (hereafter C57BL/6), AKR/OlaHsd, and A/JOlaHsd were from Harlan Winkelmann (Borchen, Germany). BALB/cJ and DBA/2 DNAs were prepared from mice obtained from OLAC (UK).
For the mouse strain panel, PCR assays were done in a MJ tetrad cycler, using 20-sec denaturation, 20-sec annealing at the temperature given in Table 1, and 20-sec extension at 72°C. Samples with different dyes were pooled and products separated on an ABI 377XL DNA sequencer using internal length standards in every lane. Analysis was with Genescan version 3.0 and Genotyper version 2.1 software from ABI.
RH panel assays were performed as in Schalkwyk et al. (1998)
:
Thirty-microliter reactions with 1.5 mM Mg2+ and
6.6 pmoles of each primer were amplified with a 55° touchdown PCR
program. For the few primers that did not give easily detectable products under these conditions, amplification was repeated with 3.0 mM Mg2+. The products were detected on Southern
blots of agarose gels probed with 32P-labeled (CA)20.
For D8Mit7, RH linkage indicates that the locus is actually on chromosome 9. Genetic mapping gives the same result (J. Walter, unpubl.). This fact may have consequences for the placement of other markers on the MIT map because D8Mit7 is identified as a framework marker.
| |
ACKNOWLEDGMENTS |
|---|
Parts of this work were funded by the Max-Planck-Gesellschaft and European Union grants BMH4-CT96-0050 (to J.W.) and CT94-0079 (to H.L.).
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
1 Corresponding author.
Present addresses: 3Ingenium Pharmaceuticals AG, D-82152 Martinsried, Germany; 5Genome Pharmaceuticals Corporation AG (GPC AG), Martinsried, Germany.
E-MAIL schalkwy{at}mpimg-berlin-dahlem.mpg.de; FAX 49 30 8413 1380.
| |
REFERENCES |
|---|
|
|
|---|
A facility for ordering markers to 0.1 cM resolution.
Hum. Mol. Genet.
3:
621-627
Phylogeny inference package (v. 3.2).
Cladistics
5:
164-166.Received May 12, 1999; accepted in revised form July 8, 1999.
This article has been cited by other articles:
![]() |
F. Y. Ideraabdullah, E. de la Casa-Esperon, T. A. Bell, D. A. Detwiler, T. Magnuson, C. Sapienza, and F. P.-M. de Villena Genetic and Haplotype Diversity Among Wild-Derived Mouse Inbred Strains Genome Res., October 1, 2004; 14(10a): 1880 - 1887. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Petkov, Y. Ding, M. A. Cassell, W. Zhang, G. Wagner, E. E. Sargent, S. Asquith, V. Crew, K. A. Johnson, P. Robinson, et al. An Efficient SNP System for Mouse Genome Scanning and Elucidating Strain Relationships Genome Res., September 1, 2004; 14(9): 1806 - 1811. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Pemberton, P. A. Knight, J. Gamble, W. H. Colledge, J.-K. Lee, M. Pierce, and H. R. P. Miller Innate BALB/c Enteric Epithelial Responses to Trichinella spiralis: Inducible Expression of a Novel Goblet Cell Lectin, Intelectin-2, and Its Natural Deletion in C57BL/10 Mice J. Immunol., August 1, 2004; 173(3): 1894 - 1901. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. W. E. Meyer, D. Korthaus, W. Jagla, E. Cornali, J. Grosse, H. Fuchs, M. Klingenspor, S. Roemheld, M. Tschop, G. Heldmaier, et al. A Novel Missense Mutation in the Mouse Growth Hormone Gene Causes Semidominant Dwarfism, Hyperghrelinemia, and Obesity Endocrinology, May 1, 2004; 145(5): 2531 - 2541. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Witmer, K. F. Doheny, M. K. Adams, C. D. Boehm, J. S. Dizon, J. L. Goldstein, T. M. Templeton, A. M. Wheaton, P. N. Dong, E. W. Pugh, et al. The Development of a Highly Informative Mouse Simple Sequence Length Polymorphism (SSLP) Marker Set and Construction of a Mouse Family Tree Using Parsimony Analysis Genome Res., March 1, 2003; 13(3): 485 - 491. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Yoshikawa, A. Watanabe, Y. Ishitsuka, A. Nakaya, and N. Nakatani Identification of Multiple Genetic Loci Linked to the Propensity for ""Behavioral Despair"" in Mice Genome Res., March 1, 2002; 12(3): 357 - 366. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Makrigiannis, A. T. Pau, A. Saleh, R. Winkler-Pickett, J. R. Ortaldo, and S. K. Anderson Class I MHC-Binding Characteristics of the 129/J Ly49 Repertoire J. Immunol., April 15, 2001; 166(8): 5034 - 5043. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Valenza-Schaerly, B. Pickard, J. Walter, M. Jung, L. Pourcel, W. Reik, D. Gauguier, G. Vergnaud, and C. Pourcel A Dominant Modifier of Transgene Methylation Is Mapped by QTL Analysis to Mouse Chromosome 13 Genome Res., March 1, 2001; 11(3): 382 - 388. [Abstract] [Full Text] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||