Genome Research cityscape

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ma, C.-X.
Right arrow Articles by Wu, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ma, C.-X.
Right arrow Articles by Wu, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 12, 1974-1981, December 2002

METHODS
A Unified Framework for Mapping Quantitative Trait Loci in Bivalent Tetraploids Using Single-dose Restriction Fragments: A Case Study from Alfalfa

Chang-Xing Ma,1 George Casella,1 Zuo-Jun Shen,2 Thomas C. Osborn,3 and Rongling Wu1,3

1 Department of Statistics, 2 Department of Industrial and Systems Engineering, University of Florida, Gainesville, Florida 32611, USA; 3 Department of Agronomy, University of Wisconsin, Madison, Wisconsin 53706, USA


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
METHODS
REFERENCES

The development of statistical methodologies for quantitative trait locus (QTL) mapping in polyploids is complicated by complex polysomic inheritance. In this article, we propose a statistical method for mapping QTL in tetraploids undergoing bivalent formation at meiosis by using single-dose restriction fragments. Our method is based on a unified framework, one that uses chromosome bivalent pairing configuration and gametic recombination to discern different mechanisms of gamete formation. Our bivalent polyploid model can not only provide a simultaneous estimation of the linkage and chromosome pairing configuration---a cytological parameter of evolutionary and systematic interest---but also enhances the precision of estimating QTL effects and position by correctly characterizing gene segregation during polyploid meiosis. By using our method and a linkage map constructed in a previous study, we successfully identify several QTL affecting winter hardiness in bivalent tetraploid alfalfa. Moreover, our results reveal significant preferential chromosome pairing at meiosis in an F1 hybrid population, which indicates the importance of reassessing the traditional view of random chromosome segregation in alfalfa.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
METHODS
REFERENCES

Statistical strategies and techniques for genomic mapping are well developed for diploid species (Lander and Botstein 1989; Wu 1999) but are lagging in the more complex polyploids. Polyploids include many important agricultural crops such as alfalfa, potato, and sugarcane (Zeven 1979; Averett 1980; Hilu 1993) and are recognized to play a pivotal role in the evolution of flowering plants (Ramsey and Schemske 1998; Ronfort et al. 1998; Otto and Whitton 2000; Soltis and Soltis 2000). The genomic mapping of polyploids, in which the genome number is higher than two, is complicated for many factors, such as: (1) uncertainty about the genotype-phenotype correspondence owing to unknown ploidy level, unknown number of gene copies (known as the dosage; Burner 1997), and unknown allelic configuration (Luo et al. 2001); (2) complex pairing behaviors undergoing gamete formation during meiosis (Bever and Felber 1992); (3) heterozygous genome structures resulting from predominantly outcrossing mating systems (Soltis and Soltis 2000); and (4) increased allelic and nonallelic combinations because of the increased number of chromosomes in the homologous set (Kempthorne 1957). The first three factors make it difficult to predict the pattern of gene segregation in a progeny family from its parental genotypes (Grivet et al. 1996; Ming et al. 1998; 2001), whereas the fourth factor leads to an exponential increase of unknown parameters, thus reducing the efficacy of the underlying model. All of them must influence the estimation of genetic parameters, including the recombination fraction and gene effects of quantitative trait loci (QTLs) on phenotypes, which thus deserve an in-depth exploration and should be incorporated into the framework of polyploid genome mapping.

We will first formulate statistical models for QTL mapping in polyploids by specifically considering different gamete formation mechanisms (factor 2). Models for incorporating the other factors will be proposed subsequently. Unlike the other factors, gamete formation mechanisms are polyploid dependent. Polyploids are traditionally classified either as allopolyploids derived from distinct genomes or as autopolyploids from genetically similar genomes (Bever and Felber 1992). But from a viewpoint of meiotic configurations, the nature of polyploids can be better described by bivalent polyploids and multivalent polyploids (R. Wu et al. 2001; S. Wu et al. 2001). In bivalent polyploids, only two chromosomes pair during meiosis at a time so that each bivalent pair contributes one chromosome to the chromosomal set in each gamete. In contrast, in multivalent polyploids, multiple chromosomes pair simultaneously, during which a gamete is formed owing to a free combination of all chromosomes in the set. Different chromosome pairing mechanisms make these two groups of polyploids different from one another in gene segregation (R. Wu et al. 2001).

In this article, we propose a new statistical method for mapping QTL in bivalent polyploids. Currently, there are only three papers that address QTL mapping methodologies for bivalent polyploids (Doerge and Craig 2000; Xie and Xu 2000; Hackett et al. 2001). As noted by Hackett (2001), the statistical model of Xie and Xu (2000) was not based on a proper biological model of polyploid meiosis. The other two papers also have limits in theory and applications. Doerge and Craig (2000) assumed preferential pairings; that is, pairings occur strictly between the same chromosomes in the set. In the paper by Hackett et al. (2001), random chromosome pairings are assumed; that is, all chromosomes have an equal opportunity to pair with one another. These two assumptions help to simplify the model derivations but may not reflect biological reality. In real life, there are a number of intermediate types between these two assumptions (Allendorf and Danzmann 1997; Fjellstrom et al. 2001), in which the probability of pairings may be higher between more similar chromosomes than between less similar chromosomes. Such a difference of pairing probability is described by the preferential pairing factor (Sygenba 1994, 1995).

In our statistical model for QTL mapping in bivalent polyploids, the preferential pairing factor specifying bivalent pairing behaviors is incorporated. To facilitate our analysis, we focus on the performance and robustness of the bivalent polyploid model built on single-dose restriction fragments (simplex). Simplex markers, as used for QTL mapping, have two major advantages: (1) they are economically cheap and readily characterized, and (2) they are abundant in many polyploids. For example, simplex markers represent 70% of the detectable polymorphic loci resulting from the segregation of alleles of different dosages (Da Silva 1993). The statistical aspects of linkage analysis in polyploids based on simplex markers have been discussed by Wu et al. (1992), Hackett et al. (1998), Ripol et al. (1999), and Skinner et al. (2000). Here, we explore the influences of gamete formation mechanisms on polyploid linkage mapping by using the simplex markers. Our new mapping model incorporating preferential chromosome pairings will be validated by a case study in autotetraploid alfalfa.

Alfalfa, as one of the most important perennial forage crops in the world, offers an excellent model system for testing our theoretical model for QTL mapping in bivalent polyploids. First, chromosomes in alfalfa predominantly pair as bivalents, but display polysomic inheritance owing to its autopolyploid nature (Bingham and McCoy 1988). Earlier studies all assume that chromosome segregation in alfalfa is random (Yu and Pauls 1993). This assumption is likely violated when a genome analysis is based on an F1 hybrid progeny derived from two different species or populations. Second, alfalfa has diploid relatives; thus, results between polyploid alfalfa and its diploid relatives can be compared. Third, a few genetic linkage maps of molecular markers have been constructed in alfalfa (Yu and Pauls 1993; Brouwer and Osborn 1999; Diwan et al. 2000), providing a foundation for the genetic analysis of complex traits and marker-assisted selection.


    RESULTS
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
METHODS
REFERENCES

We derive theoretical models for mapping QTLs in bivalent tetraploids using single-dose restriction fragments (simplex) by incorporating the preferential pairing factor defined to describe bivalent chromosome behavior (Sybenga 1994, 1995, 1996). These models are then applied to map QTLs affecting winter hardiness traits in alfalfa. The statistical methods for estimating the linkage, preferential pairing factor and QTL effects are presented in the Methods section.

The statistical model proposed in this article is used to map QTLs affecting winter hardiness based on simplex markers in a published data set of tetraploid alfalfa (Brouwer and Osborn 1999). Alfalfa is regarded as an autotetraploid, in which bivalent pairings are a predominate process during meiosis (Bingham and McCoy 1988). Earlier linkage analyses assumed random chromosome segregations, although this may deviate from biological reality (Yu and Pauls 1993; Brouwer and Osborn 1999). This assumption will be relaxed in our analysis by providing a direct estimate of the preferential pairing factor denoted as p. According to Sybenga (1994), p is defined as two-thirds of the difference between the pairing frequencies of more similar chromosomes and of less similar chromosomes, plus a constant one-third. Thus, when p = <FR><NU>2</NU><DE>3</DE></FR>, less similar chromosomes do not pair; that is, chromosomal pairings happen strictly between the homologs. When p = 0, all the four chromosomes are homologous, and they will pair randomly. The value of p theoretically ranges from 0 to <FR><NU>2</NU><DE>3</DE></FR>.

Two contrasting tetraploid plants, winter-hardy Blazer XL (B17) and winter-sensitive Peruvian 13 (P13), were crossed to generate an F1 hybrid population, which was then backcrossed to each parent (Brouwer and Osborn 1999). Each backcross obtains 101 progenies used for mapping. Two hardiness traits, freezing injury measured by electrical conductivity and winter injury, were measured in two successive years (Brouwer et al. 2000). Because the two original parents are not pure inbred lines, the two-way backcrosses virtually present a full-sib family in which many different marker types may be segregating (Wu et al. 2002). Brouwer and Osborn (1999) used 82 testcross (pseudo-test backcross) markers derived from single-dose restriction fragment length polymorphisms to construct two genetic linkage maps for each backcross population. In total, four homologous coupling-phase cosegregation groups, of which two were derived from the backcross to B17 (A and B) and the other two from the backcross to P13 (C and D), were detected for seven of the eight linkage groups. In a previous regression analysis, Brouwer et al. (2000) found that there was a higher probability of detecting significant QTLs for winter hardiness on cosegregation groups A and B than on C and D. Thus, groups A and B are used as an example to test and validate our statistical method for mapping QTLs affecting complex traits in alfalfa. The QTLs mapped are statistically tested on the basis of a critical threshold value at the significance level 5% calculated from 200 permutation tests (Churchill and Doerge 1994).

By using our newly developed method, we successfully detect five and four significant QTLs responsible for freezing injury and winter injury, respectively, in the backcross to B17. The detection of these QTLs was based on a largest likelihood value at a particular preferential pairing factor under a most likely marker-QTL linkage phase. Tables 1 and 2 give the estimates of the QTL chromosomal locations and allelic effects on the two injury traits. We present an example of the detection of the QTL for each trait, in which the peaks of the profiles of the log-likelihood ratio test statistics correspond to a likely position of the QTL detected (Fig. 1).

                              
View this table:
[in this window]
[in a new window]
 
Table 1.   The Locations and Effects of QTL Affecting Freezing Injuries Measured by Electrical Conductivity in Two Successive Years (1995 and 1996) for Bivalent Tetraploida Alfalfa



                              
View this table:
[in this window]
[in a new window]
 
Table 2.   The Locations and Effects of QTL Affecting Winter Injuries in Two Successive Years (1996 and 1997) for Bivalent Tetraploid Alfalfa


View larger version (21K):
[in this window]
[in a new window]
 
Figure 1   The profiles of the likelihood ratio test statistics across linkage group 8A in bivalent tetraploid alfalfa. Two horizonal lines indicate the thresholds at the significance level 5% calculated from 200 permutation tests (Churchill and Doerge 1994). QTL for freezing injuries (A) measured in 1995 (blue) and 1996 (pink); QTL for winter injuries (B) measured in 1996 (blue) and 1996 (pink).

Of the five QTLs detected for freezing injury, four mapped to F1-specific linkage groups 4A, 5A, 6A and 8A, whereas only one mapped to B17-specific linkage group 5B. For linkage group A, the positive allelic effect of a QTL indicates that parent P13 contributes an increasing allele for injury trait values. Our estimates of positive allelic effects (Table 1) indicate that parents B17 and P13 contribute cold-tolerant and cold-sensitive alleles to their F1 hybrids, respectively, conforming to the biological attributes of these two parents. But the positive allelic effect of a QTL on 5B implies that parent B17 may also contribute cold-sensitive alleles.

According to our estimate, the marker-QTL linkage phase with the largest probability is one for which the presence of the simplex markers (P13 alleles) is in repulsion phase with the QTL allele, leading to smaller trait values and therefore larger hardiness. We found strong evidence for the change of QTL activity over different ages. More significant QTLs were detected in the second year than in first year (Table 1). A same marker interval on linkage group 8A carries a QTL responsible for freezing injury in both years, with an increased LR value for the second year than for first year.

Similar patterns of QTL expression were also observed for winter injury in alfalfa (Table 2). But most of the QTLs detected are different between freezing and winter injuries. Two chromosomal segments on 5A and 8A detected to affect both freezing and winter injuries may contribute to their moderate correlation (Brouwer et al. 2000).

One of the major advantages of our method is that it can estimate the preferential pairing factor during polyploid meiosis. The estimated preferential pairing factor, p = 0.6 (0 <=  p <=  <FR><NU>2</NU><DE>3</DE></FR>), consistently obtained from most marker intervals regardless of the significance of their association with a QTL (Tables 1, 2), indicates that chromosome pairings of tetraploid alfalfa are actually not random.

A Simulation Study

We performed a simulation study to test the performance and robustness of our bivalent polyploid model incorporating the preferential pairing factor. Our interest was to investigate the effects of two major assumptions, completely preferential pairing, as assumed in Doerge and Craig (2000), and random segregation, as assumed in Hackett et al. (2001), on the precision of parameter estimation. We simulated two interval markers and one QTL, determining a normally distributed trait for a pseudo-test backcross population of 200 offspring. The two markers and the QTL are assumed in coupling phase. The two markers are separated 20 cM from each other, between which the QTL is located at 5 cM from the left marker. The Kosambi map function is used to convert the map distance in the corresponding recombination fraction. The QTL is hypothesized to have the additive effect of 0.5 and to explain 20% of the total phenotypic variance. Based on these conditions, a data set of markers and phenotypes are simulated under the assumption of p = 0.33 using the genotype frequencies given in Table 3.

                              
View this table:
[in this window]
[in a new window]
 
Table 3.   Joint Probabilities of Three-Locus Genotypes for a Putative QTL Bracketed by Two Simplex Markers Under Different QTL-Marker Linkage Phases as Given in Expressions 2 Through 4 in a Pseudo-Test Backcross Design.

Three methods are used to analyze the simulated data set, the first being Doerge and Craig's method of assuming completely preferential pairings, the second being Hackett et al.'s method of assuming random segregation, and the third being our method as proposed in this article. Our method takes into account all possible cases of chromosome bivalent pairings by estimating the preferential pairing factor p. The results from our analyses are summarized as follows: (1) Doerge and Criag's method gave the most biased estimates for all QTL and model parameters, although it is computationally fast; (2) Hackett et al.'s method also had significant biases for QTL position and effect estimates (biased by 10% to 20%); and (3) as expected, our method displayed reasonable estimation accuracy and precision for all parameters. An additional important advantage of our method is that it provides a direct estimate of the preferential pairing factor that is of typical interest to evolutionary and systematic biologists.


    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
METHODS
REFERENCES

We have for the first time devised a statistical method for mapping QTLs in recalcitrant polyploids by considering the chromosome pairing mechanism of polyploid meiosis. The pairing mechanisms in polyploids include two types, bivalent and multivalent configurations. In this article, bivalent chromosome pairings are considered. Our bivalent polyploid model based on maximum-likelihood methods can provide not only the estimates of the map position of QTL, its effect, and inheritance mode but also the estimate of the preferential pairing factor (p), a cytological parameter of evolutionary and systematic importance. In addition, our model incorporating bivalent pairing mechanisms can enhance the estimation precision of QTL parameters in polyploids. As demonstrated by a simulation study, greater-bias parameter estimates will be obtained if the preferential pairing factor is not considered, as assumed by Doerge and Craig (2000) and Hackett et al. (2001).

In earlier analyses of alfalfa by Brouwer and Osborn (1999) and Brouwer et al. (2000), random chromosome pairing was assumed. But our current result reveals significant preferential pairings at meiosis in the same material (p = 0.60). Our result can be regarded as being closer to biological reality for three reasons. First, the assumption of random chromosome pairings is obtained from more traditional cytological approaches that may not be accurate enough to make exclusive conclusions (Sybenga 1994). Molecular markers specifying a small chromosomal segment are indicated to have more power of detecting chromosome pairing behaviors at meiosis. Second, our model takes into account the general meiotic property of a polyploid, which can cover random chromosome pairings. As long as a polyploid undergoes random bivalent pairings, they can be diagnosed by our model.

Third and most important, our model has been validated by a real-world example. North American alfalfa cultivars have been bred from nine sources, most of which are categorized as Medicago sativa spp. sativa; however, one is considered a distinct subspecies, M. sativa spp. falcata (Barnes et al. 1988). Although these germ plasm sources have been intermated and selected to derive alfalfa cultivars, the nine original sources have been maintained separately. A previous analysis showed that seven of the nine germ plasm sources were genetically very similar, one M. sativa spp. sativa source (Peruvian) was somewhat distinct, and the M. sativa spp. falcata source was very distinct (Kidwell et al. 1994). Two tetraploid plants, Blazer XL 17 and Peruvian 13, derived from these different sources (Peruvian and Falcata) likely display preferential chromosome segregation behavior because they are genetically distinct from each other.

Because no statistically powerful and biologically relevant approach is available in the current literature, QTL mapping in polyploids was performed by using a regression-based analysis of variance (Brouwer et al. 2000; Ming et al. 2001). Based on the alfalfa mapping material used by Brouwer et al. (2000), we detected several significant QTLs affecting winter hardiness. But only one of the QTLs detected from our newly developed model is consistent with the result from the analysis of variance approach. This is not surprising given that this concordant QTL, located on linkage group A, exhibits a large additive effect. Theoretically, a large QTL can be relatively easily monitored, even by a less powerful approach. Although we should be cautious with the inconsistency of most of the QTLs detected by our method and by analysis of variance, the inherited limits of analysis of variance may give us good reasons to favor our findings. Basically, the marker-associated analysis of variance cannot clearly distinguish between large-sized but distantly localized QTLs and small-sized but closely localized QTLs. Also, it is not easy to incorporate meiotic mechanisms into analysis of variance, another reason that the results from analysis of variance may not well reflect biological reality.

We have devised a powerful statistical method for QTL mapping in tetraploids by using single-dose restriction fragments, but it is crucial to modify this method to other different situations. In this article, we assumed the meiotic mechanism of bivalent pairings. Many species also undergo multivalent formation, from which a particular genetic phenomenon called double reduction results (Darlington 1929; Butruille and Boiteux 2000). Our method can be modified to consider the mechanism of multivalent formation. In addition, the model should be extended to consider double- (duplex) or multiple-dose restriction fragments that are often used in several polyploid studies (Ming et al. 1998, 2001). For dominant duplex markers, at which there are two genotypes segregating 5:1 in a tetraploid pseudo-test backcross, we will need to derive new conditional probabilities of the QTL genotypes to fit segregation patterns of the duplex marker interval. For codominant duplex markers that segregate a 1:4:1 ratio, we will need one more parameter to model the dominant effect of a QTL. We assumed that the markers and the QTL have the same dosage level. But it is possible that simplex markers are linked with a duplex QTL or that a simplex QTL is bracketed by two duplex markers (Skinner et al. 2000). Our analysis is based on the simplest pseudo-test backcross design (Grattapagalia and Sederoff 1994) and should be extended to consider a full-sib polyploid family, in which there may be many more complicated cross types, as shown in Wu et al. (2002). A general model for simultaneously using all different marker types to map QTLs should be developed. Our model integrates the linkage and linkage phase estimation into a unified framework, displaying an advantage that it overcomes the problem owing to poor estimation for the linkage between different markers and QTL in a repulsion phase (see Hackett et al. 1998). Yet, this integration requires more powerful computational algorithms. We are now implementing new algorithms, such as genetic algorithms (Gaspin and Schier 1998), in our linkage analysis model of polyploids. After all of these extensions are developed, we will have more power to tackle complicated problems of QTL mapping resulting from the polysomic inheritance of polyploids.


    METHODS
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
METHODS
REFERENCES

The Mixture Model

A fundamental model for QTL mapping is the statistical mixture model (McLachlan and Peel 2000). In this mixture model, each observation yi is assumed to have arisen from one of n (n possibly unknown but finite) components, each component being modelled by a density from the parametric family f:
p(y<SUB>i</SUB>‖&pgr;, &phgr;, &eegr;) = &pgr;<SUB>1</SUB>f(y<SUB>i</SUB><IT>;</IT> &phgr;<SUB>1</SUB>, &eegr;) + … + &pgr;<SUB>n</SUB>f(y<SUB>i</SUB><IT>;</IT> &phgr;<SUB>n</SUB><IT>, </IT>&eegr;) (1)
where pi  = (pi 1,...,pi n) are the mixture proportions that are constrained to be nonnegative and sum to unity; phi  = (phi 1,...,phi n) are the component specific parameters, with phi j being specific to component j; and eta  is a common parameter which is common to all components.

For the mixture model used in genetic mapping (Lander and Botstein 1989), each component represents a class of QTL genotypes, and thus, the mixture model provides a framework by which observations may be clustered together into different classes of QTL genotypes. The mixture proportions represent the relative frequency of occurrence of each QTL genotype in the population. Within a particular marker genotype, the relative frequency of each QTL genotype is its conditional probability on the marker genotype.

For a pseudo-test backcross tetraploid population, there are two groups of genotypes at a single gene. Thus, the mixture model of polyploids contains two components of QTL genotypes that are predicted by four marker genotypes at a marker interval. The proportions of mixtures pi k present the probabilities of QTL genotypes conditional on marker genotypes, which have been derived in Table 1. As seen from the table, the conditional probabilities contain the information of QTL position. Each mixture is assumed to follow a normal distribution fk(yi), with the expected mean specified by the genotypic value of the corresponding QTL genotype and the common residual variance sigma 2. The genotypic values of the two QTL genotypes are expressed as µ1 = µ + 1/2a for Qqqq and µ1 = µ - 1/2a for qqqq. In quantitative genetics, µ is the overall mean, and a is the additive effect of allele Q, which is the effect of substituting q by Q.

Conditional Probabilities

For species like polyploids, in which it is difficult to generate classical pure inbred lines, we generally use a pseudo-test backcross design, derived from two outcrossing parents, for linkage mapping (Grattapaglia and Sederoff 1994). We are interested in those markers that are heterozygous in one parent but homozygous in the second. For a simplex marker, a 1:1 segregation ratio is expected in an F1 tetraploid hybrid family if one parent is heterozygous (1000), whereas the other is null (0000). Consider two simplex markers for a heterozygous bivalent tetraploid with four chromosomes---labeled by 1, 2, 3, and 4---in a set. If these four chromosomes are completely identical, the allelic configurations of the two simplex markers can be described by a coupling phase or repulsion phase (Hackett et al. 1998). But if these four chromosomes are different, as considered in this article, with chromosome pairs 1 and 2, and 3 and 4 (homologous) being more similar than chromosomes pairs 1 and 3, 2 and 4, 1 and 4, and 2 and 3 (homoeologous), then the repulsion phase of the two simplex markers have two types: (1) homologous repulsion and (2) homoeologous repulsion.

Now, consider a putative QTL for a quantitative trait that is bracketed by the two simplex markers. Two alternative alleles of this QTL, denoted by Q and q, form a genotype Qqqq in the heterozygous parent and qqqq in the homozygous parent. When the two markers are in a coupling phase, we have three different phases between the QTL and markers:
(2)

where the lines denote chromosomes 1, 2, 3, and 4 in order. In Equation 2A1, the QTL and markers are in a coupling phase, whereas in Equations 2A2 and 2A3, the QTL is in a homologous and homoeologous repulsion phase with the markers, respectively. Similar QTL-marker phase types can be detected as
(3)

for the marker homologous repulsion phase, and as
(4)

for the marker homoeologous repulsion phase.

Wu et al. (2002) have given a 6 × 6 gametic probability matrix of two fully informative markers generated by a tetraploid undergoing bivalent pairings. The gametic probabilities are a function of not only the recombination fraction r (as is the case in a diploid population, or has been assumed in previous polyploid mapping studies) but also the preferential pairing factor p. The gamete probability matrix of fully informative markers can be collapsed into a 2 × 2 matrix if both of the markers are simplex. Such a collapsed matrix, however, will have different structures, when different marker linkage phases (Equations 2-4) are considered. When a QTL is tested on the interval of the two fully informative markers, we will have a 36 × 6 matrix for the conditional probabilities of six QTL gamete genotypes on 36 marker gamete genotypes formed by a bivalent tetraploid. Similarly, this full conditional probability can be collapsed into a 4 × 2 matrix when two simplex markers are used to predict a biallelic QTL (Table 3). The structures of the collapsed matrix differ depending on different marker-QTL linkage phases (Equations 2-4; Table 3).

Generally, the linkage phase between two flanking markers is known before they are used to estimate QTL effects and position. Thus, our question for QTL mapping will be reduced to detect a most likely QTL-marker linkage phase from Equation 2 when the two markers are in a coupling phase, from Equation 3 when the two markers are in a homologous repulsion phase, or from Equation 4 when the two markers are in a homoeologous repulsion phase. S. Wu et al. (2001) used Bayes' theorem to characterize the most likely linkage phase based on a separate likelihood analysis of all possible phases. The estimation of the recombination fraction is then based on the most likely linkage phase detected. Using this approach, however, we cannot simultaneously use the information of all linkage phases. Here, all possible linkage phases will be incorporated within an integrated framework of the mixture QTL mapping model.

Assume that the probabilities of the three phases in Equation 2 are denoted by phi1 (A1), phi2 (A2), and phi3 (A3) (phi1 + phi2 + phi3 = 1). Thus, a simple mixture model (Equation 1), as used for regular QTL mapping (Lander and Botestein 1989), is changed into a two-stage hierarchical mixture model that combines the phase probabilities and conditional probabilities of QTL genotypes
(5)

where pi jk is the conditional probability of the kth QTL genotype under linkage phase j (Table 3), k = 1 for QTL genotype Qqqq and k = 2 for QTL genotype qqqq; j = 1,2,3. From Equation 5, the proportions (psi k = phi1pi 1k + phi2pi 2k phi3pi 3k) of two QTL genotypes are the combinations of the conditional probabilities weighted by the phase probabilities phi1 - phi3.

Computational Algorithm

We formulate the EM algorithm (Dempster et al. 1977; Meng and Rubin 1993) to estimate the preferential pairing factor, QTL effects, and position in a full-sib family derived from two outcrossing tetraploids. The likelihood of the phenotypes (y) for N offspring in the full-sib family is expressed as
L(&OHgr;) = <LIM><OP>∑</OP><LL>i=1</LL><UL>N</UL></LIM>[&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>) + &psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)], (6)

where Omega  = (µ, a, r1 or r2, sigma 2, p, phi1, phi2) is the vector of unknown parameters containing the overall mean, QTL effects, QTL position, residual variance, the preferential pairing factor and the phase probabilities. The log-likelihood is given by
<UP>log</UP> L(&OHgr;) = <LIM><OP>∑</OP><LL>i=1</LL><UL>N</UL></LIM><UP>log</UP>[&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>) + &psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)], (7)

with derivatives for each unknown Omega m:
<FR><NU>∂</NU><DE>∂&OHgr;<SUB>m</SUB></DE></FR><UP> log </UP>L(&OHgr;) = <LIM><OP>∑</OP><LL>i=1</LL><UL>N</UL></LIM><FENCE><FR><NU>&psgr;<SUB>1</SUB><FR><NU>∂</NU><DE>∂&OHgr;<SUB>m</SUB></DE></FR>f<SUB>1</SUB>(y<SUB>i</SUB>)</NU><DE>&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>) + &psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)</DE></FR> + <FR><NU>&psgr;<SUB>2</SUB><FR><NU>∂</NU><DE>∂&OHgr;<SUB>m</SUB></DE></FR>f<SUB>2</SUB>(y<SUB>i</SUB>)</NU><DE>&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>) + &psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)</DE></FR></FENCE> =

 <LIM><OP>∑</OP><LL>i=1</LL><UL>N</UL></LIM><FENCE><FR><NU>&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>)</NU><DE>&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>) + &psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)</DE></FR> <FR><NU>∂</NU><DE>∂&OHgr;<SUB>m</SUB></DE></FR> <UP>log</UP> f<SUB>1</SUB>(y<SUB>i</SUB>) + <FR><NU>&psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)</NU><DE>&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>) + &psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)</DE></FR> <FR><NU>∂</NU><DE>∂&OHgr;<SUB>m</SUB></DE></FR> <UP>log</UP> f<SUB>2</SUB>(y<SUB>i</SUB>)</FENCE> =

 <LIM><OP>∑</OP><LL>i=1</LL><UL>N</UL></LIM><FENCE>&PSgr;<SUB>1i</SUB> <FR><NU>∂</NU><DE>∂&OHgr;<SUB>m</SUB></DE></FR> <UP>log</UP> f<SUB>1</SUB>(y<SUB>i</SUB>) + &PSgr;<SUB>2i</SUB> <FR><NU>∂</NU><DE>∂&OHgr;<SUB>m</SUB></DE></FR> <UP>log</UP> f<SUB>2</SUB>(y<SUB>i</SUB>)</FENCE>

where we define
&PSgr;<SUB>ki</SUB> = <FR><NU>&psgr;<SUB>k</SUB>f<SUB>k</SUB>(y<SUB>i</SUB>)</NU><DE>&psgr;<SUB>1</SUB>f<SUB>1</SUB>(y<SUB>i</SUB>) + &psgr;<SUB>2</SUB>f<SUB>2</SUB>(y<SUB>i</SUB>)</DE></FR>, (8)

which could be thought of as a posterior probability that progeny i have QTL genotype k. We then implement the EM algorithm with the expanded parameter set {Omega , Psi }, where Psi  = {Psi k, k = 1, 2}. Conditional on Psi , we solve for the zeros of (partial /partial Omega m) log L(Omega ) to get our estimates of Omega  (the M step). The estimates are then used to update Psi  (the E step), and the process is repeated until convergence. The values at convergence are the MLEs.

Unlike the treatment of characterizing a most likely linkage phase by Wu et al. (2002), we implement additional parameters, phase probabilities, within our estimation model. Because it is difficult to derive the maximum likelihood estimators from the mixture model (5) of the phase probabilities phi's, preferential pairing factor p and recombination fraction r1 or r2, a grid approach is used to obtain their MLEs by taking all of their possible values. For phi's, we increase them by every 0.1 from the range 0-1 under the constraint phi1 + phi2 + phi3 = 1. The values of phi's that lead to a maximum likelihood are regarded as their MLEs. Similarly, the MLE of p is estimated by increasing it by every 0.05 in the range from 0 to <FR><NU>2</NU><DE>3</DE></FR>. By moving the assumed position of the QTL every 0.05 cM within a marker interval, the MLE of QTL position is estimated. The program that implements the proposed method can be obtained from http: //www.ifasstat.ufl.edu/genetics/alfalfa.html.


    ACKNOWLEDGMENTS

We thank Professors Sarah Otto and J. Sybenga for clarifying some ambiguities about the biological process of polyploid meiosis. This work is partially supported by an Outstanding Young Investigator Award (30128017) of the National Natural Science Foundation of China and the University of Florida Research Opportunity Fund (02050259) to R.W. The publication of this manuscript is approved as a Journal Series No. R-08796 by the Florida Agricultural Experiment Station.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.


    FOOTNOTES

3 Corresponding author.

E-MAIL rwu{at}stat.ufl.edu; FAX (352) 392-8555.

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.320202.


    REFERENCES
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
METHODS
REFERENCES

  • Allendorf, F.W. and Danzmann, R.G. 1997. Secondary tetrasomic segregation of MDH-B and preferential pairing of homeologs in rainbow trout. Genetics 145: 1083-1092[Abstract].
  • Averett, J.E. 1980. Polyploidy in plant taxa: Summary. In Polyploidy: Biological relevance (ed. W.H. Lewis), pp. 269-273. Plenum Press, New York, NY.
  • Barnes, D.K., Goplen, B.P., and Baylor, J.E. 1988. Highlights in the USA and Canada. In Alfalfa and Alfalfa improvement (ed. A.A. Hanson et al.), pp. 1-24. ASA, CSSA and SSSA, Madison, WI.
  • Bever, J.D. and Felber, F. 1992. The theoretical population genetics of autopolyploidy. Oxford Surv. Evol. Biol. 8: 185-217.
  • Bingham, E.T. and McCoy, T.J. 1988. Cytology and cytogenetics of alfalfa. In Alfalfa and Alfalfa improvement (ed. A.A. Hanson et al.), pp. 737-776. ASA, CSSA and SSSA, Madison, WI.
  • Brouwer, D.J. and Osborn, T.C. 1999. A molecular marker linkage map of tetraploid alfalfa (Medicago sativa L.). Theor. Appl. Genet. 99: 1194-1200[CrossRef].
  • Brouwer, D.J., Duke, S.H., and Osborn, T.C. 2000. Mapping genetic factors associated with winter hardiness, fall growth, and freezing injury in autotetraploid alfalfa. Crop Sci. 40: 1387-1396[Abstract/Free Full Text].
  • Burner, D.M. 1997. Chromosome transmission and meiotic behavior in various sugarcane crosses. J. Am. Soc. Sugarcane Technologists 17: 38-50.
  • Butruille, D.V. and Boiteux, L.S. 2000. Selection-mutation balance in polysomic tetraploids: Impact of double reduction and gametophytic selection on the frequency and subchromosomal localization of deleterious mutations. Proc. Natl. Acad. Sci. 97: 6608-6613[Abstract/Free Full Text].
  • Churchill, G.A. and Doerge, R.W. 1994. Empirical threshold values for quantitative trait mapping. Genetics 138: 963-971[Abstract].
  • Darlington, C.D. 1929. Chromosome behaviour and structural hybridity in the Tradescantiae. J. Genet. 21: 207-286.
  • da Silva, J. 1993. "A methodology for genome mapping of autopolyploids and its application to sugarcane Saccharum spp.)," Ph.D. dissertation Cornell University, Ithaca, NY.
  • Dempster, A.P., Laird, N.M., and Rubin, D.B. 1977. Maximum likelihood from incomplete data via EM algorithm. J. R. Stat. Soc. Ser. B 39: 1-38.
  • Diwan, N., Bouton, J.H., Kochert, G., and Cregan, P.B. 2000. Mapping of simple sequence repeat (SSR) DNA markers in diploid and tetraploid alfalfa. Theor. Appl. Genet. 101: 165-172[CrossRef].
  • Doerge, R.W. and Craig, B.A. 2000. Model selection for quantitative trait locus analysis in polyploids. Proc. Natl. Acad. Sci. 97: 7951-7956[Abstract/Free Full Text].
  • Fjellstrom, R.G., Beuselinck, P.R., and Steiner, J.J. 2001. RFLP marker analysis supports tetrasomic inheritance in Lotus corniculatus L. Theor. Appl. Genet. 102: 718-725[CrossRef].
  • Gaspin, C. and Schier, T. 1998. Genetic algorithms for genetic mapping. Lect. Notes Comput. Sci. 1363: 145-155.
  • Grattapaglia, D. and Sederoff, R. 1994. Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: Mapping strategy and RAPD markers. Genetics 137: 1121-1137[Abstract].
  • Grivet, L., D'Hont, A., Roques, D., Feldmann, P., Lanaud, C., and Glaszmann, J.C. 1996. RFLP mapping in cultivated sugarcane (Saccharum spp): Genome organization in a highly polyploid and aneuploid interspecific hybrid. Genetics 142: 987-1000[Abstract].
  • Hackett, C.A. 2001. A comment on Xie and Xu: Mapping quantitative trait loci in tetraploid species. Genet. Res. 78: 187-189[Medline].
  • Hackett, C.A., Bradshaw, J.E., Meyer, R.C., McNicol, J.W., Milbourne, D., and Waugh, R. 1998. Linkage analysis in tetraploid species: A simulation study. Genet. Res. 71: 143-154[CrossRef].
  • Hackett, C.A., Bradshaw, J.E., and McNicol, J.W. 2001. Interval mapping of quantitative trait loci in autotetraploid species. Genetics 159: 1819-1832[Abstract/Free Full Text].
  • Hilu, K.W. 1993. Polyploidy and the evolution of domesticated plants. Am. J. Bot. 80: 1491-1499.
  • Kempthorne, O. 1957. An introduction to genetic statistics. John Wiley & Sons, New York, NY.
  • Kidwell, K.K., Austin, D., and Osborn, T.C. 1994. RFLP evaluation of nine Medicago accessions representing original germ plasm sources for North American alfalfa. Crop Sci. 34: 230-236[Abstract/Free Full Text].
  • Lander, E.S. and Botstein, D. 1989. Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121: 185-199[Abstract/Free Full Text].
  • Luo, Z.W., Hackett, C.A., Bradshaw, J.E., McNicol, J.W., and Milbourne, D. 2001. Construction of a genetic linkage map in tetraploid species using molecular markers. Genetics 157: 1369-1385[Abstract/Free Full Text].
  • McLachlan, G.J. and Peel, D. 2000. Finite mixture models. Wiley, New York, NY.
  • Meng, X.L. and Rubin, D.B. 1993. Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika 80: 267-278[Abstract/Free Full Text].
  • Ming, R., Liu, S.C., Lin, Y.R., da Silva, J., Wilson, W., Braga, D., van Deynze, A., Wenslaff, T.F., Wu, K.K., Moore, P.H. 1998. Detailed alignment of Saccharum and Sorghum chromosomes: Comparative organization of closely related diploid and polyploid genomes. Genetics 150: 1663-1682[Abstract/Free Full Text].
  • Ming, R., Liu, S.C., Moore, P.H., Irvine, J.E., and Paterson, A.H. 2001. QTL analysis in a complex autopolyploid: Genetic control of sugar content in sugarcane. Genome Res. 11: 2075-2084[Abstract/Free Full Text].
  • Otto, S.P. and Whitton, J. 2000. Polyploid incidence and evolution. Annu. Rev. Genet. 34: 401-437[CrossRef][Medline].
  • Ramsey, J. and Schemske, D.W. 1998. Pathways, mechanisms, and rates of polyploid formation in flowering plants. Annu. Rev. Ecol. Syst. 29: 467-501[CrossRef].
  • Ripol, M.I, Churchill, G.A., da Silva, J.A.G., and Sorrells, M. 1999. Statistical aspects of genetic mapping in autopolyploids. Gene 235: 31-41[CrossRef][Medline].
  • Ronfort, J., Jenczewski, E., Bataillon, T., and Rousset, F. 1998. Analysis of population structure in autotetraploid species. Genetics 150: 921-930[Abstract/Free Full Text].
  • Skinner, D.Z., Loughin, T., and Obert, D.E. 2000. Segregation and conditional probability association of molecular markers with traits in autotetraploid alfalfa. Mol. Breeding 6: 295-306[CrossRef].
  • Soltis, P.S. and Soltis, D.E. 2000. The role of genetic and genomic attributes in the success of polyploids. Proc. Natl. Acad. Sci. 97: 7051-7057[Abstract/Free Full Text].
  • Sybenga, A. 1994. Preferential pairing estimates from multivalent frequencies in tetraploids. Genome 37: 1045-1055.
  • -----. 1995. Meiotic pairing in autohexaploid Lathyrus: A mathematical model. Heredity 75: 343-350.
  • -----. 1996. Chromosome pairing affinity and quadrivalent formation in polyploids: Do segmental allopolyploids exist? Genome 39: 1176-1184.
  • Wu, K.K., Burnquist, W., Sorrells, M.E., Tew, T.L., Moore, P.H., and Tanksley, S.D. 1992. The detection and estimation of linkage in polyploids using single-dose restriction fragments. Theor. Appl. Genet. 83: 294-300.
  • Wu, R.L. 1999. Mapping quantitative trait loci by genotyping haploid tissues. Genetics 152: 1741-1752[Abstract/Free Full Text].
  • Wu, R.L., Gallo-Meagher, M., Littell, R.C., and Zeng, Z.B. 2001. A general polyploid model for analyzing gene segregation in outcrossing tetraploid species. Genetics 159: 869-882[Abstract/Free Full Text].
  • Wu, R.L., Ma, C.X., and Casella, G. 2002. A bivalent polyploid model of linkage analysis in outcrossing tetraploids. Theor. Pop. Biol. 62: 129-151[CrossRef][Medline].
  • Wu, S.S., Wu, R.L., Ma, C.X., Zeng, Z.B., Yang, M.C.K., and Casella, G. 2001. A multivalent pairing model of linkage analysis in autotetraploids. Genetics 159: 1339-1350[Abstract/Free Full Text].
  • Xie, C.G. and Xu, S.H. 2000. Mapping quantitative trait loci in tetraploid populations. Genet. Res. 76: 105-115[CrossRef][Medline].
  • Yu, K.F. and Pauls, K.P. 1993. Segregation of random amplified polymorphic DNA markers and strategies for molecular mapping in tetraploid alfalfa. Genome 36: 844-851.
  • Zeven, A.C. 1979. Polyploidy and domestication: The origin and survival of polyploids in cytotype mixtures. In Polyploidy: Biological relevance (ed. W.H. Lewis), pp. 385-408. Plenum Press, New York, NY.

Received March 27, 2002; accepted in revised form September 30, 2002.


12:1974-1981 ©2002 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/02 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
GeneticsHome page
R. Wu and C.-X. Ma
A General Framework for Statistical Linkage Analysis in Multivalent Tetraploids
Genetics, June 1, 2005; 170(2): 899 - 907.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. Cao, B. A. Craig, and R. W. Doerge
A Model Selection-Based Interval-Mapping Method for Autopolyploids
Genetics, April 1, 2005; 169(4): 2371 - 2382.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
Z. W. Luo, R. M. Zhang, and M. J. Kearsey
Theoretical basis for genetic linkage analysis in autotetraploid species
PNAS, May 4, 2004; 101(18): 7040 - 7045.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
D. Cao, T. C. Osborn, and R. W. Doerge
Correct Estimation of Preferential Chromosome Pairing in Autotetraploids
Genome Res., March 1, 2004; 14(3): 459 - 462.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire