|
|
|
|
Published online before print
August 18, 2005, 10.1101/gr.3907305 Genome Res. 15:1198-1210, 2005 ©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00
Comparative genomics of Gossypium and Arabidopsis: Unraveling the consequences of both ancient and recent polyploidy1 Plant Genome Mapping Laboratory, Life Sciences Building, University of Georgia, Athens, Georgia 30602, USA 2 Biochemistry and Molecular Biology, Life Sciences Building, University of Georgia, Athens, Georgia 30602, USA
Both ancient and recent polyploidy, together with post-polyploidization loss of many duplicated gene copies, complicates angiosperm comparative genomics. To explore an approach by which these challenges might be mitigated, genetic maps of extant diploid and tetraploid cottons (Gossypium spp.) were used to infer the approximate order of 3016 loci along the chromosomes of their hypothetical common ancestor. The inferred Gossypium gene order corresponded more closely than the original maps did to a similarly inferred ancestral gene order predating an independent paleopolyploidization ( ) in Arabidopsis. At least 59% of the cotton map and 53% of the Arabidopsis transcriptome showed correspondence in multilocus gene arrangements based on one or both of two software packages (CrimeStatII, FISH). Genomic regions in which chromosome structural rearrangement has been rapid (obscuring gene order correspondence) have also been subject to greater divergence of individual gene sequences. About 26%-44% of corresponding regions involved multiple Arabidopsis or cotton chromosomes, in some cases consistent with known, more ancient, duplications. The genomic distributions of multiple-locus probes provided early insight into the consequences for chromosome structure of an ancient large-scale duplication in cotton. Inferences that mitigate the consequences of ancient duplications improve leveraging of genomic information for model organisms in the study of more complex genomes.
The generally more-rapid evolution of gene arrangement in angiosperms (flowering plants), relative to other higher eukaryotes such as animals, appears to be at least in part due to polyploidy and its consequences (Bowers et al. 2003 ) or before ( ) the monocot-dicot divergence suggest that virtually all angiosperms are paleopolyploids (Bowers et al. 2003
Burgeoning genomic data are revealing that not only do most angiosperms share a few ancient whole-genome duplications, but many have also undergone more recent lineage-specific duplications. For example, a duplication detected in the rice sequence (Goff et al. 2002
Better understanding of polyploidy and its consequences are central to comparative biology. Traditional models (Ohno 1970
The high frequency of duplications (especially in comparison to dioecious organisms such as most animals) and experimental facility of angiosperms, together with their much smaller Ne than yeast, make them attractive models for dissecting the evolutionary consequences of polyploidy in higher eukaryotes. These consequences may have many dimensionsfor example, patterns of distribution of QTLs within a nucleus, across taxa, and across environments (Jiang et al. 1998
The complicated genome structure that results from repeated cycles of genome duplication, each followed by loss of one member of many duplicated gene pairs ("diploidization"[Eckardt 2001b
Gossypium (cotton) is an especially appropriate system in which to explore the comparative genomics of paleopolyploids. While the cereals have often been preferred models for angiosperm comparative genomics, the relative lack of knowledge of non-Poaceae monocots is presently a hindrance in studying ancient genomic duplications. The Malvales (including cotton) are presently the nearest relative to Arabidopsis outside of the Brassicales, for which a detailed genetic map has been described. Dating of the
As a further step toward unraveling the complexities that polyploidy introduces into comparative genomics, we explore in detail the comparative chromosome structural evolution of Gossypium and Arabidopsis. Detailed genetic linkage maps of the tetraploid and D-diploid cotton genomes (Rong et al. 2004
Inferring gene order along the chromosomes of the hypothetical ancestor of the A and D genomes of Gossypium To improve our ability to identify intragenomic duplication in cotton, existing genetic maps (Rong et al. 2004
A total of 781 probes were mapped to two or more loci in cotton. Alignments among homoeologous At, Dt, and D chromosomes were readily established based on 333 pairs of loci arranged in extensive blocks with corresponding order and orientation. An additional 87 pairs of loci revealed blocks of corresponding order but opposite orientation (reflecting inversions). Homoeologous reference loci were used as a framework to interpolate the probable locations of additional markers that could be mapped in only a subset of the homoeologs. In the vast majority of cases, this reflects lack of genetic polymorphism. Even in interspecific crosses, per-locus polymorphism rates in tetraploid cotton are modest (e.g., Rong et al. 2004
Several lines of evidence show that gene loss per se is infrequent in the At and Dt genomes. First, the vast majority of cDNA or other low-copy probes detect multiple "alloallelic" restriction fragments in tetraploid cotton, which comigrate with fragments in diploid progenitors (Reinisch et al. 1994
In the absence of appreciable gene loss, the same principles that apply to the inference of nonpolymorphic sites across the At, Dt, and D genomes are appropriate for inference of approximate gene arrangement along the chromosomes of their hypothetical common ancestor. To infer the probable gene order along the chromosomes of a hypothetical common ancestor of the At, Dt, and D genomes, the Dt genome was used as the primary map into which the other maps were merged. We chose Dt as the reference map because (1) it is virtually collinear with the D diploid map, in contrast to extensive rearrangement of the At genome relative to its A ancestor (Brubaker et al. 1999
Differences in polymorphism associated with terminal markers may cause the respective maps to have somewhat different coverage of their respective chromosomes. As a best estimate of the coverage of the respective maps, we considered the centiMorgan length of the respective maps from the last common marker to the terminus. If the ends of the At or D maps were longer than the corresponding segments of the Dt map, the longer segment was used as the primary map for the affected end. The Dt map was the longest for only three chromosomes (2, 4, 6), thus, the maps of the other 10 chromosomes are therefore somewhat longer than the Dt map (Supplemental Fig. 1; Table 1). To deal with translocations between tetraploid (At) Chrs. 2 and 3, and 4 and 5, the affected chromosomes were split, with the appropriate regions merged into different D genome homologs (Supplemental Table 1). To implement the rules that we outline above for inferring the approximate gene arrangement along the chromosomes of a hypothetical common ancestor of the At, Dt, and D genomes, a computer program was written in PHP. In partial summary, we assembled 13 maps representing the inferred gene orders along the chromosomes of the hypothetical common ancestor of the A and D genomes of Gossypium. Each consensus map was named according to the name of the diploid D genome chromosome, but with the prefix C ("consensus"). The inferred map of the hypothetical ancestral cotton chromosomes included 3016 loci, spanning 2324.7 cM, with the largest gap being 14.5 cM (Table 1).
Cotton sequences having Arabidopsis homologs A total of 2162 (92.5%) of probes detecting 2800 (92.8%) of loci in the inferred ancestral cotton map could be sequenced (Table 1; Supplemental Table 1). Among these, 1738 (62.1%) of the sequenced loci had one or more unambiguous homologs in the Arabidopsis genome based on 7437 BLAST matches that met a threshold of E < 10-10 (listed in Supplemental Table 2). A total of 1005 cotton loci matched 3106 Arabidopsis genes that fell within 30 genes of one another along the inferred pre- gene orders, and thus, were considered tandem or proximal duplicates likely to have arisen by illegitimate recombination and/or transposition. Similarly, 386 Arabidopsis genes matched pairs of cotton loci <5 cM apart, which were also considered proximal duplicates (the rationale for setting these proximity thresholds are in Methods). To avoid the detection of false correspondence based on multiple matches to proximal duplications, only one member of these groups of genes was kept for further analysis. Removal of 2305 proximal duplicates from the match file left 5132 matching pairs that were considered "potential orthologs" and used in comparative analyses.
Distribution of Arabidopsis putative orthologs along inferred ancestral cotton chromosomes
Inferred ancestral Arabidopsis gene arrangement shows more conserved synteny with cotton than modern Arabidopsis gene order
To explore the consequences of duplication/diploidization in Arabidopsis for genomic comparisons, we compared the inferred ancestral cotton map with both the modern Arabidopsis genome and with inferred pre-
Patterns of conserved synteny between cotton and Arabidopsis
In most cases (81.1% with CS2, 96.8% with FISH), a single genomic region in the hypothetical ancestral cotton map corresponded to only one to two Arabidopsis-duplicated segments. These percentages varied among chromosomesfor example, over 80% of correspondence on C01 and C10 involved only one Arabidopsis
Ancient duplication in the cotton genome since its divergence from a common ancestor shared with Arabidopsis Previously (Rong et al. 2004 event, but needs further investigation. Similar patterns of distribution of Ks values were recently reported by others (Blanc and Wolfe 2004To explore the consequences of possible paleopolyploidy of cotton at the level of chromosome structure, loci that were duplicated on nonhomoeologous chromosomes and/or within the same subgenome were mapped to each of their multiple locations on the inferred ancestral map and identified by the addition of a lower-case letter at the end of the probe name (Fig. 1; Supplemental Fig. 1; Supplemental Table 1). FISH and CS2 were used to analyze the distribution of these multiple loci. CS2 showed correspondence over 1200.1 cM, or 51.8% of the genome, more than twice the correspondence detected with FISH (564.1 cM, 24.3%: Supplemental Table 5; Fig. 4). Individual corresponding regions detected with CS2 were generally larger and often included the regions detected by FISH (Supplemental Table 5; Fig. 4). Individual chromosomes varied widely in the portion for which correspondence to other parts of the genome could be identified, from 7.4% (C10 with FISH) to 68% (C02 with CS2).
Both FISH and CS2 suggested occasional correspondence of individual regions of one subgenome to multiple regions of the other, an observation that on the surface may indicate still more ancient duplication events in the cotton lineage. However, such associations are suspect in that they accounted for a disproportionately large share of the incongruities between inferences made using the two packages. Overall, the lengths of corresponding regions on individual chromosomes as estimated by CS2 versus FISH were significantly correlated with one another, but only at a modest level (r = 0.28). There was very strong agreement (r = 0.51) in the portions of individual chromosomes that matched only one other region of the genome. However, there was virtually no relationship between the two packages (r = 0.017) in portions matching two other regions. Higher-order matches lacked adequate data for meaningful comparison. A one-to-one relationship was indicated for 50% and 74% of the corresponding regions by CS2 and FISH, respectively. As illustrated in Figure 4, the number of corresponding regions, in particular those detected by CS2, appears to be related to marker density of the consensus map. Postulated centromeric regions appear to be especially high in both marker density and CS2-inferred correspondence among chromosomal segments. However, we question whether the multiple associations among chromosomes in these regions truly represent conserved syntenyother work in our group (Bowers et al. 2005
Further improving the detection of conserved synteny between cotton and Arabidopsis The combination of cotton segments showing paleohomoeology with one another appears likely to further improve the identification of conserved synteny between cotton and Arabidopsis. Two corresponding chromosomal segments from C05 and C09, respectively (Fig. 5), show six corresponding loci in collinear order (Gate2BC05, pAR0050, Gate4CD03, Unig26C03, pAR0945, and Gate2DH05), except for an inversion between pAR0945 and Gate2DH05 at the ends of two chromosome segments. If a consensus gene arrangement is constructed from the two segments, additional conserved syntenies are identified. For example, when each cotton segment was considered separately, no one showed conserved synteny with 03 on Arabidopsis chromosome 1. But, when the two cotton homoeologous segments are merged, correspondences to two new portions of 03 are identified (Fig. 5). Based on a consensus arrangement inferred from interleaving the two segments, a total of 11 additional Arabidopsis segments showed correspondence, with a maximum of two corresponding to the same cotton region. Sufficient data to apply this approach on a genome-wide scale in cotton may greatly extend the length of chromosomal segments over which conserved synteny can be inferred.
The relatively close relationship of cotton and Arabidopsis, detailed genetic map for cotton, and potential importance of using functional genomic information and tools from Arabidopsis to aid in dissecting economically important pathways in cotton make this system an excellent case study for exploring comparisons of gene order among divergent taxonomic families. The inferred map of 3016 loci spanning 2324.7 cM in the genome of a hypothetical common ancestor of the A and D Gossypium genomes, by itself, is a valuable tool for a wide range of applications. Due to the modest levels of DNA polymorphism among modern (AD) polyploid cottons, we can genetically map both members of a homoeologous gene set in only about 20% of cases, even using interspecific crosses. The inferred map predicts the locations of the remaining 80% of homoeologs that cannot be mapped in any one cross, resolving many incongruities between maps of different tetraploid species (which often segregate for polymorphic alleles at different homoeologs). The inferred map is an excellent resource from which markers can be selected for marker assisted selection, identification of introgression lines, QTL mapping, and SNP discovery using sensitive new techniques that permit identification of informative DNA marker alleles in sequence-tagged sites such as these that had previously been mapped as RFLPs. A growing number of such studies have been done or are in progress (Jiang et al. 1998 The inferred ancestral map is especially important for linking the tetraploid cotton genetic map to emerging BAC-based physical maps, in that it mitigates not only the omission of monomorphic loci, but also any gene loss associated with "diploidization" subsequent to AD polyploid formation. Most of the mapped sequences have been used to anchor high-coverage bacterial artificial chromosome (BAC) libraries of G. hirsutum, G. barbadense, and G. raimondii (see http://www.plantgenome.uga.edu/cotton/CottonDBFrames.htm), the latter of which is also being completely fingerprinted (http://www.plantgenome.uga.edu/projects.htm#Cotton). This will permit us to resolve more precisely the true arrangements of loci that are too closely linked to order with confidence based on genetic recombination.
The inferred ancestral map of cotton, together with similar pre-
Herein, we begin the process of unraveling the consequences of an ancient duplication so far only known in the Gossypium genus, beginning to reveal the sizes and locations of duplicated regions. Growing evidence suggests that historical estimates of the role of paleopolyploidy in angiosperm evolution (Stebbins 1966 Much remains to be done. While we have found tentative correspondence of segments covering about half of the genome, we must assume that many small rearrangements within these segments have escaped detection by the coarse resolution afforded by the (albeit relatively large) number of genetically mapped DNA markers in hand. Additional genetic information and eventually genomic sequence will further clarify the probable patterns of gene arrangement that predated this duplication in the cotton lineage.
Although we have concentrated in this work on duplication events that occurred more recently than the divergence of the cotton and Arabidopsis lineages from a common ancestor, the effects of still more ancient duplications shared by the lineages are also evident. In several cases, different Arabidopsis The true properties of the "other half" of the genome, in which we have not yet found even early evidence of ancient duplication, remain to be clarified. For many of these we may simply have too little information to discern a "signal" of structural conservation from among the many factors that may contribute noise to this data set. However, genomic regions in which chromosome structural rearrangement has been rapid (leaving no conserved synteny), appear also to have been subject to rapid divergence of individual gene sequences. In analysis of potential orthologs with FISH and CS2, 802 (46.1%) detected nonrandom correspondence, putatively conserved synteny between cotton and Arabidopsis (Table 1; Supplemental Table 2). To evaluate their patterns of distribution across the genome, the consensus maps were subdivided into bins of 10 cM in length. While all chromosomes (ranging from 89.5% to 95.1%: Table 1) and most bins (Fig. 6) contained very similar proportions of sequenced loci, the percentage of loci showing putative synteny with Arabidopsis was highly variable. We further plotted the percentage of loci for which an Arabidopsis homolog could not be identified (Table 1). There was a strong negative correlation (r = -0.57) between the percentage of loci (per bin) showing conserved synteny with Arabidopsis, and the percentage of loci showing no match in Arabidopsis. This indicates that different genomic regions may tolerate rearrangement at different rates, a hypothesis that has previously been suggested based on the failure to identify ancient duplications in centromeric regions of both Arabidopsis and rice. A fascinating question for further study is whether such rapidly evolving regions contain a disproportionate share of genes that account for morphological or physiological divergence between taxa, including reproductive isolation.
Ongoing improvement of analytical tools may also help to better resolve long-range comparative data. The main difference between CS2 and FISH is the method by which individual points in the data matrix are taken as evidence for conserved synteny. In CS2, Euclidean distance was used and if points are spatially close enough, they will be flagged as correspondence. This algorithm may thus be especially prone to false positives in regions of the genome in which low recombination per unit physical distance results in high marker cosegregation and inability to resolve true orders of genes along the chromosome. As we noted, CS2 tended to detect more correspondence and longer segments in marker-rich genomic regions, but multiple associations in these regions appear likely to include some false positives. In contrast, in FISH, Manhattan distance was used and only the points close enough to meet likelihood thresholds and also in a roughly diagonal line are taken as evidence of correspondence. However, deviations from such a diagonal may be caused by a number of factors. For example, localized inversions (Grant et al. 2000
Assembling an inferred map of the hypothetical cotton ancient genome Genetic linkage maps of the At (tetraploid A sub genome), Dt (tetraploid D sub genome), and D (diploid D genome) chromosomes, reported by Rong et al. (2004
It was previously noted that many probes detected multiple polymorphic loci (Rong et al. 2004
Here, M is the marker located on At or D and intended to be merged into the inferred map. M2 is the relative location (distance from the top of the inferred maps, cM) of the marker (M) in the inferred map (Fig. 1A). M1 is the original location of marker M on At or D. Y1 and Y2 are the location (distance from the top of the each homoeologous chromosome, cM) of the first common marker (Y) on the At (or D) and inferred map, respectively. X1 and X2 are the location of the second common marker (Y) on the At (or D) and inferred map, respectively. To efficiently build the inferred map according to the rules and formula mentioned above, a Web-based computer program was compiled using PHP. The code of the program is available at http://www.plantgenome.uga.edu/MapMerger/.
Computer software The FISH (Calabrese et al. 2003
FISH was designed for Fast Identification of Segmental Homologs (Calabrese et al. 2003
between a pair of points i and j must be less than the threshold distance
where m = number of points, n = number of cells, and T = the probability of having one or more neighbors within a distance less than dTFISH under the assumption that each cell contains a point with probability m/n.
The CS2 package was developed by Ned Levine and Associates (Levine 2002
One CS2 tool useful to genomics is the nearest neighbor hierarchical clustering analysis, which provides a means of delineating significant clusters of points in Euclidean space. This tool identifies clusters by finding nearest neighbors that are separated by a Euclidean distance
that is less than a minimum threshold distance. This threshold distance is defined to be a one-tailed confidence interval around the expected random values for distance to nearest neighbors such that,
where A is the Area being surveyed, N is the number of sampled incidents, and t is the t-value associated with a probability level from the Student's t-distribution. These nearest neighbor sets are joined into hierarchical clusters with a minimum size and probability level set by the user. CS2 will accept cluster sizes as low as two points, and can join points with probability values ranging from P = 0.00001 up to P = 0.999. The output from this analysis is a text file identifying points and their cluster association, as well as an ellipsoid shape file that may be exported to ArcView for visualization of the distribution of clusters.
Identification of ancient duplications in the hypothetical cotton ancestor For CS2, the same set of match data as in FISH was used. Instead of the marker name used in the above match files, the accumulated distances from Chromosome 1 (C01) to C13 were calculated and used as the locus name for all chromosomes. Only one file was therefore created for all 13 chromosomes and is composed of two columns, numbers in each column representing one of the matching loci. The first column was displayed as the x-axis and the second as the y-axis. CS2 analysis used the following parameters: minimum points (pairs of matching loci) per cluster, three; number of standard deviations for ellipses, one; simulation runs, 0. The likelihood was set to 1%. The lengths of conserved syntenic blocks were estimated by calculating the distance between the delimiting markers in the block detected by FISH or in the cluster detected by CS2.
Arabidopsis gene duplication database
Cotton probe DNA sequencing and BLAST search against Arabidopsis sequence database
Evaluation of conserved synteny between cotton and Arabidopsis
Similarly, all 34 Arabidopsis
This work has been supported in part by grants from the USDA National Research Initiative (97-35300-5305), and National Science Foundation Plant Genome Research Program (DBI-9872630, DBI-0211700).
3 Corresponding author. E-mail paterson{at}uga.edu; fax (706) 583-0160. [Supplemental material is available online at www.genome.org.] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3907305. Article published online before print in August 2005.
Adams, K.L., Cronn, R., Percifield, R., and Wendel, J.F. 2003. Genes duplicated by polyploidy show unequal contributions to the transcriptome and organ-specific reciprocal silencing. Proc. Natl. Acad. Sci. 100: 4649-4654. The Arabidopsis Genome Initiative 2000. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796-815.[CrossRef][Medline] Benton, M.J. 1993. The fossil record 2. Chapman and Hall, New York.
Blanc, G. and Wolfe, K.H. 2004. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell 16: 1667-1678.
Blanc, G., Barakat, A., Guyot, R., Cooke, R., and Delseny, I. 2000. Extensive duplication and reshuffling in the Arabidopsis genome. Plant Cell 12: 1093-1101. Bowers, J.E., Chapman, B.A., Rong, J., and Paterson, A.H. 2003. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422: 433-438.[CrossRef][Medline] Bowers, J.E., Arias, M.A., Asher, R., Avise, J.A., Ball, R.T., Brewer, G.A., Buss, R.W., Chen, A.H., Edwards, T.M., Estill, J.C., et al. 2005. Comparative physical mapping links retention of microsynteny to chromosome structure and recombination in grasses. Proc. Natl. Acad. Sci. (in press). Brownstein, J.S., Holford, T.R., and Fish, D. 2003. A climate-based model predicts the spatial distribution of the lyme disease vector ixodes scapularis in the United States. Environ. Health Perspect. 111: 1152-1157.[Medline] Brubaker, C.L., Paterson, A.H., and Wendel, J.F. 1999. Comparative genetic mapping of allotetraploid cotton and its diploid progenitors. Genome 42: 184-203.[CrossRef] Calabrese, P.P., Chakravarty, S., and Vision, T.J. 2003. Fast identification and statistical evaluation of segmental homologies in comparative maps. Bioinformatics 19: i74-i80.[Abstract]
Cronn, R.C., Small, R.L., Haselkorn, T., and Wendel, J.F. 2002. Rapid diversification of the cotton genus (Gossypium: Malvaceae) revealed by analysis of sixteen nuclear and chloroplast genes. Amer. J. Bot. 89: 707-725.
Eckardt, N.A. 2001a. Everything in its place: Conservation of gene order among distantly related plant species. Plant Cell 13: 723-725.
____. 2001b. A sense of self: The role of DNA sequence elimination in allopolyploidization. Plant Cell 13: 1699-1704.
Fuss, J. and Linn, S. 2002. Human DNA polymerase
Gaut, B.S., Morton, B.R., McCaig, B.C., and Clegg, M.T. 1996. Substitution rate comparisons between grasses and palms: Synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcL. Proc. Natl. Acad. Sci. 93: 10274-10279.
Goff, S.A., Ricke, D., Lan, T.H., Presting, G., Wang, R.L., Dunn, M., Glazebrook, J., Sessions, A., Oeller, P., Varma, H., et al. 2002. A draft sequence of the rice genome (Oryza sativa L. ssp japonica). Science 296: 92-100.
Grant, D., Cregan, P., and Shoemaker, R.C. 2000. Genome organization in dicots: Genome duplication in Arabidopsis and synteny between soybean and Arabidopsis. Proc. Natl. Acad. Sci. 97: 4168-4173.
Grover, C.E., Kim, H., Wing, R.A., Paterson, A.H., and Wendel, J.F. 2004. Incongruent patterns of local and global genome size evolution in cotton. Genome Res. 14: 1474-1482. Gu, Z., Steinmetz, L.M., Gu, X., Scharfe, C., Davis, R.W., and Li, W-H. 2003. Role of duplicate genes in genetic robustness against null mutations. Nature 421: 63-66.[CrossRef][Medline]
Jiang, C., Wright, R., El-Zik, K., and Paterson, A.H. 1998. Polyploid formation created unique avenues for response to selection in Gossypium (cotton). Proc. Natl. Acad. Sci. 95: 4419-4424. Jiang, C., Wright, R.J., Woo, S.S., DelMonte, T.A., and Paterson, A.H. 2000a. QTL analysis of leaf morphology in tetraploid Gossypium (cotton). Theor. Appl. Genet. 100: 409-418.[CrossRef] Jiang, C.X., Chee, P.W., Draye, X., Morrell, P.L., Smith, C.W., and Paterson, A.H. 2000b. Multilocus interactions restrict gene introgression in interspecific populations of polyploid Gossypium (cotton). Evolution 54: 798-814.[CrossRef][Medline] Kellogg, E.A. 2003. It's all relative. Nature 422: 383-384.[CrossRef][Medline]
Koch, M.A., Haubold, B., and Mitchell-Olds, T. 2000. Comparative evolutionary analysis of chalcone synthase and alcohol dehydrogenase loci in Arabidopsis, Arabis, and related genera (Brassicaceae). Mol. Biol. Evol. 17: 1483-1498. Kowalski, S., Lan, T-H., Feldmann, K., and Paterson, A.H. 1994. Comparative mapping of Arabidopsis thaliana and Brassica oleracea chromosomes reveals islands of conserved gene order. Genetics 138: 499-510.[Abstract]
Ku, H.M., Vision, T., Liu, J.P., and Tanksley, S.D. 2000. Comparing sequenced segments of the tomato and Arabidopsis genomes: Large-scale duplication followed by selective gene loss creates a network of synteny. Proc. Natl. Acad. Sci. 97: 9121-9126. Larkin, J.C., Brown, M.L., and Schiefelbein, J. 2003. How do cells know what they want to be when they grow up? Lessons from epidermal patterning in Arabidopsis. Annu. Rev. Plant Biol. 54: 403-430.[CrossRef][Medline] Levine, N. 2002. CrimeStat: A spatial statistics program for the analysis of crime incident locations (v 2.0). Ned Levine & Associates, Houston, TX, and the National Institute of Justice, Washington, DC. Liu, B., Brubaker, C.L., Mergeai, G., Cronn, R.C., and Wendel, J.F. 2001. Polyploid formation in cotton is not accompanied by rapid genomic changes. Genome 44: 321-330.[Medline]
Lynch, M. and Conery, J.S. 2003. The origins of genome complexity. Science 302: 1401-1404.
Mayer, K., Murphy, G., Tarchini, R., Wambutt, R., Volckaert, G., Pohl, T., Dusterhoft, A., Stiekema, W., Entian, K-D., Terryn, N., et al. 2001. Conservation of microstructure between a sequenced region of the genome of rice and multiple segments of the genome of Arabidopsis thaliana. Genome Res. 11: 1167-1174.
Ming, R., Liu, S-C., Irvine, J.E., and Paterson, A.H. 2001. Comparative QTL analysis in a complex autopolyploid: Candidate genes for determinants of sugar content in Sugarcane. Genome Res. 11: 2075-2084.
Moore, R.C. and Purugganan, M.D. 2003. The early stages of duplicate gene evolution. Proc. Natl. Acad. Sci. 100: 15682-15687. Muravenko, O.V., Fedotov, A.R., Punina, E.O., Fedorova, L.I., Grif, V.G., and Zelenin, A.V. 1998. Comparison of chromosome BrdU-Hoechst-Giemsa banding patterns of the A1 and (AD) (2) genomes of cotton. Genome 41: 616-625.[CrossRef] Ohno, S. 1970. Evolution by gene duplication. Springer, Berlin, Germany. Paterson, A.H., Lan, T.H., Reischmann, K.P., Chang, C., Lin, Y.R., Liu, S.C., Burow, M.D., Kowalski, S.P., Katsar, C.S., DelMonte, T.A., et al. 1996. Toward a unified genetic map of higher plants, transcending the monocot-dicot divergence. Nat. Genet. 14: 380-382.[CrossRef][Medline]
Paterson, A.H., Bowers, J., Burow, M., Draye, X., Elsik, C., Jiang, C., Katsar, C., Lan, T., Lin, Y., Ming, R., et al. 2000. Comparative genomics of plant chromosomes. Plant Cell 12: 1523-1539. Paterson, A., Bowers, J., Peterson, D., Estill, J., and Chapman, B. 2003. Structure and evolution of cereal genomes. Curr. Opin. Genet. Dev. 13: 644-650.[CrossRef][Medline] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||