|
|
|
|
Published online before print
August 10, 2006, 10.1101/gr.5338906 Genome Res. 16:1241-1251, 2006 ©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00
Letter Uneven chromosome contraction and expansion in the maize genome1Munich Information Center for Protein Sequences (MIPS), Institute for Bioinformatics, GSF Research Center for Environment and Health, D-85764 Neuherberg, Germany; 2The Plant Genome Initiative at Rutgers (PGIR), Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA; 3Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, USA; 4Department of Genetics, University of Georgia, Athens, Georgia 30602, USA; 5Arizona Genomics Institute (AGI), University of Arizona, Tucson, Arizona 85721, USA; 6The Institute for Genomic Research (TIGR), Rockville, Maryland 20850, USA; 7Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA
Maize (Zea mays or corn), both a major food source and an important cytogenetic model, evolved from a tetraploid that arose about 4.8 million years ago (Mya). As a result, maize has extensive duplicated regions within its genome. We have sequenced the two copies of one such region, generating 7.8 Mb of sequence spanning 17.4 cM of the short arm of chromosome 1 and 6.6 Mb (25.6 cM) from the long arm of chromosome 9. Rice, which did not undergo a similar whole genome duplication event, has only one orthologous region (4.9 Mb) on the short arm of chromosome 3, and can be used as reference for the maize homoeologous regions. Alignment of the three regions allowed identification of syntenic blocks, and indicated that the maize regions have undergone differential contraction in genic and intergenic regions and expansion by the insertion of retrotransposable elements. Approximately 9% of the predicted genes in each duplicated region are completely missing in the rice genome, and almost 20% have moved to other genomic locations. Predicted genes within these regions tend to be larger in maize than in rice, primarily because of the presence of predicted genes in maize with larger introns. Interestingly, the general gene methylation patterns in the maize homoeologous regions do not appear to have changed with contraction or expansion of their chromosomes. In addition, no differences in methylation of single genes and tandemly repeated gene copies have been detected. These results, therefore, provide new insights into the diploidization of polyploid species.
Maize, Zea mays L., is one of the most productive crops on earth. It is a critical source of animal feed, a staple food for many countries in Latin America and Africa, and has many industrial uses. It serves as a model species to study many basic biological processes such as recombination, transposition, meiosis, paramutation, imprinting, gene expression, and plant development. Owing to its economic and biological importance (Messing 2005 2400 Mb (Rayburn et al. 1993 500 Mb each (Cannon et al. 2005
Within the cereals, conserved genetic markers can readily be found, and are largely collinear (Gale and Devos 1998
Sequence analysis of collinear segments, whole-genome shotgun sequences, BAC end sequences (BES), and fully sequenced random BAC clones have also been important for our understanding of the drastic differences in genome sizes (Tikhonov et al. 1999
Comparative analysis of orthologous regions between cereals has also allowed a higher resolution view of synteny, and its preservation or lack thereof. Although alignments of genetic markers indicate extensive synteny between grass genomes (Gale and Devos 1998
All studies of orthologous regions described to date have investigated relatively small chromosomal intervals containing few maize genes, owing in large part to the resources available at the time of the studies. Here, we use the completed rice genome sequence and well-characterized maize BAC libraries to study much longer chromosomal regions. DNA fingerprinting was used to select BAC clones with a minimal overlap from highly redundant maize B73 BAC libraries (Yim et al. 2002
Physical and genetic maps Two homoeologous regions of maize, Zm1S and Zm9L, were selected for this study (Supplemental Fig. 1). Zm1S is located on the short arm of chromosome 1, spans the markers bnlg1124 at position 29.0 and bnlg1112 at position 46.4 of the BNL 2002 map, and is contained within a tiling path that contains 60 BAC clones (Supplemental Table A). Zm9L is located on the long arm of chromosome 9, spans the markers bnlg619 at position 129.4 and bnlg1129 at position 155.0 of the BNL 2002 map, and is contained within a tiling path that contains 56 BAC clones (Supplemental Table B). Based on these markers, the two homoeologous regions of the maize genome comprise a genetic distance of 17.4 and 25.6 cM, respectively, or a total of 43 cM. If the same regions are compared to maps derived from recombinant inbred lines, the genetic distances expand dramatically as expected (Lee et al. 2002
Clones were subjected to shotgun sequencing, and sequences were assembled with ARACHNE and then were curated as described in Methods. All clones are listed in Supplemental Tables D (Zm1S) and E (Zm9L), with their sizes and accessions. The tiles yield a sequence of 7,822,695 and 6,560,930 bp, respectively, from these contiguous BACs. Both maize regions have dramatically higher recombination frequencies per kilobase compared to the average for the entire maize genome. The B73 genome has an estimated total physical length of
Gene content
The number of exons per gene is similar between the two maize regions (5.56.8) and the rice regions (5.7). However, many genes in maize appear to be expanded in length compared to rice (average gene length: 4.2 kb for maize, 3.3 kb for rice). While the average exon length is slightly longer in rice than in maize, the reverse is true for the length of introns (Table 1). We also find a number of very large predicted genes in maize, and these large-intron genes are responsible for the larger average intron size compared to rice. The largest predicted maize gene on the chromosome 1 segment is 89.1 kb and on the chromosome 9 segment it is 64.7 kb, whereas the largest predicted gene in the comparable rice regions is only 30.4 kb. It is possible that some of these very large predicted maize genes are annotation artifacts, or mutant alleles, but we have found some cases in which maize B73 ESTs seamlessly cover the exons flanking a very large intron, suggesting that these large candidate genes are expressed and properly processed (Supplemental Fig. 2). Therefore, it appears that maize has expanded both intergenic regions and the genes themselves relative to rice.
Intergenic regions
For all regions, the observed repeat content is lower than the average genome values. This is not unexpected, given that the gene density is higher than the genome-wide average. Compared to the average for the rice genome (33% repetitive DNA), the studied Os3S region contains only 19% repetitive DNA, while the maize regions each contain <66% repetitive DNA predicted for the entire genome (63% for Zm1S and 59% for Zm9L). Conversely, the amount of coding space in Os3S (44%) differs from the overall rice average of 33%. Interestingly, among all 12 rice chromosomes (35%), chromosome 3 (29%) has the least amount of repeats and the highest average gene density, 1 gene per 8.7 kb, as opposed to the rice genome, 1 gene per 9.9 kb (International Rice Genome Sequencing Project 2005 37% of the space (compared to a genomic average of 34%), while both Zm1S and Zm9L have 25% nonassigned intergenic sequences, compared to a 22% genome-wide average.
One of the striking differences observed for Os3S is that the region contains a higher than average quantity of DNA transposons (by a factor of 1.44). This is largely due to the insertion of miniature inverted-repeat-transposable elements (MITES) (Wessler et al. 1995
The distribution of repeat elements correlates well with the higher than average gene density in the maize chromosome regions studied. These features are preserved even in the segment that underwent the largest expansion (Zm1S). In this respect, it is interesting to note that these regions are located at the distal ends of their respective chromosomes (Fig. 1). It has previously been proposed that regions close to the telomeric regions of the wheat genome have increased gene density (Akhunov et al. 2003
Alignment of syntenic regions When all three regions are aligned based on conserved sequences, the Zm1S region becomes the limiting factor despite being the longest sequence of 7.8 Mb (Fig. 1). This region has experienced the highest degree of relative expansion, which can be expressed as the size of the maize region compared to the rice reference region. Based on distinct junctions of nonhomoeologous sequences separating sequence segments of collinear genes, one can divide the aligned sequences into segments of homoeologous sequences. Individual segments of Zm1S have expansion factors between 1.9 for segment A and 4.2 for segments B1 and B2 (Fig. 1). The expansion rates of Zm9L vary from 0.8 (segments B1 and B2) to almost no expansion (segments A and C) to 1.8 (segment D). Consequently, degrees of expansion between Zm1S and Zm9L differ by as much as a factor of 5.25 (segments B1 and B2). If, after the WGD event, maize had retained all of its duplicated regions, the relative sizes of the present-day maize and rice genomes would suggest a genome-wide threefold expansion rate for all homoeologous regions in maize. While segments B1 and B2 of Zm1S expanded beyond the average, all others are far below the expected expansion rates. It is not clear whether this was caused by a lower-than-average frequency of transposon amplification in all regions but B1 or B2, or by a higher frequency of the small deletions (most caused by illegitimate recombination) that have been found to be responsible for genome shrinkage in rice and Arabidopsis (Devos et al. 2002
Another marked feature of Zm1S is an insertion of
Contraction and expansion of synteny blocks
A different situation occurs for the Zm1S region. Although segment A experienced an overall expansion, the genic space is considerably reduced by a factor of 0.4. In addition, the nonassigned space has a moderate expansion factor of 1.3. For the drastically expanded segments B1 and B2, we observed relative expansion factors of 1.2 and 1.4 for the genic space, and 2.3 and 2.7 for the unassigned space (Table 3). Interestingly the expansion factor of genes between random maize BACs versus random rice regions is 1.4 (Table 3). Based on 37,544 gene models in rice, the number of gene models in maize would be 37,544 x 1.4 = 52,562, which falls within the range proposed earlier (Haberer et al. 2005 Further analysis of repeat elements in different segments illustrates the uneven target site preference of retrotransposition (Table 3). The Zm1S and Zm9L regions have very different distributions of repeat elements. Segments of Zm9L are quite variable and range between 34.6% and 61.2% in repeat content (overall, 58.8%). In comparison, Zm1S shows less variability and has a local repeat concentration that ranges between 62% and 65.5% (overall, 63.1%). The B1 segments of both maize regions have the largest increase in retroelement content relative to rice, with Ty1/copia-like elements increasing substantially more than the Ty3/gypsy-like elements. The increase in Zm1S is nearly a magnitude higher than in Zm9L. B1 is marked on one side by a rather large junction of nonhomoeologous sequences that occurs in both Zm1S and Zm9L. On the other side, an inversion event is specific to Zm9L, and perhaps is involved in the halted expansion of Zm9L relative to Zm1S. Once the entire maize genome is sequenced, it will be possible to assess whether there is a general correlation between local genome expansion and the presence of inversions or other chromosomal rearrangements.
Analysis of these two large regions does not reveal evidence of large gene islands separated by retrotransposon blocks. As previously reported, most gene islands are small (one to two genes) (Bennetzen et al. 2005
Gene loss after WGD to form maize chromosomes. The number increases only slightly to 306 (48%) when less stringent microsynteny criteria are used. Because a large number of BAC clones are still at phase 1 (unordered contigs due to physical gaps) stage and not finished (Supplemental Tables D and E), the order of contigs within BAC clones is not always resolved (see also Methods). This relaxed stringency would also allow for small-scale rearrangements within syntenic blocks.
When comparing both maize regions with rice, the number of genes collinear in all three regions is <50%. This is not unexpected because, as has been noted previously, one copy of a duplicated gene has been frequently lost after the WGD event. Of the 270 genes in rice that are present in either Zm1S or Zm9L, only 298 genes (55%) of the possible 540 (one copy in each maize region) were detected. This number increases to 346 (64%) with the relaxed stringency described above. This is a lower percentage than previously observed for the analysis of five different duplicated regions of the maize genome (Lai et al. 2004b
An even greater loss of the second gene copies of the genes in the duplicated genome (90%) occurred after the hybridization of the two fungal species that formed baker's yeast (Kellis et al. 2004
Significant gene movement in rice and maize
Conversely, we addressed the same question for the 338 genes present in rice Os3S, but lacking orthologous counterparts on both Zm1S and Zm9L. In the absence of the entire maize genome sequence, we took advantage of the complete collection of maize genomic survey sequences (GSS containing high C0t [HC], MF, BES, and whole-genome shotgun sequences) and expressed sequences (EST and TCs) and asked whether the extra Os3S genes are homologous to any of the maize-derived GSSs or ESTs/TCs. Out of the 338 extra rice genes, 220 (65%) were assigned to maize GSSs or ESTs/TCs. Because the current maize GSSs and ESTs/TCs do not cover the complete gene set of maize, this can be considered an underestimate of the number of rice genes with a homolog elsewhere in the maize genome. In fact, for 51 out of the 118 rice genes without identified maize counterparts, homologs could be detected in ESTs/TCs collections from other plants. Nevertheless, our results indicate that maize and rice contain additional (or different) genes without homologous counterparts in the other species at levels of 9%10% (Zm1S + Zm9L vs. rice) and 18% (Os3S vs. maize). In support of this finding, a recent comparison of maize unigenes or cDNAs with the rice genome indicates that 22% of the maize genes are not present in rice (Lai et al. 2004a
Duplicated genes in maize
In summary, 150 genes of both maize regions are related to each other, while 153 and 176 genes from Zm1S and Zm9L, respectively, represent either a single copy of a duplicated gene or a noncollinear gene as described above. Even with the inclusion of genes conserved on both maize chromosomes, an estimate of the number of genes lost and gained relative to rice is uncertain. Recently an additional comparison against a syntenic region from sorghum has shown that both copies of a gene derived from an ancestral chromosome can be lost relative to a copy conserved in its position in sorghum and rice (Lai et al. 2004b
DNA methylation of the maize regions
To gain insights into the distribution of methylation within the large contiguous sequenced regions of the maize genome, we mapped genomic survey sequences (GSSs) derived from a methylation-filtered (MF) library (Rabinowicz et al. 1999
Figure 3 gives a graphical overview of the density distribution for MF and HC clones in the two maize Zm1S and Zm9L regions. Both filtered libraries cover a similar percentage of the genic (CF: 32%; MF: 30%) as well as of the complete genomic space (CF: 12.4%; MF: 16.6%). The cumulative coverage of CF and MF for the genic space was 49%, indicating overlapping and complementary specificities for the two methods. About 91.5% of all genes were detected by at least one GSS tag. A previous comparison of CF and MF with a set of 78 full-length cDNAs also showed that 95% of the cDNAs are tagged by at least one sequence read (Springer et al. 2004
To analyze the potential epigenetic influences on the regulation of duplicated genes, we were particularly interested in similarities and differences of the methylation patterns for both tandemly repeated genes and conserved syntenic genes between the Zm1S and Zm9L regions. Coverage of tandemly repeated genes by MF clones was quite similar to the average coverage of all 479 genes. In particular, 78% of all genes and 75% of tandem genes had at least one matching MF clone.
Genes conserved in both syntenic maize segments revealed a significantly higher coverage by MF clones (0.42x of the genic space vs. 0.30x for all genes), suggesting a higher degree of hypomethylation for this class of genes. Interestingly, the fraction of these conserved genes with no MF coverage (3.8%; 3 out of 78) is significantly lower compared to the expectation for all genes [P < 0.0001; P(X Interestingly, the expansion that occurred in different intervals of Zm1S and Zm9L relative to Os3S did not create a differential methylation pattern. Hypomethylation seems to be present even in regions where an increased expansion by retrotrans-position occurred. Future analysis of other duplicated regions of the maize genome will help to refine the correlation of methylation and chromosome architecture.
We have obtained the first large contiguous sequences of maize, spanning >43 cM of genetic distance, by an economical sequencing strategy. The sequence data enabled an analysis of the changes that occurred after the progenitor of maize was formed by hybridization of two closely related species. By using the known genome sequence of rice as a reference, important insights into genome evolution have been obtained. An unforeseen result is the enormous variability of chromosomal expansion in different syntenic blocks. Our data suggest that the ancestral chromosomes contained seeding sites for inversions and insertions within segmental duplicated regions, including sites for differential growth by insertions and deletions. It appears that the C-value enigma results from a composite of chromosome contraction and expansion. In addition, the death and birth of new genes is pronounced. Approximately a tenth of the predicted maize genes lack a homologous counterpart in rice, a phenomenon unseen in closely related mammalian genomes (International Human Genome Sequencing Consortium 2004
DNA sequencing, sequence assembly, and data deposition The tiling path for both maize chromosomes was selected at the Arizona Genomics Institute. All BACs were validated by cross-checking their HindIII profiles with pre-existing agarose fingerprints. Overlap between clones is quite variable because they were not yet optimized with high-resolution fingerprinting (Nelson et al. 2005 17,200 clones for all FPCs, currently 90% of the B73 genome (http://www.genome.arizona.edu/fpc/maize/).
In total, 116 BAC clones were chosen for sequencing. Sixty clones for maize chromosome 1 were sequenced at The Institute of Genomic Research, while Zm9L was sequenced with 40 clones at the Broad Institute, and 16 at the Plant Genome Initiative at Rutgers (Supplemental Tables D and E). Randomly sheared BAC DNA in a very narrow size range of 4 kb was used to construct shotgun libraries in the pOTWI vector for Zm9L, while the pCR4TOPO vector (Invitrogen) and 68-kb inserts were used for Zm1S BACs. Inserts were sequenced from both ends using universal primers (Vieira and Messing 1982
In order to confirm that the entire pseudomolecule mapped to the same location on the physical and genetic maps, we mapped several new SSRs (Supplemental Table H). Four BACs, distributed across the 56 BAC Zm9L megacontig, were chosen for detection of SSRs (Castelo et al. 2002 Sequences of genetic markers from maize, rice, and sorghum were downloaded from the community Web sites, and used to determine their exact positions in the maize and rice sequences at the PGIR. After extensive curation and editing, files of the pseudomolecules were sent to the Munich Information Center for Protein Sequences (MIPS) for analysis.
Sequence analysis The gene annotation of the syntenic part of rice was based on the TIGR rice assembly (version 3). Sequences were masked for repetitive elements applying identical methods as used for both maize regions. For gene prediction, TIGR (v3) gene models as well as complementary monocot ESTs/TCs collections were mapped to the genomic sequences. Potential transposable elements were identified by BLASTN comparisons against the MIPS plant repeat library (for cutoff parameters, see above). Identified repetitive elements and transposable elements within rice were excluded from the analysis of syntenic relationships. In order to determine the syntenic relation between predicted genes/proteins, the bidirectional best BLASTP hits (BBHs; e-value < 1e-20) of the proteins of each chromosomal region was determined. The syntenic pairs were determined manually once with rice as central reference, where genes from both maize chromosomes were compared to rice, and between the two maize chromosomes. The micro-collinear gene clusters were merged into macro-collinear blocks. The criterion for homologous genes considered as tandem duplicated genes was a BLASTP e-value < 1e-20 and a maximal distance of one nonhomologous gene inserted between the homologous genes.
Coverage of filtered clones
This work was supported by the National Science Foundation Plant Genome grants 0211851 (PI: J.M., The Plant Genome Initiative at Rutgers) and 0221536 (PI: K. Schubert, Donald Dan-forth Plant Science Center). Work at MIPS was, in part, supported by the GABI program of the German Ministry for Education and Research (BMBF).
8 These authors contributed equally to this work.
9 Corresponding author. E-mail messing{at}mbcl.rutgers.edu; fax (732) 445-0072. Supplemental material is available online at www.genome.org. The list of the accessions deposited into GenBank can be found in Supplemental Tables D (chromosome 1S) and E (chromosome 9L). Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5338906.
Akhunov, E.D., Goodyear, A.W., Geng, S., Qi, L.L., Echalier, B., Gill, B.S., MiftahudinGustafson, J.P., Lazo, G., Chao, S. et al. 2003. The organization and rate of evolution of wheat genomes are correlated with recombination rates along chromosome arms. Genome Res. 13: 753763. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403410.[CrossRef][Medline] Anderson, L.K., Lai, A., Stack, S.M., Rizzon, C., Gaut, B.S. 2006. Uneven distribution of expressed sequence tag loci on maize pachytene chromosomes. Genome Res. 16: 115122. Arabidopsis Genome Initiative. 2000. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796815.[CrossRef][Medline] Batzoglou, S., Jaffe, D.B., Stanley, K., Butler, J., Gnerre, S., Mauceli, E., Berger, B., Mesirov, J.P., Lander, E.S. 2002. ARACHNE: A whole-genome shotgun assembler. Genome Res. 12: 177189. Bennett, M.D., Leitch, I.J., Price, H.J., Johnston, J.S. 2003. Comparisons with Caenorhabditis (approximately 100 Mb) and Drosophila (approximately 175 Mb) using flow cytometry show genome size in Arabidopsis to be approximately 157 Mb and thus approximately 25% larger than the Arabidopsis Genome Initiative estimate of approximately 125 Mb. Ann. Bot. (Lond.) 91: 547557. Bennetzen, J.L., Schrick, K., Springer, P.S., Brown, W.E., SanMiguel, P. 1994. Active maize genes are unmodified and flanked by diverse classes of modified, highly repetitive DNA. Genome 37: 565576.[Medline] Bennetzen, J.L., Coleman, C., Liu, R., Ma, J., Ramakrishna, W. 2004. Consistent over-estimation of gene number in complex plant genomes. Curr. Opin. Plant Biol. 7: 732736.[CrossRef][Medline] Bennetzen, J.L., Ma, J., Devos, K.M. 2005. Mechanisms of recent genome size variation in flowering plants. Ann. Bot. (Lond.) 95: 127132. Brunner, S., Fengler, K., Morgante, M., Tingey, S., Rafalski, A. 2005. Evolution of DNA sequence nonhomologies among maize inbreds. Plant Cell 17: 343360. Buell, C.R., Yuan, Q., Ouyang, S., Liu, J., Zhu, W., Wang, A., Maiti, R., Haas, B., Wortman, J., Pertea, M. et al. 2005. Sequence, annotation, and analysis of synteny between rice chromosome 3 and diverged grass species. Genome Res. 15: 12841291. Burr, B., Burr, F.A., Thompson, K.H., Albertson, M.C., Stuber, C.W. 1988. Gene mapping with recombinant inbreds in maize. Genetics 118: 519526. Cannon, S.B., Crow, J.A., Heuer, M.L., Wang, X., Cannon, E.K., Dwan, C., Lamblin, A.F., Vasdewani, J., Mudge, J., Cook, A. et al. 2005. Databases and information integration for the Medicago truncatula genome and transcriptome. Plant Physiol 138: 3846. Castelo, A.T., Martins, W., Gao, G.R. 2002. TROLLTandem Repeat Occurrence Locator. Bioinformatics 18: 634636. Chomet, P.S., Wessler, S., Dellaporta, S.L. 1987. Inactivation of the maize transposable element Activator (Ac) is associated with its DNA modification. EMBO J 6: 295302.[Medline] Clark, R.M., Linton, E., Messing, J., Doebley, J.F. 2004. Pattern of diversity in the genomic region near the maize domestication gene tb1. Proc. Natl. Acad. Sci. 101: 700707. Das, O.P. and Messing, J. 1994. Variegated phenotype and developmental methylation changes of a maize allele originating from epimutation. Genetics 136: 11211141.[Abstract] Devos, K.M., Brown, J.K., Bennetzen, J.L. 2002. Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res. 12: 10751079. Fu, H. and Dooner, H.K. 2002. Intraspecific violation of genetic colinearity and its implications in maize. Proc. Natl. Acad. Sci. 99: 95739578. Fu, Y., Hsia, A.P., Guo, L., Schnable, P.S. 2004. Types and frequencies of sequencing errors in methyl-filtered and high c0t maize genome survey sequences. Plant Physiol 135: 20402045. Gale, M.D. and Devos, K.M. 1998. Comparative genetics in the grasses. Proc. Natl. Acad. Sci. 95: 19711974. Gardiner, J., Schroeder, S., Polacco, M.L., Sanchez-Villeda, H., Fang, Z., Morgante, M., Landewe, T., Fengler, K., Useche, F., Hanafey, M. et al. 2004. Anchoring 9,371 maize expressed sequence tagged unigenes to the bacterial artificial chromosome contig map by two-dimensional overgo hybridization. Plant Physiol 134: 13171326. Gremme, G., Brendel, V., Sparks, M.E., Kurtz, S. 2005. Engineering a software tool for gene structure prediction in higher organisms. Inf. Softw. Technol 47: 965978.[CrossRef] Haberer, G., Young, S., Bharti, A.K., Gundlach, H., Raymond, C., Fuks, G., Butler, E., Wing, R.A., Rounsley, S., Birren, B. et al. 2005. Structure and architecture of the maize genome. Plant Physiol 139: 16121624. Hosouchi, T., Kumekawa, N., Tsuruoka, H., Kotani, H. 2002. Physical map-based sizes of the centromeric regions of Arabidopsis thaliana chromosomes 1, 2, and 3. DNA Res. 9: 117121.[Abstract] Ilic, K., SanMiguel, P.J., Bennetzen, J.L. 2003. A complex history of rearrangement in an orthologous region of the maize, sorghum, and rice genomes. Proc. Natl. Acad. Sci. 100: 1226512270. International Human Genome Sequencing Consortium. 2004. Finishing the euchromatic sequence of the human genome. Nature 431: 931945.[CrossRef][Medline] International Rice Genome Sequencing Project. 2005. The map-based sequence of the rice genome. Nature 436: 793800.[CrossRef][Medline] Jaffe, D.B., Butler, J., Gnerre, S., Mauceli, E., Lindblad-Toh, K., Mesirov, J.P., Zody, M.C., Lander, E.S. 2003. Whole-genome sequence assembly for mammalian genomes: Arachne 2. Genome Res. 13: 9196. Kapitonov, V.V. and Jurka, J. 2001. Rolling-circle transposons in eukaryotes. Proc. Natl. Acad. Sci. 98: 87148719. Kellis, M., Birren, B.W., Lander, E.S. 2004. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: 617624.[CrossRef][Medline] Lai, J., Dey, N., Kim, C.S., Bharti, A.K., Rudd, S., Mayer, K.F., Larkins, B.A., Becraft, P., Messing, J. 2004a. Characterization of the maize endosperm transcriptome and its comparison to the rice genome. Genome Res. 14: 19321937. Lai, J., Ma, J., Swigonova, Z., Ramakrishna, W., Linton, E., Llaca, V., Tanyolac, B., Park, Y.J., Jeong, O.Y., Bennetzen, J.L. et al. 2004b. Gene loss and movement in the maize genome. Genome Res. 14: 19241931. Lai, J., Li, Y., Messing, J., Dooner, H.K. 2005. Gene movement by Helitron transposons contributes to the haplotype variability of maize. Proc. Natl. Acad. Sci. 102: 90689073. Lal, S.K., Giroux, M.J., Brendel, V., Vallejos, C.E., Hannah, L.C. 2003. The maize genome contains a helitron insertion. Plant Cell 15: 381391. Lee, M., Sharopova, N., Beavis, W.D., Grant, D., Katt, M., Blair, D., Hallauer, A. 2002. Expanding the genetic map of maize with the intermated B73 x Mo17 (IBM) population. Plant Mol. Biol. 48: 453461.[CrossRef][Medline] Lukashin, A.V. and Borodovsky, M. 1998. GeneMark.hmm: New solutions for gene finding. Nucleic Acids Res. 26: 11071115. Lund, G., Messing, J., Viotti, A. 1995a. Endosperm-specific demethylation and activation of specific alleles of Lund, G., Prem Das, O., Messing, J. 1995b. Tissue-specific DNase I-sensitive sites of the maize P gene and their changes upon epimutation. Plant J 7: 797807.[CrossRef] Ma, J. and Bennetzen, J.L. 2004. Rapid recent growth and divergence of rice nuclear genomes. Proc. Natl. Acad. Sci. 101: 1240412410. Ma, J., Devos, K.M., Bennetzen, J.L. 2004. Analyses of LTR-retrotransposon structures reveal recent and rapid genomic DNA loss in rice. Genome Res. 14: 860869. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||