|
|
|
|
Published online before print
August 18, 2005, 10.1101/gr.3869505 Genome Res. 15:1284-1291, 2005 ©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00
Letter Sequence, annotation, and analysis of synteny between rice chromosome 3 and diverged grass speciesThe Rice Chromosome 3 Sequencing Consortium1,2
Rice (Oryza sativa L.) chromosome 3 is evolutionarily conserved across the cultivated cereals and shares large blocks of synteny with maize and sorghum, which diverged from rice more than 50 million years ago. To begin to completely understand this chromosome, we sequenced, finished, and annotated 36.1 Mb ( 97%) from O. sativa subsp. japonica cv Nipponbare. Annotation features of the chromosome include 5915 genes, of which 913 are related to transposable elements. A putative function could be assigned to 3064 genes, with another 757 genes annotated as expressed, leaving 2094 that encode hypothetical proteins. Similarity searches against the proteome of Arabidopsis thaliana revealed putative homologs for 67% of the chromosome 3 proteins. Further searches of a nonredundant amino acid database, the Pfam domain database, plant Expressed Sequence Tags, and genomic assemblies from sorghum and maize revealed only 853 nontransposable element related proteins from chromosome 3 that lacked similarity to other known sequences. Interestingly, 426 of these have a paralog within the rice genome. A comparative physical map of the wild progenitor species, Oryza nivara, with japonica chromosome 3 revealed a high degree of sequence identity and synteny between these two species, which diverged 10,000 years ago. Although no major rearrangements were detected, the deduced size of the O. nivara chromosome 3 was 21% smaller than that of japonica. Synteny between rice and other cereals using an integrated maize physical map and wheat genetic map was strikingly high, further supporting the use of rice and, in particular, chromosome 3, as a model for comparative studies among the cereals.
As one of the world's most important food crops, rice has emerged as the leading experimental model for functional and evolutionary genomics of cereals. Rice belongs to the genus Oryza, which is composed of 23 species divided into 10 genome types. Cultivated rice is classified as an AA diploid genome (2n = 24) and has two cultivated species, Oryza sativa and Oryza glaberimma, and six wild relatives. The wild AA genome relatives of cultivated rice are rich sources for new alleles for crop improvement, but their usage has been limited because of sterility of the F1 hybrids. O. sativa is thought to have originated from the annual plant type Oryza nivara, which was derived from the perennial Oryza rufipogon. Because of its agronomic importance, compact 430-Mb genome, 50 million years of shared evolutionary history with other larger genome cereals (e.g., maize, wheat, and barley), transformation competency, and abundance of robust molecular genetic resources, the scientific community selected rice (Oryza sativa subsp. japonica cv Nipponbare; hereon referred to as japonica) to be completely sequenced. Currently, whole genome draft sequences of rice subspecies japonica and indica (Goff et al. 2002
As part of the International Rice Genome Sequencing Project (IRGSP), our consortium sequenced japonica chromosome 3 using a clone-by-clone approach. Cytologically, chromosome 3 is the second largest rice chromosome and is one of the most euchromatic chromosomes (Cheng et al. 2001
Although Oryza separated from maize and sorghum
Structural features of japonica chromosome 3 A total of 323 BAC/P1 artificial chromosome (PAC) clones, 44.6 Mb in total, were used to construct a 36.1-Mb pseudomolecule (virtual contig) (Table 1; Fig. 1). The pseudomolecule spans the majority of the chromosome, with five physical gaps present in the two arms, gaps at each telomere, and one gap in the centromere. The centromere of chromosome 3 (CEN3) contains 180 kb of the CentO satellite repeats (Cheng et al. 2002 1.26 Mb. Clones to close one gap in the tiling path have been identified and are currently being sequenced. However, all attempts to identify additional clones from four BAC/PAC libraries failed, suggesting an under-representation of these sequences in existing libraries. In total, the short arm (3S) is 19.4 Mb, whereas the long arm (3L) is 16.7 Mb. The arm lengths are consistent with pachytene chromosomal measurements of chromosome 3 (Cheng et al. 2001
The total amount of repetitive sequence present on rice chromosome 3 is 21.4%, with transposable elements (TE) accounting for >90% of the repetitive sequences (Supplemental Table 1). The TEs are predominated with retrotransposons and miniature inverted repeat transposable elements (MITEs). Segments of retrotransposons (2745 segments in total) are present on the chromosome, totaling 3.38 Mb. In comparison, 11,865 MITEs totaling 2.29 Mb are present on chromosome 3. Another feature of chromosome 3 includes the insertions of organellar sequences. In total, 57 (37.8 kb) and 70 (27.6 kb) sequences with significant similarity to the rice chloroplast and mitochondrion, respectively, are present.
Functional features of japonica chromosome 3
Using EST frequency to assess transcription levels, a clear reduction of expression was apparent near the centromere (Fig. 1). A detailed examination of expression in six tissues did not reveal a tissue-specific pattern of expression along the chromosome (Supplemental Fig. 1). In contrast to the reduced frequency of expressed genes at the centromere, there was a clear enrichment of transposable elements, with the exception of MITEs, present near the centromere (Fig. 1). This general pattern has also been observed in Arabidopsis (CSHL/WUGSC/PE Biosystems Arabidopsis Sequencing Consortium 2000 The predicted proteome of japonica chromosome 3 is similar to that of other published rice chromosomes. Searches against the predicted proteome of Arabidopsis thaliana with the deduced protein sequences from the 6232 gene models revealed 4201 rice proteins (67%) with a putative ortholog/paralog in the Arabidopsis proteome as defined by a BLASTP E-value of <10-5. A reduced frequency of homology, 12%-41% of the rice proteins with a putative ortholog, was apparent between japonica chromosome 3 and other model organisms such as Escherichia coli, Synechocystis, yeast, fly, worm, and human (Supplemental Table 2). In addition to searches against model organisms, we ascertained the presence of Pfam domains in the predicted rice chromosome 3 proteome. Excluding TE-related proteins, a total of 2462 proteins contained at least one Pfam domain, with 929 Pfam domains represented within the rice chromosome 3 proteome. The most prevalent non-TE related Pfam domain is protein kinase with 127 members. The most prevalent Pfam domains on rice chromosome 3 are presented in Supplemental Table 3. Construction of paralogous families using a combination of Pfam domains and BLASTP similarity revealed 662 paralogous families representing 2807 proteins. The distribution of proteins in paralogous families is shown in Supplemental Figure 2 with a majority of the proteins belonging to a paralogous family that contained only two members. The largest paralogous family, with 85 members, encodes protein kinase.
To ascertain the number of proteins that could be potentially novel to rice, we assessed the predicted japonica chromosome 3 proteome for similarity to predicted proteins from other genomes and/or the presence of a Pfam domain (Supplemental Fig. 3). We first excluded all TE-related proteins from our analysis. Of the 5317 non-TE-related proteins, only 1377 did not exhibit sequence similarity to an Arabidopsis protein or a non-rice sequence present in GenBank (BLASTP E-value cutoff of 10-5) or contain a Pfam domain above the trusted cutoff. Of these remaining 1377 proteins, 67 had an EST match to rice (>97% identity), suggesting these are expressed genes. Another 42 and 38 proteins matched a monocot or dicot EST, respectively. A search of assembled gene-enriched sequences from maize (Whitelaw et al. 2003 Further annotation of the predicted proteome was obtained through assignment of Plant GOSlim ontologies. A total of 2250 of the 6232 predicted proteins (36.1%) could be assigned a minimum of one GO term with 8897 GO terms assigned in total, of which 3982 were biological process, 3737 were molecular function, and 1178 were cellular component terms. Molecular function Plant GOSlim terms with at least 20 assignments are shown in Supplemental Figure 4. A search of databases that contain genomic DNA sequences flanking tagged insertion sites (FSTs; Tos17, Ac/Ds, T-DNA) revealed a total of 3977 insertions in chromosome 3, of which 2798 could be aligned only with rice chromosome 3. Of the 3977 FST insertions, 2787 were either in a gene or within 500 bp of a gene providing a candidate insertion line for 1146 genes (19.4 % of the 5915 genes) on rice chromosome 3.
Comparison with O. nivara
Reconstructed contigs and coverage
Insertion/deletion analysis
Polymorphic SSRs Among molecular marker systems, simple sequence repeats (SSRs) have been used successfully not only to detect genetic variation within or between species but also to develop molecular markers tightly linked to agronomically important traits in breeding programs. Since O. nivara is an important source of new genes and allelic variation for the improvement of cultivated rice, we can use the O. nivara chromosome 3 contig/BES map described above to construct a virtual SSR map between japonica and O. nivara that can be used for high-resolution marker-assisted selection in breeding populations.
Our analysis of japonica chromosome 3 revealed a total of 6489 SSRs using RepeatMasker with default parameters. Among those, the trinucleotide repeat was the most abundant repeat class and accounts for
The O. nivara BES showed similar SSR composition with that of japonica chromosome 3 except for a 2% increase in dinucleotide repeats and a 2% decrease in tri- and pentanucleotide repeats (Supplemental Table 6). We detected an To detect polymorphic SSRs on chromosome 3 between japonica and O. nivara, we compared the SSR content of 326 mapped O. nivara BES containing SSRs to their corresponding regions in japonica. Of those, 206 SSRs showed the same motif and length (length difference ± 2 bp considered the same length), 118 SSRs representing 87 unique loci showed different SSR length, and two SSRs were present only in the O. nivara sequence. Trinucleotide and dinucleotide repeats were the predominant polymorphic SSRs with 49 and 32 SSRs, respectively. CGG/CCG was the most abundant polymorphic SSR type (25 loci), and GA/TC and TA motifs represented 15 loci each in the chromosome. The base pair positions and SSR compositions of these 118 polymorphic SSRs are shown in detail in Supplemental Table 7.
Synteny between rice and other major cereals
Although Goff et al. (2002
The hexaploid nature of bread wheat can tolerate the large deletion of a chromosome fragment. A collection of deletion lines has allowed for the establishment of an overlapping cytogenetically based physical map of wheat (Werner et al. 1992
Conclusions
Chromosome 3 was sequenced using a clone-by-clone approach in which large insert BAC and PAC clones were selected for sequencing from multiple libraries (Baba et al. 2000 8-10-fold sequence redundancy, and gaps were closed using a combination of resequencing, alternative chemistries, microlibraries, and/or transposon-mediated sequencing. The pseudomolecule, or virtual contig for the chromosome, was constructed by aligning the BAC/PAC clones and trimming overlapping regions. The complete sequence of the pseudomolecule is available in Gen-Bank (accession no. DP000009
[GenBank]
).
Genes and gene models were identified through computational methods using the ab initio gene prediction program FGENESH (Salamov and Solovyev 2000
The pseudomolecule was further annotated for features such as repetitive sequences, transcription activity, and distribution of FSTs. Repetitive sequences were identified on japonica chromosome 3 using RepeatMasker (http://www.repeatmasker.org) with the TIGR Oryza Repeat Database and quantitated using a 100-kb window. To identify transcriptional activity, the pseudomolecule was searched against the TIGR Rice Gene Index (Quackenbush et al. 2001
For synteny analysis between maize and rice, we used the integrated physical and genetic maps of maize. The maize FPC physical map covers
For reconstruction of O. nivara chromosome 3, we used the O. nivara FPC map and BES (Wing et al. 2005
The Institute for Genomic Research (TIGR) C. Robin Buell,2,3 Qiaoping Yuan,3 Shu Ouyang,3 Jia Liu,3 Wei Zhu,3 Aihui Wang,3 Rama Maiti,3 Brian Haas,3 Jennifer Wortman,3 Mihaela Pertea,3 Kristine M. Jones,3 Mary Kim,3 Larry Overton,3 Tamara Tsitrin,3 Douglas Fadrosh,3 Jayati Bera,3 Bruce Weaver,3 Shaohua Jin,3 Shivani Johri,3 Matt Reardon,3 Kristen Webb,3 Jessica Hill,3 Kelly Moffat,3 Luke Tallon,3 Susan Van Aken,3 Matthew Lewis,3 Teresa Utterback,3 Tamara Feldblyum,3 Victoria Zismann,3 Stacey Iobst,3 Joseph Hsiao,3 Aymeric R. de Vazeille,3 Steven L. Salzberg,3 Owen White,3 and Claire Fraser3
Arizona Genomics Institute (AGI)
Clemson University Genomics Institute (CUGI)
Arizona Genomics Computational Laboratory (AGCoL)
Cold Spring Harbor Laboratory (CSHL)
Washington University School of Medicine Genome Sequencing Center (WUGSC)
University of Wisconsin
Purdue University
Funding for work on rice chromosome 3 was provided by grants from the U.S. Department of Agriculture Cooperative State Research, Education, and Extension Service (Grants 99-35317-8275, 2003-35317-13173 to C.R.B.; Grants 99-35317-8505, 2002-35317-12414 to R.A.W.); the National Science Foundation (Grants DBI998282, DBI0321538 to C.R.B.; Grant DBI0241181 to R.A.W.); and the U.S. Department of Energy (Grant DE-FG02-99ER20357 to C.R.B.; Grant DE-FG03-02ER15363 to R.A.W.). C.R.B. acknowledges the assistance of the TIGR Sequencing Facility, the TIGR Informatics Department, the TIGR IT Group, and the J. Craig Venter Joint Technology Center. R.A.W. acknowledges the assistance of the AGI and CUGI Sequencing, Physical Mapping, BAC/EST Resource Centers, and the Arizona Genomics Computational Laboratory. W.R.M. acknowledges the assistance of the CSHL sequencing and informatics groups. We thank the Japanese Ministry of Agriculture, Forestry and Fishery (MAFF) for genetic markers used to develop initial seed BACs for sequencing.
1 A complete list of authors appears at the end of this manuscript.
2 Corresponding authors. [Supplemental material is available online at www.genome.org. The chromosome 3 pseudomolecule sequence data from this study has been submitted to GenBank under accession no. DP000009 [GenBank] .] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3869505. Article published online before print in August 2005.
3 The Institute for Genomic Research, Rockville, Maryland 20850, USA.
4 Arizona Genomics Institute (AGI), Department of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, Arizona 85721, USA.
5 Clemson University Genomics Institute (CUGI), Clemson University, Clemson, South Carolina 29634, USA.
6 Arizona Genomics Computational Laboratory (AGCoL), Department of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, Arizona 85721, USA.
7 Cold Spring Harbor Laboratory (CSHL), Cold Spring Harbor, New York 11723, USA.
8 Washington University School of Medicine Genome Sequencing Center (WUGSC), St. Louis, Missouri 63108, USA.
9 University of Wisconsin, Department of Horticulture, Madison, Wisconsin 53706, USA.
10 Purdue University, Department of Agronomy, West Lafayette, Indiana 47907, USA.
Ahn, S., Anderson, J.A., Sorrells, M.E., and Tanksley, S.D. 1993. Homoeologous relationships of rice, wheat and maize chromosomes. Mol. Gen. Genet. 241: 483-490.[CrossRef][Medline] Altschul, S.F., Gish, W., Miller, W., Meyers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410.[CrossRef][Medline] Baba, T., Katagiri, S., Tanoue, H., Tanaka, R., Chiden, Y., Saji, S., Hamada, M., Nakashima, M., Okamoto, M., Hayashi, M., et al. 2000. Construction and characterization of rice genomic libraries: PAC library of japonica variety, Nipponbare and BAC library of indica variety, Kasalath. Bull. NIAR 14: 41-49.
Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., et al. 2004. The Pfam protein families database. Nucleic Acids Res. 32: D138-D141.
Chen, M., Presting, G., Barbazuk, W.B., Goicoechea, J.L., Blackmon, B., Fang, G., Kim, H., Frisch, D., Yu, Y., Sun, S., et al. 2002. An integrated physical and genetic map of the rice genome. Plant Cell 14: 537-545.
Cheng, Z., Buell, C.R., Wing, R.A., Gu, M., and Jiang, J. 2001. Toward a cytological characterization of the rice genome. Genome Res. 11: 2133-2141.
Cheng, Z., Dong, F., Langdon, T., Ouyang, S., Buell, C.R., Gu, M., Blattner, F.R., and Jiang, J. 2002. Functional rice centromeres are marked by a satellite repeat and a centromere-specific retrotransposon. Plant Cell 14: 1691-1704.
Coe, E., Cone, K., McMullen, M., Chen, S.S., Davis, G., Gardiner, J., Liscum, E., Polacco, M., Paterson, A., Sanchez-Villeda, H., et al. 2002. Access to the maize genome: An integrated physical and genetic map. Plant Physiol. 128: 9-12. The Cold Spring Harbor Laboratory, Washington University Genome Sequencing Center, and PE Biosystems Arabidopsis Sequencing Consortium. 2000. The complete sequence of a heterochromatic island from a higher eukaryote. Cell 100: 377-386.[CrossRef][Medline] Feng, Q., Zhang, Y., Hao, P., Wang, S., Fu, G., Huang, Y., Li, Y., Zhu, J., Liu, Y., Hu, X., et al. 2002. Sequence and analysis of rice chromosome 4. Nature 420: 316-320.[CrossRef][Medline]
Feuillet, C. and Keller, B. 1999. High gene density is conserved at syntenic loci of small and large grass genomes. Proc. Natl. Acad. Sci. 96: 8265-8270.
Gale, M.D. and Devos, K.M. 1998. Comparative genetics in the grasses. Proc. Natl. Acad. Sci. 95: 1971-1974.
Goff, S.A., Ricke, D., Lan, T.H., Presting, G., Wang, R., Dunn, M., Glazebrook, J., Sessions, A., Oeller, P., Varma, H., et al. 2002. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296: 92-100. Greco, R., Ouwerkerk, P.B., Taal, A.J., Favalli, C., Beguiristain, T., Puigdomenech, P., Colombo, L., Hoge, J.H., and Pereira, A. 2001. Early and multiple Ac transpositions in rice suitable for efficient insertional mutagenesis. Plant Mol. Biol. 46: 215-227.[CrossRef][Medline]
Haas, B.J., Delcher, A.L., Mount, S.M., Wortman, J.R., Smith Jr., R.K., Hannick, L.I., Maiti, R., Ronning, C.M., Rusch, D.B., Town, C.D., et al. 2003. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31: 5654-5666.
Harushima, Y., Yano, M., Shomura, A., Sato, M., Shimano, T., Kuboki, Y., Yamamoto, T., Lin, S.Y., Antonio, B.A., Parco, A., et al. 1998. A high-density rice genetic linkage map with 2275 markers using a single F2 population. Genetics 148: 479-494. Kantety, R.V., La Rota, M., Matthews, D.E., and Sorrells, M.E. 2002. Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Mol. Biol. 48: 501-510.[CrossRef][Medline] Kim, C.M., Piao, H.L., Park, S.J., Chon, N.S., Je, B.I., Sun, B., Park, S.H., Park, J.Y., Lee, E.J., Kim, M.J., et al. 2004. Rapid, large-scale generation of Ds transposant lines and analysis of the Ds insertion sites in rice. Plant J. 39: 252-263.[CrossRef][Medline] Lippman, Z., Gendrel, A.V., Black, M., Vaughn, M.W., Dedhia, N., McCombie, W.R., Lavine, K., Mittal, V., May, B., Kasschau, K.D., et al. 2004. Role of transposable elements in heterochromatin and epigenetic control. Nature 430: 471-476.[CrossRef][Medline]
Lowe, T.M. and Eddy, S.R. 1997. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25: 955-964. Luo, M.C., Thomas, C., You, F.M., Hsiao, J., Ouyang, S., Buell, C.R., Malandro, M., McGuire, P.E., Anderson, O.D., and Dvorak, J. 2003. High-throughput fingerprinting of bacterial artificial chromosomes using the snapshot labeling kit and sizing of restriction fragments by capillary electrophoresis. Genomics 82: 378-389.[CrossRef][Medline]
Ma, J. and Bennetzen, J.L. 2004. Rapid recent growth and divergence of rice nuclear genomes. Proc. Natl. Acad. Sci. 101: 12404-12410.
Ma, J., Devos, K.M., and Bennetzen, J.L. 2004. Analyses of LTR-retrotransposon structures reveal recent and rapid genomic DNA loss in rice. Genome Res. 14: 860-869.
Miyao, A., Tanaka, K., Murata, K., Sawaki, H., Takeda, S., Abe, K., Shinozuka, Y., Onosato, K., and Hirochika, H. 2003. Target site specificity of the Tos17 retrotransposon shows a preference for insertion within genes and against insertion in retrotransposon-rich regions of the genome. Plant Cell 15: 1771-1780. Moore, G., Devos, K.M., Wang, Z., and Gale, M.D. 1995. Cereal genome evolution: Grasses, line up and form a circle. Curr. Biol. 5: 737-739.[CrossRef][Medline] Morgante, M., Hanafey, M., and Powell, W. 2002. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat. Genet. 30: 194-200.[CrossRef][Medline]
Ouyang, S. and Buell, C.R. 2004 The TIGR Plant Repeat Databases: A collective resource for the identification of repetitive sequences in plants. Nucleic Acids Res. 32: D360-D363.
Qi, L.L., Echalier, B., Chao, S., Lazo, G.R., Butler, G.E., Anderson, O.D., Akhunov, E.D., Dvorak, J., Linkiewicz, A.M., Ratnasiri, A., et al. 2004. A chromosome bin map of 16,000 expressed sequence tag loci and distribution of genes among the three genomes of polyploid wheat. Genetics 168: 701-712.
Quackenbush, J., Cho, J., Lee, D., Liang, F., Holt, I., Karamycheva, S., Parvizi, B., Pertea, G., Sultana, R., and White, J. 2001. The TIGR Gene Indices: Analysis of gene transcript sequences in highly sampled eukaryotic species. Nucleic Acids Res. 29: 159-164.
The Rice Chromosome 10 Sequencing Consortium. 2003. In-depth view of structure, activity, and evolution of rice Chromosome 10. Science 300: 1566-1569.
Salamov, A.A. and Solovyev, V.V. 2000. Ab initio gene finding in Drosophila genomic DNA. Genome Res. 10: 516-522. Sallaud, C., Gay, C., Larmande, P., Bes, M., Piffanelli, P., Piegu, B., Droc, G., Regad, F., Bourgeois, E., Meynard, D., et al. 2004. High throughput T-DNA insertion mutagenesis in rice: A first step towards in silico reverse genetics. Plant J. 39: 450-464.[CrossRef][Medline] Sasaki, T., Matsumoto, T., Yamamoto, K., Sakata, K., Baba, T., Katayose, Y., Wu, J., Niimura, Y., Cheng, Z., Nagamura, Y., et al. 2002. The genome sequence and structure of rice chromosome 1. Nature 420: 312-316.[CrossRef][Medline] Soderlund, C., Humphray, S., Dunham, A., and French, L. 2002 Contigs built with fingerprints, markers, and FPC V4.7. Genome Res. 10: 1772-1787.
Sorrells, M.E., La Rota, M., Bermudez-Kandianis, C.E., Greene, R.A., Kantety, R., Munkvold, J.D., Miftahudin, M.A., Ma, X., Gustafson, P.J., Qi, L.L., et al. 2003. Comparative DNA sequence analysis of wheat and rice genomes. Genome Res. 13: 1818-1827.
Werner, J.E., Endo, T.R., and Gill, B.S. 1992. Toward a cytogenetically based physical map of the wheat genome. Proc. Natl. Acad. Sci. 89: 11307-11311.
Whitelaw, C.A., Barbazuk, W.B., Pertea, G., Chan, A.P., Cheung, F., Lee, Y., Zheng, L., van Heeringen, S., Karamycheva, S., Bennetzen, J.L., et al. 2003. Enrichment of gene-coding sequences in maize by genome filtration. Science 302: 2118-2120. Wing, R.A., Ammiraju, J.S.S., Luo, M., Kim, H.R., Yu, Y., Kudrna, D., Goicoechea, J.L., Wang, W., Nelson, W., Rao, K., et al. 2005. The Oryza Map Alignment Project: The golden path to unlocking the genetic potential of wild rice species. Plant Mol. Biol. (in press).
Wortman, J.R., Haas, B.J., Hannick, L.I., Smith Jr., R.K., Maiti, R., Ronning, C.M., Chan, A.P., Yu, C., Ayele, M., Whitelaw, C.A., et al. 2003. Annotation of the Arabidopsis genome. Plant Physiol. 132: 461-468.
Wu, J., Maehara, T., Shimokawa, T., Yamamoto, S., Harada, C., Takazaki, Y., Ono, N., Mukai, Y., Koike, K., Yazaki, J., et al. 2002. A comprehensive rice transcript map containing 6591 expressed sequence tag sites. Plant Cell 14: 525-535. Yu, J., Wang, J., Lin, W., Li, S., Li, H., Zhou, J., Ni, P., Dong, W., Hu, S., Zeng, C., et al. 2005. The genomes of Oryza sativa: A history of duplications. PLoS Biol. 3: e38.[CrossRef][Medline] Yuan, Q., Hill, J., Hsiao, J., Moffat, K., Ouyang, S., Cheng, Z., Jiang, J., and Buell, C.R. 2002. Genome sequencing of a 239 kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast insertion. Mol. Genet. Genomics 267: 713-720.[CrossRef][Medline]
ftp://ftp.tigr.org/pub/data/rice/rice.chr03.v2.1/; annotation data for chromosome 3. http://www.agcol.arizona.edu; Arizona Computational Genomics Lab. http://www.agcol.arizona.edu/symap; SyMAP. http://www.cbs.dtu.dk/services/SignalP-2.0/; SignalP Server. http://www.cbs.dtu.dk/services/TMHMM/; TMHMM Server v 2.0. http://www.genome.arizona.edu; AGI/AGCoL. http://www.omap.org; Oryza Map Alignment Project. http://www.repeatmasker.org;: RepeatMasker. http://www.shigen.nig.ac.jp/rice/oryzabase/top/top.jsp; Oryzabase Database. http://www.tigr.org/tdb/e2k1/ath1; TIGR Arabidopsis. http://www.tigr.org/tigr-scripts/tgi/T_index.cgi?species=rice; TIGR Rice Gene Index.
Received March 3, 2005; accepted in revised format May 17, 2005. This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||