|
|
|
|
Vol. 9, Issue 9, 825-829, September 1999
LETTER
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
With the genomic sequencing of Arabidopsis nearing completion and rice sequencing very much in its infancy, a key question is whether we can exploit the Arabidopsis sequence to identify candidate genes for traits in cereal crops using a map-based approach. This requires the existence of colinearity between the Arabidopsis and cereal genomes, represented by rice, which is readily detectable using currently available resources, that is, Arabidopsis genomic sequence, rice ESTs, and genetic and physical maps. A detailed study of the colinearity remaining between two small regions of Arabidopsis chromosome 1 and rice suggests that at least in these regions of the Arabidopsis genome, conservation of gene orders with rice has been eroded to the point that it is no longer identifiable using comparative mapping. Although our analysis does not preclude that tracts of colinear gene orders may be identified using sequence comparisons or may exist in other regions of the rice and Arabidopsis genomes, it is unlikely that the extent of colinearity will be sufficient to allow map-based cross-species gene prediction and isolation. Our research also highlights the difficulties encountered in identifying orthologs using BLAST searches in incomplete sequence databases. This complicates the interpretation of comparative data among highly divergent species and limits the exploitation of Arabidopsis sequence in monocot studies.
| |
INTRODUCTION |
|---|
|
|
|---|
Comparative genome analyses have shown the existence of conserved
gene orders (colinearity) in the genomes of different
plant and mammal species. In plants, this is best documented in the grass family, where colinearity has been maintained over evolutionary periods as long as 60 million years (Devos and Gale 1997
; Gale and
Devos 1998
). In mammals, the most comprehensive comparative maps are
available for human and mouse, which diverged ~70 million years ago
(Carver and Stubbs 1997
). Although short-range conserved synteny has
been demonstrated between the genomes of human and chicken (Klein et
al. 1996
) and human and pufferfish (Elgar et al. 1996
), which diverged
some 300 and 400 million years ago, respectively, conserved synteny
does not imply conservation of gene orders. Paterson et al. (1996)
predicted that 43%-58% of chromosomal tracts
3 cM should have
remained colinear over the evolutionary time period [130-240 million
years (Wolfe et al. 1989
; Crane et al. 1995
)] separating the monocots
and eudicots and provided some empirical mapping data to support this
hypothesis. With a large part of the Arabidopsis sequence
available, we aimed to investigate whether Arabidopsis-rice
colinearity can be identified and thus exploited using currently
available data and tools. The key issue is not the existence of
colinearity at the sequence level. It is clear that any colinearity
that can be detected only when the genomic sequence is available for
both rice and Arabidopsis will have limited applications. Once
the rice sequence is available, the exploitation of rice, and not
Arabidopsis, sequence will be the priority in cereal research.
In the absence of rice genomic sequence, a comparative genetic study of
the location in the rice genome of expressed sequence tags (ESTs) with
homology to genes from the top of Arabidopsis chromosome 1 failed to demonstrate that gene orders had remained conserved over the
monocot-eudicot divide to the extent that Arabidopsis sequence
could be exploited for the map-based identification and isolation of
genes underlying cereal traits. The study also highlights the practical
problems involved in establishing relationships between highly
divergent genomes.
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
To establish whether gene orders remained conserved over smaller
genetic distances, genes belonging to the same BAC and to BACs spaced
over two regions of maximum 3 cM of Arabidopsis chromosome 1 were selected for an Arabidopsis-rice comparative study (Fig. 1a). BLAST searches identified one or more rice ESTs
(n
200) for 53 of 128 annotated Arabidopsis
genes from five BAC clones. A further 47 hits were obtained when
querying the rice EST database with the complete sequence of five
nonannotated BACs (data available at
http://pgec-genome.pw.usda.gov/sequencing.html in January
1998). Initially, Arabidopsis genes for which no putative rice
homolog could be identified at the nucleotide level, were rescreened at the amino acid level. However, levels of homology between
Arabidopsis genes and rice ESTs that were identified at the
amino acid level only were generally too low for the hits to be
considered orthologs. BLAST searches at the amino acid level were
therefore not pursued further. Because it was expected that a number of
the rice hits obtained at the nucleotide level were also not
orthologous to the Arabidopsis query sequence, the rice ESTs
were rescreened against the Arabidopsis database. About 30%
of the identified rice ESTs displayed a higher degree of homology to
Arabidopsis BACs other than those originally used to select
them, indicating that they do not correspond to the
Arabidopsis genes in the target region.
|
Of the remaining rice ESTs, 33 were mapped to 33 loci on 10 of the 12 rice chromosomes (Fig. 1b). Their estimated copy number in rice, based on the number of hybridizing fragments, is given in Figure 1a. For probes that detected weak fragments in addition to a strongly hybridizing fragment, only the map position of the latter was included in the analysis. Sixty-seven percent of these loci mapped to rice chromosomes 1, 2, 3, and 5. This could be defined as synteny strictu sensu, that is, genes lying on the same chromosomes, without making assumptions about genetic linkage. Potential regions of conserved synteny could be observed, for example, between contigs 1 and 2 of Arabidopsis chromosome 1 and rice chromosome 5 (Fig. 1b). Nevertheless, within these regions, little evidence was found for conserved gene orders.
Elsewhere on rice chromosome arm 8S, two markers from contigs 2 (E60275) and 3 (S20913), which spanned a genetic distance of 1.2 cM in
Arabidopsis, detected closely linked (0.3 cM) loci (Fig. 1).
No conserved positions in rice were found, however, for the other
Arabidopsis genes located in this apparently conserved region.
Assuming that this linkage is a remnant of ancestral gene associations,
our data suggest that the conservation between monocot and eudicot
species of ~50 % of 3-cM intervals, as suggested by Paterson et al.
(1996)
, may be an overestimate. A level of conserved synteny, as
identified between Arabidopsis chromosome 1 and rice, has
limited applications in map-based gene prediction.
It thus appears that the 130-240 million years that separate Arabidopsis and rice have largely eroded close linkages, at least in the area under investigation. One can argue that although a number of nonorthologs were discarded based on the results of a reciprocal BLAST search, no evidence is available to suggest that the remaining rice ESTs are orthologous to the Arabidopsis target genes on chromosome 1. Rice ESTs were initially selected mainly on the basis of 5' homology. To estimate the extent of homology between the rice and the corresponding Arabidopsis genes more accurately, 17 of the mapped rice ESTs were also sequenced from the 3' end, and these 3' ends were subjected to BLAST searches against the Arabidopsis database. Nine clones showed a level of homology with both the 5' and 3' ends to the BAC originally used to select the rice ESTs (Fig. 1a). For one rice EST, different BACs were identified with the 3' and 5' end sequences, whereas the 3' ends of the remaining clones displayed no homology to Arabidopsis BAC sequences. Lack of homology, especially when the 3' end sequence consists of <400 bp, may be explained by the presence of a 3'-untranslated region, which is unlikely to have remained conserved between Arabidopsis and rice. However, it is possible that a number of these BLAST hits represent domain homologies rather than gene homologies.
To determine the level of error introduced in BLAST-based comparative analyses, correspondence of the hybridization patterns of 11 Arabidopsis exons and their putative rice homologs were used to evaluate orthology between the identified Arabidopsis and rice sequences. From the seven genes that cross-hybridized, two produced a pattern different than that of the corresponding rice ESTs. Good correspondence between the hybridization patterns was obtained for the remaining five Arabidopsis sequences and rice ESTs, all of which displayed homology with Arabidopsis at both the 5' and 3' ends. However, for genes belonging to multigene families or displaying different copy numbers in rice and Arabidopsis, paralogous loci could have been mapped in the two species. From the nine mapped rice ESTs that displayed good homology at both the 5' and 3' ends with Arabidopsis, only three produced single copy patterns in both rice and Arabidopsis. For two further ESTs, a comparison of the relative signal strengths of the hybridizing fragments in Columbia (the ecotype used to construct the Arabidopsis BAC library), in the target BAC, and in rice confirmed that orthologous loci had been mapped in Arabidopsis and rice (Fig. 1a). No conclusions could be drawn for the other four genes. Although this stringent selection retained the two loci that were linked in both Arabidopsis and rice, colinearity was not maintained for an additional locus from the same BAC (Fig. 1a). Two loci from two other BACs that mapped <3 cM apart were also not linked in rice (Fig. 1a).
In addition to demonstrating a lack of identifiable genome conservation between the top of Arabidopsis chromosome 1 and rice using currently available comparative mapping tools, our study highlighted the need for careful interpretation of comparative data between species as divergent as monocots and eudicots. The alignment of conserved domains in nonorthologous genes in BLAST queries and the identification of different members of multigene families may confound relationships. Although this would most likely lead to an underestimation of the extent of synteny, the alignment of nonorthologous genes following a BLAST search in a database biased toward genes with nonrandom genome distribution could provide apparent support for conserved relationships. Based on the stringency of the parameters used in defining genome conservation, different investigators may therefore come to varying conclusions.
A fuller picture of the precise relationship between the Arabidopsis and rice genomes will emerge as more sequence data become available. However, even if tracts of conserved gene orders do exist between the two model species at the DNA sequence level, the fact that they are not readily identifiable using currently available comparative mapping tools will greatly diminish the impact of comparative knowledge between Arabidopsis and rice on grass genome analyses. The exploitation of rice rather than Arabidopsis genomic sequence will then be the priority in cereal research. Nevertheless, for certain applications such as the identification of orthologous relationships between different members of a gene family and the study of positional effects on gene function, the relative position of genes in the Arabidopsis and rice genomes will continue to be important.
| |
METHODS |
|---|
|
|
|---|
Homology Searches
A BLAST search was conducted at the nucleotide level with the Arabidopsis gene sequences from five annotated BAC clones from the top of Arabidopsis chromosome 1 (F21M12, F7G19, F19G10, T26J12, and F21J9; Fig. 1a) against the rice EST database of the Japanese Rice Genome Programme (RGP), which contains mainly 5' end sequences. For the remaining BACs (Fig. 1a), the complete sequence was used in BLAST searches. Reciprocal BLAST searches using selected rice ESTs as query against the Arabidopsis thaliana Database (http://genome-www.stanford.edu/Arabidopsis/) were also carried out at the nucleotide level.
DNA Sequencing
Rice ESTs were sequenced from the 3' end by the dideoxy termination method using fluorescent primers on an Applied Biosystems ABI 377 automated DNA sequencer.
Plant Material
An F2 population of 186 plants from the Oryza
sativa cross Nipponbare (japonica) × Kasalath
(indica) was used for mapping in rice (Kurata et al. 1994
;
Harushima et al. 1998
). Additional mapping was carried out in a
population of 155 F2 progeny or their F3 families
from the cross IR20 (indica) × 6383 (japonica)
(Quarrie et al. 1997
). The A. thaliana ecotype Columbia was
used for cross-hybridization experiments.
Primers and Probes
Primers to rice and Arabidopsis sequences were designed using the program "Primer" (Whitehead Institute for Biomedical Research, Cambridge, MA). Rice ESTs were obtained from RGP, and Arabidopsis BACs were from The Arabidopsis Biological Resource Center (Columbus, OH). Arabidopsis probes were prepared by PCR amplification of exon sequences from BAC F21M12 using standard conditions.
Marker Analyses
DNA isolation and digestion, electrophoresis, and Southern blot
transfers were performed as described by Kurata et al. (1994)
and Sharp
et al. (1988)
, for use with chemiluminescent and radioisotope labeling
and detection systems, respectively. For chemiluminescent labeling and
detection of probes, the ECL-Direct kit (Amersham) was used according
to the supplier's instructions. Hybridization conditions following
radioactive labeling with 32P were performed as described by
Laurie et al. (1993)
. Restriction fragment length polymorphism (RFLP)
markers were added to the existing Nipponbare × Kasalath and
IR20 × 6383 genetic maps using the "try" command of the
program Mapmaker version 3.0 (Whitehead Institute for Biomedical Research).
Physical mapping of rice ESTs to YACs by PCR was carried out as
described by Umehara et al. (1996)
.
| |
ACKNOWLEDGMENTS |
|---|
K.M.D. was funded by the Biotechnology and Biological Sciences Research Council (BBSRC) through a David Phillips Research Fellowship. Part of the work was carried out at RGP under a STAFF Visiting Research Fellowship Program.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Corresponding author.
E-MAIL katrien.devos{at}bbsrc.ac.uk; FAX 44 1603 502 241.
| |
REFERENCES |
|---|
|
|
|---|
Received June 2, 1999; accepted in revised form July 21, 1999.
This article has been cited by other articles:
![]() |
R. Liu, C. Vitte, J. Ma, A. A. Mahama, T. Dhliwayo, M. Lee, and J. L. Bennetzen A GeneTrek analysis of the maize genome PNAS, July 10, 2007; 104(28): 11844 - 11849. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Nelson, R. L. Naylor, and M. M. Jahn The Role of Genomics Research in Improvement of "Orphan" Crops Crop Sci., November 1, 2004; 44(6): 1901 - 1904. [Full Text] [PDF] |
||||
![]() |
H. Kuittinen, A. A. de Haan, C. Vogl, S. Oikarinen, J. Leppala, M. Koch, T. Mitchell-Olds, C. H. Langley, and O. Savolainen Comparing the Linkage Maps of the Close Relatives Arabidopsis lyrata and A. thaliana Genetics, November 1, 2004; 168(3): 1575 - 1584. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Kellogg and J. L. Bennetzen The evolution of nuclear genome structure in seed plants Am. J. Botany, October 1, 2004; 91(10): 1709 - 1725. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Peng, H. Zadeh, G. R. Lazo, J. P. Gustafson, S. Chao, O. D. Anderson, L. L. Qi, B. Echalier, B. S. Gill, M. Dilbirligi, et al. Chromosome Bin Map of Expressed Sequence Tags in Homoeologous Group 1 of Hexaploid Wheat and Homoeology With Rice and Arabidopsis Genetics, October 1, 2004; 168(2): 609 - 623. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Munkvold, R. A. Greene, C. E. Bermudez-Kandianis, C. M. La Rota, H. Edwards, S. F. Sorrells, T. Dake, D. Benscher, R. Kantety, A. M. Linkiewicz, et al. Group 3 Chromosome Bin Maps of Wheat and Their Relationship to Rice Chromosome 1 Genetics, October 1, 2004; 168(2): 639 - 650. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. J. M. Koopman and G. Gort Significance Tests and Weighted Values for AFLP Similarities, Based on Arabidopsis in Silico AFLP Fragment Length Distributions Genetics, August 1, 2004; 167(4): 1915 - 1928. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhu, D.-J. Kim, J.-M. Baek, H.-K. Choi, L. C. Ellis, H. Kuester, W. R. McCombie, H.-M. Peng, and D. R. Cook Syntenic Relationships between Medicago truncatula and Arabidopsis Reveal Extensive Divergence of Genome Organization Plant Physiology, March 1, 2003; 131(3): 1018 - 1026. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Vandepoele, Y. Saeys, C. Simillion, J. Raes, and Y. Van de Peer The Automatic Detection of Homologous Regions (ADHoRe) and Its Application to Microcolinearity Between Arabidopsis and Rice Genome Res., November 1, 2002; 12(11): 1792 - 1801. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. TUBEROSA, S. SALVI, M. C. SANGUINETI, P. LANDI, M. MACCAFERRI, and S. CONTI Mapping QTLs Regulating Morpho-physiological Traits and Yield: Case Studies, Shortcomings and Perspectives in Drought-stressed Maize Ann. Bot., June 15, 2002; 89(7): 941 - 963. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Salse, B. Piegu, R. Cooke, and M. Delseny Synteny between Arabidopsis thaliana and rice at the genome level: a tool to identify conservation in the ongoing rice genome sequencing project Nucleic Acids Res., June 1, 2002; 30(11): 2316 - 2328. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. P. Dunford, M. Yano, N. Kurata, T. Sasaki, G. Huestis, T. Rocheford, and D. A. Laurie Comparative Mapping of the Barley Ppd-H1 Photoperiod Response Gene Region, Which Lies Close to a Junction Between Two Rice Linkage Segments Genetics, June 1, 2002; 161(2): 825 - 834. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yu, S. Hu, J. Wang, G. K.-S. Wong, S. Li, B. Liu, Y. Deng, L. Dai, Y. Zhou, X. Zhang, et al. A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) Science, April 5, 2002; 296(5565): 79 - 92. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. FEUILLET and B. KELLER Comparative Genomics in the Grass Family: Molecular Characterization of Grass Genome Structure and Evolution Ann. Bot., January 1, 2002; 89(1): 3 - 10. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Liu, R. Sachidanandam, and L. Stein Comparative Genomics Between Rice and Arabidopsis Shows Scant Collinearity in Gene Order Genome Res., December 1, 2001; 11(12): 2020 - 2026. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Draper, L. A.J. Mur, G. Jenkins, G. C. Ghosh-Biswas, P. Bablak, R. Hasterok, and A. P.M. Routledge Brachypodium distachyon. A New Model System for Functional Genomics in Grasses Plant Physiology, December 1, 2001; 127(4): 1539 - 1555. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Bennetzen, V. L. Chandler, and P. Schnable National Science Foundation-Sponsored Workshop Report. Maize Genome Sequencing Project Plant Physiology, December 1, 2001; 127(4): 1572 - 1578. [Full Text] [PDF] |
||||
![]() |
N. A. Eckardt Everything in Its Place: Conservation of Gene Order among Distantly Related Plant Species PLANT CELL, April 1, 2001; 13(4): 723 - 725. [Full Text] |
||||
![]() |
M. Rossberg, K. Theres, A. Acarkan, R. Herrero, T. Schmitt, K. Schumacher, G. Schmitz, and R. Schmidt Comparative Sequence Analysis Reveals Extensive Microcolinearity in the Lateral Suppressor Regions of the Tomato, Arabidopsis, and Capsella Genomes PLANT CELL, April 1, 2001; 13(4): 979 - 988. [Abstract] [Full Text] |
||||
![]() |
M. Freeling Grasses as a Single Genetic System. Reassessment 2001 Plant Physiology, March 1, 2001; 125(3): 1191 - 1197. [Full Text] |
||||
![]() |
J. Dubcovsky, W. Ramakrishna, P. J. SanMiguel, C. S. Busso, L. Yan, B. A. Shiloff, and J. L. Bennetzen Comparative Sequence Analysis of Colinear Barley and Rice Bacterial Artificial Chromosomes Plant Physiology, March 1, 2001; 125(3): 1342 - 1353. [Abstract] [Full Text] |
||||
![]() |
T. J. Vision, D. G. Brown, and S. D. Tanksley The Origins of Genomic Duplications in Arabidopsis Science, December 15, 2000; 290(5499): 2114 - 2117. [Abstract] [Full Text] |
||||
![]() |
A. H. Paterson, J. E. Bowers, M. D. Burow, X. Draye, C. G. Elsik, C.-X. Jiang, C. S. Katsar, T.-H. Lan, Y.-R. Lin, R. Ming, et al. Comparative Genomics of Plant Chromosomes PLANT CELL, September 1, 2000; 12(9): 1523 - 1540. [Abstract] [Full Text] |
||||
![]() |
H.-M. Ku, T. Vision, J. Liu, and S. D. Tanksley Comparing sequenced segments of the tomato and Arabidopsis genomes: Large-scale duplication followed by selective gene loss creates a network of synteny PNAS, July 19, 2000; (2000) 160271297. [Abstract] [Full Text] |
||||
![]() |
J. L. Bennetzen Comparative Sequence Analysis of Plant Nuclear Genomes: Microcolinearity and Its Many Exceptions PLANT CELL, July 1, 2000; 12(7): 1021 - 1030. [Abstract] [Full Text] |
||||
![]() |
K. M. Devos and M. D. Gale Genome Relationships: The Grass Model in Current Research PLANT CELL, May 1, 2000; 12(5): 637 - 646. [Abstract] [Full Text] |
||||
![]() |
H.-M. Ku, T. Vision, J. Liu, and S. D. Tanksley Comparing sequenced segments of the tomato and Arabidopsis genomes: Large-scale duplication followed by selective gene loss creates a network of synteny PNAS, August 1, 2000; 97(16): 9121 - 9126. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Mayer, G. Murphy, R. Tarchini, R. Wambutt, G. Volckaert, T. Pohl, A. Dusterhoft, W. Stiekema, K.-D. Entian, N. Terryn, et al. Conservation of Microstructure between a Sequenced Region of the Genome of Rice and Multiple Segments of the Genome of Arabidopsis thaliana Genome Res., July 1, 2001; 11(7): 1167 - 1174. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||