|
|
|
|
Vol. 10, Issue 7, 908-915, July 2000
REPORTS
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Organisms with large genomes contain vast amounts of repetitive DNA sequences, much of which is composed of retrotransposons. Amplification of retrotransposons has been postulated to be a major mechanism increasing genome size and leading to "genomic obesity." To gain insights into the relation between retrotransposons and genome expansion in a large genome, we have studied a 66-kb contiguous sequence at the Rar1 locus of barley in detail. Three genes were identified in the 66-kb contig, clustered within an interval of 18 kb. Inspection of sequences flanking the gene space unveiled four novel retroelements, designated Nikita, Sukkula, Sabrina, and BAGY-2 and several units of the known BARE-1 element. The retroelements identified are responsible for at least 15 integration events, predominantly arranged as multiple nested insertions. Strikingly, most of the retroelements exist as solo LTRs (Long Terminal Repeats), indicating that unequal crossing over and/or intrachromosomal recombination between LTRs is a common feature in barley. Our data suggest that intraelement recombination events deleted most of the original retrotransposon sequences, thereby providing a possible mechanism to counteract retroelement-driven genome expansion.
[The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF254799.]
| |
INTRODUCTION |
|---|
|
|
|---|
Genome size in eukaryotes differs considerably both
across and within phyla whereas the number of genes
in species of the same phyla is generally similar. This phenomenon has
been referred to as C value paradox, i.e., variation in genome
size (108 to 1011 bp) is not correlated with the
complexity of the organism (Thomas 1971
). For example, the size of the
barley (Hordeum vulgare) genome is 11- and 35-fold larger than
that of rice and Arabidopsis, respectively (Bennett and Leitch 1997
).
The paradox has been resolved by the realization that the share of the
genome dedicated to genes is relatively constant in the eukaryotes,
whereas the amount of repetitive DNA varies widely even within families
of organisms and can represent >70% of some plant genomes (Flavell
et al. 1977
; Barakat et al. 1997
). Much of the repetitive DNA is, in
turn, retroelements, which represent 30% of the human and 50% of the
maize genome (SanMiguel et al. 1996
). To understand the relation of
genome structure and repetitive DNAs to genome expansion/contraction,
analysis of large contiguous genomic sequences is essential.
In maize, a plant with a large genome, "diagnostic sequencing" was
taken to study the intergenic region of a 280-kb interval containing
the Adh1-F locus (SanMiguel et al. 1996
). Ten retroelement families compose >60% of this region. In contrast, retroelements including Ta11, Ty3-gypsy, Ty1-copia, and
Athila families are mainly found at higher density toward the
regions flanking centromeres in the small genome plant Arabidopsis
(Copenhaver et al. 1999
; Lin et al. 1999
; Mayer et al. 1999
). A
comparison of the Adh regions of maize (Zea maize)
and sorghum (Sorghum bicolor), having a 3.5-fold smaller
genome than that of maize, revealed that an equivalent 78-kb sorghum
sequence contains the same nine genes and five additional ones, as the
225-kb maize sequence (Tikhonov et al. 1999
). The main difference is
the presence in maize of 166 kb (74%) of retrotransposons and 33 miniature inverted-repeat transposable elements (MITEs) (6% of the
sequence). These observations provide direct evidence that retroelement
proliferation can account for an increase in genome size.
The replicative potential of retrotransposons and their possible role
as factors in genome expansion raises the question of what mechanisms
might counteract retrotransposon activity or even remove inserted
copies to maintain or reduce genome size. Plants may resist
retroelement amplification through such mechanisms as DNA methylation
in repetitive sequences (Bennetzen et al. 1994
) or gene silencing
(Ketting et al. 1999
). However, mechanisms for removing substantial
parts of repetitive elements have not to date been shown in plants to
play that role. In Drosophila, however, a high rate of DNA
deletions inside retroelements was observed, and this process appears
not to be restricted to these elements (Petrov et al. 1996
). Recently,
loss of retroelements was found to be greater than 40 times faster in
Drosophila than Laupala crickets that have a genome 11 times
larger than that of Drosophila (Petrov et al. 2000
). This
indicates that rate of DNA loss can contribute to large differences in
genome size in closely related species. Unequal crossing over and/or
intrachromosomal recombination between the LTRs (Long Terminal repeats)
can remove the internal domain of LTR-containing retrotransposons
(Bennetzen and Kellog 1997
), creating a single, solo LTR at the
excision site. However, a detailed analyses in the above mentioned
maize Adh1-F region revealed very few solo LTRs compared to
intact elements, suggesting that deletions inside retroelements might
not contribute to genome size differences, at least in maize (SanMiguel
et al. 1996
).
In the barley genome, the BARE-1 retrotransposon is present in
16.6 ± .6 × 103 copies and composes ~3% of the genome
(Vicient et al. 1999
). It is transcribed in somatic tissues (Suoniemi
et al. 1996
) and is translated, processed, and assembled into
virus-like particles (Jääskeläinen et al. 1999
). Dot
blot data and combined restriction mapping and hybridization analyses
of BAC (Bacterial Artificial Chromosome) clones indicated that, in
barley and other Hordeum species, BARE-1 LTRs vastly
outnumber (16 ± 2 ×) BARE-1 elements containing internal
domains. (Jääskeläinen et al. 1999
; Vicient et al.
1999
). These data would be consistent with intraelement LTR
recombination generating solo LTRs, at least of BARE-1.
However, there are no sequence data yet available to determine the
exact nature of these truncated copies nor the means by which they arose.
Here, we report the analysis of a 65,979-bp contiguous sequence from barley chromosome 2HL. Three genes were found to be clustered whereas the majority of the flanking regions consists of highly complex nested retroelement configurations. Most of these configurations represent derivatives of the original elements that are likely to be products of intra- and interelement recombination events. This suggests that recombination between the LTRs of a single or multiple elements occurs frequently in many families of retrotransposons and may contribute to reducing genome size.
| |
RESULTS |
|---|
|
|
|---|
Gene Density
Three criteria were used to search for genes in the 66-kb contiguous
sequence: homology with characterized genes or ESTs (Expressed Sequence
Tags) in the public databases, occurrence of extended high coding
probabilities, and application of gene finder programs. We identified
three genes in the genomic interval that each match at least two of the
three criteria. These were Rar1 (Shirasu et al. 1999
) and two
homologs of aquaporin genes at positions 56,694, 44,020, and 46,345, respectively (Fig. 1). The deduced amino acid sequences of the latter two proteins reveals a similarity of 71% and
identity of 45% to each other. Both proteins are more similar to the
-type of tonoplast intrinsic proteins from Arabidopsis (similarity
90% and 71%) than to the
-type (79% and 65%) (Weig et al. 1997
).
Therefore, we designated these genes Hv-TIP1 (44,020-44,976) and Hv-TIP2 (45,365-46,345), respectively. Hv-TIP1 mRNA was
detected in leaf and root tissues by reverse transcription-PCR
(Polymerase Chain Reaction) analysis, whereas Hv-TIP2 was undetectable
in these tissues (data not shown). Barley genes within the sequenced region were clustered because the Rar1 and aquaporin genes
span only 18 kb of the 66-kb region.
|
The region next to the two TIP genes (40,261-41,734) shows
high sequence similarity to the tnp2 gene that is part of the
En/Spm-type transposable element Tam1 from
Antirrhinum (Fig. 1) (Nacken et al. 1991
). A sequence similar
to the reverse transcriptase of the non-LTR retrotransposon
Ta11 was found between the TIPs and Rar1 (nt
positions 50,003-53,152). These sequences contain multiple stop codons
and reveal no other parts of En/Spm or Ta11-like
elements, suggesting that they are nonfunctional transposable element genes.
BARE-1 Elements
The contiguous sequence presented here contains five BARE-1
units, four in inverted orientation regarding the sense direction of
BARE-1 and one in direct orientation (Figs. 1 and
2B). Complete BARE-1 retrotransposons
consist of 1.8-kb LTRs at either end, which contain the elements for
transcription and mRNA processing, and an internal domain encoding a
polyprotein that is processed into functional units. The organization
of BARE-1 and other copia-like retrotransposons is:
5' LTR
UTL
GAG
AP
IN
RT-RH
UTR (Untranslated
Region)
LTR 3', where UTL is the 5' untranslated leader, GAG
encodes the capsid protein of the virus-like particle, AP encodes an
aspartic proteinase, IN encodes the integrase, RT-RH encodes both the
reverse transcriptase and RNaseH, and UTR is the 3' untranslated
region (Fig. 2A). Retrotransposon insertion generates direct repeats at
the integration site, due to repair of the staggered cuts made by the
integrase. These direct repeats therefore are found at the outer edges
of the inserted retrotransposon, on the left flank of the left LTR and
the right flank of the right LTR.
|
In the sequenced region, the first two BARE-1 units are solo LTRs, LTR-1 (nt 14,967-16,796) and LTR-2 (nt 24,049-25,865), and are flanked by the direct repeats TTCTT and GGTAG, respectively (Fig. 3A). Immediately internal to the direct repeats are the conserved, terminal inverted repeats characteristic of BARE-1 LTRs. The units are complete LTRs showing >90% identity to previously sequenced BARE-1 LTRs, a level of conservation typical for BARE-1 LTRs in the barley genome.
|
The third and fourth BARE-1 LTRs, LTR-3 and LTR-4, directly
flank BARE-1 sequence that consists of the 3' UTR and part
of the RH region (position 34,586-36,407) (Fig. 2B). All of the
components of the unit LTR-3
UTR
RH
LTR-4 are arranged in the
same order and orientation as in a full-length BARE-1 element.
However, because the junction between the interrupted RH region and
LTR-4 occurs precisely at the end of the LTR sequence, it is unlikely that the structure results from an internal deletion of a full-length BARE-1 element. Instead, this observation points to an origin involving an insertion of one BARE-1 element into another,
followed by a deletion resulting from recombination between two LTRs,
as depicted in Figure 2C. Consistent with this origin is the occurrence of 5-bp direct repeats flanking the entire structure (CCAGG, Fig. 3A),
generated by the proposed first insertion event.
The fifth BARE-1 unit (position 63,185-65,979) consists of an LTR (LTR-5) followed by a UTL, highly conserved with respect to BARE-1a (GenBank accession no. Z17327), which is interrupted by the edge of the cloned genomic region.
Gypsy-like Retrotransposons
The region between nt positions 405 and 5501 encodes a protein with
similarity to polyproteins from Ty3/Gypsy-like plant
retrotransposons, which have the domain order GAG
AP
RT
RH
IN (Fig. 1) (Suoniemi et al. 1998
). Because the deduced protein
sequence of the polyprotein is 47% similar to the internal domain of
the previously described barley BAGY-1 (Panstruga et al.
1998
), we designated this element BAGY-2. To make the
polyprotein domains into a single contiguous open reading frame, we had
to introduce three frame shifts (positions 3071, 3193, and 3648) and a
nonsense mutation (position 4160), suggesting that this retroelement
does not encode functional enzymes. Sequences in the 3' direction
from the BAGY-2 polyprotein are interrupted by the edge of the
66-kb sequence contig. However, the likely 5' LTR of
BAGY-2 was located between nt 5608 and 13,627 (Fig. 3B).
Identical end sequences were found in a number of other deposited
barley and wheat sequences, supporting the idea that this is indeed one
LTR of BAGY-2 (Fig. 3B). The 5 LTR of BAGY-2 is
interrupted by a Sukkula element (see below, between nt 6245 and 11,204) and by the LTR of a second copy of BAGY-2 (between nt 11,758 and 13,280). This second copy seems to represent a solo LTR
because its near-identical flanking sequences, GCC(A)C and GCCAC, are
likely to represent the 5-bp direct repeats of the original insertion
event (Figs. 1 and 3B).
Sukkula Elements
The interval between nt positions 6245 and 11,204 shows sequence
similarity to intergenic sequences at the barley Mlo locus and
to an insertion sequence present in the 3' LTR of one
BARE-1 element (GenBank accession nos. Y14573 and Z17327,
respectively; Fig. 1) (Manninen and Schulman 1993
). A compilation of
these three ~5-kb long sequences revealed stretches with varying
degrees of sequence similarity (data not shown). A segment of 1788 bp
appears to be internally deleted from the copy that resides in the
BARE-1 LTR. The insertion in the BARE-1 LTR is
flanked by a 5-bp direct repeat, CCTAG, typical of retroelement
integration sites (Fig. 3C). Likewise, the 5-kb sequence at the
Rar1 locus is also flanked by 5-bp direct repeats, in this
case CACCA (Fig. 3C), suggesting that the related 5-kb sequences
represent retrotransposon derivatives. Here, we name the insertion
sequences Sukkula (pronounced sook-koo-la), which means
"shuttle" in Finnish. The terminal sequences of Sukkula are strikingly similar to LTR terminal regions of gypsy-like
retrotransposons (RIRE) described recently in rice (Kemekawa
et al. 1999
). Although none of the Sukkula copies in the
contig contains sequences encoding corresponding gypsy-like
polyproteins adjacent to the LTR sequences, we have identified
Sukkula internal domains elsewhere in the barley genome (data
not shown). Thus, it is likely that the Sukkula sequences in
the contig represent solo LTRs of a novel barley retrotransposon that
were generated by recombination between two LTRs of the original element.
More evidence for the insertional activity of Sukkula was found between nt 20,145 and 27,570, where another remnant of this retroelement interrupts a Sabrina element (see below, Fig. 1). A 5-bp direct repeat, ATAGA, was identified at positions 20,145 and 27,570 suggesting that this Sukkula copy represents a further solo LTR (Fig. 3C). This Sukkula copy was, in turn, interrupted by a BARE-1 solo LTR and by a large region between nt 20,145 and 24,049 with no sequence similarities to any other elements, suggesting a set of nested solo-LTRs and possibly other insertions in this region.
Sabrina Elements
Sequences in a number of regions (nt 14,378-17,590, 19,160-28,211, and 31,781-33,400) are highly similar to sequences flanking the insertion site of a Cerebra retroelement in barley (AF078801) (Figs. 1 and 3D). Several lines of evidence suggest that these are fragments of yet another retroelement that we designate Sabrina. Imperfect 5-bp direct repeats (AGGCG/AGGCA) directly flank Sabrina sequences present between positions 19,160 and 33,401, thereby possibly defining a Sabrina integration site. Indeed, sequences next to the 5-bp direct repeats are related but in inverse orientation, appropriately positioned to be the diverged terminal LTR sequences of this element (Fig. 3D). Two further putative Sabrina terminal LTR sequences (positions 28,211 and 31,781) were found within the ~14-kb interval bordered by the imperfect 5-bp direct repeats (Fig. 3D). The orientation of these terminal sequences relative to those next to the 5-bp direct repeats indicates the presence of two Sabrina LTRs, designated LTR-2 and LTR-3 (Fig. 1). The Sabrina LTR-2 is disrupted by Sukkula LTR-2 and by BARE-1 LTR-2. A third potential Sabrina LTR, LTR-1, was identified at positions 14,378-17,590 although the 5' end of the LTR appears to be truncated (Figs. 1 and 3D). This Sabrina LTR was found to be interrupted by the BARE-1 LTR-1 (Fig. 1). Finally, almost identical Sabrina terminal LTR sequences were found in the deposited genomic sequence adjacent to the insertion of a Cerebra retroelement (AF078801; Fig. 3D).
Nikita Elements
Another likely solo LTR element was located between nt positions 33,897 and 40,171. These sequences are highly related to an ~3-kb stretch at the Mlo locus (nt positions 30,047 and 27,118 in Y14573) (Fig. 3E). We designated the element Nikita. In both cases, 5-bp direct repeats were found at the insertion sites (CCTAT and ATAAT, respectively; Fig. 3E). The Nikita element at the Rar1 locus is interrupted by two BARE-1 LTRs (LTR-3 and LTR-4) at nt positions 34,586 and 38,442 (Fig. 1), whereas it is contiguous at the Mlo locus.
Stowaway Elements
In addition to retrotransposon-like sequences, a 160-bp inverted
repeat element was found flanking the 3' region of TIP2
(nt 47,279-47,443). On the basis of sequence similarity, this element belongs to the Stowaway family (Bureau and Wessler 1994
). Like Tourist elements, Stowaway family members are small
elements (ranging from ~50 to 300 bp) that lack coding potential but
share conserved terminal inverted repeats and the potential to form DNA
secondary structures (Bureau and Wessler 1994
). They are thought to be
deletion derivatives of autonomous type II (DNA) transposable elements. A further Stowaway element (112 bp) was found directly beyond the 3' end of the Rar1 gene (nt 61,060-61,170). The
location of the two Stowaway elements in the sequence contig
is consistent with findings in other plant species indicating a tight
association with genes (Bureau and Wessler 1994
). Indeed, analysis of
two other large genomic barley sequence contigs, the Mlo
(Y14573) and the Mla locus (Wei et al. 1999
), identified
Stowaway elements only in tight association with genes (data
not shown).
| |
DISCUSSION |
|---|
|
|
|---|
In-depth analysis of a 66-kb contiguous stretch of barley chromosome 2HL has produced insights into gene density, gene organization, and a complex structure of intergenic sequences. These data allow us to infer models for the evolution of complex grass genomes.
Our analysis has identified at least three genes within the 66-kb
contig, yielding a density of approximately one gene per 20 kb. This is
almost identical to the density described in the only other published
large contiguous barley genomic interval, a 60-kb stretch at
Mlo on chromosome 4HL (Panstruga et al. 1998
). This 60-kb
stretch harbors three genes that are clustered within an interval of 18 kb. Despite the limitations of computer-aided gene identification
(Fickett 1996
) and the uncertainties of whether the two genomic
intervals are representative for the barley genome, patterns are
recognizable from the sequence contigs. First, the observed gene
density at both loci is 6-10-fold higher than expected from an
equidistant gene distribution in the 5300-Mb barley genome (Bennett and
Leitch 1997
; Bevan et al. 1998
; Panstruga et al. 1998
). Second, if one
considers only the intervals of the sequence contigs harboring genes at
Rar1 and Mlo, the density is approximately one gene
per 6 kb. This is only marginally different from the 4.8 kb calculated
as the mean of a 1.9-Mb contiguous sequence of the small-size genome of
Arabidopsis (Bevan et al. 1998
) and similar to the density seen at the
orthologous Lrk/Tak loci in wheat, barley, and rice (Feuillet
and Keller 1999
). Third, with the exception of a Ta11-like
reverse trascriptase sequence and a Nikita element that were
located between genes at the Rar1 and Mlo loci,
respectively, it appears that the gene space is largely void of
transposable element sequences. Our data therefore imply a clustering
of genes and support the idea of `gene islands' in the barley genome,
given the fact that the Arabidopsis genome is ~35-fold less complex
and estimates of gene number in higher plants vary only between 25,000 and 43,000 (Miklos and Rubin 1996
).
Analysis of the 66-kb contiguous stretch on barley chromosome 2HL
reveals an extraordinarily complex structure of sequences flanking the
gene island. At least four of the five identified BARE-1
elements have undergone recombination events, leaving only 1.8-kb
remnants of the large 8.9-kb retroelement at the Rar1 locus. Solo LTRs may result from unequal crossing over or from
intrachromosomal recombination between nearby LTRs. In the latter,
recombination between the LTRs would result in circularization and
deletion of the internal region and simultaneous production of a hybrid LTR containing the left end of the left LTR and the right end of the
right LTR (Fig. 2D). The sequence analysis supports the large LTR
excesses recently reported for the barley and other genus
Hordeum genomes (Vicient et al. 1999
). The structures of the
BARE-1 retroelements found on the LTR-3
LTR-4 stretch of chromosome 2H reflect both the phenomena of nested retrotransposon insertion (SanMiguel et al. 1996
; Suoniemi et al. 1997
).
However, analysis of the contig indicates that not only BARE-1
but also the novel retroelements Sukkula, Sabrina,
Nikita, and BAGY-2 all have undergone recombination
events at the Rar1 locus, leaving in each case only remains of
the original elements as solo LTRs and the products of nested
insertions. In contrast to BARE-1 elements, the newly
identified elements are generally interrupted by nonrelated
retrotransposons. For example, the arrangement of the unit
Sabrina LTR-2
LTR-3 can be interpreted as an internal deletion of a full-length Sabrina element, as evidenced by the flanking imperfect 5-bp direct repeats (AGGCG/A) (Fig. 3D).
Alternatively, the unit may reflect a nested Sabrina insertion
that was followed by an inter-/intrachromosomal recombination
between the LTR of the original element and the LTR of the newly
inserted Sabrina element (Fig. 4). Thus, the
unit Sabrina LTR-2
LTR-3 may represent a structure
equivalent to the one described above for the BARE-1 unit
LTR-3
LTR-4. If this is the case, then we predict that the
unaccounted sequence space between nt positions 28,212 and 31,780 is
likely to represent non-LTR Sabrina DNA.
|
Unlike the BARE-1 unit LTR-3
LTR-4, a Sukkula
insertion has interrupted the Sabrina LTR-2, which, in turn,
is interrupted by a BARE-1 insertion (BARE-1 LTR-2)
(Fig. 1). Both Sukkula LTR-2 and BARE-1 LTR-2
insertions have flanking 5-bp direct repeats, strongly suggesting that
both LTR structures are the result of intraelement recombination events
that deleted most of the original retrotransposon sequences. However,
we were unable to annotate sequences between nt positions 20,200 and
24,048, but it is possible that they represent the insertion of a
further, as yet unknown, element into Sukkula LTR-2.
Taken together, our data identified at least 15 insertion events from
retrotransposons and transposon-like elements within the 66-kb contig
(Fig. 5). As many as four elements were found to be
nested into each other, comparable to nested insertions seen in the
maize genome (SanMiguel et al. 1996
). It appears that the frequency of
nested insertions of distinct retroelements is similar to that of
nested insertions of sequence-homologous partners into each other (five
versus three, respectively). The insertions of unrelated elements into
each other allows, by the nesting order depicted in the simplified
diagram in Figure 5, a consideration of the relative insertional
activity and abundance of certain elements at various times. None of
the five BARE-1 insertion sequences in the contig is
interrupted by heterologous retroelement sequences. The BARE-1
retroelement family is abundant throughout Hordeum (Vicient et
al. 1999
), is present and insertionally polymorphic in other
Triticeae genera (Gribbon et al. 1999
) and is closely related
to RIRE-1 in the phylogenetically distant rice (Noma et al.
1997
). Hence, BARE-1 appears to be both an ancient and an active retrotransposon. Its recent activity is supported by the observation that all available flanking 5-bp direct repeats are perfectly conserved (Fig. 3B). In contrast, we infer that
Sabrina elements represent ancient but recently inactive
elements because they were found to be disrupted by both
Sukkula and BARE-1 elements (Fig. 5).
|
Although nested retrotransposon insertion appears also characteristic
of the maize genome (SanMiguel et al. 1996
), the findings here are
remarkable for the prevalence of solo LTRs. This implies that in barley
the conversion to solo LTRs occurs more rapidly than integration.
Regional recombinational differences are known to occur in complex
plant genomes. However, at least for BARE-1, amounting to
2.9% of the barley genome, there is direct evidence that the rapid
conversion to solo LTRs is not restricted to the Rar1 locus
but operates throughout the whole genome (Vicient et al. 1999
). Work
with Nicotiana with repeats of varying length (Puchta and Hohn
1991
) indicates that recombination between elements as long as
BARE-1 LTRs (1.8 kb) may be quite efficient. A detailed analysis of a 280-kb interval in maize flanking the Adh1 and
u22 genes indicates very few solo LTRs relative to intact
elements (SanMiguel et al. 1996
). Of the 16 named families of maize
retrotranspsosons in the current GenBank database, only five have LTRs
longer than 750 bases, the others averaging 450 ± 65 bp
(S.E.M.). This contrasts markedly with the LTRs on the
barley contig (BARE-1 1.8 kb; BAGY-2 1.5 kb;
Sukkula 4.9 kb; Sabrina 1.6 kb; Nikita 2.9 kb). The two solo LTRs reported for maize (SanMiguel et al. 1996
) both
are of Ji elements, having LTRs of 1156 bp (GenBank accession
no. U68405). Hence, one explanation for the low frequency of solo LTRs
in the maize genome may be a lower recombinational efficiency between
the comparatively short LTRs of maize retroelements.
Recombination between nearby LTRs offers a means to reverse the
increase in genome size caused by successive integrations of large
retrotransposons. Were recombination to occur between homologous LTRs
of different retroelement copies, the intervening genomic DNA would
also be removed irrespective of whether the recombination is inter- or
intrachromosomal. Indeed, the units BARE-1 LTR-3
LTR-4 and
Sabrina LTR-2
LTR-3 provide strong evidence that entire
elements have been eliminated at Rar1. Although this might be
deleterious if a gene was between the copies, nested retrotransposon
insertion or insertion into repetitive DNA would not present this
hindrance. The organization of the contig suggests that extensive
intra-/interchromosomal recombination has acted to delete integrated
retroelements. These data may help explain the observation that, in the
phylogeny of diploid grasses, decreases in genome size have possibly
occurred with the same frequency as size increases (Bennetzen and
Kellog 1997
).
| |
METHODS |
|---|
|
|
|---|
DNA Manipulations and Computer Analysis
Physical delimitation of the Rar1 locus has been performed
previously (Lahaye et al. 1998
). A fivefold redundancy of independent YAC (Yeast Artificial Chromosome) clones and a greater than fourfold redundancy of BAC clones cover the relevant genomic region (Shirasu et
al. 1999
). DNA fingerprinting of multiple overlapping and independent YAC and BAC clones revealed absence of any rearrangements in each tested clone in the sequenced area. The contiguous DNA sequence of
65,979 bp from barley chromosome 2HL has been deposited at GenBank
under accession number AF254799. Briefly, BAC 1B2 and BAC12 covering
this interval were completely sequenced by means of random shotgun
cloning in the sequencing vector, pBluescript II KS+. A 49-bp gap
between BAC 1B2 and BAC12 was closed by using PCR amplification with
end sequences of these two BACs, on template DNA of BAC 3H6, and
subsequent direct sequencing of the amplicon. Sequence homology
analyses were performed using BLAST2 software available from the
National Center for Biotechnology Information (http://www.ncbi.nim.nih.gov).
Putative exons were predicted by the programs GENSCAN 1.0 (http://gnomic.stanford.edu/~chris/GENSCANW.html), NetPlantGene V2.0
(http://www.cbs.dtu.dk/NetPlantGene.html), and Grail
(http://compbio.ornl.gov/Grail-1.3).
| |
ACKNOWLEDGMENTS |
|---|
We thank Nicholas Collins and Louise Jones for constructive criticism of this manuscript and Margaret Shailer for technical assistance. This work was supported by grants from the GATSBY Charitable Foundation and the Biotechnology and Biological Sciences Research Council to P. S.-L.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Present address: Institute of Genetics, Martin-Luther University, Weinbergstrasse 22, D-06120 Halle, Germany.
4 Corresponding author.
E-MAIL schlef{at}bbsrc.ac.uk; FAX 44 1603 450011.
| |
REFERENCES |
|---|
|
|
|---|
583 new estimates.
Ann. Bot.
80:
169-196
a new family of inverted repeat elements associated with the genes of both monocotyledonous and dicotyledonous plants.
Plant Cell
6:
907-916[Abstract].Received February 4, 2000; accepted in revised form May 5, 2000.
This article has been cited by other articles:
![]() |
R. Kalendar, J. Tanskanen, W. Chang, K. Antonius, H. Sela, O. Peleg, and A. H. Schulman Cassandra retrotransposons carry independently transcribed 5S RNA PNAS, April 15, 2008; 105(15): 5833 - 5838. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tomita, K. Shinohara, and M. Morimoto Revolver is a New Class of Transposon-like Gene Composing the Triticeae Genome DNA Res, February 1, 2008; 15(1): 49 - 62. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Hollister and B. S. Gaut Population and Evolutionary Dynamics of Helitron Transposable Elements in Arabidopsis thaliana Mol. Biol. Evol., November 1, 2007; 24(11): 2515 - 2524. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ner-Gaon, N. Leviatan, E. Rubin, and R. Fluhr Comparative Cross-Species Alternative Splicing in Plants Plant Physiology, July 1, 2007; 144(3): 1632 - 1641. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Wang and H. K. Dooner Eukaryotic Transposable Elements and Genome Evolution Special Feature: Remarkable variation in maize genome structure inferred from haplotype diversity at the bz locus PNAS, November 21, 2006; 103(47): 17644 - 17649. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Hawkins, H. Kim, J. D. Nason, R. A. Wing, and J. F. Wendel Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium Genome Res., October 1, 2006; 16(10): 1252 - 1261. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Piegu, R. Guyot, N. Picault, A. Roulin, A. Saniyal, H. Kim, K. Collura, D. S. Brar, S. Jackson, R. A. Wing, et al. Doubling genome size without polyploidization: Dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice Genome Res., October 1, 2006; 16(10): 1262 - 1269. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. H. Oakley, B. Ostman, and A. C. V. Wilson Repression and loss of gene expression outpaces activation and gain in recently duplicated fly genes PNAS, August 1, 2006; 103(31): 11637 - 11641. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Neumann, A. Koblizkova, A. Navratilova, and J. Macas Significant Expansion of Vicia pannonica Genome Size Mediated by Amplification of a Single Type of Giant Retroelement Genetics, June 1, 2006; 173(2): 1047 - 1056. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Boulesteix, M. Weiss, and C. Biemont Differences in Genome Size Between Closely Related Species: The Drosophila melanogaster Species Subgroup Mol. Biol. Evol., January 1, 2006; 23(1): 162 - 167. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-S. Kim, M. N. Islam-Faridi, P. E. Klein, D. M. Stelly, H. J. Price, R. R. Klein, and J. E. Mullet Comprehensive Molecular Cytogenetic Analysis of Sorghum Genome Architecture: Distribution of Euchromatin, Heterochromatin, Genes and Recombination in Comparison to Rice Genetics, December 1, 2005; 171(4): 1963 - 1976. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bouck, R. Peeler, M. L. Arnold, and S. R. Wessler Genetic Mapping of Species Boundaries in Louisiana Irises Using IRRE Retrotransposon Display Markers Genetics, November 1, 2005; 171(3): 1289 - 1303. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Jing, M. R. Knox, J. M. Lee, A. V. Vershinin, M. Ambrose, T. H. N. Ellis, and A. J. Flavell Insertional Polymorphism and Antiquity of PDR1 Retrotransposon Insertions in Pisum Species Genetics, October 1, 2005; 171(2): 741 - 752. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Caetano-Anolles Evolution of Genome Size in the Grasses Crop Sci., August 1, 2005; 45(5): 1809 - 1816. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Matsuo, Y. Ito, R. Yamauchi, and J. Obokata The Rice Nuclear Genome Continuously Integrates, Shuffles, and Eliminates the Chloroplast Genome to Cause Chloroplast-Nuclear DNA Flux PLANT CELL, March 1, 2005; 17(3): 665 - 675. [Abstract] [Full Text] [PDF] |
||||