|
|
|
|
Vol. 10, Issue 1, 129-136, January 2000
RESOURCE
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Recently a new strategy using BAC end sequences as sequence-tagged
connectors (STCs) was proposed for whole-genome sequencing projects. In
this study, we present the construction and detailed characterization
of a 15.0 haploid genome equivalent BAC library for the cultivated
tomato, Lycopersicon esculentum cv. Heinz 1706. The library
contains 129,024 clones with an average insert size of 117.5 kb and a
chloroplast content of 1.11%. BAC end sequences from 1490 ends were
generated and analyzed as a preliminary evaluation for using this
library to develop an STC framework to sequence the tomato genome. A
total of 1205 BAC end sequences (80.9%) were obtained, with an average
length of 360 high-quality bases, and were searched against the GenBank
database. Using a cutoff expectation value of <10
6,
and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of
which almost half (48.7%) share sequence similarities to
retrotransposons and 7% to known genes. Some of the transposable
element sequences were the first reported in tomato, such as sequences
similar to maize transposon Activator (Ac) ORF and
tobacco pararetrovirus-like sequences. Interestingly, there were no BAC
end sequences similar to the highly repeated TGRI and TGRII elements.
However, the majority (70.3%) of STCs did not share significant
sequence similarities to any sequences in GenBank at either the DNA or
predicted protein levels, indicating that a large portion of the tomato
genome is still unknown. Our data demonstrate that this BAC library is
suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed.
[The BAC end sequences described in this paper have been deposited in the GenBank data library under accession nos. AQ367111-AQ368361.]
| |
INTRODUCTION |
|---|
|
|
|---|
Tomato (Lycopersicon esculentum cv. Heinz
1706) is a vegetable crop that ranks second only to potatoes in
economic importance (Gould 1992
). Tomato is a member of the dicot
family Solanaceae, which contains well-known plant species such as
potato, tobacco, eggplant, and pepper. Despite differences in genome
sizes among Solanaceae members, most, if not all, share the same basic
chromosome number of 12 with tomato having the smallest haploid genome
size of 953 Mb (Arumuganathan and Earle 1991
). Comparative genetic studies revealed that tomato and potato share very conserved
colinearity between their genomes (Bonierbale et al. 1988
). Tomato has
conserved gene repertoires with pepper even though their gene order is
different (Tanksley et al. 1988
). Among geneticists, tomato is
considered an ideal model crop plant. Tomato has an excellent classical
morphological map, a high-density molecular map containing >1000
markers (Rick and Yoder 1988
; Tanksley et al. 1992
; Broun and Tanksley
1996
), and a large collection of well-characterized mutants and
near-isogenic lines (NILs). In addition, tomato has a relatively small
genome size, several large-insert YAC libraries (Martin et al. 1992
; Bonnema et al. 1996
), a transposon tagging system (Briza et al. 1995
),
as well as a routine Agrobacterium-mediated transformation system (McCormick et al. 1986
). Furthermore, in 1998 several leading researchers in the tomato genomics community began to pursue the discovery and analysis of the majority of genes in tomato using primarily an EST-based approach (S.D. Tanksley, J.J. Giovannoni, G.B.
Martin, J.C. Venter, unpubl.; http://www.nsf.gov; http://www.tigr.org). These publicly available resources and enabling technologies will undoubtedly provide an invaluable foundation for plant research, with
an ultimate goal of sequencing and understanding the entire tomato genome.
A critical tool for genomic studies in tomato is the availability of
deep-coverage large-insert genomic libraries, such as yeast artificial
chromosomes (YACs) and bacterial artificial chromosomes (BACs), that
can be used for physical mapping, positional cloning, and genome
sequencing. The publicly available tomato YAC libraries have served as
valuable research tools for the isolation of several agriculturally
important genes by positional cloning (e.g., Martin et al. 1993
; Alpert
and Tanksley 1996
). Unfortunately, YAC libraries and clones are
cumbersome to construct, screen, and analyze, and DNA inserts are often
chimeric (Green et al. 1991
), rearranged, or have internal deletions.
To date, no comprehensive L. esculentum BAC libraries suitable
for whole-genome sequencing and physical mapping have been published.
Except for DNA insert-size capacity, BAC libraries have several
advantages over YACs in that they can be easily constructed and
screened. Moreover, genomic inserts in BACs have been shown to be very
stable in Escherichia coli and thus serve as ideal templates
in generating whole-genome physical maps by DNA fingerprinting (Marra
et al. 1997
), developing sequence-tagged connectors (STCs) (Venter et
al. 1996
; Boysen et al. 1997
), and shotgun sequencing (Boysen et al.
1997
). These features make the BAC cloning system a popular choice for
high-throughput genomics studies.
Because of recent technological advances and reduced costs in
high-throughput genomic DNA sequencing, it may soon become practical and cost effective to sequence the genomes of a number of important crop plants with moderate genome sizes of ~1000 Mb (e.g., tomato, sorghum, and soybean). The proposed STC strategy in combination with
DNA fingerprinting can be used to establish sequence-ready genome
frameworks using deep-coverage BAC libraries (Venter et al. 1996
;
Boysen et al. 1997
). Briefly, each genomic insert from a deep-coverage
BAC library is sequenced from both ends and fingerprinted to develop an
STC and fingerprint databases, respectively. Simultaneously, a
genetically anchored "seed" BAC is selected and shotgun sequenced. The complete sequence of the seed BAC is used to screen the STC database to obtain a set of BACs that overlap with the seed BAC based
on DNA sequence. The fingerprint database is then consulted to confirm
these overlaps and to select a BAC that overlaps minimally with the
seed BAC. Incorporating a fingerprinting strategy has proven useful in
selecting subsequent BACs for shotgun sequencing in Arabidopsis
thaliana (Marra et al. 1999
) and to help resolve conflicting data
of library hybridization with anchored markers (Mozo et al. 1999
). The
STC and fingerprinting strategy has been widely adopted and is
presently being used for the human, Arabidopsis, and rice
genome sequencing projects.
In this study we report the construction and characterization of a comprehensive BAC library for the cultivated tomato L. esculentum cv. Heinz 1706 and analysis of 1205 BAC end sequences as an preliminary evaluation of the STC strategy for establishing a framework to sequence the tomato genome.
| |
RESULTS |
|---|
|
|
|---|
Tomato BAC Library Construction
A tomato BAC library was constructed from L. esculentum cv.
Heinz 1706 using a HindIII partial digestion of megabase-size DNA embedded in agarose plugs. Several ligation reactions were performed using size-selected DNA fragments from three regions of CHEF
gels of 100-150, 150-200, and 200-300 kb. Optimal ligations were
obtained from partially restricted DNA in the region of 150-200 kb
after a single size selection with a 20-sec constant pulse, 6 V/cm at
14°C for 18 hr. Two ligations yielded a significant number of white
clones, averaging 1250 (80.6%) white clones, and 300 (19.4%) blue
clones per microliter of ligation mixture as assayed on X-gal/IPTG
Luria broth (LB) agar plates. Although the region of the size-selected
DNA used for these ligations comigrated between 150 and 200 kb (based
on the
concatemer size markers run side by side on the same gel),
the preliminary analysis revealed that the average insert size was
between 115 kb and 120 kb. About 100 transformations were performed
using the two ligations to produce a total of 129,024 clones. White
clones were picked and arrayed into 336 microliter plates with 384 wells using the Genetix Q-Bot.
Insert Size Distribution and Genome Coverage
To estimate the genome coverage of the L. esculentum BAC
library the average insert size of the clones, the number of
nonrecombinant clones, and the number of chloroplast DNA-containing
clones were determined. Inserts from 498 random BAC clones were sized
by pulsed field gel electorphoresis (PFGE) on 1% agarose CHEF gels.
Figure 1 shows an example of the analysis of 27 clones digested with NotI. Inserts of most of the clones did
not contain internal NotI sites, which is a typical feature
for dicot genomic DNA (Choi et al. 1995
; Marek et al. 1997
; Danesh et
al. 1998
; Tomkins et al. 1999
). The size distribution of 498 random
clones is depicted in Figure 2, which was used to
calculate an average insert size of 117.5 kb. Of the clones, 78% have
inserts >100 kb; ~11% contained inserts <70 kb, indicating
that there were a significant number of small inserts (<100 kb)
trapped in the size-selected fraction used to construct the BAC
library. In addition, ~2.41% of clones did not contain inserts,
suggesting the recircularization of vectors with damaged termini. To
determine the percentage of BAC clones containing chloroplast DNA in
the library, high-density membranes containing the entire library were
probed with three chloroplast specific genes from barley (Hordeum
vulgare), ndhA, rbcL, and psbA, which
are evenly spaced 40 kb apart across the chloroplast genome. The
results showed that 1432 positive clones, representing only 1.11% of
the entire library, contain chloroplast DNA sequences (data not shown).
Of the clones picked robotically, ~2.11% were later found to be
nonrecombinant (blue) by inoculating clones from the first forty
384-well plates of the library onto Q-plates containing X-gal and IPTG
and counting the number of blue colonies (325 blue clones/15,360
clones). Taking 1.11% of chloroplast DNA, 2.11% of nonrecombinant
clones, and 2.41% of white empty clones into consideration and
assuming similar content of mitochondria to the tomato BAC libraries of
Hamilton et al. (1999)
(0.012%), with an average insert size of 117.5 kb and a haploid genome size of 953 Mb, the library is estimated to
contain ~15.0 haploid genome equivalents.
|
|
To further examine the predicted genomic coverage of the BAC library
constructed, 19 one- to four-copy restriction fragment length
polymorphism (RFLP) markers from tomato chromosome 12 (Table 1) were used to screen 3 of the total 7 filters,
which represent a calculated genome coverage of 6.43×. As summarized
in Table 1 at least one clone was identified for each marker tested.
Considering the approximate copy number of the RFLP markers in the
genome (Table 1), an average of 3.33 positive hybridization signals was
obtained per probe, which is lower than expected. This result suggests
that either our estimation of genome coverage is an overestimate, the
library is not entirely random, or our estimation of RFLP copy number
is too high.
|
BAC End Sequencing
To examine the feasibility of using a STC strategy to establish a framework to sequence the tomato genome, we sequenced and analyzed the ends of the first 745 clones of the BAC library. A total of 1490 sequencing reactions were performed resulting in a set of 1205 high-quality sequences (a success rate of 80.9%). High-quality sequences were defined as those having >75 high-quality bases other than vector and E. coli sequences. The average length of the end sequence, after removal of the vector sequence using CROSS-MATCH software, was 360 bases with a standard deviation of 142 bases.
To determine the sequence composition of the BAC end sequences, they
were searched against Genbank using BLASTN, BLASTX, and TBLASTX
(Altschul et al. 1990
). A probability cutoff value (E value)
of at least 10
6 was used to assign putative identities to
the STCs. The BLASTN search resulted in 194 sequences with
E < 10
6, whereas BLASTX and TBLASTX
resulted in 90 and 37 sequences, respectively. Fifteen STCs were found
to contain chloroplast DNA making up 1.2% of the total high-quality
BESs. This value is very close to the estimation made by hybridizing
the library with probes from the chloroplast genome (1.11%, as
described above). For the remaining 306 STCs sharing significant
sequence similarities in Genbank, sequences similar to retrotransposons
constitute the major component of each category. A total of 149 out of
306 STCs (48.7%) were found to be similar to various retrotransposons. The BLASTN search resulted in 78 out of 194 STCs that were
retrotransposon-like sequences including both copia-like (51 STCs) and gypsy-like (27 STCs) retrotransposons. However, it
appears that the sequences of gypsy-like retrotransposons are
more conserved than copia-like retrotransposons as 51 out of
78 STCs similar to retrotransposons picked up by BLASTN search turned
out to be gypsy-like retrotransposons. It is interesting that
some of the previously determined repetitive sequences were found to be
part of the retrotransposons. For example, 20 STCs were similar to a
tomato random amplified polymorphic DNA sequence. A BLASTX search of
this sequence (GenBank accession no. AJ223850) showed that it is to
some extent similar to a Vicia faba retrotransposon (accession
no. AB007467). The BLASTX database search resulted in 58 out of 59 sequences that are similar to the protein domains of
copia-like retrotransposons, suggesting that tomato
copia-like retrotransposons are more degenerated on the DNA
level but still very conserved at the protein level. We were able to
find 12 STCs that are similar to retrotransposon-like sequences in 37 STCs obtained from TBLASTX with E < 10
6.
Four STCs were similar to a tobacco pararetrovirus sequences and
another three STCs were similar to maize transposon Activator (Ac) open reading frame.
The number of putative genes detected by BLASTN, BLASTX, and TBLASTX is
85. This represents only one-quarter of those STCs with known sequence
matches in GenBank (27.8%) and 7% of the high-quality tomato
sequences. Some of the high-scoring sequences that share significant
similarity with genes are summarized in Table 2.
|
Twenty-five STCs were similar to repetitive sequences from various plants, including 21 STCs that are similar to a tomato microsatellite repeat, a tomato sequence containing a GATA microsatellite, and a high-copy repeat (GenBank accession no. X90770).
Finally, 41 (8%) sequences are associated with genes and have
E < 10
6 but are neither part of gene coding
sequences nor can be classified into any categories described above.
The results described above for the 306 STCs sharing significant similarity with sequences deposited in GenBank are summarized in Figure 3 , and the complete BLAST search statistics for the 1490 ends can be viewed in detail at the Clemson University Genomics Institute (CUGI) web site (http://www.genome.clemson.edu).
|
| |
DISCUSSION |
|---|
|
|
|---|
The study of tomato has a rich breeding history over the centuries
and recently has fostered pioneering research in genomics including the
development of high-density molecular genetic maps (Tanksley et al.
1992
), comparative molecular genetic maps between closely related
species (e.g., potato, pepper, and tobacco; Bonierbale et al. 1988
;
Tanksley et al. 1988
), plant transformation (McCormick et al. 1986
),
the molecular dissection of quantitative inheritance (Paterson et al.
1988
), and the positional cloning of disease resistance (Martin et al.
1993
) and fruit-ripening genes (Wilkinson et al. 1995
). Although the
soon-to-be sequenced A. thaliana (120-Mb genome) is touted as
the model flowering plant to study plant biology, tomato continues to
play a pivotal and leading role as a model crop plant. Because of this
foundation and popularity and a haploid genome size of 953 Mb
by far
the smallest among the Solanacae family (e.g., potato, pepper, and
tobacco)
it can be argued that tomato is an ideal candidate for
generation of a complete genome sequence.
As a prelude to establish a STC framework of the tomato genome we constructed and characterized a 15.0 haploid genome equivalent BAC library and sequenced and analyzed the insert ends of the first 745 clones in the library.
The BAC library was constructed from the cultivated tomato L. esculentum cv. Heinz 1706 and contains 129,024 clones with an average insert size of 117.5 kb with a low chloroplast DNA
contamination (1.11%). Heinz 1706 was chosen because it has been used
as the recurrent parent for generating many near-isogenic lines
(Philouze 1991
) and is the L. esculentum parent of an
interspecific cross that is being used to map the jointless-2
locus (Zhang et al. 1999
) for positional cloning in our laboratory. In
comparison to the previously described tomato YAC libraries (Martin et
al. 1992
; Bonnema et al. 1996
) and BAC libraries (Hamilton et al. 1999
), the genomic coverage of this BAC library far exceeds both tomato
YAC and BAC libraries (5.5×, Martin et al. 1992
; Bonnema et al.
1996
; 4.6×, Hamilton et al. 1999
; 3×). It should be noted that
the genome coverage estimates for all of these libraries, including the
library described here, were calculated similarly. This result makes
the Heinz 1706 library the deepest coverage large-insert tomato library
available to date. Although YAC libraries usually have larger average
insert sizes (130 kb, Martin et al. 1992
; 250 kb, Bonnema et al. 1996
),
the depth of the BAC library can compensate for its relatively smaller
average insert size. Analysis of chloroplast DNA content showed that
high-molecular-weight DNAs prepared from tomato nuclei contained less
chloroplast DNA than those derived from protoplasts, such as has been
shown for other BAC libraries using similar nuclei extraction methods
(Choi et al. 1995
; Zhang et al. 1995
; Hamilton et al. 1999
; M. Budiman and R. Wing, unpubl.). For example, BAC and YAC libraries produced from
megabase-size DNA from rice (Nakamura et al. 1997
), sorghum (Woo et al.
1994
), and tomato (Martin et al. 1992
) protoplasts had chloroplast DNA
contamination of 7%, 14%, and 10%, respectively. Thus, the
development of a 15-fold BAC library for tomato should facilitate
physical mapping, positional cloning of many agronomically important
genes and genomic regions, and whole-genome sequencing.
The primary purpose of the STC strategy is to generate frameworks for
genome assembly and sequencing by providing informative DNA sequence
for the precise selection of minimally overlapping BAC clones for
subsequent sequencing substrates. This procedure was performed
previously through end-sequence walking or PCR-based screening using
sequence-tagged sites (STSs) on cosmid clones (Venter et al. 1996
). The
STC approach and derivatives incorporating BAC fingerprinting (Boysen
et al. 1997
) are now practiced widely and are being used to assemble
the genomes of human, mouse, Drosophila, Arabidopsis,
and rice. In our effort to develop a similar STC database for tomato,
BAC insert ends from 745 clones were sequenced as a preliminary study
for the feasibility of using this tomato library for whole-genome
sequencing. Among 1205 high-quality BAC end sequences, 94.6% are from
tomato nuclear DNA. Analysis of these STCs using the statistically
significant E
10
6 revealed that 26.6%
of the STCs have significant matches in GenBank, of which nearly half
were transposable elements and other repetitive sequence. STCs with
putative functions or similar to ESTs with unknown functions make up
24.3% of all of the STCs that have a match in GenBank. Of the total
STCs analyzed 70.3% appear to represent previously uncharacterized or
unique sequences. This fraction demonstrates that a significant portion
of the tomato genome remains unknown. The major repetitive sequences
are retrotransposons and may be distributed throughout the genome.
Surprisingly, we did not identify any BAC end sequences sharing
significant similarities with the well-characterized and highly
repetitive sequences TGRI and TGRII, which comprise ~1.85% of the
tomato genome. This result is likely due to the fact that both TGRI and
TGRII are mainly clustered tandemly in telomeric and centromeric
regions (Zamir and Tanksley 1988
), which might be difficult to
clone, though there are four STCs (e.g., toxb0001cE11f)
that are similar to Lycopersicon pennellii
paracentromeric sequence (GenBank accession no. AF07252).
By extrapolating the sequence and library characterization analysis above, if the insert ends of the entire BAC library of 121,744 clones (243,488 ends) were sequenced, there would be, on average, one STC every 3.73 kb DNA across the tomato genome. With a moderate DNA sequencing success rate (~80%) and an average of 360 high-quality bases per sequence, the STC database would represent 7.4% of the tomato genome. Such an STC database will no doubt provide an indispensable tool for the generation of a complete tomato genome sequence in the future. In addition, even during the early stages of developing such a database, end sequences and contig information from fingerprinting together with integrated genetic markers will facilitate local physical mapping, gene isolation, and gene discovery. Finally, an STC database will provide a vision of the organization of the tomato genome before the entire genome sequence becomes known. For example, assuming that 7% of the STCs represent putative genes, based on BLASTX and TBLASTX similarity searching, sequencing the entire tomato BAC library could produce the partial sequence of 17,044 genes.
| |
METHODS |
|---|
|
|
|---|
BAC Vector Isolation
pBeloBAC 11 DNA was isolated using an alkaline lysis method
(Sambrook et al. 1989
). Vector DNA was purified by two rounds of CsCl
density gradient centrifugation, completely linearized with
HindIII (New England Biolabs), and dephosphorylated with heat-killable thermolabile phosphatase (Epicenter Technologies). The
extent of digestion, dephosphorylation, and integrity of vector DNA
were assayed by comparing several ligation reactions with HindIII-cut
DNA (New England Biolabs) on agarose gels
and tested by transformation into E. coli DH10B (Research Genetics).
Preparation of Partially Digested DNA of Tomato
High-molecular-weight DNA was prepared from young leaf nuclei of
L. esculentum cv. Heinz 1706 grown under greenhouse
conditions. Leaf tissue (30 gm) was harvested and kept at
80°C
or used directly. Nuclei were extracted by grinding the tissue to a
smooth powder in liquid nitrogen and processed (De Scenzo and Wise
1996
). The nuclei were embedded in 1% low-melting agarose plugs (Ganal
and Tanksley 1989
). The use of nuclei preparation, instead of the protoplast preparation, should reduce the contribution of organellar DNA such as chloroplast and mitochondria genomes. Chopped plugs were
serially digested with HindIII (0, 0.5, 1.0, 2.5, 5.0, and 50 units) in a total volume of 70 µl at 37°C for 20 min. After inactivating the restriction enzyme, partially digested fragments were
separated by PFGE (CHEF DR II, Bio-Rad) and three DNA fractions of
100-150, 150-200, and 200-250 kb were excised. Gel pieces were washed three times with 1 ml of cold TE buffer on ice for 10 min each
and stored at 4°C.
BAC Library Construction
DNA fragments in 100 mg of gel slices were electroeluted (Strong et
al. 1997
) and the concentrations of the eluant were assayed on an
agarose gel. Ligation was performed in a final volume of 100 µl
with 50-150 ng of eluted DNA and 20 ng of dephosphorylated linearized
pBeloBAC11 vector. Before adding the T4 ligase (Promega), the ligation
mixture was incubated at 55°C for 10 min and then cooled down to
room temperature. Ligation was performed at 16°C overnight. One
microliter of desalted ligation mixture was transformed with 20 µl
of E. coli DH10B (Research Genetics) using a BRL
electroporator with the following settings: 320 V, 330 µF
capacitance, low ohms impedance, fast charge rate, and 4000
voltage booster resistance. Transformed cells were spread onto LB media
containing 12.5 µg of chloramphenicol, X-gal, and IPTG and grown at
37°C for 18 hr. The plates were then transferred to the dark and
stored at room temperature for an additional 1-2 days to allow the
nonrecombinant colonies to turn dark blue, thus making automated colony
picking with the Genetix Q-Bot (Genetix LTD) more efficient
(~1500-2000 colonies per 500-cm2 plate). White clones
were picked and stored in 384-well microtiter master plates containing
LB freezing media [36 mM K2HPO4, 13.2 mM KH2PO4, 1.7 mM sodium
citrate, 0.4 mM MgSO4, 6.8 mM
(NH4)2SO4, 4.4% glycerol (vol/vol), 12.5 µg/ml chloramphenicol].
BAC Insert Size Analysis
BAC DNA was prepared by a standard alkaline lysis method (Sambrook
et al. 1989
) from a 3-ml overnight culture using the Autogen 740 automated DNA isolation system (Integrated Separation System). DNA was
digested with NotI (New England Biolabs) to completion and
separated by PFGE (CHEF DR III, Bio-Rad) on a 1% agarose gel in
0.5× TBE with a linear pulse from 5 to 15 sec for 14 hr at 14°C
along with a mid-range PFGE marker I (New England Biolabs).
High Density Filter Production and Hybridization
The entire library containing 129,024 clones was gridded onto seven
22.5 × 22.5-cm nylon filters (Amersham N+) using the Genetix Q-Bot
(Genetix Ltd.). Each filter contained 18,432 individual clones that
were doubly spotted. The high-density hybridization filters were
hybridized as described (Church and Gilbert 1984
) except that BSA was
omitted. Gel-purified DNA fragments were labeled by random priming
(Feinberg and Vogelstein 1984
) with [32P] dCTP (NEN). After
hybridization, the filters were washed at 65°C twice in 1.0× SSC,
0.1% SDS, and twice in 0.5× SSC, 0.1% SDS, for 20 min each time
and exposed to X-ray film (Kodak X-Omat). Three barley
chloroplast-specific probes were obtained from Dr. J. Mullet (Texas A&M
University, College Station): ndhA of plasmid pBHP20,
rbcL of pBPH134, and psbA of pBHE319. Tomato
chromosome 12 RFLP markers were obtained from S. Tanksley (Cornell
University, Ithaca, NY).
BAC End Sequencing and Bioinformatics
Four microliters of BAC culture in LB freezing media was inoculated
into 4 ml of LB media containing chloramphenicol and incubated for 20 hr at 37°C. BAC DNA was isolated using the Autogen 740. DNA pellets
were resuspended in 25 µl of 1 mM Tris-HCl (pH 7.5), and
20 µl was used as the template for sequencing reactions in a total
volume of 30 µl [5 µl of ABI Big Dye (Perkin Elmer), 50 pmoles
of primer, 1.75 µl of sequencing buffer containing 800 mM
Tris-HCl (pH 9.0), 20 mM MgCl2, 2.25 µl
dH2O]. PCR reactions were performed as follows: one cycle
for 4 min at 95°C; 2) 70 cycles of 15 sec at 95°C, followed by 4 min at 60°C and 10 sec at 51°C. PCR products were precipitated
with ethanol containing one-third volume of 7.5 M
NH4OAc and run on ABI377 automatic sequencers. Base-calling
was performed automatically by PHRED (Ewing and Green 1998
), and vector
sequences were removed by CROSS-MATCH (http://www. genome.washington.edu). High-quality BAC end sequences (defined as
those having >75 nonvector bases with PHRED-quality value >20) were used as queries in BLASTX and BLASTN (Altschul et al. 1990
) searches of 77,977 protein sequences of SwissProt version 37 (Bairoch and Apweiler 1999
) and a subset of 46,221 plant DNA sequences from
GenBank version 111 (Benson et al. 1999
). All software was run locally
on a Sun Ultra30 workstation running Solaris 2.6.
| |
ACKNOWLEDGMENTS |
|---|
We thank Dr. David Frisch, Edward Bishop, and Michael Atkins of the CUGI BAC Resource Center for their help in using the Q-bot for robotic picking and filter gridding. We also thank Yeisoo Yu for his BAC end sequencing protocol and Maciek Sasinowski and Robert Kingsbury III for their bioinformatics assistance. This work was supported by the U.S. Department of Agriculture (NRICGP grant no. 9701388), the National Science Foundation (NSF) MRI grant no. 9724557 and NSF Plant Genome grant no. DBI-987276), and the Coker Endowed Chair to R.A.W.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Corresponding author.
E-MAIL rwing{at}clemson.edu: FAX (864) 656-4293.
| |
REFERENCES |
|---|
|
|
|---|
Received May 26, 1999; accepted in revised form November 9, 1999.
This article has been cited by other articles:
![]() |
T. Hain, S. Otten, U. von Both, S. S. Chatterjee, U. Technow, A. Billion, R. Ghai, W. Mohamed, E. Domann, and T. Chakraborty Novel Bacterial Artificial Chromosome Vector pUvBBAC for Use in Studies of the Functional Genomics of Listeria spp. Appl. Envir. Microbiol., March 15, 2008; 74(6): 1892 - 1901. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Galpaz, G. Ronen, Z. Khalfa, D. Zamir, and J. Hirschberg A Chromoplast-Specific Carotenoid Biosynthesis Pathway Is Revealed by Cloning of the Tomato white-flower Locus PLANT CELL, August 1, 2006; 18(8): 1947 - 1960. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Barry and J. J. Giovannoni From The Cover: Ripening in the tomato Green-ripe mutant is inhibited by ectopic expression of a protein that disrupts ethylene signaling PNAS, May 16, 2006; 103(20): 7923 - 7928. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, X. Tang, Z. Cheng, L. Mueller, J. Giovannoni, and S. D. Tanksley Euchromatin and Pericentromeric Heterochromatin: Comparative Composition in the Tomato Genome Genetics, April 1, 2006; 172(4): 2529 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Peters, J. C. van Haarst, T. P. Jesse, D. Woltinge, K. Jansen, T. Hesselink, M. J. van Staveren, M. H.C. Abma-Henkens, and R. M. Klein-Lankhorst TOPAAS, a Tomato and Potato Assembly Assistance System for Selection and Finishing of Bacterial Artificial Chromosomes Plant Physiology, March 1, 2006; 140(3): 805 - 817. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Tapia, I. Verdugo, M. Yanez, I. Ahumada, C. Theoduloz, C. Cordero, F. Poblete, E. Gonzalez, and S. Ruiz-Lara Involvement of Ethylene in Stress-Induced Expression of the TLC1.1 Retrotransposon from Lycopersicon chilense Dun. Plant Physiology, August 1, 2005; 138(4): 2075 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Guyot, X. Cheng, Y. Su, Z. Cheng, E. Schlagenhauf, B. Keller, and H.-Q. Ling Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus Plant Physiology, July 1, 2005; 138(3): 1205 - 1215. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Mueller, T. H. Solow, N. Taylor, B. Skwarecki, R. Buels, J. Binns, C. Lin, M. H. Wright, R. Ahrens, Y. Wang, et al. The SOL Genomics Network. A Comparative Resource for Solanaceae Biology and Beyond Plant Physiology, July 1, 2005; 138(3): 1310 - 1317. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ratnayaka, M. Baga, D. B. Fowler, and R. N. Chibbar Construction and Characterization of a BAC Library of a Cold-Tolerant Hexaploid Wheat Cultivar Crop Sci., June 24, 2005; 45(4): 1571 - 1577. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Li, A. L. Schilmiller, G. Liu, G. I. Lee, S. Jayanty, C. Sageman, J. Vrebalov, J. J. Giovannoni, K. Yagi, Y. Kobayashi, et al. Role of {beta}-Oxidation in Jasmonate Biosynthesis and Systemic Wound Signaling in Tomato PLANT CELL, March 1, 2005; 17(3): 971 - 986. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van der Knaap, A. Sanyal, S. A. Jackson, and S. D. Tanksley High-Resolution Fine Mapping and Fluorescence in Situ Hybridization Analysis of sun, a Locus Controlling Tomato Fruit Shape, Reveals a Region of the Tomato Genome Prone to DNA Rearrangements Genetics, December 1, 2004; 168(4): 2127 - 2140. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-Y. Chen and S. D. Tanksley High-Resolution Mapping and Functional Analysis of se2.1: A Major Stigma Exsertion Quantitative Trait Locus Associated With the Evolution From Allogamy to Autogamy in the Genus Lycopersicon Genetics, November 1, 2004; 168(3): 1563 - 1573. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li, Y. Zhao, B. C. McCaig, B. A. Wingerd, J. Wang, M. E. Whalon, E. Pichersky, and G. A. Howe The Tomato Homolog of CORONATINE-INSENSITIVE1 Is Required for the Maternal Control of Seed Maturation, Jasmonate-Signaled Defense Responses, and Glandular Trichome Development PLANT CELL, January 1, 2004; 16(1): 126 - 143. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Li, G. Liu, C. Xu, G. I. Lee, P. Bauer, H.-Q. Ling, M. W. Ganal, and G. A. Howe The Tomato Suppressor of prosystemin-mediated responses2 Gene Encodes a Fatty Acid Desaturase Required for the Biosynthesis of Jasmonic Acid and the Production of a Systemic Wound Signal for Defense Gene Expression PLANT CELL, July 1, 2003; 15(7): 1646 - 1661. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Fridman and D. Zamir Functional Divergence of a Syntenic Invertase Gene Family in Tomato, Potato, and Arabidopsis Plant Physiology, February 1, 2003; 131(2): 603 - 609. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-S. Yim, G. L. Davis, N. A. Duru, T. A. Musket, E. W. Linton, J. W. Messing, M. D. McMullen, C. A. Soderlund, M. L. Polacco, J. M. Gardiner, et al. Characterization of Three Maize Bacterial Artificial Chromosome Libraries toward Anchoring of the Physical Map to the Genetic Map Using High-Den |