|
|
|
|
Vol. 8, Issue 5, 557-561, May 1998
GENOME METHODS
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Using the recently introduced BigDyeTM terminators, large-template DNA can be directly sequenced with custom primers on automated instruments. Cycle sequencing conditions are presented to sequence DNA samples isolated from a number of microbial genomes including 750-kb Ureaplasma urealyticum, 1.2-Mb Mycoplasma fermentans, 2.3-Mb Streptococcus pneumoniae, and 4.6-Mb Escherichia coli. Average read lengths of >700 bp from unique primer annealing sites are often sufficient to fill final gaps in microbial genome sequencing projects without additional manipulations of template DNA. The technique can also be applied to sequence-targeted regions, thereby bypassing tedious subcloning steps.
| |
INTRODUCTION |
|---|
|
|
|---|
In microbial genome or large-insert clone sequencing projects that
use the predominant random subclone sequencing strategy, progress tends
to decrease dramatically at late stages as one confronts
gaps. At these points, DNA is under-represented or
unstable in subclones (E.Y. Chen et al. 1996
; Chissoe et al. 1997
).
Further sequencing with additional random subclones is then inefficient at best, and one must frequently employ alternative cloning systems or
additional methods like long-range PCR to recover missing DNA (C.N.
Chen et al. 1996
). The variability of performance of these methods and
the necessity for custom-tailored work tend to hamper the late stages
of sequencing efforts. In contrast, if one can sequence directly from
genomic DNA (or large-insert clones such as BACs or PACs) with walking
primers, cumbersome work to fill gaps could be completed in a much
shorter time.
As an example, in a recent project to sequence the 750-kb genome of
Ureaplasma urealyticum (J. Glass, in prep.) assemblage of
~13,000 sequence reads and combinatorial PCR reactions to join contigs left two gaps. No
pUC, or M13 subclones were recovered that spanned the gaps, nor were PCR products derived with any of
several sets of flanking primers. The difficulty of cloning these
segments is probably attributable to repeated sequences in and near the
two gaps, but the high sensitivity of the recently introduced BigDye
terminator (Rosenblum et al. 1997
) permitted direct sequencing of the
gap regions on genomic U. urealyticum DNA templates. Using the
conditions described in this report, two gaps of 259 and 121 bp were
sequenced from both strands with walking primers to complete the
project of 751,723 bp.
Direct sequencing was further tested for larger templates, and good results were reproducibly obtained with 1.2-Mb Mycoplasma fermentans, 2.3-Mb Streptococcus pneumoniae, and 4.6-Mb Escherichia coli genomic DNA (see example in Fig. 1). In addition, several difficult gaps in sequencing projects with BAC clones, ranging in size from 140 to 250 kb, have also been filled in this manner. Essentially the method is applicable whenever 2-3 µg of high-quality large-template DNA is available.
|
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
Figure 1 shows an example of the results from these experiments.
Although the signal intensity tends to be low
only ~10%-20% compared to the data from regular M13 or pUC templates
base-calling quality remains high, because the baseline noise is sharply reduced by
the increased brightness and improved spectral resolution of the BigDye
terminators (Rosenblum et al. 1997
). Lower signal strength is expected
considering the molarity of microbial template DNA, which is several
hundred to a thousand times less than that of the regular plasmid
templates. Higher level of primers (2×-5×) and greater number of
cycles (from 45 to 60, more cycles for larger templates) as described
in Methods helped to boost the signal intensities. The addition of
cycles (up to 99) has been found to increase the signal strength and
decrease the readable range (see Table 1). Accurate
quantitation of template DNA to within 2-4 µg is essential. Too
much template (>5 µg) produced much lower quality results (see
Fig. 2 for an example), whereas too little DNA also gave rise to weak
signal and low-quality results (data not shown).
|
|
There are several other factors that are also important for optimal results. First, template DNA should be free of salts, detergents, proteins, and cell debris that might interfere with the primer annealing. Second, as may be expected, having a good primer is critical. Successful primers have been typical 21-25-mers with appropriate GC contents from a unique site. Only 1 of the 10 primers tested failed to generate a useful sequence because that primer annealed at two different locations on the S. pneumoniae genome. Third, special care should be taken in the elimination of excess dye terminators before loading samples on gels (note that carryover of dyes can be seen in Fig. 1 in the region of bases 45-55; apparently the system is very sensitive to residual dye when signals are so low). Fourth, to get high-quality, low-signal data, it is important to have a well-tuned sequencing instrument equipped with a good multicomponent matrix and base-calling software capable of analyzing data with weak signals. Here we used version 3.0.1b3 software, which requires no minimal signals.
There appears to be no correlation between the quality of the sequence
data and the size of the template, as shown in Table 2, in which nine
different sequences obtained from bacterial genomic templates are
compared with corresponding GenBank sequences that were presumably
derived from subcloned templates. Using the unedited
sequence files of up to 600 bases beyond the first legible base,
genome-derived sequences have an average fidelity of 98.2% (1.1%
N and 0.7% discordant base-calls). After minimal manual editing of the sequences, the average useable read length was 712 ± 18 bases. It is interesting that the signal strength remains relatively constant among these nine genomic templates ranging from
0.75 to 4.6 Mb. This suggests that the present protocol, after some
modifications, may be applicable to even larger templates. The method,
nevertheless, does require 2-3 µg of high-quality template DNA for
each sequencing reaction
an amount that could be difficult to obtain
from certain microorganisms.
|
In summary, we have demonstrated that template DNAs up to 4.6 Mb in size can be directly sequenced with automated sequencers using custom primers and BigDye terminators. This protocol could expedite large-scale sequencing projects by facilitating gap closure, especially for difficult areas that are refractory to cloning and PCR methods. The technique can also be applied to bypass tedious subcloning steps in bacterial sequencing projects that focus on individual genes.
| |
METHODS |
|---|
|
|
|---|
Preparation of DNA Samples
Multiple methods, including Easy-DNA kit (Invitrogen, USA),
SDS-proteinase K lysis procedure, and isopycnic banding in CsCl gradients (Wilson 1994
), all worked well to prepare microbial genomic
DNA samples for sequencing. BAC DNA was purified as described by
Rosenblum et al (1997)
. Sequencing primers were designed using Oligo
5.0 (National Biosciences, USA). Sizes varied from 21 to 25 bases and
Tm from 60°C to 74°C (GC% method).
Sequencing Conditions
Each cycle sequencing reaction contained 16 µl of BigDye Terminator mix (PE-Applied Biosystems, USA), 13 pmoles of primer, and 2-3 µg of microbial DNA (or 6 pmoles of primer and 0.4 µg of BAC DNA) in a total of 40-µl volume. In some experiments 1 µl of ThermoFidelase I (Fidelity Systems, USA) was included to improve data quality, but the results were inconsistent, so it is not a standard component of current reaction mixtures. The cycle conditions were initial denaturation at 95°C for 5 min followed by 30-60 cycles (30 for BACs, 45 for all microbial DNAs, except E. coli samples, which get 60 cycles; see Results/Discussion) at 95°C for 30 sec, 55°C for 20 sec, and 60°C for 4 min. Excess dye terminators were removed with a spin column (PE-Applied Biosystems User Manual) and reaction mixtures were dried in a SpeedVac system. Each sample was resuspended in 2 µl of formamide solution and denatured by heat, and the entire volume loaded on an ABI 377 automated DNA sequencing instrument with a 36-well comb and a 48-cm WTR (well-to-read) gel. Electrophoresis was at 2400 V for 10-11 hours. Sequence data were analyzed by ABI version 3.0.1b3 software modified for weak signal base-calling (available through PE-ABI ftp site).
| |
ACKNOWLEDGMENTS |
|---|
We thank Dan Allison for modifying the base-calling software, Jennifer Glass for help in preparing the microbial DNAs, and Chun-Nan Chen for valuable discussions. This work is supported, in part, by the National Institutes of Health (grants HG00201 and RO1 AI28279).
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
4 Corresponding author.
E-MAIL cheney{at}perkin-elmer.com; FAX (650) 638-6177.
| |
REFERENCES |
|---|
|
|
|---|
Received November 26, 1997; accepted in revised form March 18, 1998.
This article has been cited by other articles:
![]() |
M. Iovene, E. Grzebelus, D. Carputo, J. Jiang, and P. W. Simon Major cytogenetic landmarks and karyotype analysis inDaucus carota and other Apiaceae Am. J. Botany, July 1, 2008; 95(7): 793 - 804. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-H. Klusmann, U. Creutzig, M. Zimmermann, M. Dworzak, N. Jorch, C. Langebrake, A. Pekrun, K. Macakova-Reinhardt, and D. Reinhardt Treatment and prognostic impact of transient leukemia in neonates with Down syndrome Blood, March 15, 2008; 111(6): 2991 - 2998. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Al-Saady, A. J. Khan, A. Calari, A. M. Al-Subhi, and A. Bertaccini 'Candidatus Phytoplasma omanense', associated with witches'-broom of Cassia italica (Mill.) Spreng. in Oman Int J Syst Evol Microbiol, February 1, 2008; 58(2): 461 - 466. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pilhofer, A. P. Bauer, M. Schrallhammer, L. Richter, W. Ludwig, K.-H. Schleifer, and G. Petroni Characterization of bacterial operons consisting of two tubulins and a kinesin-like gene by the novel Two-Step Gene Walking method Nucleic Acids Res., November 29, 2007; 35(20): e135 - e135. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Alhapel, D. J. Darley, N. Wagener, E. Eckel, N. Elsner, and A. J. Pierik Molecular and functional analysis of nicotinate catabolism in Eubacterium barkeri PNAS, August 15, 2006; 103(33): 12341 - 12346. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Hudson, T. S. Gorton, L. Papazisi, K. Cecchini, S. Frasca Jr., and S. J. Geary Identification of a Virulence-Associated Determinant, Dihydrolipoamide Dehydrogenase (lpd), in Mycoplasma gallisepticum through In Vivo Screening of Transposon Mutants Infect. Immun., February 1, 2006; 74(2): 931 - 939. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Clapper, A.-H. T. Tu, W. L. Simmons, and K. Dybvig Bacteriophage MAV1 Is Not Associated with Virulence of Mycoplasma arthritidis Infect. Immun., December 1, 2004; 72(12): 7322 - 7325. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ekins, F. Bahrami, A. Sijercic, D. Maret, and D. F. Niven Haemophilus somnus Possesses Two Systems for Acquisition of Transferrin-Bound Iron J. Bacteriol., July 1, 2004; 186(13): 4407 - 4411. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Westberg, A. Persson, A. Holmberg, A. Goesmann, J. Lundeberg, K.-E. Johansson, B. Pettersson, and M. Uhlen The Genome Sequence of Mycoplasma mycoides subsp. mycoides SC Type Strain PG1T, the Causative Agent of Contagious Bovine Pleuropneumonia (CBPP) Genome Res., February 1, 2004; 14(2): 221 - 227. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Wall, S. Roos, K. Jacobsson, A. Rosander, and H. Jonsson Phage display reveals 52 novel extracellular and transmembrane proteins from Lactobacillus reuteri DSM 20016T Microbiology, December 1, 2003; 149(12): 3493 - 3505. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Serror, G. Ilami, H. Chouayekh, S. D. Ehrlich, and E. Maguin Transposition in Lactobacillus delbrueckii subsp. bulgaricus: identification of two thermosensitive replicons and two functional insertion sequences Microbiology, June 1, 2003; 149(6): 1503 - 1511. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ekins and D. F. Niven Identification of fur and fldA Homologs and a Pasteurella multocida tbpA Homolog in Histophilus ovis and Effects of Iron Availability on Their Transcription J. Bacteriol., May 1, 2002; 184(9): 2539 - 2542. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Nagai, J. C. Kagan, X. Zhu, R. A. Kahn, and C. R. Roy A Bacterial Guanine Nucleotide Exchange Factor Activates ARF on Legionella Phagosomes Science, January 25, 2002; 295(5555): 679 - 682. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Melville, K. P. Scott, D. K. Mercer, and H. J. Flint Novel Tetracycline Resistance Gene, tet(32), in the Clostridium-Related Human Colonic Anaerobe K10 and Its Transmission In Vitro to the Rumen Anaerobe Butyrivibrio fibrisolvens Antimicrob. Agents Chemother., November 1, 2001; 45(11): 3246 - 3249. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hoskins, W. E. Alborn Jr., J. Arnold, L. C. Blaszczak, S. Burgett, B. S. DeHoff, S. T. Estrem, L. Fritz, D.-J. Fu, W. Fuller, et al. Genome of the Bacterium Streptococcus pneumoniae Strain R6 J. Bacteriol., October 1, 2001; 183(19): 5709 - 5717. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Krappmann, W. N. Lipscomb, and G. H. Braus Coevolution of transcriptional and allosteric regulation at the chorismate metabolic branch point of Saccharomyces cerevisiae PNAS, November 22, 2000; (2000) 240469697. [Abstract] [Full Text] |
||||
![]() |
S. Krappmann, R. Pries, G. Gellissen, M. Hiller, and G. H. Braus HARO7 Encodes Chorismate Mutase of the Methylotrophic Yeast Hansenula polymorpha and Is Derepressed upon Methanol Utilization J. Bacteriol., August 1, 2000; 182(15): 4188 - 4197. [Abstract] [Full Text] |
||||
![]() |
S. Palacios and J. C. Escalante-Semerena prpR, ntrA, and ihf Functions Are Required for Expression of the prpBCDE Operon, Encoding Enzymes That Catabolize Propionate in Salmonella enterica Serovar Typhimurium LT2 J. Bacteriol., February 15, 2000; 182(4): 905 - 910. [Abstract] [Full Text] |
||||
![]() |
The C. elegans Sequencing Consortium Genome Sequence of the Nematode C. elegans: A Platform for Investigating Biology Science, December 11, 1998; 282(5396): 2012 - 2018. [Abstract] [Full Text] |
||||
![]() |
M. Lu, L. S. Holliday, L. Zhang, W. A. Dunn Jr., and S. L. Gluck Interaction between Aldolase and Vacuolar H+-ATPase. EVIDENCE FOR DIRECT COUPLING OF GLYCOLYSIS TO THE ATP-HYDROLYZING PROTON PUMP J. Biol. Chem., August 3, 2001; 276(32): 30407 - 30413. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Krappmann, W. N. Lipscomb, and G. H. Braus Coevolution of transcriptional and allosteric regulation at the chorismate metabolic branch point of Saccharomyces cerevisiae PNAS, December 5, 2000; 97(25): 13585 - 13590. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||