|
|
|
|
Vol. 10, Issue 1, 62-71, January 2000
LETTER
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.[The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AF209020-209030 and AF209508-209518.]
| |
INTRODUCTION |
|---|
|
|
|---|
Escherichia coli is a species of
Gram-negative bacterium composed of numerous strains and serotypes
(Ochman and Selander 1984
; Ahmed et al. 1987
; Jay 1996
). Although
certain strains comprise an important element of the normal intestinal
microflora (Johnson 1991
; Hays 1992
), other strains produce toxins and
are pathogenic (Johnson 1991
; Olsivik et al. 1992
; Yu and Kaper 1992
).
In environmental monitoring studies, coliform bacteria provide a
presumptive indictor of fecal contamination of surface waters or food
(European Economic Community 1980
; American Public Health Association
et al. 1985
; Hays 1992
). Food-safety studies routinely include
monitoring for contamination by pathogenic E. coli (Vanderzant
and Splittstoesser 1992
), particularly in meat processing (Padhye and
Doyle 1992
; Witham et al. 1996
). E. coli is an important model
organism for study of gene expression in prokaryotes (Niedhardt 1996
).
Rapid detection and characterization of E. coli strains poses
important scientific and practical applications.
Simple sequence repeats (SSRs, or microsatellites) are a class of DNA
sequences consisting of simple motifs of 1-6 nucleotides that are
tandemly repeated from two or three up to a few dozen times at a locus
(Vogt 1990
). SSRs long have been known to be distributed throughout the
genomes of eukaryotes and to be highly polymorphic (Tautz 1989
; Weber
1990
). There is accumulating evidence that SSRs serve a functional
role, affecting gene expression, and that polymorphism of SSR tracts
may be important in the evolution of gene regulation (Rosenberg et al.
1994
; Kunzler et al. 1995
; Kashi et al. 1997
; King et al. 1997
; Kashi
and Soller 1998
; Tonjum et al. 1998
; Moxon and Wills 1999
; van Belkum
1999
). The sequencing of prokaryotic genomes allows screening of entire
genomes for the existence of SSRs (Field and Wills 1996
, 1998
),
revealing large numbers of SSR tracts not detected in earlier studies
that focused on particular loci. Recent publication of the complete genome sequence for E. coli (Blattner et al. 1997
) provides
the basis for characterizing SSR tracts in this organism, both
genome-wide and at particular loci.
In this study we screen the entire E. coli genome for the presence, locations, and composition of SSR tracts. We test our observations against the null hypotheses that SSRs are randomly distributed among coding and noncoding regions and that they collectively have the same composition as the genome. We show that SSRs are differentially distributed among coding and noncoding regions. We also show that SSRs are polymorphic among E. coli strains, providing potential marker loci for rapid detection and characterization. To our knowledge, this is a first analysis of the E. coli genome for such purposes and represents a general approach for analysis of other prokaryotic genomes.
| |
RESULTS |
|---|
|
|
|---|
Genomic Content, Distribution, and Composition of SSRs
Although the existence and abundance of SSRs in eukaryotes are well
documented, SSRs are not well studied in prokaryotes. Using computer
software that we developed, we conducted a genome-wide scan of the DNA
sequence of E. coli. A total of 235,495 SSR tracts were found
(Table 1). These tracts were distributed rather
evenly throughout the genome (Fig. 1). Total lengths
of particular SSR tracts in E. coli were small (Table 1; Fig.
1). Those with mononucleotide repeats seldom exceeded 9 bp in length,
and higher-order SSRs (i.e., those with di-, tri-, or tetranucleotide
repeats) rarely exceeded 12 bp. SSR tracts of 6 or more bp in length
comprised 2.4% of the E. coli genome (a total of 109 kb).
|
|
Analysis of genome-wide frequencies of SSR arrays of given motif length and repeat number showed a significant (P < 0.001) excess of mono- and trinucleotide SSRs relative to expectations (Table 1). Expected frequencies of SSRs of given motif length and repeat number were determined by observing those in 10 computer-generated genomes constructed by random ordering of nucleotides according to their overall frequencies in the genome, with departures tested using parametric statistics. A few significant test results for tract lengths larger than 10 bp (data not shown) likely were attributable to the small numbers expected.
In eukaryotes, SSRs are most abundant in noncoding areas that have
little or no effect on gene expression. To determine whether this also
was the case for E. coli, its complete DNA sequence first was
subjected to a computerized screening for the gross locations of SSRs
relative to open reading frames (ORFs). The E. coli K-12
genome of 4.64 × 106 nucleotides includes 79.5% of the
genome in ORFs and 20.5% of the genome in noncoding regions (Table
2). Mononucleotide SSRs 3 bp in length were
distributed among coding and noncoding regions at very nearly the same
proportions, 78.0% and 22.0%, respectively. However, as
mononucleotide repeat number became higher, the tracts became more and
more under-represented in ORFs. The regression of proportion of
mononucleotide SSRs in noncoding regions on tract length was positive
and significant (see Table 2). In contrast, the distribution of SSR
tracts with higher-order motifs among ORFs and noncoding regions
approximated the overall proportion of these regions in the genome.
|
The nucleotide composition of the E. coli genome and its SSR
tracts, including breakdowns for coding and noncoding regions, is
presented in Table 3. The composition
of mono- and dinucleotide SSRs differed from that of the genomic
regions in which they occurred. The composition of mononucleotide SSRs
exhibited a strong over-representation of A and T, 93% overall (Table
3b). Of the six possible dinucleotide motifs, the 49.1% frequency of
CG/GC in ORFs clearly exceeded the 17.3% expected. In noncoding
regions, AT/TA was over-represented relative to expectation (24.4% vs.
17.9%), as was CG/GC (23.1% vs. 15.4%). The frequencies of SSRs with
particular motifs of 3 or 4 bp did not represent all possible
combinations equally. Most notably, of 52 tetranucleotide SSRs, TGGC
occurred 12 times and its complement, GCCA, 9 times in coding
sequences. The finding that the E. coli genome is rich in TGGC
has been attributed to the activity of VSP (very
short patch) repair that corrects T:G mismatches to C:G (Bhagwat and McClelland 1992
; Gutierrez et al. 1994
).
The occurrence of three repeats of the tetranucleotide TGGC has been
identified as a mutation hot spot in the promoter of the lacI
gene (Sedgwick et al. 1986
; Murata-Kamiya et al. 1997
).
|
|
Fine Positions of SSRs Relative to ORFs
The presence and variation of SSRs in upstream regulatory elements might affect the expression of ORFs in either an on/off or a quantitative manner. We surveyed the fine positions of all SSRs in noncoding regions genome-wide relative to the ATG codon marking the start of translation of the adjacent gene (Fig. 2). There are 2178 mononucleotide SSR tracts >6 bp in length within 200 bp upstream of such ATG codons (Fig. 2A). The number of such SSRs in noncoding areas decreases with distance from the start of translation because the number of inter-ORF sequences of given length also decreases. Similar distributions, with decreasing numbers of SSRs at greater distances from the start of translation, were observed for di- and trinucleotide SSRs (Fig. 2B,C). Because this is a compact, prokaryotic genome, intergenic regions are usually short, and a subset of SSRs are in upstream areas where variation could affect gene expression (see Discussion below).
|
Polymorphism of SSRs
Screening for polymorphisms among strains of E. coli was
conducted at 14 arbitrarily chosen loci containing SSR repeats (Table 4). DNA at chosen loci was amplified by PCR using primers flanking the
particular SSR locus. Repeat number polymorphism at the ycgW locus was observed as differential mobility of radioactively labeled amplification products through polyacrylamide gels (Fig.
3), demonstrating hypervariable single-locus DNA
fingerprint bands distinguishing among E. coli strains. DNA
sequence alignments (Fig. 4) showed that a number of
mononucleotide SSR arrays in noncoding regions were polymorphic,
exhibiting two to four alleles for SSR repeat number. At three loci,
additional polymorphisms observed in sequences flanking the targeted
SSR tract proved to be due to different numbers of mononucleotides
(Fig. 4A-C). Two SSR polymorphisms at the ycgW gene (Fig. 4A)
were located upstream of the ORF at the
77 position and, depending
on the strain, at the
84 to
89 position relative to the ATG
codon at the start of translation. DNA from some of the pathogenic
strains did not exhibit PCR amplification; presumably, one or both
primers did not anneal because of sequence variation at the site.
|
|
Overall, four SSR tracts exhibited length polymorphism among strains of E. coli (Table 4). All four polymorphic sites shared three characteristics. Namely, they involved mononucleotide SSRs in noncoding regions. This is particularly striking because, in all, only five sites meeting these criteria were examined. In contrast, length polymorphism was not shown by two mononucleotide SSRs in coding regions or by seven higher-order SSRs in either coding or noncoding regions. The numbers examined in these categories, however, were too small to determine which of the two defining characteristics (mononucleotide motif or noncoding location) was more important for the presence of polymorphism. All SSRs examined had a tract length of at least 8 nucleotides in the sequenced E. coli K12 genome. Thus, mononucleotide SSRs of this length appear to have a high likelihood of being polymorphic among E. coli strains. In all, there are 240 mononucleotide tracts of this length in the E. coli K12 genome. Polymorphism among tracts of lesser length was not examined.
| |
DISCUSSION |
|---|
|
|
|---|
Until recently, SSR regions in E. coli were thought to be
rare and limited to dinucleotide SSRs with a maximum of five repeat units per locus (van Belkum et al. 1998
). However, Field and Wills (1998)
presented data reporting tens of thousands of mononucleotide SSRs in E. coli and showing the existence of SSRs with longer motifs. Our results confirm that SSR tracts in E. coli are
numerous and diverse in terms of motif and repeat number and show that they are widely distributed throughout the genome. We show that mononucleotide SSRs occur more frequently than expected in noncoding areas. SSRs of many motif lengths differ in composition from the genomic regions in which they occur, with mononucleotide SSRs with
poly(A) or poly(T) strongly over-represented in both coding and noncoding
regions. We show polymorphism of mononucleotide SSRs in noncoding regions.
Distribution of SSR Tract Length and Structure
Mutation at SSR loci is believed to be the consequence of
slipped-strand mispairing during DNA replication (Strand et al. 1993
).
This is because the tertiary structure of repetitive DNA allows
mismatching of neighboring repeats, and depending on the strand
orientation, repeats can be inserted or deleted during DNA
polymerase-mediated DNA duplication (Coggins and O'Prey 1989
; Hauge
and Litt 1993
; Chiurrazzi et al. 1994
). The resulting mutations are not
always repaired by DNA mismatch-repair mechanisms (Strand et al. 1993
;
Modrich and Lahue 1996
). Our observation of upper limits for SSR array
lengths in E. coli (i.e., 9 bp for mononucleotides and 12 bp
for SSRs with longer motifs; Fig. 1) suggests that the tendency for
repeat length at a locus to rise via mutation is counteracted by
selection. Such selection might operate through an uncharacterized
mechanism on the length of the SSR sequence itself or on gene
expression as affected by the SSR sequence at issue.
Interacting processes of mutation and selection can be invoked to explain observations regarding motif length and repeat number at SSR tracts. Slipped-strand mispairing is more likely for mononucleotide SSRs than for higher-order SSRs, because both strand separation and slippage are more likely. This is particularly important for poly(A) and poly(T), as strand separation is easier than for poly(C) and poly(G). For higher-order SSRs having small repeat number, there is very little mutability in repeat number. Thus, selection would have considerable opportunity to operate only against larger repeat numbers. In coding regions, variation in mononucleotide SSR repeat number causes frame-shift, nonsense mutations and, hence, will be selected against strongly. Thus, there will be a balance between production of SSRs and selection against them. In noncoding regions, any effects of mononucleotide SSR repeat number variation are less obvious. The tremendous lack of poly(C) and poly(G) SSR tracts is remarkable and requires explanation.
Expectations of SSR frequencies were calculated on the basis of the
genome-wide nucleotide composition of ORFs and noncoding regions.
Within ORFs, this approach does not reflect differential nucleotide
composition at the first, second, and third codon positions (Andachi et
al. 1987
); differential nucleotide compositions among different classes
of genes (Hirosawa et al. 1997
); and the effect of codon usage bias on
the distribution of polynucleotides within ORFs (Sharp 1991
; Sharp et
al. 1995
). The importance of these effects remains to be evaluated in
future study.
Locations of SSRs Relative to ORFs
To assess the likelihood that SSRs might affect gene expression, we
examined the positions of all SSRs in the genome with regard to ORFs.
In Figure 2, we show the distribution of SSRs upstream of ORFs in
relation to the first ATG codon of ORFs. Substantial numbers of SSR
tracts are localized up to 200 bp from the ATG. The DNA sequence
immediately upstream of an ORF contains proximal regulatory elements
that play an important role in controlling expression of the gene. In
E. coli, mononucleotide SSRs occurred in noncoding regions
more frequently than expected by chance. Given the compact nature of
the E. coli genome, almost any genetic variation might affect
gene function; however, variation in SSR arrays at regulatory regions
of genes must affect gene expression in a way that can be tolerated by
the host (Kashi et al. 1997
; King et al. 1997
). Variation at or near
regulatory elements can influence gene expression by affecting binding
of regulatory elements (Bewley et al. 1998
), distance between
regulatory elements, bending of DNA (Perez-Martin et al. 1994
),
blocking of DNA replication elongation (Krasilnikov et al. 1999
),
phasing on the DNA helix, formation of unusual DNA structures
(Williamson 1994
; Soyfer and Potaman 1995
), DNA coiling, DNA packaging
(Pettijohn 1988
), or other mechanisms (Kashi 1998
). Some of these
variations affect gene expression in a gross on-off manner (Rosenberg
et al. 1994
; Moxon and Wills 1999
), whereas others affect fine-tuning
of the level of gene expression (Kashi et al. 1997
; King et al. 1997
). Hypervariable SSR loci serve as transcriptional or translational switches in a variety of pathogenic (Himmelreich et al. 1996
; Karlin et
al. 1996
; Henaut et al. 1998
) and nonpathogenic (Field and Wills 1998
)
microbes. Our computer-based screening showed that large repeat tracts
with motifs of 2 or more bp do not occur in the E. coli K12
genome. It has been shown in eukaryotes that tracts of certain types of
repetitive DNA are localized to the 5' or 3' flanking regions
of genes, where they may affect nucleosome organization, recombination,
or regulation of gene expression or gene product activity (Tripathi and
Brahamachari 1977
; Kashi et al. 1997
; King et al. 1997
; Kashi and
Soller 1998
), suggesting the need for further study in prokaryotes.
Practical and Evolutionary Implications of SSR Polymorphisms
Observation of repeat number variation at SSR loci in E. coli suggests that SSRs may prove a ready source of polymorphisms for marking its genome. SSR loci in other prokaryotes also have been
shown to exhibit length polymorphisms (for review, see Moxon et al.
1994
; van Belkum et al. 1998
). For example, variation for specific
trinucleotide repeats of very large tract size was shown for
Neisseria meningitidis, Mycoplasma genitalium, and
Mycobacterium leprae (Field and Wills 1996
), and SSR variation
has been observed in Staphylococcus aureus and Hemophilus
influenzae (van Belkum et al. 1996
, 1997a
,b
). Polymorphism of SSR
tracts in prokaryotes poses both practical and evolutionary implications.
Although E. coli is part of the normal human microflora, there
are pathogenic strains for which rapid detection and strain identification are important. Present-day approaches for typing of
prokaryotes (Vanderzant and Splittstoesser 1992
) have limited ability
to distinguish among E. coli strains and are time consuming (Padhye and Doyle 1992
; Yu and Kaper 1992
; Witham et al. 1996
). Screenings of SSR variation may provide the basis for rapid and sensitive identification of pathogenic and nonpathogenic E. coli strains. Polymorphic mononucleotide sites found in E. coli exhibited 1-4 bp size differences. The small numbers of
repeats are well suited for development of SSR allele-specific
oligonucleotides (ASOs). Such ASOs may be used, for example, as PCR
primers, or as hybridization probes that can be spotted on DNA
microarrays (Southern 1996
; Marshall and Hodgson 1998
; Ramsey 1998
) for
rapid, automated characterization of variation at a given set of loci for purposes of DNA fingerprinting of E. coli strains.
Similarly, knowledge of SSR variation in other pathogenic microbes,
such as H. influenzae (van Belkum et al. 1997a
), Candida
albicans (Field et al. 1996
; Bretagne et al. 1997
),
Bacteroides fragilis and Bacteroides thetaiotaomicron
(Claros et al. 1997
), Helicobacter pylori (Marshall et al.
1996
), and N. meningitidis (Tonjum et al. 1998
), has been or
could be applied for rapid detection and strain characterization. A
DNA-fingerprinting approach based on SSR polymorphism also can be used
for epidemiological purposes, for example, to determine whether a
pathogenic E. coli strain detected in a patient matches a
known or suspected source of a given disease outbreak. SSRs have been
used as markers for such purposes for several pathogenic microbes (for
review, see van Belkum 1999
), including Mycobacterium tuberculosis (van Soolingen et al. 1993
), H. pylori
(Marshall et al. 1996
), and H. influenzae (van Belkum et al.
1997a
). To demonstrate a similar approach in E. coli, a
collection of allelic SSR markers distinguishing relevant strains will
have to be developed. Recent work with hypervariable markers in
pathogenic microbes (van Belkum et al. 1996
; Moxon and Wills 1999
)
shows that the variability at particular markers will have to be
evaluated to determine that it reflects the overall rate of evolution
of the E. coli genome.
SSRs can be screened to determine whether such molecular variation
gives rise to phenotypic variation. For example, SSR variability poses
clear implications for virulence in pathogenic microbes. Tracts of SSRs
have been found within confirmed or potential virulence genes of
H. influenzae (Karlin et al. 1997
); Neisseria sp.,
Hemophilus parainfluenzae, and Moraxella catarrhalis
(Peak et al. 1996
), and repeat number variation seems to be related to
modulation of expression of virulence factors. Contingency genes
containing SSRs exhibit high mutation rates, allowing the bacterium to
respond rapidly to challenging environmental conditions (Moxon et al. 1994
). Locating SSR repeat arrays by computerized search of the genomic
sequence and localization of such arrays with regard to expressed
genes, as we report here, could provide a basis for discovering new
virulence- or other key phenotype-determining loci in bacteria.
All of the SSR polymorphisms observed at the arbitrarily chosen sites
screened in this study were in noncoding regions. Over an evolutionary
time frame, E. coli has allowed these polymorphisms to
persist. Allelic variation mostly was conserved within each E. coli strain that we screened. These SSR sites were not
hypervariable, as were SSRs at contingency genes in pathogens such as
H. influenzae (Karlin et al. 1997
; van Belkum et al. 1997a
;
Moxon and Wills 1999
). These observations may support the hypotheses
(Moxon et al. 1994
; Kashi et al. 1997
; King et al. 1997
; Moxon and
Wills 1999
) that mutation rates are higher in genes whose products
interact with the environment in unpredictable ways and that SSRs
affect mutability so that different classes of genes have adaptively appropriate mutation rates. Mutability rate may be mediated by SSR
motif length and overall tract length. We hypothesize that certain SSR
variation drives fine-tuning of gene expression as well as variation of
key phenotypes, providing an important target for natural selection,
thereby affecting evolution of both pathogenic and nonpathogenic
E. coli strains. It is possible that some portion of
between-strain functional variation in E. coli results from differences in SSR repeat number in gene regulatory regions. Further DNA sequencing in genomes of pathogenic E. coli strains could yield insights into relative rates of mutability among SSR loci and
into the phenotypic consequences of SSR variation.
| |
METHODS |
|---|
|
|
|---|
DNA Sequence Analysis Software
We developed DNA sequence analysis software in the programming language C that screens entire genomes for SSRs and reports motif, number of repeats, and genomic position. It is available for downloading from our university's ftp site at ftp://ftp.technion.ac.il/pub/supported/biotech/ssr.exe. It searches for all of the SSRs with motif lengths up to 10 bp; records motif, repeat number, and genomic location; and reports the results in an output file. The complete genomic sequence of E. coli was obtained from http://mol.genes.nig.ac.jp/ecoli/ and screened for SSRs, their motif sequence, number of repeats, and genomic locations.
A second program in the programming language C characterizes the locations of SSR arrays in relation to ORFs in genomic sequence data sets. It reports the numbers of occurrences of SSRs of specified motif length and repeat number in both ORFs and noncoding sequences. For SSRs occurring upstream of ORFs, it reports the number of nucleotides between the SSR tract and the ATG codon marking the start of translation.
Statistical Testing of SSR Frequencies
To determine whether frequencies of SSRs of given motif length and repeat number occurred as expected by chance, ten simulated genomes were constructed by randomly choosing nucleotides at the frequencies characterizing the E. coli genome. The simulated genomes then were analyzed using the genome scanning software described above to determine the number of SSRs of given motif length and repeat number. Results of the ten runs were summarized in terms of means and standard errors, yielding expected numbers of tracts of given motif length and repeat number. Departures of observed numbers of SSRs of given motif length and repeat number from expectations were tested using parametric statistics.
Were all nucleotides equally frequent in the genome, the relative frequencies of the six possible combinations of nucleotides in dinucleotide SSRs all would equal 0.167. However, because frequencies of the respective nucleotides were not equal, expectations for the relative frequencies of particular dinucleotides (E) were adjusted, as E = (fN1 + fN2) × 2 × 0.167, where fN1 and fN2 are the frequencies of nucleotides 1 and 2, respectively. For example, the frequencies of both C and G in ORFs are 0.26, and we seek the frequencies of CG and GC dinucleotides. Hence, E = (0.26 + 0.26) × 2 × 0.167 = 0.173.
Screening for Variability of SSRs Among E. coli Strains
Nonpathogenic and pathogenic strains of E. coli screened
for variation at SSR loci included K12 (DH5
, W3110), B (SR9b,
SR9c), E (1, 7, 11, 18, 47, 52, 54, 63, 68, 69); EHEC O157:H7 (FEB,
Rowe no. E304810, HER 1057, 1058, 1261, 1265, 1266), EPEC [serotype O111ac (Rowe no. E639616)], ETEC [serotype O78:H (Rowe no. E10407)]. The K and B strains were obtained from the microbiology laboratory collection of our department. The E strains were isolated by and obtained from Ochman and Selander (1984)
. The EHEC O157:H7 HER strains
were isolated by and obtained from Ahmed et al. (1987)
. Cultures for
DNA extraction were grown on Luria broth agar plates for 24 hr at
37°C. A large loop of colonies from the plate was transferred to a
microcentrifuge tube containing 500 µl of TE buffer (pH 7.5) and
vortexed thoroughly. Bacterial cells were lysed at 80°C for 10 min
and centrifuged for 10 min at 14,000 rpm (20,800g). The pellet
was suspended in 100 µl of TE, boiled for 5 min, and centrifuged at
14,000 rpm for 2 min (Kirschner and Bottger 1996
). The supernatant was
held at
20°C until used for PCR.
Fourteen SSR loci of E. coli were selected for detailed analysis. The forward (F) and reverse (R) PCR primer sequences for the loci examined were as follows: ycgW (F = 5'-GATTTTGCATATGAGTATATTAC-3', R = 5'-TTAATTACAGGATGTTCAGTC-3'); yaiN (F = 5'-AATTTATCCGGTGAATGTGGT-3', R = 5'-CAACTTAATCTCGGGCTGAC-3'); serW (F = 5'-TTCCACAGGTAACATACTCCAC-3', R = 5'-TTTGGTGAGGTCTCCGAG-3'); YjiD (F = 5'-TACATGGCTGATTATGCGG-3', R = 5'-TCGCTATGAATATCTACTGAC-3'); aidB (F = 5'-GTCAGAGCAGATCCAGAATG-3', R = 5'-TCTACAGCAAATGAACAATG-3'); molR_1 (F = 5'-GGTCATCAGGTGAAATAATC-3', R = 5'-CGTCCTGATAGATAAAGTGTC-3'); ftsZ (F = 5'-CAATGGAACTTACCAATGAC-3', R = 5'-TACCGCGAAGAATTCAACAC-3'); b1668 (F = 5'-AGCATCAGCGCACAATGCAC-3', R = 5'-TGTATGCAGGCTGGCACAAC-3'); yiaB (F = 5'-ATAACGATCTCCATATCTAC-3', R = 5'-CTCTATCAGCAACTTCTGCC-3'); hisC (F = 5'-ATCCGCAGGATTTTCGCACC-3', R = 5'-TGCCAGCGTAAATCCGCAAC-3'); MhpR (F = 5'-AATCACCCGTTGTTCACT-3', R = 5'-CGGAACAAGACCGCAAGGA-3'); b0829 (F = 5'-ACCGCAACATCCTTACAC-3', R = 5'-TGACAAGATTACGCACTC-3'); yibA (F = 5'-AATCGGACTTTCCTACAGA-3', R = 5'-AACTCACGCTATGAACGC-3'); and caiF (F = 5'-TGAATGCCGATGCGACTG-3', R = 5'-GTATGCAACTTCACCGTC-3').
Five microliters of DNA extract (~50 ng), 2.5 µl of 10× PCR
buffer (ProMega, 25 mM Mg2+ added), 0.2 µl of
25mM dNTPs, 1.0 units of Taq polymerase (Promega), and l0 pmoles each of F and R primers were brought to a final volume of
25 µl with sterile ddH2O. Mineral oil (15-20 µl) was added for PCR in a MJ Research thermocycler without a heating cover.
The cycling conditions for PCR consisted of denaturation at 95°C for
5 min, followed by 5 cycles (1 min at 95°C, 1 min at
Tm, and 1 min at 72°C), 20 cycles (1 min at
95°C, 1 min at Tm
5°C, and 1 min at
72°C), a final step of 7 min at 72°C, and cooling to room temperature.
Methods for radioactive PCR were as follows: To label primers, 2 µl
(1 ng) of primer DNA, 2 µl of 10× T4 kinase buffer
(NEB), 4 µl of [
-35S] ATP (250 mCi, NEN), and 1 µl (10 units) of T4 DNA kinase (NEB) were brought to a
final volume of 20 µl with sterile ddH2O. The contents
were mixed and incubated at 37°C for 1 hr. The reaction was stopped
by incubation at 70°C for 10 min. For the radioactive PCR reaction,
0.5 µl of nonradioactive and 0.5 µl of radioactive primer
(together, 10 pmoles) were used following the PCR protocol described
above. To observe small size differences among PCR products, electrophoresis of radioactive products was carried out in a 5% denaturing TBE acrylamide gel. The gels were dried (80°C for 1.5 hr)
and exposed to a PhosphorImager cassette, and the results were read
using a PhosphorImager (Bas reader 100, Fuji).
PCR products were eluted from electrophoretic gels using Jetsorb (Genomed) and sequenced by the dideoxy-chain termination method using an ABI automated sequencing machine (Biological Services, Weizmann Institute, Rehovot, Israel).
| |
ACKNOWLEDGMENTS |
|---|
This research was supported in part by the Technion Otto Meyerhof Center for Biotechnology, established by the Minerva Foundation, Germany. R.G.-A. was supported by the Food Control Administration in the Israel Ministry of Health. E.H. was supported by Virginia Polytechnic Institute and State University and by the U.S. Fulbright Senior Scholars Program. We are grateful to A. Korol, T. Haran, M. Soller, N. Ulitzur, and two anonymous reviewers for constructive comments on drafts of the manuscript.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 These authors contributed equally to this work.
4 Corresponding author.
E-MAIL kashi{at}tx.technion.ac.il; FAX 972-4-8320742.
| |
REFERENCES |
|---|
|
|
|---|
Received May 14, 1999; accepted in revised form October 14, 1999.
This article has been cited by other articles:
![]() |
M. GuhaMajumdar, E. Dawson-Baglien, and B. B. Sears Creation of a Chloroplast Microsatellite Reporter for Detection of Replication Slippage in Chlamydomonas reinhardtii Eukaryot. Cell, April 1, 2008; 7(4): 639 - 646. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Diancourt, V. Passet, C. Chervaux, P. Garault, T. Smokvina, and S. Brisse Multilocus Sequence Typing of Lactobacillus casei Reveals a Clonal Population Structure with Low Levels of Homologous Recombination Appl. Envir. Microbiol., October 15, 2007; 73(20): 6601 - 6611. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Y. Broza, Y. Danin-Poleg, L. Lerner, M. Broza, and Y. Kashi Vibrio vulnificus Typing Based on Simple Sequence Repeats: Insights into the Biotype 3 Group J. Clin. Microbiol., September 1, 2007; 45(9): 2951 - 2959. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Mrazek, X. Guo, and A. Shah Simple sequence repeats in prokaryotic genomes PNAS, May 15, 2007; 104(20): 8472 - 8477. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhang, Y. Deng, J. Tan, S. Hu, J. Yu, and Q. Xue A Genome-wide Microsatellite Polymorphism Database for the Indica and Japonica Rice DNA Res, April 23, 2007; (2007) dsm005v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Danin-Poleg, L. A. Cohen, H. Gancz, Y. Y. Broza, H. Goldshmidt, E. Malul, L. Valinsky, L. Lerner, M. Broza, and Y. Kashi Vibrio cholerae Strain Typing and Phylogeny Study Based on Simple Sequence Repeats J. Clin. Microbiol., March 1, 2007; 45(3): 736 - 746. [Abstract] [Full Text] [PDF] |
||||
![]() |
|