|
|
|
|
Vol. 9, Issue 12, 1305-1312, December 1999
METHODS
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
The availability of comprehensive sets of genes has prompted the researchers to carry out systematic collection of gene expression data. RT-PCR has the highest specificity and sensitivity for transcript detection among all available methods. Low throughput, especially when quantitative data are desired, has precluded RT-PCR from genome-wide application. Here we report a PCR-based expression profiling method, introduced amplified fragment length polymorphism (iAFLP), that has the same specificity and sensitivity as RT-PCR and a throughput level comparable to that of DNA-microarray hybridization. In this method, restricted ends of total cDNAs from six sources were ligated with adaptors having various length of short insertions to a common sequence (polymorphic adaptors). Amplification of a pool of these differentially adapted cDNAs with a gene-specific primer and an adaptor primer allows us to quantitate the abundance of any transcript in six mRNA sources. Using three different primer colors this technique allows quantitation of expression of 864 genes across six different sources per day with a single autosequencer, which is comparable to the throughput of microarray analysis in terms of number of genes × number of sources.
| |
INTRODUCTION |
|---|
|
|
|---|
To date, the complete genome sequences of 17 organisms have been
reported and expressed sequence tags (ESTs)
collection for the last decade has produced partial sequence of at
least 50,000 human genes (Deloukas et al. 1998
).
Availability of a comprehensive set of genes has prompted the
researchers to carry out systematic collection of gene expression data.
Various methods for analyzing gene expression (Adams et al. 1992
; Liang
and Pardee 1992
; Okubo et al. 1992
; Kato 1995
), hybridization of
high-density arrays of DNA, oligomers (McGall et al. 1996
; Wodicka et
al. 1997
) or PCR products (Lockhart et al. 1996
; Iyer et al. 1999
),
have been used, because they provide high throughput in the numbers of
genes tested at one time. Although a single transcript in a cell
theoretically may be detected (Duggan et al. 1999
), it is still unclear
what proportion of arrayed DNAs give informative signals. Individual
gene probes in labeled cDNA/cRNA have unique secondary structures,
melting temperatures, and reassociation rates (Southern et al. 1999
),
which makes hybridization of all gene probes under optimum condition
nearly impossible. In contrast, RT-PCR is the most sensitive and
specific of all techniques available for transcript detection
(Becker-Andre et al. 1989
). The major drawback of genome-wide
application of RT-PCR is low throughput, especially when quantitative
data are desired. For the analysis of thousands of transcripts with
concentrations that differ as much as 1000-fold (Okubo et al. 1992
),
noncompetitive PCR has the critical disadvantage that it requires a
standard curve for every primer pair (Singer-Sam et al. 1990) and
quantitation by competitor fragments for each gene is not realistic.
Kato proposed that engineered total cDNA from different sources replace
the competitor DNA for every transcript species (Kato 1997
). We have expanded this idea and developed a protocol to provide high throughput. Here we report a novel PCR-based protocol, introduced-amplified fragment length polymorphism (iAFLP), that allows relative quantitation of transcript concentration across as many as six mRNA sources in a
single PCR reaction. We also describe a method to apply this protocol
to minute amounts of sample and to increase the throughput to levels
comparable to those of DNA chip hybridization.
| |
RESULTS |
|---|
|
|
|---|
Expression Profiling With iAFLP
A set of adaptors that mimic a length-polymorphic locus were designed. These adaptors have common sequences at both ends and sequences of different lengths internally. The polymorphic portion of the adaptors was composed with degenerate A, C, and T residues to neutralize sequence-specific effects of the adaptors on the amplification kinetics (Fig. 1B). Ligation of these adaptors to restricted cDNA fragments from multiple sources yielded source-specific length-polymorphism on the cDNA ends. From a pool of such adapted cDNAs, any gene sequence can be amplified as length-polymorphic fragments when a gene-specific primer and an adaptor primer [F-T7(25)] are used (Fig. 1A).
|
Because of the nearly identical sequence content and common primer
annealing sites, the polymorphic fragments should have similar
amplification efficiencies. Therefore, the relative concentrations of
the transcript from different sources should be reflected in the amount
of corresponding polymorphic products. The relationship between the
ratio of polymorphic products (iAFLP profile) and the ratio of targets
in mRNAs was confirmed by two methods. First, MboI-digested
human hepatoblastoma hepG2 cDNA was divided between six tubes and
ligated to different iAFLP adaptors. These ligation mixtures were
pooled at 24 different ratios by adding two- to fivefold excess of one
of each adapted cDNA into an even mixture of all adapted cDNAs. In
these mixtures, every transcript has controlled relative abundance
across differentially adapted cDNAs. As an example, the concentration
of GAPDH was measured by PCR with antisense GAPDH primer and
dye-labeled adaptor primer F-T7(25) (Fig. 2A). Within
the mixing range of 1:2 to 1:5, all cDNA pools gave iAFLP
profiles proportional to the original mixing ratios (Fig. 2B). Similar
profiles were obtained with other gene-specific primers tested (data
not shown). The quantitation errors for the highest peaks [(observed
excess ratio/true excess ratio)
1] in 24 pools were +12% on
average with a standard deviation of 4.2%.
|
Then, the results of iAFLP profiles were compared with results of Northern hybridization. Equal amounts of poly(A) RNAs from six human tissues were converted to cDNAs and ligated with one of the polymorphic adaptors after cleavage with MboI. These six ligation mixtures were pooled equally and used as a template. The iAFLP profiles of four genes with various expression patterns, GAPDH, troponin C, neurofilament L, and cathepsin L, were compared with Northern hybridization signals from the corresponding probes (Fig. 3). The intensities of hybridization signals and peak areas of polymorphic fragments correlated very well within a wide range of expression levels.
|
Enhancing the Throughput of iAFLP
Using the present protocol, which allows direct comparison of six sources, unlimited numbers of RNAs can be compared by including one reference cDNA sample in different pools. We tested the relative quantitation of transcript in 30 tissues by making six pools (pools A to F) of six adapted cDNAs (Fig. 4B). As a reference cDNA, we pooled 30 cDNAs before adaptor ligation and included in every pool as one component. Antisense primers were designed based on the sequence of the most 3' MboI fragment of cDNAs for GAPDH, apolipoproten B, neurofilament L, and troponin C. Each primer was used to amplify pools A through F with F-T7 primer (Fig.4A). To save the lanes required for the analysis, a primer with 5-Fam (blue) label was used for pools A and D, Hex (green) for B and E, and Tamra (yellow) for C and F. After PCR, reactions with different dyes were pooled and loaded onto one lane. By normalizing the peak area for each source relative to the reference peak in the same pool, concentrations of transcripts in different pools can be compared (Fig.4C). The relative concentration of gene transcripts in 30 human tissues were determined (Fig.4C). The sum of normalized peak areas across 30 tissues were 25 (NF-L, Apo B), 28 (troponin C), and 32 (GAPDH). Because the reference is an even pool of 30 cDNAs, these values, which were close to the expected value of 30, suggest that quantitation is reliable. Moreover, the reference allowed accurate comparison by dividing the difference in expression levels into two smaller factors. For example, the very weak expression of troponin C in adipose tissue probably originated from a rich vascular bed (pool F, third position), is titrated as 1:75 of expression in skeletal muscle (pool E, second position).
|
Application to Microsamples
To expand the application of this protocol to minute
amounts of sample, we tested PCR amplification of total cDNA
before iAFLP adaptor ligation. First, we evaluated the
distortion in composition of total cDNA made by PCR
amplification. From a fraction of an adapted total hepG2 cDNA, the most
3' fragments were amplified ~1000-fold with primers that anneal
to the adaptor and vector-primer. After amplification, the adaptor
portion was cleaved off with MboI and replaced with another
iAFLP adaptor. From the 3' ESTs of hepG2 cells deposited in
GenBank, 192 independent sequences were selected for analysis. The
concentrations of these 192 sequences in the cDNA before and after the
amplification were compared by iAFLP
(Fig.5). Among 192 gene-specific primers, 175 primers yielded PCR products with expected sizes. The ratios of
post-PCR to pre-PCR concentration of these 175 sequences ranged from
0.6 to 1.5, with an average of 1.1 and standard deviation of 0.24. We
found a moderate correlation between amplification efficiency and
lengths of cDNA fragments. If concentrations are compared between
amplified total cDNAs, the error would probably be smaller because the
preferred sequence would be amplified efficiently in any sample. For
example, a mouse embryo at 9.5 dpc (13 somite stage) was dissected into five different parts and the anatomical distribution of two morphogenic gene transcripts, hindbrain-specific krox (Nonchev et al. 1996
) and
somite-specific Mox-1 (Candia et al. 1992
), were monitored with
amplified iAFLP. The reported distribution of each of these transcripts
is well reproduced (Fig. 6). For krox20 gene, second primer (krox20-2), which anneals to 18 bp distal to the first one
(krox20), was used to confirm the expression pattern.
|
|
| |
DISCUSSION |
|---|
|
|
|---|
In competitive PCR, the ratio of products sometimes does not reflect
the ratio of competitor to target when the reactions are run to
saturation (Volkenandt et al. 1992
; Pannetier et al. 1993
) . Morrison
and Gannon (1994)
clearly demonstrated that the plateau onset is
simultaneous for all amplicons. Therefore, if the competitor and target
have similar amplification efficiencies, the ratio of products reflects
the ratio of targets well after the reaction reaches saturation.
McCulloch et al. (1995)
showed that homologous fragments with size
differences smaller than 50 bp meet this requirement. In our protocol,
the ratio of differentially adapted gene-specific amplicons in the PCR
product, with size differences of at most 15 bp, reflected very well
the initial ratio of targets.
In comparative gene expression analysis, throughput should be compared in terms of number of genes tested × number of sources compared. Obviously, the throughput of our protocol is limited by the separation and detection of PCR products on an autosequencer. The necessity of size separation precluded the possibility of using colorimetric detection or real-time quantitation. Nevertheless, we can use three colors for dye-primers that permit the application of 288 reactions on a single sequencing gel, leaving one color for size standards. As shown in Figure 4, spectrum leaks in the neighboring windows can be eliminated by using a properly calculated matrix. Therefore, the throughput for a single 2-hr run is 96 genes × 6 sources × 3 colors = 1728 gene × sources. Assuming three runs a day, 5184 genes × sources can be analyzed per day per machine. This throughput is comparable to that of a single microarray hybridization. The number of PCR reactions required for sample preparation is 864, which is easily generated by one thermocycler that accepts two 384-well plates.
There are several advantages of our protocol over microarray-based expression studies. The signal for any gene-specific primer is first validated by the size of the PCR product, which is determined to a single nucleotide resolution in our protocol. If necessary, further confirmation is obtained by using another primer to the same gene, as shown for krox20. The quantitative aspect is validated by calculating the ratio of the total amount of products from individual sources to the product from the mixture of these sources. This is advantageous over microarray hybridization, which is weak in gene-by-gene data validation.
Moreover, the system setup is much simpler than microarray. When cDNA
sequences or 3' EST sequences are selected, primers can be designed
using simple computer programs and custom primers are available from
many manufacturers in the 96-well format at properly adjusted
concentrations. Thus, once the pool of adapted cDNAs are made, the PCR
reactions are simply set up by adding the primers to the master mix.
Gene selection is highly flexible, and newly discovered genes are
easily tested by researchers. For a single microarray hybridization,
2-5 µg of poly(A) RNA is required for probe preparation (Duggan et
al. 1999
), in which up to 10,000 genes can be tested. Using our
protocol, a similar number of genes can be tested from 1 µg of
poly(A) RNA without any preamplification step.
In summary, we developed the protocol iAFLP which allows highly specific and sensitive profiling of gene expression. One autosequencer can produce expression data for 5184 genes × sources per day. Once an antisense oligomer is synthesized for all genes in the scale of micromoles, it will become a permanent resource for expression tests in genomes. Unlike microarrays, the genes tested at one time are flexible in variety and number.
| |
METHODS |
|---|
|
|
|---|
Oligomers
iAFLP oligomers are as follows: LP40, 5'-ACGACTCACTATAGGGAGATTACTTHHHHHCAGTCAGGAT; LP43, 5'-ACGACTCACTATAGGGAGATTACTTHHHHHHHHCAGTCAGGAT; LP46, 5'-ACGACTCACTATAGGGAGATTACTTHHHHHHHHHHHCAGTCAGGAT; LP49, 5'-ACGACTCACTATAGGGAGATTACTTHHHHHHHHHHHHHHCAGTCAGGAT; LP52, 5'-ACGACTCACTATAGGGAGATTACTTHHHHHHHHHHHHHHHHHCAGTCAGGAT; LP55, 5'-ACGACTCACTATAGGGAGATTACTTHHHHHHHHHHHHHHHHHHHHCAGTCAGGAT (H = A, C, or T). NH2-terminated oligomer (NH1400: GATCATCCTGACTG-NH2) complement to the 3' termini of iAFLP oligonucleotides was kinated by T4 kinase and mixed with equimolar amounts of iAFLP oligomer. These pairs of oligomers were used as iAFLP adaptors.
5-Fam-, Hex-, and Tamra-labeled T7 primers [F-T7(25): F-ACGACTCACTATAGGGAGATTACTT] were custom-made by PE Biosystems. Bam7T:5'-GACGCGGATCCAAGCTTGCATGCCTGCATTTTTTT Gene-specific antisense primers were designed based on cDNA sequences in GenBank. The distance from the GATC to the annealing site is indicated: human GAPDH (GenBank accession no. M17851), GTGCTAAGCAGTTGGTGGT (23 nucleotides from GATC); human apolipoprotein B(J02775); CAAGTTTAGCAAAATAACTCAGA (3 nucleotides); human neurofilament L(X05608), GATGATTCACATTGCCGTAGA (3 nucleotides); human troponin C(M37984), GATTTGCATCCCCCAGGACAGA (3 nucleotides); mouse GAPDH(M32599), TGTCATCATACTTGGCAGGT (23 nucleotides); mouse krox20 (X06746) 1, GTTAACCTTTGACTCATTTCAAGAGA (3 nucleotides) and 2, CCCCCTTTAAATAAGTTAGTT (24 nucleotides) mouse MOX1 (Z15103), GTAAACACTCACAATCAGATC (1 nucleotide); and mouse WNT3a (X56842), GACAGGGTGGTGGCCAACGGA (3 nucleotides).
Preparation of Template Pools
Human RNAs were purchased from Clontech. RNA from hepG2 cells was
described previously (Okubo et al. 1992
). For all cDNA synthesis, a
pUC19-based vector primer (Okubo et al. 1992
) was used to monitor synthesis and cleavage of cDNA on agarose gel. Poly(A) RNA or total RNA
was annealed with an equal amount or 1:20 amount of vector primer,
respectively, in 13 µl of distilled water at 37°C and 2×
reaction mixture was added to yield final concentration of 0.5 mM dNTP, 1× RT buffer, and 0.1 U/µl of SuperscriptII
and incubated at 42°C for 60 min. Second strand was synthesized as recommended by the manufacturer's instructions (GIBCO-BRL, MD). After
purification by phenol-chloroform extraction and ethanol precipitation, cDNA was dissolved in an appropriate volume of distilled
water. Concentrations of cDNAs were adjusted by staining with
cybergreen I and measuring the fluorescence intensities using a
fluoroimage analyzer (Fuji LAS1000). A standard curve made with samples
of known concentrations of DNAs was used for calculation of DNA
concentrations from fluorescence intensities. One microgram of cDNA was
cleaved with 5 units of MboI [New England Biolabs (NEB), MA]
in 20 µl of recommended buffer for 60 min at 37°C. After heat
inactivation for 20 min at 70°C, 1 µl of 10 mM ATP and
100 pmoles of kinated adaptor cassette, 100 units of T4 ligase (NEB)
were added to the reaction mixture and incubated at 16°C for 3 hr.
Equal volumes of ligation reaction were pooled and the concentration of
the pools of cDNAs was adjusted to 1 ng/µl with 0.1 µg/µl
of glycogen solution before use.
Profiling PCR Reactions
The profiling PCR reactions contained 2 pmoles each of F-T7(25) and a gene-specific antisense primer, 0.1 ng of template pool, and 0.1 unit of Platinum Taq polymerase (GIBCO-BRL) in 10 µl of reaction mixture recommended by the manufacturer. The template concentration was set to detect 1 in 100,000 transcript in a source RNA. Thermal profile was 35 cycles of 94°C, 55°C, and 72°C for 30 sec each with an initial 2 min at 94°C and a final 72°C for 7 min. An autosequencer 377XL and GeneScan program (PE Biosystems) were used for sizing and quantitation of dye-labeled fragments.
PCR mixture (0.25 µl) was mixed with 3 µl of formamide-loading buffer (PE Biosystems, CA) containing 1/4 volume of internal size marker (Tamra 350, PE Biosystems). After denaturation at 96°C for 3 min, 0.5 µl of sample was separated on 10% acrylamide 6 M urea gel with a 15-cm well-to-detector distance. Analysis was done according to the manufacturer's instructions for GeneScan.
Northern Hybridization
Human poly(A) RNAs were purchased from Clontech. Two micrograms each of poly(A) RNA was separated on formalin-agarose gels and transferred to Hybond-N membrane (Amersham Pharmacia). Probes were synthesized by PCR with appropriate sets of gene-specific primers. Biotin-dUTP was incorporated by random-priming DNA synthesis using a commercial kit (Boehringer Mannheim). Hybridization was performed according to the manufacturer's instructions. Chemiluminescence signals were detected and analyzed by Fuji LAS1000.
Amplification of Total cDNA
After adaptor ligation, 1 ng of adapted vector-primed cDNA was added to the 100 µl of PCR mixture containing 5 units of Platinum Taq (GIBCO BRL), 0.2 mM dNTP, 1× Platinum Taq buffer, and 10 pmoles each of primers T7(25) and Bam7T. Amplification conditions were 94°C for 3 min, 15 cycles of 94°C for 30 sec, 55°C for 30 sec, 72°C for 2 min, and final incubation at 72°C for 10 min. After cleavage with 10 units of MboI for 1 hr at 37°C, reaction mixture was extracted once with phenol and once with chloroform. DNA was recovered by ethanol precipitation. After dissolving in 100 µl of distilled water, DNA concentration was measured by LAS1000 as described above.
Independent 192 3' ESTs from hepG2 cells were selected from GenBank and one antisense primer was synthesized for each. Accession numbers of ESTs and primer sequences are obtainable through the world wide web at http://imcb.osaka-u.ac.jp/okubo/materials/primer_GR
| |
ACKNOWLEDGMENTS |
|---|
This work was supported by grant 08283105, Grant-in-Aid for Scientific Research on Priority Areas from the Ministry of Education, Science and Culture of Japan and grant 96L00104 from the Research for the Future (RFTF) of Japan Society for the Promotion of Science.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Corresponding author.
E-MAIL kousaku{at}imcb.osaka-u.ac.jp; FAX +81-6-877-1922.
| |
REFERENCES |
|---|
|
|
|---|
Received July 13, 1999; accepted in revised form September 21, 1999.
This article has been cited by other articles:
![]() |
E. Sekiyama, T. Nakamura, L. J. Cooper, S. Kawasaki, J. Hamuro, N. J. Fullwood, and S. Kinoshita Unique distribution of thrombospondin-1 in human ocular surface epithelium. Invest. Ophthalmol. Vis. Sci., April 1, 2006; 47(4): 1352 - 1358. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kawasaki, H. Tanioka, K. Yamasaki, N. Yokoi, A. Komuro, and S. Kinoshita Clusters of corneal epithelial cells reside ectopically in human conjunctival epithelium. Invest. Ophthalmol. Vis. Sci., April 1, 2006; 47(4): 1359 - 1367. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tanino, M.-A. Debily, T. Tamura, T. Hishiki, O. Ogasawara, K. Murakawa, S. Kawamoto, K. Itoh, S. Watanabe, S. J. de Souza, et al. The Human Anatomic Gene Expression Library (H-ANGEL), the H-Inv integrative display of human gene expression across disparate technologies and platforms Nucleic Acids Res., January 1, 2005; 33(suppl_1): D567 - D572. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Suzuki, R. Yamashita, M. Shirota, Y. Sakakibara, J. Chiba, J. Mizushima-Sugano, K. Nakai, and S. Sugano Sequence Comparison of Human and Mouse Genes Reveals a Homologous Block Structure in the Promoter Regions Genome Res., September 1, 2004; 14(9): 1711 - 1718. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Chiba, H. Michibata, K. Wakimoto, M. Seishima, S. Kawasaki, K. Okubo, H. Mitsui, H. Torii, and Y. Imai Cloning of a Gene for a Novel Epithelium-specific Cytosolic Phospholipase A2, cPLA2{delta}, Induced in Psoriatic Skin J. Biol. Chem., March 26, 2004; 279(13): 12890 - 12897. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kawamoto, J. Yoshii, K. Mizuno, K. Ito, Y. Miyamoto, T. Ohnishi, R. Matoba, N. Hori, Y. Matsumoto, T. Okumura, et al. BodyMap: A Collection of 3' ESTs for Analysis of Human Gene Expression Information Genome Res., November 1, 2000; 10(11): 1817 - 1827. [Abstract] [Full Text] |
||||
![]() |
Y. Suzuki, T. Tsunoda, J. Sese, H. Taira, J. Mizushima-Sugano, H. Hata, T. Ota, T. Isogai, T. Tanaka, Y. Nakamura, et al. Identification and Characterization of the Potential Promoter Regions of 1031 Kinds of Human Genes Genome Res., May 1, 2001; 11(5): 677 - 684. [Abstract] [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||