|
|
|
|
Vol. 9, Issue 2, 182-188, February 1999
METHODS
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Gene isolation methods used during positional cloning rely on physical contigs consisting of bacterial artificial chromosomes, P1, or cosmid clones. However, in most instances, the initial framework for physical mapping consists of contigs of yeast artificial chromosome (YACs), large vectors that are suboptimal substrates for gene isolation. Here we report a strategy to identify gene sequences contained within a YAC by using cDNA representational difference analysis (RDA) to directly isolate transcripts expressed from the YAC in mammalian cells. The RDA tester cDNAs were generated from a previously reported hamster cell line derived by stable transfer of a 590-kb YAC (911D5) that expressed NPC1, the human gene responsible for Niemann-Pick type C (NP-C). The driver cDNAs were generated from a control hamster cell line that did not contain the YAC that expressed NPC1. Among the gene fragments obtained by RDA, NPC1 was the most abundant product. In addition, two non-NPC1 fragments were isolated that were mapped to and expressed from 911D5. One of these RDA gene fragments (7-R) spans more than one exon and has 98% sequence identity with a human cDNA clone reported previously as an expressed sequence tag (EST), but not mapped to a chromosomal region. The other fragment (2-R) that had no significant sequence similarities with known mammalian genes or ESTs, was further localized to the region of overlap between YACs 911D5 and 844E3. The latter YAC is part of a contig across the NP-C candidate region, but does not contain NPC1. This two-part approach in which stable YAC transfer is followed by cDNA RDA should be a useful adjunct strategy to expedite the cloning of human genes when a YAC contig is available across a candidate interval.
[The sequence data described in this paper have been submitted to GenBank under accession nos. AF117641 and AF117642.]
| |
INTRODUCTION |
|---|
|
|
|---|
The identification of genes responsible for
inherited human disorders is an integral part of the Human Genome
Project. There are several widely applied methods for gene isolation
associated with conventional positional cloning, including a
combination of sequencing, database searching and candidate gene
analysis, exon trapping (Duyk et al. 1990
; Buckler et al. 1991
; Krizman and Berget 1993
), and direct selection (Lovett 1994
; Osborne-Lawrence et al. 1995
; Simmons et al. 1995
; Del Mastro and Lovett 1997
). The
effective use of most of these methods relies on physical contigs
constructed from genomic clones contained within vectors of manageable
sizes, such as bacterial artificial chromosomes (BACs), P1 clones, or
cosmids. However, in most positional cloning efforts, the initial
framework for physical mapping consists of contigs of yeast artificial
chromosomes (YACs). Unfortunately, these large vectors are suboptimal
substrates for most of the commonly used gene isolation techniques
listed above.
Here we report a novel strategy designed to accelerate gene
identification, which was developed with resources from the positional cloning of NPC1, the gene responsible for Niemann Pick Type C [(NP-C), Carstea et al. 1997
]. In the process of identifying
NPC1, the candidate interval was substantially narrowed by
complementation of the NP-C phenotype by stable integration of a 590-kb
YAC (911D5) into the genome of CT60, an NP-C cell line (Gu et
al. 1997
) derived by chemical mutagenesis of the Chinese hamster ovary
(CHO) cell line, 25-RA, which has a normal phenotype with respect to
NP-C (Cadigan et al. 1990
). One advantage of this successful YAC
complementation cloning strategy was that the search for candidate
genes was quickly focused on only a few hundred kilobase of DNA.
However, it was still necessary to generate smaller insert contigs
across the narrowed interval to identify NPC1 (Carstea et al.
1997
). To preclude this time-consuming step in future studies, we
sought to demonstrate that YACs such as 911D5 could be used
directly for the isolation of human transcripts. The current study
combines YAC complementation cloning with cDNA representational
difference analysis (RDA), a technique that relies on the generation of
tester and driver representations from cDNAs generated from two types
of cell populations expected to differentially express specific
transcripts (Lisitsyn et al. 1993
; Hubank and Schatz 1994
; Braun et al.
1995
; Chu and Paul 1997
; Gress et al. 1997
). RNA from the NPC1
complemented cell lines in which YAC 911D5 was stably
integrated (911D5A1 and 911D5A13) was used to
generate the tester representations, and a noncomplemented hamster cell
line (CFTRA1) was used to generate the driver. Because RDA
eliminates fragments present at similar concentrations in both
populations, leaving only the differences, NPC1 and other
genes expressed from 911D5 should be identified among the
endogenous hamster transcripts in the complemented cell lines.
| |
RESULTS |
|---|
|
|
|---|
Generation of First and Second Difference Products
The cDNA RDA procedure used in the present study was based closely
on the protocol described by Hubank and Schatz (1994)
. Our first goal
was to use RDA to identify NPC1, known to be on YAC
911D5. Tester cDNAs were synthesized separately from two
clonal cell lines derived by spheroplast fusion of 911D5 with
CT60 cells. Both of the clonal cell lines, 911D5A1 and
911D5A13, express NPC1 at a level detectable by
Northern blot hybridization (Gu et al. 1997
). The driver cDNA was
synthesized from a control clonal cell line (CFTRA1) that does
not express NPC1, derived by spheroplast fusion of CT60 with a
325-kb YAC that contains the cystic fibrosis transmembrane conductance
regulator gene [(CFTR), Mogayzel et al. 1996
; Gu et al.
1997
]. Each of the cell lines expresses a gene
(neor) that confers resistance to the
neomycin analog, G-418. Tester and driver cDNAs were digested with the
restriction enzyme DpnII to generate small fragments (mean
length 256 bp) that should be easily amplified by PCR. Because there
are 10 DpnII sites in the full-length NPC1 cDNA
generating fragments ranging from 37-1200 bp, NPC1 should be
readily amplifiable by the PCR conditions used. Tester and driver cDNA
representations were generated by PCR and then denatured and
reassociated to generate the first difference products (DP I). The
procedure was repeated to generate the second difference products (DP II).
The difference products (I and II) were initially evaluated by agarose
gel electrophoresis (Fig. 1A). The complexity of DP I
was reduced compared with the amplicons from tester cDNAs from 911D5A1 or 911D5A13. The complexity was further
reduced in DP II as indicated by the distinct bands that appeared on
the ethidium bromide-stained gel. The staining pattern of the
difference products obtained by use of either 911D5A1 or
911D5A13 as the tester cDNAs looked similar, consistent with
the likelihood that these two clonal cell lines are sibling clones (Gu
et al. 1997
).
|
A probe derived from the full-length NPC1 cDNA (Carstea et al.
1997
) hybridized to the original DpnII digested amplicons from both tester cDNAs, but not to the driver cDNA (Fig. 1B, cf. lanes 2 and
3 with lane 4). The same probe hybridized with progressively stronger
intensity to the DP I and DP II, respectively, derived from the cDNA
from both testers (Fig. 1B, lanes 5-8), indicating that NPC1
is enriched in the difference products. The smaller fragments appear to
be preferentially enriched after each round of subtraction and
amplification. This result is consistent with the use of PCR conditions
that are likely to favor smaller fragments during the generation of the
difference products. These data demonstrate that RDA can be used to
detect a gene expressed as a consequence of integrated heterologous
DNA. We attempted to distinguish the specific bands predicted for
DpnII digestion of the NPC1 cDNA by altering gel
running conditions and/or film exposure time, but were unsuccessful.
Cloning the Second Difference Products
The second goal of the study was to isolate transcripts expressed from genes other than NPC1 that might be contained on YAC 911D5 (summarized in Fig. 2). Because NPC1 spans <100 kb of this 590-kb YAC (J.A. Morris and E.D. Carstea, unpubl.), we expected that the YAC could contain up to 10-20 additional genes (assuming an average gene density of one gene per 30 kb). First, the gene fragments represented in DP II were subcloned and colony lifts were prepared and screened with the full-length NPC1 cDNA probe. The majority (>75%) of the RDA gene fragments represented by the recombinant colonies hybridized to the NPC1 probe. Then, the RDA gene fragments isolated from 911D5A1 and 911D5A13 (12 and 22 colonies, respectively) that did not hybridize to NPC1 were further evaluated (Fig. 2). To eliminate redundancy among the non-NPC1 RDA gene fragments, two fragments were randomly chosen (one each from the DP II from 911D5A1 and 911D5A13) as probes to hybridize to an arrayed panel of the 34 non-NPC1 colonies. This procedure reduced the number of nonredundant RDA gene fragments to 11. Sequence analysis revealed that 7 of these 11 fragments were unique, 3 others were redundant with each other, and 1 fragment yielded poor quality sequence data. On the basis of this data, PCR primers were synthesized for the 8 unique gene fragments.
|
Gene Fragment Assessment: Localization on YAC 911D5, Expression and Sequence Analysis
On the basis of the premise of the cDNA RDA procedure, the eight
fragments were obtained because they were present to a greater extent
in the cDNA of the tester compared with that of the driver. The next
step was to determine if the differential expression of these eight
fragments was a direct consequence of expression of a gene contained
within YAC 911D5. First, the primer pairs designed for each of
the eight unique gene fragments were used in PCR assays to determine
whether the fragments mapped back to the YAC. Two of the eight RDA gene
fragments (2-R and 7-R; GenBank accession nos. AF117641 and AF117642,
respectively) were found to map back to YAC 911D5 (Figs.
2-4). For 2-R, a PCR product of
the expected size (261 bp) was amplified from genomic DNA from YAC
911D5 and the cell lines in which 911D5 was stably
integrated [the testers 911D5A1 and 911D5A13 and
another NPC1 complemented cell line, 911D5B5 (Gu et
al. 1997
)]. No product was amplified from genomic DNA from the
parental cell lines CT60 and 25-RA. Interestingly, a 261-bp product was
also amplified from genomic DNA from YAC 844E3 as well as the
cell line 844E3A5 in which 844E3 was stably
integrated (Fig. 3A). YAC 844E3 does not contain NPC1 but overlaps 911D5 as part of the YAC contig encompassing the NP-C locus (Gu et al. 1997
). Thus, 2-R maps to the region in which 844E3 and 911D5 overlap. These mapping results were
supported by the RT-PCR data that indicated that 2-R was expressed as
expected in the testers 911D5A1 and 911D5A13 as well
as in 844E3A5, but not in the parental lines CT60 and 25-RA or
the driver CFTRA1 (Fig. 3B). Further confirmation of the
origin of the 2-R gene fragment was obtained by comparing the sequence
of 2-R with the sequences of 911D5 and human genomic DNA. A
223-bp sequence was obtained that was 100% homologous among 2-R,
911D5, and human genomic DNA (data not shown). Further
sequence analysis revealed that gene fragment 2-R did not have
significant sequence similarities when BLAST (Altschul et al. 1990
)
searched against GenBank nonredundant EST division (BLASTN-dbEST), the
nonredundant GenBank+EMBL+DDBJ+PDB sequences (BLASTN-NR) and the
nonredundant protein sequence databases (BLASTX-NR).
|
|
By use of the primers designed to amplify 7-R, PCR products were obtained from genomic DNAs from YAC 911D5, normal human, and 911D5A1 and 911D5A13, but not from the YAC 844E3 or the parental cell lines 25-RA and CT60 (Fig. 4A). Thus, in contrast to 2-R, 7-R did not localize to the overlapping region between 844E3 and 911D5. The PCR products from the genomic DNAs were all the same size but were ~900-bp larger than expected and obtained with gene fragment 7-R as the template (154 bp; Fig. 4A, lane 8). The data suggested that the 7-R gene fragment spanned more than one exon and that the genomic DNA PCR products contained intronic sequence. To assess this issue, RT-PCR and sequence analysis were performed. The RT-PCR data demonstrated the expected 154-bp fragment for each of the hamster cell lines, including the parental line CT60 (Fig. 4B). This data supported the conclusion that gene fragment 7-R spanned more than one exon, but also raised the possibility that expression from the hamster homolog accounted for the detection of 7-R by RDA, even though no product had been obtained by PCR using CT60 genomic DNA. To assess this possibility, the sequence of clone 7-R was compared with the sequences of 911D5 genomic DNA and subcloned RT-PCR products from CT60 obtained using primers designed to amplify 7-R. The data indicated that fragment 7-R was obtained during the RDA procedure as a consequence of expression from a gene on 911D5. There were 84 bp of 100% homologous sequence between YAC 911D5 and 7-R. Within those 84 bp, there were 11 mismatches in the CT60 sequence compared with 7-R. Further sequence analysis revealed a splice donor site in 911D5 and a splice junction in 7-R, supporting the contention that the human 7-R fragment spans more than one exon (data not shown). The lack of amplification of a product from CT60 genomic DNA with 7-R primers (as described above and shown in Fig. 4A) likely reflects that the hamster genomic fragment contains a larger intron than the human DNA.
The sequence of gene fragment 7-R identified a human cDNA clone (Soares
ovary tumor NbHOT cDNA clone 741737, GenBank accession no. AA402970,
WashU-Merck EST project 1997, unpubl.) with 98% sequence identity.
Therefore, this previously unmapped human cDNA clone was placed on YAC
911D5 on human chromosome 18q11-12 (Gu et al. 1997
). Finally,
Northern analysis confirmed that 7-R was differentially expressed in
the testers 911D5A1 and 911D5A13 (as well as
911D5B5), compared with the driver, CFTRA1 and the
noncomplemented cell line 844E3A5 (Fig. 4C).
As further confirmation of their human chromosomal origin, both 2-R and 7-R were analyzed by radiation hybrid mapping. By use of the Stanford G3 Radiation Hybrid Mapping Panel (webmaster{at}shgc.stanford.edu) the gene fragments mapped to two neighboring index markers on chromosome 18q (CHLC.GATA41G05 and SHGC-3938). These markers are separated by 4 cR or ~100 kb, suggesting that the two gene fragments themselves are not far from each other. However, we cannot determine the actual distance between the fragments from this data.
The remaining six RDA gene fragments did not map back to YAC 911D5 (Fig. 2), suggesting that they represented genes upregulated in the hamster genome after YAC transduction. BLAST search showed that three of these fragments had significant sequence similarities with other mammalian genes. The highest nucleotide identity scores were obtained for 5-R (Mus musculus mRNA for Ki-67, 85% identity), 6-R (rat Y-b3 glutathione S-tranferase, 81% identity), and 8-R (rat cholecystokinin receptor mRNA, 89% identity).
| |
DISCUSSION |
|---|
|
|
|---|
We have shown that we can directly isolate transcripts expressed from genes present on YACs using complementation cloning and cDNA RDA. Among the isolated transcripts were those from NPC1, which is known to be on YAC 911D5, and two others not localized previously. One of the non-NPC1 fragments, 2-R, is a novel sequence that was further localized during the present study to the region of overlap between two YACs in the NP-C contig used for complementation. The other non-NPC1 fragment, 7-R, spans more than one exon and shares 98% sequence identity with a previously unmapped human EST. These data suggest the usefulness of this procedure as an adjunct strategy to positional cloning.
This approach has several advantages that complement conventional gene
isolation techniques from specific genomic clones: (1) For many regions
on human chromosomes, the number of mapped genes or expressed sequence
tags (ESTs) is scarce, and thus, database searching and candidate
analysis may not be effective. The use of cDNA RDA provides a
relatively quick way of identifying exon sequences (Lisitsyn et al.
1993
; Hubank and Schatz 1994
; Braun et al. 1995
; Chu and Paul 1997
;
Gress et al. 1997
), as an addition or alternative to direct selection
or techniques such as exon trapping that usually rely on smaller insert
clones; (2) RDA theoretically allows genes expressed at relatively low
levels to be isolated (Hubank and Schatz 1994
). By use of RDA, the
absolute abundance of a gene becomes less critical, whereas the degree
of difference between the tester and driver cDNAs becomes crucial; (3)
large-scale sequencing of genomic DNA for gene identification is still
time consuming and costly. cDNA RDA identifies fragments of expressed genes before any sequencing is performed.
The combination of YAC transfer and cDNA RDA has a potential advantage over exon trapping or other techniques in which YACs can be used directly to isolate cDNAs (e.g., direct hybridization to cDNA libraries or cDNA selection). The success of these other methods depends on the presence of specific transcripts in particular cDNA libraries. Without prior knowledge of the tissue-specific expression of the genes on the YAC, there is no easy way to narrow down which libraries to screen. The approach described here does not utilize cDNA libraries or depend on prior knowledge of gene expression patterns: If a gene is on the YAC and is expressed after transfer to mammalian cells, it will be present in the tester representation. On the other hand, if genes contained within a specific YAC are expressed in a cell-type-specific fashion, the cells chosen for transfer of the YAC may be critical to the success of the experiment. This problem could be addressed by utilizing different cell types for YAC transfer and then performing RDA.
Whereas the two goals of the present study were clearly achieved with
cDNA RDA, there are potential limitations of this methodology for
exhaustive gene isolation. The primary concern is that the RDA gene
fragments may represent only a subpopulation of the genes expressed
from the YAC. There are several reasons for this limitation. (1) Some
of the genuine differences between tester and driver cDNAs may not be
selectively amplified if the cDNAs do not contain sufficient sites for
the restriction endonuclease used for the initial digestion. (2) The
restriction fragments may be too big for efficient amplification by
PCR. Therefore, it may be necessary to perform parallel RDA experiments
with multiple restriction enzymes to increase the likelihood of
obtaining a more complete array of products (Braun et al. 1995
). (3) A
low yield of novel genes may correlate with the abundance of a single
transcript. For example, in the present study, ~75% of the RDA
products obtained were fragments from NPC1. A second RDA in
which the driver is supplemented with full-length cDNA of the known
abundant transcript might increase the yield of novel genes (Hubank and
Schatz 1994
).
The flexibility of the RDA methodology that allows for variation in the
stringency of hybridization could influence detection of subtle
differences in gene expression between tester and driver cDNAs. When
difficulties are encountered detecting differential expression, the
ratio of the tester and driver cDNAs used to generate the difference
products becomes critical (Hubank and Schatz 1994
). Too much driver
cDNA can cause insufficient enrichment of the targets, rendering
differences invisible. In contrast, too little driver cDNA may cause
insufficient exhaustion of common (but differentially expressed)
sequences in the tester cDNA, generating background such as the gene
fragments isolated, but not present, in the target YAC 911D5.
These fragments could represent genes upregulated in the hamster cells
after YAC integration, or genes whose protein products are involved in
the same pathway, but downstream to elevated gene expression from the
YAC. Given these possibilities, it is crucial to do a detailed
assessment of the sequence of the fragments.
Thus, there are several ways to modify cDNA RDA to address problems particular to tester and driver representations generated from YAC transduction of mammalian cells. In the present study, the identification of NPC1 and these two novel non-NPC1 gene fragments suggests that RDA combined with YAC transfer to mammalian cells is useful for gene identification and could be used when only a YAC physical contig exists, but no phenotype is available to monitor. The current strategy is advantageous because it allows for the search for genes to begin at the point that a YAC contig is available, thus accelerating the gene identification process.
| |
METHODS |
|---|
|
|
|---|
cDNA Synthesis and Generation of Difference Products
All cell lines were grown as monolayers in Ham's F-12 medium (Biofluids, Rockville, MD) supplemented with 10% fetal bovine serum (FBS, Hyclone), 2 mm glutamine, 100 U/ml penicillin, and 100 µg/ml streptomycin. RNA was extracted from cell monolayers by use of TRIZOL (GIBCO-BRL, Gaithersberg, MD), following the product instructions. Poly(A)+ mRNA was separated from total RNA by the Oligo (dT) Columns mRNA Purification Kit (Pharmacia, Piscataway, NJ). Double-stranded cDNA was prepared by reverse transcription by use of the Universal RiboClone cDNA Synthesis System (Promega, Madison, WI).
cDNA representational difference analysis was performed on the basis of
the protocol by Hubank and Schatz (1994)
with slight modifications.
Double-stranded cDNA (2 µg) was digested with DpnII to
generate tester and driver cDNA representations. In the second round of
subtraction hybridization, the tester/driver cDNA ratio was increased
to 1:40,000 and mung bean nuclease digestion of PCR products was omitted.
Cloning and Characterization of DP II
Second difference products were directly cloned into pCR 2.1 TA
cloning vectors and transformed into INV
F' One Shot competent cells with the TA Cloning Kit (Invitrogen, Carlsbad, CA). The cloned
gene fragments were plated on LB/Ampicilin plates and colony lifts were
prepared for subsequent screenings with the NPC1 gene or other
probes. Representative gene fragments were sequenced and the obtained
sequences were BLAST (Altschul et al. 1990
) searched against GenBank to
identify sequence similarities. PCR primers used to determine the
physical locations of the gene fragments were as follows: 2-R-f
5'-CAATCAGAGTGAAGCCTGGGAC-3'; 2-R-r
5'-TCTTGGGGTAGAAACCTACGTCAC-3'; 7-R-f
5'-CGTCAAACTCTCCCCGTAACTTG-3'; and 7-R-r
5'-CCACGAGAAGGTGCCTGTAAAAAG-3'.
For RT-PCR using 7-R primers, total RNA was treated with RNase-free DNase (RQ1, Promega) at 37°C for 30 min, and then at 90°C for 10 min to inactivate the DNase. Reverse transcription and PCR were performed by the SUPERSCRIPT Preamplification System (GIBCO-BRL, Gaithersburg, MD) following the manufacturer's instructions.
DNA Sequencing
DNA fragments were amplified by PCR from human genomic DNA and YAC 911D5 with the primer pairs, 2-R-f/2-R-r and 7-R-f/7-R-r, gel purified and directly sequenced by automated sequencing. DNA fragments amplified by RT-PCR from CT60 by use of the same primers were cloned into T-Adv vector (Clontech, Palo Alto, CA). Six CT60 clones containing the 2-R insert and eight clones containing the 7-R insert were sequenced by automated sequencing. Sequences were aligned by the MacVector 6.0.1 Align program (Oxford Molecular, Ltd.).
Northern Blot Analysis
For each RNA sample, 10 µg was loaded on a 1.2%
agarose/formaldehyde gel. Northern blotting and hybridization were
performed as described (Mogayzel et al. 1996
). Probes were labeled by
random oligo extension (Ready-To-Go kit, Pharmacia Biotech), following the kit instructions. After a 10-min denaturation period, repeated sequences were blocked by incubating the probe with unlabeled human
placental and Cot-1 DNA at 65°C for 30 min.
| |
ACKNOWLEDGMENTS |
|---|
We thank L. Staudt (National Cancer Institute) for providing a protocol for cDNA RDA, F. Collins, J. Trent, D. Tagle, and S. Chandrasekharappa for their contributions to this study, K. Henning, P. Mogayzel, and P. Liu for critical reading of the manuscript, B. Pike for performing the radiation hybrid mapping, P. Pentchev, E. Carstea, and J. Morris (National Institute of Neurological Disorders and Stroke) for providing the NPC1 full-length cDNA probe, J. Morris and E. Carstea for providing unpublished data, and D. Leja for graphics assistance. J.G. was supported by a grant from the Ara Parseghian Medical Research Foundation.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Present address: Genome Therapeutics Corporation, Department of Human Genetics, Waltham, Massachusetts 02154 USA.
4 Corresponding author.
E-MAIL melis{at}nhgri.nih.gov; FAX (301) 492-4929.
| |
REFERENCES |
|---|
|
|
|---|
Received September 11, 1998; accepted in revised form December 5, 1998.
This article has been cited by other articles:
![]() |
J. A. Maguire and J. W. Reagan Jr. Silencing of the mutant SCAP allele accounts for restoration of a normal phenotype in CT60 cells selected for NPC1 expression J. Lipid Res., September 1, 2005; 46(9): 1840 - 1848. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Walter, J. P. Davies, and Y. A. Ioannou Telomerase immortalization upregulates Rab9 expression and restores LDL cholesterol egress from Niemann-Pick C1 late endosomes J. Lipid Res., February 1, 2003; 44(2): 243 - 253. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. FROHME, B. SCHARM, H. DELIUS, R. KNECHT, and J. D. HOHEISEL Use of Representational Difference Analysis and cDNA Arrays for Transcriptional Profiling of Tumor Tissue Ann. N.Y. Acad. Sci., June 1, 2000; 910(1): 85 - 105. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||