|
|
|
|
Vol. 11, Issue 11, 1871-1877, November 2001
METHODS
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
A retroviral gene trap containing a human CD2 cell surface antigen/neomycin-phosphotransferase fusion gene in the U3 region of its LTR (U3Ceo) was used to screen the mammalian genome for genes encoding secreted and/or transmembrane proteins that are repressed by oncogenic transformation. From an integration library consisting of cells transformable by the insulin-like growth factor 1 (IGF-1), a collection of neomycin resistant (NeoR) clones was obtained; 86% also expressed the CD2 cell surface antigen. Molecular analysis of a random sample of NeoR clones revealed that the U3Ceo gene trap preferentially disrupted genes coding for secreted and transmembrane proteins. In each case, the signal sequence of the endogenous gene was fused in-frame to the CD2/neomycin-phosphotransferase reporter gene due to a cryptic splice acceptor site embedded in the coding region of the CD2 cDNA. When the library was transformed by IGF-1 and selected against CD2 expression, integrations were obtained in genes that are repressed by transformation. Molecular analysis of six randomly chosen integrations revealed that, in each case, U3Ceo captured a signal sequence from proteins involved in oncogenic transformation and metastatic spread.
| |
INTRODUCTION |
|---|
|
|
|---|
Because defects in intracellular signaling can
cause cancer, components of signal transduction pathways are common
targets of antineoplastic drugs. Among these, secreted and cell surface proteins are preferred because they are easily accessible to specific agonists and antagonists. For example, many epithelial cancers show
constitutive activation of the tyrosine kinase receptors for epidermal
growth factor (EGF) or insulin-like growth factor (IGF) (Schlessinger
2000
); this has led to the development of drugs that interfere with
these receptors. Drugs like ZD1839/Iressa, which antagonizes EGF
receptor activation and signaling, are presently in clinical
development (Arteaga 2000
).
The targeting of proteins to the secretory pathway requires a short
N-terminal sequence called a "signal" or "leader" peptide (von
Heijne 1985
), which also determines the protein's orientation across
the cellular membranes. This signal sequence is conserved in secreted
and membrane-spanning proteins and has been exploited in all strategies
developed to isolate or identify signal sequence genes. Thus, signal
sequence traps consisting of reporter or selectable marker genes, whose
expression is dependent on the acquisition of a signal sequence, or
antibodies raised against signal sequence peptides, have been used with
variable success (Tashiro et al. 1993
; Skarnes et al. 1995
; Imai et al.
1996
; Klein et al. 1996
; Scherer et al. 1998
; Lim and Garzino-Demo
2000
; Mitchell et al. 2001
). To develop a strategy that would be
applicable to genomewide screens for secreted and transmembrane
proteins in living cells, we used a gene trap approach.
Gene traps insert a reporter gene into mostly random chromosomal sites,
including transcriptionally active regions. By selecting for gene
expression, recombinants are obtained in which the reporter gene is
fused to the regulatory elements of endogenous genes. Transcripts
generated by these fusions faithfully reflect the activity of the
tagged cellular gene and thus provide an effective means to study the
expression of genes in their normal chromosomal location (Friedrich and
Soriano 1991
; Skarnes et al. 1992
; von Melchner et al. 1992
). By using
appropriate reporter systems, we and others have developed gene trap
strategies to identify genes regulated during important biological
processes or by exogenous stimuli (Reddy et al. 1992
; Forrester et al.
1996
; Russ et al. 1996a
; Thorey et al. 1998
; Medico et al. 2001
).
To develop a gene trap strategy that would select for integrations into regulated genes encoding secreted and/or transmembrane proteins, we used the human CD2 cell surface antigen fused in-frame to an Escherichia coli neomycin-phosphotransferase (Ceo) as a reporter gene in the retroviral gene trap U3Ceo. Because the signal peptide sequence of the CD2 cDNA terminates in a cryptic splice acceptor site (GenBank accession no. XM_002141), we assumed that it would be removed by splicing from U3Ceo integrations into introns of expressed genes. Consequently, the cell surface expression of the CD2-neo fusion protein would rely on the acquisition of a signal sequence from an endogenous gene.
We show here that the retroviral gene trap U3Ceo effectively selects for integrations into genes encoding secreted and/or transmembrane proteins that are repressed by oncogenic transformation. This enables a genomewide screen for proteins with putative antineoplastic functions and for targets that are easily accessible to anti-cancer drugs.
| |
RESULTS |
|---|
|
|
|---|
Construction of a Gene Trap Retrovirus Expressing a CD2/Neomycin-Phosphotransferase Fusion Gene (U3Ceo)
To construct a gene trap that would allow selection for integrations
into genes with signal sequences, we cloned the combined reporter/selectable marker gene Ceo into the U3-region of a
promoter/enhancer-deleted retroviral vector based on pBabePuro
(Morgenstern and Land 1990
). The Ceo gene was an in-frame fusion
between the genes encoding the human T-cell-specific CD2 antigen (CD2)
(Seed and Aruffo 1987
) and the E. coli
neomycin-phosphotransferase (neo) (Colbere-Garapin et al. 1981
). It
was obtained by fusing an ATG-deleted and truncated CD2 cDNA to the neo
gene immediately downstream of its first ATG (Fig.
1A). The truncated CD2 cDNA, which consists
of nucleotides 15-782 (GenBank accession no. XM_002141), encodes the
entire extracellular and transmembrane domains, but only a short
segment of the intracellular domain. As has been shown in previous
studies, virus replication and long terminal repeat (LTR)-mediated
duplication places sequences inserted into U3 just 30 nucleotides
downstream of the flanking chromosomal DNA (von Melchner and Ruley
1989
). Because of a cryptic splice acceptor sequence located in the CD2
coding sequence 58 nucleotides downstream of the ATG, and a branch site
consensus sequence located 21 nucleotides upstream of the splice site
(nucleotides 41-47), gene trap integrations into the introns of
transcribed genes were expected to express cell-provirus fusion
transcripts in which the upstream exons are spliced to CD2. Because
this excises CD2's signal sequence, we anticipated that U3Ceo
expression would be dependent on an in-frame fusion to a signal
sequence of a cellular gene (Fig. 1B). Selection for U3Ceo expression
would then enrich for integrations into genes with signal sequences.
|
Development of a Cell Line Susceptible to Reversible Oncogenic Transformation
To identify genes that are repressed by oncogenic transformation, we
required a cell system that would allow a controlled and reversible
switching from a normal to a transformed state. A cell line (N93.1/28)
with such properties was derived from NIH3T3 fibroblasts after
transducing a tetracycline-sensitive (tet-off system) human IGF-1
receptor (IGF-1R) gene and selecting for clones with a tight
anhydrotetracycline (ATC) repressible promoter (Baasner et al. 1996
).
When N93.1/28 cells were exposed to IGF-1, they converted to
anchorage-independent growth and formed foci in standard focus-forming
assays (Fig. 2). Conversion was dose
dependent and could be readily reversed by ATC, indicating that IGF-1R
signaling is required for transformation (Fig. 2B) (Baserga et al. 1997
).
|
U3Ceo Selects for Integrations into Genes Encoding Secreted and Transmembrane Proteins
To test the ability of U3Ceo to capture cellular signal sequences,
we infected nontransformed N93.1/28 cells with U3Ceo virus at a simple
multiplicity of infection (MOI = 1) and selected in G418.
From a library of 1 × 106 independent integrations, we
obtained 400 G418-resistant (NeoR) clones, indicating that
only 1 in 2500 integrations are transcribed. This suggested that only a
small subset of genes expressed in N93.1/28 cells are capable of
activating U3Ceo, and led to the assumption that this subset represents
the signal sequence genes. If this is indeed the case, the
G418-resistant clones should also express CD2 as a result of in-frame
fusions between an endogenous signal sequence peptide and the Ceo
protein. To test this, we treated pooled NeoR clones with the
anti-CD2-specific monoclonal antibody Leu5b and analyzed by flow
cytometry for CD2 expression. Figure 3
shows that >86% of the NeoR cells were positive for CD2,
indicating that the majority of U3Ceo fusion proteins have captured an
endogenous signal peptide. This high frequency of CD2-expressing
NeoR cells indicated that the majority of Ceo fusion proteins
expressed from gene trap integrations into genes without a signal
sequence are unstable (Skarnes et al. 1995
). To investigate this
further, cell-provirus fusion transcripts from eight randomly selected NeoR clones were amplified by 5'RNA-ligase mediated (RLM)
RACE (Maruyama and Sugano 1994
) and sequenced. In each case, cellular
sequences were fused to nucleotide 69 in the CD2 cDNA, which is
immediately downstream of the cryptic splice acceptor. Splicing of
cellular exons to the Ceo fusion gene deleted the first 90 nucleotides of U3Ceo, including the CD2 signal sequence and the cell DNA-provirus junction. Database analysis of the fusion transcripts revealed that
U3Ceo acquired signal sequences from six previously characterized proteins and from an anonymous EST (Table
1). Thus, taken together, the results
indicate that U3Ceo effectively selects for integrations into genes
encoding secreted and/or membrane-spanning proteins.
|
|
Selection for and against U3Ceo Expression Identifies Genes Repressed by Oncogenic Transformation
The value of secreted and cell surface proteins that are involved in
transformation is presently unmatched. Therefore, we asked whether
U3Ceo could be used to identify proteins whose expression is repressed
in transformed cells. For this, we infected 1 × 107
N93.1/28 cells with U3Ceo at a MOI = 1 and selected the integration library in G418. The G418-resistant library with U3Ceo integrations in
expressed genes was then subjected to several rounds of positive and
negative selection using antibody-mediated cell adhesion (panning) with
the Leu-5b anti-CD2 antibody (Seed and Aruffo 1987
). As shown in Figure
4A, after a first round of positive
panning, most cells expressed the CD2 cell surface antigen. However,
expression levels varied broadly, as one would expect from a polyclonal
population with U3Ceo integrations into genes with variable promoter
strength. The CD2-positive library was then transformed by IGF-1 and
selected against CD2 expression to eliminate all integrations in
constitutively expressed genes (Fig. 4 B,C). Finally, to ensure that
only truly regulated genes were recovered, the negatively selected
library was reversed to its nontransformed state by withdrawing IGF-1 and again selecting for CD2 expression. As shown in Figure 4D, the CD2
receptor returned to the cell surface in largely variable amounts,
indicating that polyclonality was maintained despite multiple rounds of
selection. Moreover, CD2 expression was promptly repressed following
IGF-1 addition, implying that the recovered U3Ceo integrations are in
tightly regulated genes (Fig. 4E).
|
To identify some of the regulated genes trapped by U3Ceo, six clones
were isolated from the preselected library by limiting dilution, and
their cell-provirus fusion transcripts were amplified and sequenced.
Database analysis revealed that, in each case, U3Ceo had disrupted a
gene encoding a signal peptide protein involved in oncogenic
transformation and metastatic spread (Table
2). Each of these genes was tightly
regulated by IGF-1, indicating that gene repression was either caused
directly by IGF-1 or, alternatively, occurred as a consequence of
oncogenic transformation (Fig. 5, and data not shown).
|
|
| |
DISCUSSION |
|---|
|
|
|---|
In the present study we have developed a gene trap strategy enabling genomewide screening for putative tumor suppressor genes encoding secreted and/or cell surface proteins in living cells. By infecting a conditionally transformable cell line with the retroviral gene trap U3Ceo and selecting in G418, recombinants were obtained that expressed a functional CD2/neomycin-phosphotransferase fusion protein on the cell surface. In each case, transport to the cell membrane was enabled by the acquisition of an N-terminal signal peptide encoded by an endogenous gene.
Proteins targeted to the secretory pathway have generated considerable
interest for two main reasons. First, most fundamental biological
processes involve secretory proteins and membrane receptors. Second,
secreted and membrane-bound proteins are easily accessible to exogenous
drugs and are thus preferred targets for drug development. To identify
such proteins, several strategies have been developed that all take
advantage of a characteristic N-terminal signal peptide sequence that
is required for channeling through the secretory pathway (Tashiro et
al. 1993
; Skarnes et al. 1995
; Imai et al. 1996
; Klein et al. 1996
;
Scherer et al. 1998
; Lim and Garzino-Demo 2000
; Mitchell et al. 2001
).
In most cases, reporter- and/or selectable marker genes encoding a
transmembrane domain but lacking a signal sequence have been used to
identify complementing cDNAs from cDNA libraries (Tashiro et al. 1993
;
Imai et al. 1996
; Klein et al. 1996
; Lim and Garzino-Demo 2000
).
Similarly, reporter genes dependent on signal sequences for expression
have been used to directly screen the genome for secretory proteins.
For example, a gene trap encoding a
-galactosidase/neomycin-phosphotransferase (
geo) fusion protein
into which a CD4 transmembrane domain had been inserted was shown to
enrich for integrations into signal sequence genes expressed during
mouse development (Skarnes et al. 1995
; Mitchell et al. 2001
). However,
because the efficiency of gene trap activation by signal sequence
capture was <20%, this gene trap seems unlikely to be suitable for
large-scale functional genomics. In contrast, U3Ceo, with its signal
sequence capture frequency of >86%, seems ideally suited for this
purpose. Thus, by using U3Ceo gene traps for each reading frame, it
seems possible to tag all secreted and transmembrane proteins expressed
in the mammalian genome.
The most attractive feature of U3Ceo is its ability to select for a
subset of genes that are not only encoding secreted and/or cell surface
proteins but are also regulated during a biological process. For
example, by selecting for and against Ceo expression in a reversible
transformation model, we expected to recover genes that are involved in
oncogenesis and tumor suppression. Indeed, each of the genes disrupted
by U3Ceo in this model had been previously shown to interfere with
tumor growth and metastatic spread. The human homolog of the thrombin
receptor (PAR-1) and the PDGF A-chain homodimer induce cell cycle
arrest and apoptosis by up-regulating cyclin-dependent kinase
inhibitors and caspases (Huang et al. 2000
; Yu et al. 2000
). The
-2
type 1 collagen and the Ly-6A.2 alloantigen inhibit cellular
transformation and proliferation (Travers et al. 1996
; Satoh et al.
1997
). Moreover, Ly-6A.2, also known as stem cell antigen-1 (Sca-1) is
expressed by primitive stem cells of the hematopoietic system and
mediates cell adhesion (English et al. 2000
). Similarly, cell adhesion
is promoted by the membrane-bound receptor-like protein tyrosine
phosphatase
(R-PTP-
), indicating that both proteins are likely
to reduce metastatic spread (Fuchs et al. 1996
). Finally, the class I
major histocompatibility complex (MHC-1) is essential for the
recognition of tumor antigens by the immune system. In many cancers,
down-regulation of MHC-1 by tumor cells enables them to evade a
specific immune response (Seliger et al. 2000
). Taken together, these
results underscore the ability of U3Ceo to selectively identify genes with type-2 tumor suppressor function (Lee et al. 1991
).
In conclusion, the gene trap strategy described here provides a means
to perform genomewide screens for secreted and/or transmembrane proteins regulated during a specific biological process. Although reversible transformation systems such as the one used here are particularly attractive for drug-target discovery in cancer research, the approach seems flexible enough to be adapted to almost any other
biological system in which altered signal transduction leads to a
detectable phenotype. Particularly interesting in this regard are the
postgenomic large-scale mouse mutagenesis programs that are certain to
benefit from the unique features of the U3Ceo gene trap (Wiles et al.
2000
; Mitchell et al. 2001
).
| |
METHODS |
|---|
|
|
|---|
Plasmids and Viruses
A CD2/neomycin-phosphotransferase fusion gene (Ceo) was inserted
into the NheI cloning site of a pBabeSin retroviral vector to obtain pBabeU3Ceo. pBabeSin was derived from pBabePuro (Morgenstern and Land 1990
) by removing SV40puro and the U3 promoter/enhancer sequences from the 3'LTR. To this end, pBabePuro was cleaved with SalI/XhoI and NheI/BanII,
respectively, and religated. To obtain Ceo, CD2 and
neomycin-phosphotransferase coding sequences were amplified by PCR
using the primer pairs
5'-attctagattccatgt aaatttgtagccagcttcc-3'/5'-cgcgggggatccgtagctactct gtgggctcttgtctc-3' and
5'-cgcgggggatcccattgaacaagatg gattgcacgcagg-3'/5'-cgcggggaattctctagattag-aag aactcgtcaagaaggcg-3',
respectively. The CD2-amplification product was ligated as an
XbaI/BamHI fragment to the 5' BamHI site of the amplified neomycin-phosphotransferase sequence and the fusion gene
was cloned as an XbaI/EcoRI fragment into pBluescript
KS. After verifying the in-frame fusion by sequencing (ABI310 Genetic Analyzer, Applied Biosystems), the fusion gene was ligated as an
XbaI fragment into the NheI site of pBabe Sin.
High-titer U3Ceo retroviral supernatants were generated by transient
transfection of pBabeU3Ceo into BOSC23 packaging cells and used for
infections as described previously (Russ et al. 1996b
).
Cell Cultures and Colony Assays
Cells were grown in Dulbecco's Modified Eagles Medium
supplemented with 10% newborn calf serum (NCS). NIH3T3 cells
overexpressing the human IGF-1 receptor under control of the
tetracycline-regulated promoter were generated by stable transfection
as previously described (Baasner et al. 1996
). Clones isolated by
limiting dilution were tested for tetracycline-regulated IGF1-R
expression by Northern blotting and by reversible tumorigenicity in
vitro as described previously (Andreu et al. 1998
). The cell line
N93.1/28 with tightly regulated IGF-1R expression was selected for all experiments.
Anchorage-independent growth was analyzed in soft agar colony assays by plating 1.5 × 103 cells into 96 well plates containing 0.5% w/v DMEM/agar supplemented with 10% (v/v) NCS. After incubating for 7 d, cell proliferation was estimated using the XTT-Cell Proliferation Kit (Roche Diagnostics, Mannheim) according to the manufacturer`s instructions. Light absorption reflecting colony proliferation was measured at 490 nm using a Victor 1420 multilabel counter (Wallac, Turku).
Panning and Antibody Treatment
The positive and negative panning procedures with the human CD2-specific mouse monoclonal antibody Leu-5b (Becton Dickinson, Heidelberg) were performed as follows: 100-mm bacterial plates were treated with 10 mL of a 20 µg/mL solution of anti-mouse-IgG-antibody (Cappel, Durham). After incubating for 1.5 h, plates were washed three times with 0.15 M NaCl and overlaid with 10 mL of 1% (w/v) solution of bovine serum albumin in PBS. After incubating overnight at room temperature, the bovine serum albumin was removed and plates were stored at -20°C until use. To identify genes repressed by IGF-1, we infected 1 × 107 N93.1/28 cells with U3Ceo at MOI = 1 and first selected for 10 d in G418 (800µg/mL). For panning, cells were suspended at a concentration of 5 × 106/mL and incubated for 30 min at room temperature in a 300 ng/mL Leu-5b antibody solution. After washing in PBS, the antibody-treated cells were placed on top of the anti-mouse-IgG antibody coated plates. After incubating for 1 h at room temperature, plates were gently washed with 10 mL PBS to discard the nonadherent cells. Adherent cells were subsequently harvested in DMEM medium supplemented with 10% NCS and exposed to 100 ng/mL recombinant human IGF-1 (Sigma, Deisenhofen) for 3 d. Transformed cells were subjected to negative panning essentially as described earlier except that this time the nonadherent cells were harvested and the adherent cells discarded.
Flow Cytometry
1 × 106 cells were suspended in 50 µL FACS-Wash solution (Becton Dickinson, Heidelberg) and incubated for 30 min at 4°C with 2.5 µg/mL of CD2-specific mouse monoclonal antibody Leu-5b. Following washings, the cells were incubated for 30 min at 4°C in 50 µL FACS-Wash solution containing 10% (v/v) FITC-conjugated anti-mouse-IgG antibody (Roche Diagnostics, Mannheim). Finally, 2 × 104 cells were suspended in 500 µL FACS-Wash solution and analyzed with a FACSCALIBUR (Becton Dickinson) flow cytometer using FITC-specific settings.
5'-RACE
5'-RACE was performed on 5 µg of total RNA using the FirstChoice RLM-RACE-Kit (Ambion) according to the manufacturer's instructions. CD2 specific primers were 5'-caagttgatgtc ctgacccaag-3' and 5'-ggtttccaaggcattcgtaatctc-3' (nested). The first amplification was for 35 cycles, at 96°C for 30 sec, 60°C for 30 sec, and 72°C for 2 min, whereas the second (nested) amplification was for 35 cycles at 96°C for 30 sec, 62°C for 30 sec, and 72°C for 2 min. RACE products were cloned into the pGEM-Teasy vector (Promega) and inserts of at least two independent E. coli clones were sequenced using a T7-specific primer (Promega).
| |
ACKNOWLEDGMENTS |
|---|
We thank E. Thoenes, C. Magel, B. Szabo, and D. Bauer for expert technical assistance. The vector pBabePuro was kindly provided by H. Land (Imperial Cancer Research Fund, London, UK). The CD2 cDNA was obtained from B. Seed (Harvard University, Boston) and the tet-off system from H. Bujard (ZMBH, Heidelberg). This work was supported in part by a grant from the Bundesministerium für Bildung und Forschung (BMBF), Bonn, Germany, to H.vM.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Corresponding author.
E-MAIL melchner{at}em.uni-frankfurt.de; FAX: 49-69-63016390.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.202601.
| |
REFERENCES |
|---|
|
|
|---|
2 (COL1A2) are epidermal growth factor-regulated growth repressors.
J. Biol. Chem.
273:
13848-13854
2(I) collagen.
Cell Growth Differ.
7:
1353-1360[Abstract].
activates c-Jun NH2-terminal kinase-1 and antagonizes PDGF receptor-
-induced phenotypic transformation.
J. Biol. Chem.
275:
19076-19082Received June 27, 2001; accepted in revised form August 7, 2001.
This article has been cited by other articles:
![]() |
S. De-Zolt, F. Schnutgen, C. Seisenberger, J. Hansen, M. Hollatz, T. Floss, P. Ruiz, W. Wurst, and H. von Melchner High-throughput trapping of secretory pathway genes in mouse embryonic stem cells Nucleic Acids Res., February 13, 2006; 34(3): e25 - e25. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Park, Y. Hu, T. V. S. Murthy, F. Vannberg, B. Shen, A. Rolfs, J. E. Hutti, L. C. Cantley, J. LaBaer, E. Harlow, et al. Building a human kinase gene repository: Bioinformatics, molecular cloning, and functional validation PNAS, June 7, 2005; 102(23): 8114 - 8119. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Schnutgen, S. De-Zolt, P. Van Sloun, M. Hollatz, T. Floss, J. Hansen, J. Altschmied, C. Seisenberger, N. B. Ghyselinck, P. Ruiz, et al. Genomewide production of multipurpose alleles for the functional analysis of the mouse genome PNAS, May 17, 2005; 102(20): 7221 - 7226. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hansen, T. Floss, P. Van Sloun, E.-M. Fuchtbauer, F. Vauti, H.-H. Arnold, F. Schnutgen, W. Wurst, H. von Melchner, and P. Ruiz A large-scale, gene-driven mutagenesis approach for the functional analysis of the mouse genome PNAS, August 19, 2003; 100(17): 9918 - 9922. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||