|
|
|
|
Genome Res. 13:1307-1317, 2003 ©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00 Cytokine-Related Genes Identified From the RIKEN Full-Length Mouse cDNA Data Set1Institute for Infocomm Research, Singapore 119613 2Medical Informatics Centre, Division of Science and Design, University of Canberra, Bruce ACT 2617, Australia 3Autoimmunity Research Unit, The Canberra Hospital, Garran ACT 2606, Australia 4John Curtin School of Medical Research, Australian National University, Canberra ACT 2601, Australia 5Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan 6Genome Science Laboratory, RIKEN, Hirosawa, Wako, Saitama 351-0198, Japan 7Biomedical Knowledge Discovery Team, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), Yokohama 230-0045, Japan
To identify novel cytokine-related genes, we searched the set of 60,770 annotated RIKEN mouse cDNA clones (FANTOM2 clones), using keywords such as cytokine itself or cytokine names (such as interferon, interleukin, epidermal growth factor, fibroblast growth factor, and transforming growth factor). This search produced 108 known cytokines and cytokine-related products such as cytokine receptors, cytokine-associated genes, or their products (enhancers, accessory proteins, cytokine-induced genes). We found 15 clusters of FANTOM2 clones that are candidates for novel cytokine-related genes. These encoded products with strong sequence similarity to guanylate-binding protein (GBP-5), interleukin-1 receptor-associated kinase 2 (IRAK-2), interleukin 20 receptor isoform 3, a member of the interferon-inducible proteins of the Ifi 200 cluster, four members of the membrane-associated family 1-8 of interferon-inducible proteins, one p27-like protein, and a hypothetical protein containing a Toll/Interleukin receptor domain. All four clones representing novel candidates of gene products from the family contain a novel highly conserved cross-species domain. Clones similar to growth factor-related products included transforming growth factor -inducible early growth response protein 2 (TIEG-2), TGF -induced factor 2, integrin -like 1, latent TGF-binding protein 4S, and FGF receptor 4B. We performed a detailed sequence analysis of the candidate novel genes to elucidate their likely functional properties.
Cytokines are polypeptides secreted by immune cells that function as humoral regulators modulating functional activities of cells and tissues including cellular interactions and processes. Cytokine genes often encode functional variants through alternative splicing, resulting in proteins that may have slightly different biological activities (Ibelgaufts 1999
Activated cytokine-producing cells often release many different cytokines, resulting in complex networks of interacting signals. Cytokines are crucial in immune cell activation, cell-to-cell communication, signal transduction, mitosis, cell survival, death, and transformation. In addition to their involvement in initiation and regulation of immune responses, cytokines are also important in embryogenesis, organ development, and neuronal processes (Ibelgaufts 1999 The function of a particular cytokine can be assessed in vivo in transgenic or gene knock-out animals by studying the effects of an excess or lack of a particular cytokine gene product. It is of great importance for cytokine research, therefore, that the counterparts of human cytokine, cytokine receptors, and cytokine-related genes are identified in animals (e.g., the mouse) suitable for genetic manipulation. Because of the remarkable similarity between human and mouse genomes and their closely related biochemical, physiological, and pathological pathways, the mouse is an extremely useful animal model for immunological studies.
The FANTOM2 collaboration (Okazaki et al. 2002
Of the 60,770 FANTOM2 clones, 256 had annotations that contained a cytokine name. A total of 222 clones had high-sequence identity (>96% over 50% length of the reference sequence) to 108 known mouse genes, as listed in the major databases. Of these, clones represented 27 known interferon-related genes (Table 1). The known mouse genes in the FANTOM2 set comprised 1 interferon (IFN- ), 2 IFN receptors (IFN![]() receptor, and IFN![]() receptor 2), 20 inducible/activatable genes, and 4 other IFN-related genes. Search of the protein translation of the GenBank database (Benson et al. 2002
A total of 77 clones represented 39 known interleukin-related genes (Table 2). The known mouse genes in the FANTOM2 set comprised 11 interleukins (IL) or chains thereof (IL-1
A total of 61 clones represented 23 known growth factor (EGF, FGF, and TGF)-related genes (Table 3). The known mouse genes in the FANTOM2 set comprised 5 growth factors (EGF, FGF 14, FGF 15, TGF
Finally, 30 clones represented 19 known small inducible cytokine-related genes in mouse (Table 4). Search of the protein translation of the GenBank database revealed that there are at least 38 nonredundant entries of various small inducible cytokines in mouse.
Clones representing 17 known mouse cytokine-related genes were derived from cDNA libraries from the non-obese diabetic (NOD) mouse. All of the remaining cytokine-related clones were derived from the C57BL/6 mouse strain. Six known genes were only found from the NOD mouse libraries. Clones representing a total of 11 known mouse genes were found in both C57BL/6 and NOD mouse cDNA libraries. The clones representing known cytokine-related genes were derived from a broad variety of RIKEN cDNA libraries. These libraries were generated from various tissues and developmental stages. Because this analysis focused on name matching from both the FANTOM2 data set and GenBank database, we can estimate that FANTOM2 clones cover We have also identified 15 candidate novel cytokine or cytokine-related genes (Table 5), including 7 candidate novel mouse IFN-related genes (13 clones), 3 candidates for novel IL-related genes (5 clones), and 4 candidates for novel growth factor-related genes (3 TGF- and 1 EGF-related) (16 clones). All of the clones representing candidate novel genes were derived from the C57BL/6 mouse and showed ≤96% identity over >50% length of the reference sequence.
Interferon-related Novel Gene Candidates
Sequence of the clone 2310061N23 showed similarity to the gene for the IFN-
Clone A43007
[GenBank]
5K09 represents a potential new member of a group of structurally related interferon-inducible proteins, the Ifi 200 cluster, that regulate biological activities of IFNs. The IFNs are immune response modulating cytokines that exhibit antiviral effects, induce expression of class I or class II major histocompatibility complex proteins, and help activate macrophages, natural killer cells, and neutrophils. Sequence of the clone A43007
[GenBank]
5K09 shows similarity (84.9%ID, 425 amino acids, 100% of the total length) to the Ifi 205, the new member of the Ifi 200 family (Table 5 and Fig. 4, supplemental research data; available online at www.genome.org). Little is known about the precise function of the Ifi 205. However, it contains the PAAD/DAPIN/Pyrin domain (Pfam 02758), which is present in other members of the Ifi 200 family, indicating a possible proteinprotein interaction site. This domain was predicted to have the same six
Sequence of the clone 5330409J06 showed a weak similarity (Table 5 and Fig. 5, supplemental research data) to the guanylate-binding protein 5GBP5 (P26376
[GenBank]
), published only as a database entry (GenBank AF288815
[GenBank]
). To our knowledge, the function and structure of GBP5 has not been described or published. In humans, the members of the GBP family are the most abundant class of proteins induced by IFN-
Interleukin-Related Novel Gene Candidates
Sequence of the clone 4732448K15 is similar (69% identity in full-length sequence of 598 AA) to the human IL-1 receptor-associated kinase-2 (IRAK-2), a proximal mediator of IL-1 signaling (Table 5 and Fig. 7, supplemental research data). IRAK-2 binds to the IL-1 receptor and is involved in IL-1-dependent signaling (Muzio et al. 1997
Sequence of the clone E230031K19 is weakly similar (76.3% identity over 93.3% length) to the human interleukin 20 receptor
Growth Factor-Related Novel Gene Candidates
Sequence of the clone 4921501K24 showed similarity 93.7% identity over the full length(Table 5 and Fig. 10, supplemental research data) to TGF-
Little is known about the integrin
The conceptual translation of clone E33001
[GenBank]
2C08 showed similarity (Table 5 and Fig. 12, supplemental research data) to the human latent TGF-
Finally, the conceptual translation of clone 9130025L13 showed similarity (Table 5 and Fig. 13, supplemental research data) to the human FGF receptor 4BFGFR4b (Q9BXY2). FGF receptors are involved in multiple hormonal and proliferative processes of hormonal cells (Yu et al. 2002
The mouse is the most popular model organism for both basic and applied immunological studies, including the study of cytokines. Our analysis of the FANTOM2 data set focused on the subset of transcripts that showed similarity to the cytokine-related products, and which had cytokine names or abbreviations in the name or description fields of the corresponding database entries (the cytokine-name criterion). Our study is only partial because databases include cytokine-related entries of genes and proteins that do not have an explicit cytokine name in the name or description fields. However, this study is representative because it focused on a well-defined subset of cytokine-related genes from which we can make an estimate of the coverage of cytokine-related genes in the FANTOM2 data set.
In comparison with database entries that conform to the cytokine-name criterion, we estimate that the ratio of cytokine-related products represented in the FANTOM2 data set is of the order of 20% of the total known cytokine-related products. This estimate is not surprising, because cytokines are inducible and often tissue-specific products. In addition, even when induced, the expression level of cytokines is often very low. The RIKEN libraries, although diverse, may not represent the optimal conditions required for induction of cytokines and their related genes. The regulatory processes involving cytokines are complex processes involving activation and suppression of multiple genes (Doly et al. 1998
It is not always possible, particularly in large-scale studies, to derive clear conclusions and explanations of the data. Our selection of the threshold that distinguishes novel from the known genes or gene candidates is arbitrary. The clone representing integrin
Computational analysis has been applied previously to identification of novel cytokines from EST libraries, such as cardiotrophin-like cytokine (Shi et al. 1999 We identified 15 candidates for novel cytokine-related genes. Because many cytokine-related genes do not contain cytokine names in their description, this should represent only a fraction of the novel cytokine-related candidate genes captured by the FANTOM2 library. Further strategies will involve making comprehensive lists of cytokine-related genes followed by comparison of these genes with the existing cDNA libraries and future cDNA libraries designed for capturing specific gene expression during the immune processes.
The FANTOM2 set of full-length mouse cDNA clones, referred to as clones in this text, contains 60,770 sequences with the total length of almost 120 million base pairs and more than 12 million conceptually translated amino acids. The FANTOM2 clones were functionally annotated using automatic computational annotation followed by expert human curation (Okazaki et al. 2002
Sequence Analysis
Functional Annotation and Identification of FANTOM2 Clones
Diego G. Silva is the recipient of a scholarship from the Canberra Hospital Salaried Specialists Private Practice Fund.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1016503.
10 Corresponding author.
9 Piero Carninci, Jun Kawai, and Yoshihide Hayashizaki. [Supplemental material is available online at www.genome.org]
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215:403 -410.[CrossRef][Medline] Anderson, K.V 2000. Toll signaling pathways in the innate immune response. Curr. Opin. Immunol. 12: 13-19.[CrossRef][Medline] Anderson, S.L., Carton, J.M., Lou, J., Xing, L., and Rubin, B.Y. 1999. Interferon-induced guanylate binding protein-1 (GBP-1) mediates an antiviral effect against vesicular stomatitis virus and encephalomyocarditis virus. Virology 256: 8-14[CrossRef][Medline]
Apweiler, R., Attwood, T.K., Bairoch, A., Bateman, A., Birney, E., Biswas, M., Bucher, P., Cerutti, L., Corpet, F., Croning, M.D., et al. 2001. The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 29:37
-40.
Bairoch, A. and Apweiler, R. 2000. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28:45
-48.
Bange, J., Prechtl, D., Cheburkin, Y., Specht, K., Harbeck, N., Schmitt, M., Knyazeva, T., Muller, S., Gartner, S., Sures, I., et al. 2002. Cancer progression and tumor cell motility are associated with the FGFR4 Arg (388) allele. Cancer Res.
62:840
-847.
Bateman, A., Birney, E., Cerruti, L., Durbin, R., Etwiller, L., Eddy, S.R., Griffiths-Jones, S., Howe, K.L., Marshall, M., and Sonnhammer, E.L. 2002. The Pfam protein families database. Nucleic Acids Res. 30:276
-280.
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Rapp, B.A., and Wheeler, D.L. 2002. GenBank. Nucleic Acids Res. 30:17
-20.
Blake, J.A., Richardson, J.E., Bult, C.J., Kadin, J.A., Eppig, J.T., and the Mouse Genome Database Group. 2002. The Mouse Genome Database (MGD): The model organism database for the laboratory mouse. Nucleic Acids Res. 30:113
-115. Cao, L., Kulmburg, P., Veelken, H., Mackensen, A., Mezes, B., Lindemann, A., Mertelsmann, R., and Rosenthal, F.M. 1998. Cytokine gene transfer in cancer therapy. Stem Cells 16:251 -260.
Cook, T., Gebelein, B., Mesa, K., Mladek, A., and Urrutia, R. 1998. Molecular cloning and characterization of TIEG2 reveals a new subfamily of transforming growth factor-
Denovan-Wright, E.M., Ferrier, G.R., Robertson, H.A., and Howlett, S.E. 2000. Increased expression of the gene for Doly, J., Civas, A., Navarro, S., and Uze, G. 1998. Type I interferons: Expression and signalization. Cell. Mol. Life. Sci. 54:1109 -1121.[CrossRef][Medline] Dumoutier, L. and Renauld, J.C. 2002. Viral and cellular interleukin-10 (IL-10)-related cytokines: From structures to function. Eur. Cytokine Netw. 13: 5-15.[Medline] Gribaudo, G., Ravaglia, S., Guandalini, L., Riera, L., Gariglio, M., and Landolfo, S. 1997. Molecular cloning and expression of an interferon-inducible protein encoded by gene 203 from the gene 200 cluster. Eur. J. Biochem. 249:258 -264.[Medline] Gribaudo, G., Riera, L., De Andrea, M., and Landolfo, S. 1999. The antiproliferative activity of the murine interferon-inducible Ifi 200 proteins depends on the presence of two 200 amino acid domains. FEBS Lett. 456: 31-36.[CrossRef][Medline] Guenzi, E., Topolt, K., Cornali, E., Lubeseder-Martellato, C., Jorg, A., Matzen, K., Zietz, C., Kremmer, E., Nappi, F., Schwemmle, M., et al. 2001. The helical domain of GBP-1 mediates the inhibition of endothelial cell proliferation by inflammatory cytokines. EMBO J. 20:5568 -5577.[CrossRef][Medline] Han, B.H., Park, D.J., Lim, R.W., Im, J.H., and Kim, H.D. 1998. Cloning, expression, and characterization of a novel guanylate-binding protein, GBP3 in murine erythroid progenitor cells. Biochim. Biophys. Acta 1384:373 -386.[CrossRef][Medline] Hanks, S.K. and Hunter, T. 1995. Protein kinases 6. The eukaryotic protein kinase superfamily: Kinase (catalytic) domain structure and classification. FASEB J. 9: 576-596.[Abstract]
Hayzer, D.J., Brinson, E., and Runge, M.S. 1992. A rat Hofmann, K. and Tschopp, J. 1995. The death domain motif found in Fas (Apo-1) and Tnf receptor is present in proteins involved in apoptosis and axonal guidance. FEBS Lett. 371:321 -323.[CrossRef][Medline] Homey, B., Muller, A., and Zlotnik, A. 2002. Chemokines: Agents for the immunotherapy of cancer? Nature Rev. Immunol. 2:175 -184.[CrossRef][Medline]
Hubbard, T., Barker, D., Birney, E., Cameron, G., Chen, Y., Clark, L., Cox, T., Cuff, J., Curwen, V., Down, T., et al. 2002. The Ensembl genome database project. Nucleic Acids Res.
30: 38-41. (www.ensembl.org). Ibelgaufts, H. 1999. Cytokine Online Pathfinder Encyclopaedia. (www.copewithcytokines.de) Imoto, I., Pimkhaokham, A., Watanabe, T., Saito-Ohara, F., Soeda, E., and Inazawa, J. 2000. Amplification and overexpression of TGIF2, a novel homeobox gene of the TALE superclass, in ovarian cancer cell lines. Biochem. Biophys. Res. Commun. 276:264 -270.[CrossRef][Medline] Jeschke, M.G., Herndon, D.N., Baer, W., Barrow, R.E., and Jauch, K.W. 2001. Possibilities of non-viral gene transfer to improve cutaneous wound healing. Curr. Gene Ther. 1: 267-278.[Medline]
Kawaji, H., Schonbach, C., Matsuo, Y., Kawai, J., Okazaki, Y., Hayashizaki, Y., and Matsuda, H. 2002. Exploration of novel motifs derived from mouse cDNA sequences. Genome Res.
12:367
-378. Knutson, K.L. and Disis, M.L. 2001. Expansion of HER2/neu-specific T cells ex vivo following immunization with a HER2/neu peptide-based vaccine. Clin. Breast Cancer 2: 73-79.[Medline] Lewin, A.R., Reid, L.E., McMahon, M., Stark, G.R., and Kerr, I.M. 1991. Molecular analysis of a human interferon-inducible gene family. Eur. J. Biochem. 199:417 -423.[Medline]
Lo Conte, L., Brenner, S.E., Hubbard, T.J., Chothia, C., and Murzin, A.G. 2002. SCOP database in 2002: Refinements accommodate structural genomics. Nucleic Acids Res.
30:264
-267.
Melhuish, T.A., Gallo, C.M., and Wotton, D. 2001 TGIF2 interacts with histone deacetylase 1 and represses transcription. J. Biol. Chem. 276:32109
-32114. Miyazaki, H. and Kato, T. 1999. Thrombopoietin: Biology and clinical potentials. Int. J. Hematol. 70:216 -225.[Medline] Muller, Y.A., Ultsch, M.H., and de Vos, A.M. 1996. The crystal structure of the extracellular domain of human tissue factor refined to 1.7 Å resolution. J. Mol. Biol. 256:144 -159.[CrossRef][Medline]
Munshi, N., Yie, Y., Merika, M., Senger, K., Lomvardas, S., Agalioti, T., and Thanos, D. 1999. The IFN-
Muzio, M., Ni, J., Feng, P., and Dixit, V.M. 1997. IRAK (Pelle) family member IRAK-2 and MyD88 as proximal mediators of IL-1 signaling. Science 278:1612
-1615. Okazaki, Y., Furuno, M., Kasukawa, T., Adachi, J., Bono, H., Kondo, S., Nikaido, I., Osato, N., Saito, R., Suzuki, H., et al. 2002. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 420:563 -573.[CrossRef][Medline] Pearson, W.R., Wood, T., Zhang, Z., and Miller, W. 1997. Comparison of DNA sequences with protein sequences. Genomics 46:24 -36.[CrossRef][Medline] Prakash, B., Renault, L., Praefcke, G.J., Herrmann, C., and Wittinghofer, A. 2000. Triphosphate structure of guanylate-binding protein 1 and implications for nucleotide binding and GTPase mechanism. EMBO J. 19:4555 -4564.[CrossRef][Medline]
Pru, J.K., Austin, K.J., Haas, A.L., and Hansen, T.R. 2001. Pregnancy and interferon-
Quackenbush, J., Cho, J., Lee, D., Liang, F., Holt, I., Karamycheva, S., Parvizi, B., Pertea, G., Sultana, R., and White, J. 2001. The TIGR Gene Indices: Analysis of gene transcript sequences in highly sampled eukaryotic species. Nucleic Acids Res. 29:159
-164.
Rasmussen, U.B., Wolf, C., Mattei, M.G., Chenard, M.P., Bellocq, J.P., Chambon, P., Rio, M.C., and Basset, P. 1993. Identification of a new interferon- Rosenberg, S.A. 2001. Progress in the development of immunotherapy for the treatment of patients with cancer. J. Intern. Med. 250:462 -475.[CrossRef][Medline]
Saharinen, J., Taipale, J., Monni, O., and Keski-Oja, J. 1998. Identification and characterization of a new latent transforming growth factor-
Schwemmle, M. and Staeheli, P. 1994. The interferon-induced 67-kDa guanylate-binding protein (hGBP1) is a GTPase that converts GTP to GMP. J. Biol. Chem.
269:11299
-11305. Shi, Y., Wang, W., Yourey, P.A., Gohari, S., Zakauskas, D., Zhang, J., Ruben, S., and Alderson, R.F. 1999. Computational EST database analysis identifies a novel member of the neuropoietic cytokine family. Biochem. Biophys. Res. Commun. 262:132 -138.[CrossRef][Medline]
Slack, J.L., Schooley, K., Bonnert, T.P., Mitcham, J.L., Qwarnstrom, E.E., Sims, J.E., and Dower, S.K. 2000. Identification of two major sites in the type I interleukin-1 receptor cytoplasmic region responsible for coupling to pro-inflammatory signaling pathways. J. Biol. Chem.
275:4670
-4678. Smith, B.R. 1990. Regulation of hematopoiesis. Yale J. Biol. Med. 63:371 -380.[Medline] Staudt, L.M. and Brown, P.O. 2000. Genomic views of the immune system. Annu. Rev. Immunol. 18:829 -859.[CrossRef][Medline]
Sterner-Kock, A., Thorey, I.S., Koli, K., Wempe, F., Otte, J., Bangsow, T., Kuhlmeier, K., Jin, S., Keski-Oja, J., and Melchner, H. 2002. Disruption of the gene encoding the latent transforming growth factor-
Thompson, J.D., Higgins, D.G., and Gibson, T.J. 1994. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res.
22:4673
-4680. Vanden Berghe, W., Vermeulen, L., De Wilde, G., Bosscher, K., Boone, E., and Haegeman, G. 2000. Signal transduction by tumor necrosis factor and gene regulation of the inflammatory cytokine interleukin-6. Biochem. Pharmacol. 60:1185 -1195.[CrossRef][Medline] Vestal, D.J., Gorbacheva, V.Y., and Sen, G.C. 2000. Different subcellular localizations for the related interferon-induced GTPases, MuGBP-1 and MuGBP-2: Implications for different functions? J. Interferon Cytokine Res. 20:991 -1000.[CrossRef][Medline]
Wheeler, D.L., Church, D.M., Lash, A.E., Leipe, D.D., Madden, T.L., Pontius, J.U., Schuler, G.D., Schriml, L.M., Tatusova, T.A., Wagner, L., et al. 2002. Database resources of the National Center for Biotechnology Information: 2002 update. Nucleic Acids Res. 30:13
-16.
Wu, C.H., Huang, H., Arminski, L., Castro-Alvear, J., Chen, Y., Hu, Z.Z., Ledley, R.S., Lewis, K.C., Mewes, H.W., Orcutt, B.C., et al. 2002. The Protein Information Resource: An integrated public resource of functional annotation of proteins. Nucleic Acids Res. 30:35
-37.
Yu, S., Asa, S.L., and Ezzat, S. 2002. Fibroblast growth factor receptor 4 is a target for the zinc-finger transcription factor Ikaros in the pituitary. Mol. Endocrinol.
16:1069
-1078.
Received November 27, 2002;
accepted in revised format March 6, 2003.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||