|
|
|
|
Published online before print
December 19, 2005, 10.1101/gr.4106106 Genome Res. 16:149-156, 2006 ©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00
Massive genome erosion and functional adaptations provide insights into the symbiotic lifestyle of Sodalis glossinidius in the tsetse host1 Kitasato Institute for Life Sciences, Kitasato University, Sagamihara, Kanagawa 228-0829, Japan 2 Center for Basic Research, Kitasato Institute, Minato-ku, Tokyo 108-8641, Japan 3 Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, Connecticut 06510, USA 4 Genome Core Technology Facility, RIKEN Genomic Sciences Center, Yokohama, Kanagawa 230-0045, Japan
Sodalis glossinidius is a maternally transmitted endosymbiont of tsetse flies (Glossina spp.), an insect of medical and veterinary significance. Analysis of the complete sequence of Sodalis' chromosome (4,171,146 bp, encoding 2,432 protein coding sequences) indicates a reduced coding capacity of 51%. Furthermore, the chromosome contains 972 pseudogenes, an inordinately high number compared with that of other bacterial species. A high proportion of these pseudogenes are homologs of known proteins that function either in defense or in the transport and metabolism of carbohydrates and inorganic ions, suggesting Sodalis' degenerative adaptations to the immunity and restricted nutritional status of the host. Sodalis possesses three chromosomal symbiosis regions (SSR): SSR-1, SSR-2, and SSR-3, with gene inventories similar to the Type-III secretion system (TTSS) ysa from Yersinia enterolitica and SPI-1 and SPI-2 from Salmonella, respectively. While core components of the needle structure have been conserved, some of the effectors and regulators typically associated with these systems in pathogenic microbes are modified or eliminated in Sodalis. Analysis of SSR-specific invA transcript abundance in Sodalis during host development indicates that the individual symbiosis regions may exhibit different temporal expression profiles. In addition, the Sodalis chromosome encodes a complete flagella structure, key components of which are expressed in immature host developmental stages. These features may be important for the transmission and establishment of symbiont infections in the intra-uterine progeny. The data suggest that Sodalis represents an evolutionary intermediate transitioning from a free-living to a mutualistic lifestyle.
Through diverse processes and complex interactions, beneficial microbes contribute to the health, evolution, and diversity of ecosystems. Despite their wide occurrence, insights into the genomic and functional aspects of their biology are only recently accumulating. Members of the Class Insecta are among the taxa that have succeeded in often resource-limited and inhospitable ecological terrains, due in part to fitness advantages granted by symbiosis. Tsetse flies (Diptera: Glossinidia), vectors of pathogenic African trypanosomes that cause sleeping sickness in humans and nagana in animals, rely on symbiotic flora for important physiological processes such as fecundity.
Sodalis glossinidius is one of three vertically transmitted microbial symbionts in tsetse (Dale and Maudlin 1999
Three additional lines of evidence support Sodalis' recent divergence from a free-living ancestor. The first is the ability to successfully cultivate Sodalis on insect cells (Welburn et al. 1987
Despite its recent association, Sodalis does not exhibit a pathogenic association with its tsetse host. In fact, in experiments where Sodalis was selectively eliminated from tsetse by using the sugar analog and antibiotic streptozotocin, host longevity was significantly reduced (Dale and Welburn 2001
Unlike the diverse microbial flora harbored by animals with multiple diets, analysis to date indicates that tsetse flies are colonized by a small number of symbiotic microbes. This phenomenon is likely a consequence of the highly sterile nature of tsetse's vertebrate blood-specific diet, upon which it depends during all developmental stages. In addition to Sodalis, tsetse flies harbor the intracellular obligate mutualist Wigglesworthia glossinidia, which has coevolved with its tsetse host for >80 Myr (Chen et al. 1999 Here we report on the genome sequence of Sodalis, the first from an insect mutualist symbiont. We present characteristics of the Sodalis genome that represent the dynamic process of transitioning from a free-living to a mutualistic lifestyle. Furthermore, we discuss aspects of an active gene decay process and functional adaptations that enable the persistence of a successful symbiotic relationship.
The Sodalis proteome and its comparison with closely related bacteria The Sodalis genome consists of one circular chromosome of 4,171,146 bp with an average G+C content of 54.7%. In addition, Sodalis has three extrachromosomal plasmids designated pSG1, pSG2, and pSG3, as well as a phage, SG1. The chromosome exhibits a GC skew pattern typical of prokaryotic genomes that have two major shifts, one near the origin and one near the terminus of replication, with dnaA assigned as base pair 1 of the chromosome (Fig. 1). Of the 2432 putative protein-coding sequences (CDSs) annotated on the chromosome, 1465 (60%) are assigned to putative functions on the basis of homology to other known proteins, 484 (20%) are conserved in several bacteria but have unknown functions, 262 (11%) are phage related, and 221 (9%) have no homology to entries in the public databases. In addition to these CDSs, the chromosome also contains 972 pseudogenes distributed throughout the chromosome (Fig. 1). These segments are likely rendered nonfunctional by virtue of the accumulation of various mutations compared with corresponding functional homologs. The predicted CDSs cover only 51% of the chromosome, making this genome one of the least dense bacterial genomes in terms of coding capacity. The general features of the Sodalis chromosome and the associated plasmids are summarized in Table 1.
Phylogenetic analysis of Sodalis from tsetse species in different subgenera has indicated a single highly related lineage in Enterobacteriacaea, affirming its recent establishment in tsetse (Chen et al. 1999
When compared with the proteomes of related free-living microbes E. coli K-12, Salmonella, and Yersinia, the size distribution of Sodalis' CDSs indicates that its genome encodes a lower percentage of large protein products (Fig. 2). These data suggest that the Sodalis genome contains a significant number of fragmented CDSs or pseudogenes. Alignment of these pseudogenes with their functional homologs validated this hypothesis. Most of the pseudogenes were found to be generated by substitutions, insertions, or deletions of single nucleotides or large segments of DNA or by the incorporation of insertion elements. The majority of Sodalis' pseudogenes were created when a functional CDS was divided into more than three fragments, or through the loss of a large segment of DNA. Furthermore, we identified several loci with functional homologs in other species, but those in Sodalis were divided into two fragments due to a single mutation. These loci were considered pseudogenes if the CDS was less than half the length of its functional homolog in related species. On the other hand, if the CDS was greater than half the length of its functional homolog, the loci were not considered pseudogenes. Hence, some of Sodalis' assigned CDSs may in fact be pseudogenes, and future functional studies will be required to determine the precise composition of Sodalis' proteome. Based on this definition of a pseudogene, we mapped 972 loci rendered nonfunctional on the Sodalis chromosome (and some in the three plasmids) (Table 1).
Sodalis' low gene density is further demonstrated when its putative protein products are categorized into clusters of orthologous groups (COGs) and compared with those from closely related bacteria (Fig. 3). This low coding density (51%) is equivalent to that of the obligate intracellular human parasite Mycobacterium leprae (50%) (Cole et al. 2001
Biosynthetic capabilities of Sodalis reflect adaptations to host environment
Sodalis' gene set responsible for altering its cellular behavior in response to environmental cues is highly streamlined. Its genome has retained only five sigma factors (the primary RpoD, RpoE, RpoH, RpoN, and FliA) and eight signal transduction systems, in contrast to the human commensal microbe Bacteroides thetaiotaomicron, which has 50 ECF-type sigma factors and 32 one-component systems (Xu et al. 2003
Analysis of genes from the defense mechanisms COG indicates that Sodalis' need for multidrug resistance genes may have been eliminated, possibly because it resides in such a specialized environment. For example, -lactamase and its regulator, as well as multidrug efflux transporters, appear to be nonfunctional (Fig. 3; Supplemental Fig. 3). Despite its existence in the hemolymph and gut, the presence of Sodalis apparently does not invoke either a systemic or epithelial immune response in the fly (Hao et al. 2001
Sodalis genes associated with virulence in related pathogenic organisms
Preliminary analysis of the TTSSs indicate that the envelope-associated structural components, encoded within tightly linked operons, have been conserved, while some of the secreted effectors and transcriptional regulators found at positions adjacent to the core components appear absent, nonfunctional, or under relaxed selection. The nonfunctional proteins include sipACD (from SSR-1) and invE and hilA (from SSR-2), while those that display relaxed selection include prgIJ, spaN, and invB from SSR-2 (Dale et al. 2005 Despite an apparent lack of motility associated with Sodalis either in vivo or under in vitro cultivation conditions, 90 flagellar-related CDSs, contained on two distinct clusters, were identified on Sodalis' chromosome (Supplemental Fig. 4). The first of these (35.3 kb in length, SG0021SG0059) is predicted to encode a complete flagellar apparatus with motility proteins MotA and MotB, transcriptional regulators FlhCD, and the sigma factor FliA. In addition, the putative chemotaxis-transducing proteins CheW and CheZ are also present. The second locus (SG2052SG2068, 27.1 kb) contains 15 pseudogenes and thus encodes an incomplete flagellar apparatus.
Expression of functions in Sodalis typically associated with virulence in related pathogenic microbes
Conclusions Eubacterial genomes range in size from 0.45 Mb to over 10 Mb (Wernegreen 2002
The Sodalis genome still retains a robust DNA-repair system and extensive synteny with genomes of closely related enterics, possibly again indicative of its recent symbiotic affiliation. Comparative analysis of the putative proteomes of related Yersinia and Photorhabdus indicates that almost a third of Sodalis' capabilities are unique. About two-thirds of the unique gene set corresponds to phage-like sequences and symbiosis region genes, while the remainder comprises putative proteins with no known homologs in the databases. Their future functional characterization can provide insights into the exceptional aspects of Sodalis' symbiotic biology. Interestingly, Sodalis shares all of Wigglesworthia's gene contents except those encoding thiamine, cobalamine, and molybdopterin biosynthesis pathways. One of Wigglesworthia's presumed functions is the supplementation of tsetse's restricted diet with a plethora of vitamins known to be important for fertility (Nogge 1981
The genome of entomopathogenic Photorhabdus encodes many adhesins, toxins, hemolysins, proteases, and lipases and contains a wide array of antibiotic synthesizing genes. These products are likely to play a role in the elimination of competitors, as well as in colonization, invasion, and degradation of the host insect cadaver (Duchaud et al. 2003
Future Sodalis functional studies will be aided by the availability of an in vitro culture system, episomal and chromosomal transformation systems, and a well-established protocol for reconstituting flies with genetically modified symbionts. Finally, since Sodalis lives in close proximity to pathogenic trypanosomes, a paratransgenic strategy (where trypanocidal products are expressed in tsetse midgut by recombinant Sodalis) is entertained as a means to control parasite infections in the fly. The eventual replacement of natural parasite-susceptible vector populations with refractory flies harboring modified Sodalis could provide an additional strategy to reduce disease transmission (Rio et al. 2004
Insect and bacterial cultivation Glossina morsitans morsitans flies were maintained at 24°C with 55% relative humidity and received defibrinated bovine blood through an artificial membrane system every 48 h. The symbiont, S. glossinidius, was isolated from surface-sterilized pupae of G. m. morsitans and cultured on a feeder layer of Aedes albopictus C6/36 cells as described previously (Dale and Maudlin 1999
Genome sequence analysis
Bioinformatics Specific loci were designated as pseudogenes when, compared with their functional homologs in related microbes, they encoded small fragmented ORFs or were interrupted by the loss or addition of a large portion of DNA.
Gene expression analysis
We thank K. Furuya, C. Yoshino, A. Nakazawa, Y. Yamashita, and N. Itoh for technical assistance. This work was supported in part by Grant-In-Aid of the Ministry of Education, Culture, Sports, Science and Technology, Japan; Grant of the 21st Century COE Program, Ministry of Education, Culture, Sports, Science and Technology, Japan; and Research for the Future Program from the Japan Society for the Promotion of Science and NIH/NIAID grant AI-34033 and NSF/MCB 0237305.
Article published online ahead of print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4106106.
5 Corresponding authors. [Supplemental material is available online at www.genome.org. The sequence data from this study have been submitted to GenBank under accession nos. AP008232 [GenBank] , AP008233 [GenBank] , AP008234 [GenBank] , and AP008235 [GenBank] for the chromosome, and plasmids pSG1, pSG2, and pSG3, respectively.]
Akman, L. and Aksoy, S. 2001. A novel application of gene arrays: Escherichia coli array provides insight into the biology of the obligate endosymbiont of tsetse flies. Proc. Natl. Acad. Sci. 98: 7546-7551. Akman, L., Rio, R.V.M., Beard, C.B., and Aksoy, S. 2001. Genome size determination and coding capacity of Sodalis glossinidius, an enteric symbiont of tsetse flies, as revealed by hybridization to Escherichia coli gene arrays. J. Bacteriol. 183: 4517-4525. Akman, L., Yamashita, A., Watanabe, H., Oshima, K., Shiba, T., Hattori, M., and Aksoy, S. 2002. Genome sequence of the endocellular obligate symbiont of tsetse, Wigglesworthia glossinidia. Nat. Genet. 32: 402-407.[CrossRef][Medline] Aksoy, S. 1995. Molecular analysis of the endosymbionts of tsetse flies: 16S rDNA locus and over-expression of a chaperonin. Insect Mol. Biol. 4: 23-29.[Medline] . 2000. Tsetse: A haven for microorganisms. Parasitol. Today 16: 114-118.[CrossRef][Medline] Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402. Amann, R.I., Ludwig, W., and Schleifer, K.H. 1995. Phylogenetic identification and in situ detection of individual microbial cells without cultivation. Microbiol. Rev. 59: 143-169. Beard, C.B., O'Neill, S.L., Mason, P., Mandelco, L., Woese, C.R., Tesh, R.B., Richards, F.F., and Aksoy, S. 1993. Genetic transformation and phylogeny of bacterial symbionts from tsetse. Insect Mol. Biol. 1: 123-131.[Medline] Buchrieser, C., Glaser, P., Rusniok, C., Nedjari, H., D'Hauteville, H., Kunst, F., Sansonetti, P., and Parsot, C. 2000. The virulence plasmid pWR100 and the repertoire of proteins secreted by the type III secretion apparatus of Shigella flexneri. Mol. Microbiol. 38: 760-771.[CrossRef][Medline] Charles, H., Heddi, A., and Rahbe, Y. 2001. A putative insect intracellular endosymbiont stem clade, within the Enterobacteriaceae, infered from phylogenetic analysis based on a heterogeneous model of DNA evolution. C.R. Acad. Sci. III 324: 489-494. Chen, X.A., Li, S., and Aksoy, S. 1999. Concordant evolution of a symbiont with its host insect species: Molecular phylogeny of genus Glossina and its bacteriome-associated endosymbiont, Wigglesworthia glossinidia. J. Mol. Evol. 48: 49-58.[CrossRef][Medline] Cheng, Q. and Aksoy, S. 1999. Tissue tropism, transmission and expression of foreign genes in vivo in midgut symbionts of tsetse flies. Insect Mol. Biol. 8: 125-132.[CrossRef][Medline] Cheng, Q., Ruel, T.D., Zhou, W., Moloo, S.K., Majiwa, P., O'Neill, S.L., and Aksoy, S. 2000. Tissue distribution and prevalence of Wolbachia infections in tsetse flies, Glossina spp. Med. Vet. Entomol. 14: 44-50.[CrossRef][Medline] Cole, S.T., Eiglmeier, K., Parkhill, J., James, K.D., Thomson, N.R., Wheeler, P.R., Honore, N., Garnier, T., Churcher, C., Harris, D., et al. 2001. Massive gene decay in the leprosy bacillus. Nature 409: 1007-1011.[CrossRef][Medline] Dale, C. and Maudlin, I. 1999. Sodalis gen. nov. and Sodalis glossinidius sp. nov., a microaerophilic secondary endosymbiont of the tsetse fly Glossina morsitans morsitans. Int. J. Systematic Bacteriol. 49: 267-275. Dale, C. and Welburn, S.C. 2001. The endosymbionts of tsetse flies: Manipulating host-parasite interactions. Int. J. Parasitol. 31: 628-631.[Medline] Dale, C., Young, S.A., Haydon, D.T., and Welburn, S.C. 2001. The insect endosymbiont Sodalis glossinidius utilizes a type III secretion system for cell invasion. Proc. Natl. Acad. Sci. 98: 1883-1888. Dale, C., Plague, G.R., Wang, B., Ochman, H., and Moran, N.A. 2002. Type III secretion systems and the evolution of mutualistic endosymbiosis. Proc. Natl. Acad. Sci 99: 12397-12402. Dale, C., Jones, T., and Pontes, M. 2005. Degenerative evolution and functional diversification of type-III secretion systems in the insect endosymbiont Sodalis glossinidius. Mol. Biol. Evol. 22: 758-766. Delcher, A.L., Harmon, D., Kasif, S., White, O., and Salzberg, S.L. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 27: 4636-4641. Deng, W., Burland, V., Plunkett, G., Boutin, A., Mayhew, G.F., Liss, P., Perna, N.T., Rose, D.J., Mau, B., Zhou, S., et al. 2002. Genome sequence of Yersinia pestis KIM. J. Bacteriol. 184: 4601-4611. Duchaud, E., Rusniok, C., Frangeul, L., Buchrieser, C., Givaudan, A., Taourit, S., Bocs, S., Boursaux-Eude, C., Chandler, M., Charles, J.F., et al. 2003. The genome sequence of the entomopathogenic bacterium Photorhabdus luminescens. Nat. Biotechnol. 21: 1307-1313.[CrossRef][Medline] Foultier, B., Troisfontaines, P., Muller, S., Opperdoes, F.R., and Cornelis, G.R. 2002. Characterization of the ysa pathogenicity locus in the chromosome of Yersinia enterocolitica and phylogeny analysis of type III secretion systems. J. Mol. Evol. 55: 37-51.[CrossRef][Medline] Gordon, D., Desmarais, C., and Green, P. 2001. Automated finishing with autofinish. Genome Res. 11: 614-625. Hacker, C.S. and Kilama, W.L. 1974. The relationship between Plasmodium gallinaceum density and the fecundity of Aedes aegypti. J. Invert. Pathol. 23: 101-105.[CrossRef][Medline] Hao, Z., Kasumba, I., Lehane, M.J., Gibson, W.C., Kwon, J., and Aksoy, S. 2001. Tsetse immune responses and trypanosome transmission: Implications for the development of tsetse-based strategies to reduce trypanosomiasis. Proc. Natl. Acad. Sci. 98: 12648-12653. Hensel, M., Shea, J.E., Raupach, B., Monack, D., Falkow, S., Gleeson, C., Kubo, T., and Holden, D.W. 1997. Functional analysis of ssaJ and the ssaK/U operon, 13 genes encoding components of the type III secretion apparatus of Salmonella Pathogenicity Island 2. Mol. Microbiol. 24: 155-167.[CrossRef][Medline] Hu, Y. and Aksoy, S. 2005. An antimicrobial peptide with trypanocidal activity characterized from Glossina morsitans morsitans. Insect Biochem. Mol. Biol. 35: 105-115.[CrossRef][Medline] Kuhle, V. and Hensel, M. 2004. Cellular microbiology of intracellular Salmonella enterica: Functions of the type III secretion system encoded by Salmonella pathogenicity island 2. Cell Mol. Life Sci. 61: 2812-2826.[CrossRef][Medline] Lostroh, C.P. and Lee, C.A. 2001. The Salmonella pathogenicity island-1 type III secretion system. Microbes Infect. 3: 1281-1291.[CrossRef][Medline] Lowe, T.M. and Eddy, S.R. 1997. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25: 955-964. Ma, W.C. and Denlinger, D.L. 1974. Secretory discharge and microflora of milk gland in tsetse flies. Nature 247: 301-303.[CrossRef] Maudlin, I., Welburn, S.C., and Mehlitz, D. 1990. The relationship between rickettsia-like organisms and trypanosome infections in natural populations of tsetse in Liberia. Trop. Med. Parasitol. 41: 265-267.[Medline] Moran, N.A. 2002. Microbial minimalism: Genome reduction in bacterial pathogens. Cell 108: 583-586.[CrossRef][Medline] Nogge, G. 1981. Significance of symbionts for the maintenance of an optimal nutritional state for successful reproduction in haematophagous arthropods. Parasitology 82: 101-104. Parkhill, J., Dougan, G., James, K.D., Thomson, N.R., Pickard, D., Wain, J., Churcher, C., Mungall, K.L., Bentley, S.D., Holden, M.T., et al. 2001a. Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature 413: 848-852.[CrossRef][Medline] Parkhill, J., Wren, B.W., Thomson, N.R., Titball, R.W., Holden, M.T., Prentice, M.B., Sebaihia, M., James, K.D., Churcher, C., Mungall, K.L., et al. 2001b. Genome sequence of Yersinia pestis, the causative agent of plague. Nature 413: 523-527.[CrossRef][Medline] Rio, R.V., Hu, Y., and Aksoy, S. 2004. Strategies of the home-team: Symbioses exploited for vector-borne disease control. Trends Microbiol. 12: 325-336.[CrossRef][Medline] Sahu, S.N., Acharya, S., Tuminaro, H., Patel, I., Dudley, K., LeClerc, J.E., Cebula, T.A., and Mukhopadhyay, S. 2003. The bacterial adaptive response gene, barA, encodes a novel conserved histidine kinase regulatory switch for adaptation and modulation of metabolism in Escherichia coli. Mol. Cell. Biochem. 253: 167-177.[CrossRef][Medline] Sakiyama, T., Takami, H., Ogasawara, N., Kuhara, S., Kozuki, T., Doga, K., Ohyama, A., and Horikoshi, K. 2000. An automated system for genome analysis to support microbial whole-genome shotgun sequencing. Biosci. Biotechnol. Biochem. 64: 670-673.[CrossRef][Medline] Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22-28. Welburn, S.C. and Maudlin, I. 1999. Tsetse-typanosome interactions: Rites of passage. Parasitol. Today 15: 399-403.[CrossRef][Medline] Welburn, S.C., Maudlin, I., and Ellis, D.S. 1987. In vitro cultivation of rickettsia-like organisms from Glossina spp. Ann. Trop. Med. Parasitol. 81: 331-335.[Medline] Wernegreen, J.J. 2002. Genome evolution in bacterial endosymbionts of insects. Nat. Rev. Genet. 3: 850-861.[CrossRef][Medline] Xu, J., Bjursell, M.K., Himrod, J., Deng, S., Carmichael, L.K., Chiang, H.C., Hooper, L.V., and Gordon, J.I. 2003. A genomic view of the human Bacteroides thetaiotaomicron symbiosis. Science 299: 2074-2076. Zhou, D., Hardt, W.D., and Galan, J.E. 1999. Salmonella typhimurium encodes a putative iron transport system within the centisome 63 pathogenicity island. Infect. Immun. 67: 1974-1981.
Received May 5, 2005; accepted in revised format September 19, 2005. This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||