|
|
|
|
Published online before print
November 5, 2007, 10.1101/gr.6531807 Genome Res. 17:1809-1822, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00 OPEN ACCESS ARTICLE
Resource Comparative genome analysis across a kingdom of eukaryotic organisms: Specialization and diversification in the Fungi1 School of Computer Science, University of Manchester, Manchester M13 9PL, United Kingdom; 2 Faculty of Life Sciences, University of Manchester, Manchester M13 9PL, United Kingdom; 3 Department of Biosciences, University of Exeter, Exeter EX4 4QD, United Kingdom; 4 Department of Biochemistry, University of Cambridge, Cambridge CB2 1 GA, United Kingdom
The recent proliferation of genome sequencing in diverse fungal species has provided the first opportunity for comparative genome analysis across a eukaryotic kingdom. Here, we report a comparative study of 34 complete fungal genome sequences, representing a broad diversity of Ascomycete, Basidiomycete, and Zygomycete species. We have clustered all predicted protein-encoding gene sequences from these species to provide a means of investigating gene innovations, gene family expansions, protein family diversification, and the conservation of essential gene functions—empirically determined in Saccharomyces cerevisiae—among the fungi. The results are presented with reference to a phylogeny of the 34 fungal species, based on 29 universally conserved protein-encoding gene sequences. We contrast this phylogeny with one based on gene presence and absence and show that, while the two phylogenies are largely in agreement, there are differences in the positioning of some species. We have investigated levels of gene duplication and demonstrate that this varies greatly between fungal species, although there are instances of coduplication in distantly related fungi. We have also investigated the extent of orthology for protein families and demonstrate unexpectedly high levels of diversity among genes involved in lipid metabolism. These analyses have been collated in the e-Fungi data warehouse, providing an online resource for comparative genomic analysis of the fungi.
The Fungi represent a single eukaryotic kingdom, characterized by an osmotrophic growth habitat in which extracellular enzymes are secreted to break down complex substrates, the resulting simple sugars and amino acids being taken up by the growing fungus. Fungi exist in two distinct morphological growth forms, the unicellular yeasts (which grow by budding or simple fission) and the filamentous fungi (which produce polarized hyphal strands that aggregate to form a network called a mycelium). The osmotrophic growth habit of fungi is extremely effective for colonizing diverse habitats and has made the fungi the principal degraders of biomass in all terrestrial ecosystems (de Boer et al. 2005
The yeasts and filamentous fungi cover a huge evolutionary range. The Pezizomycotina (filamentous ascomycetes) and the Saccharomycotina (budding yeasts), for example, diverged from one another some 900–1000 million years ago (Mya) (Hedges et al. 2004
It is also striking that among the 431 published studies describing completed genome sequences (October 2006), 361 describe bacterial genomes, 28 Archaeal genomes, and only 42 are genomes from eukaryotic organisms (for compendium, see http://www.tigr.org/tdb/ and the genomes online database http://www.genomesonline.org). Of the eukaryotic genome sequences currently available, more than half come from the kingdom Fungi, making them the only credible candidates for a kingdom-wide exploration of genomic relatedness. In addition to sampling this vast evolutionary diversity, there are complete genome sequences now available for very closely related fungal species—for instance, the Saccharomyces "sensu stricto" species group whose members are so closely related that they can mate to produce viable hybrids (Naumov et al. 1997
The majority of the publicly available fully sequenced fungal genomes come from Ascomycete species, but we have also investigated two sequenced Basidiomycete fungi, Ustilago maydis and Phanerochaete chrysosporium, plus the Zygomycete Rhizopus oryzae and the Microsporidian Encephalitozoon cuniculi. In addition, we have included two non-fungal species, the Oomycetes Phytophthora sojae and Phytophthora ramorum. Oomycetes are heterokonts that diverged from the Fungi very early in the evolution of eukaryotes. Nevertheless, in terms of their biological activity, they demonstrate convergent evolution with fungal plant pathogens. For example, oomycetes share a filamentous, osmotrophic growth habit and are capable of invasive growth and colonization of living plant tissue (Money et al. 2004 We have used the features of e-Fungi to ask some fundamental questions about fungal evolutionary biology, including patterns of gene loss and gene duplication, extent of protein family conservation among the kingdom Fungi, and the distinctive genomic features that separate yeasts and filamentous species. Our study represents the most comprehensive analysis of the kingdom Fungi carried out to date and has highlighted some surprising levels of hitherto unrecognized functional and biochemical diversity among fungal species.
Comparative genomics of the Fungi Our first aim was to organize all available genomic data into a form that could be readily used to assess the evolutionary relatedness among fungal species and also to generate clusters of related sequences. The 36 genomes used in this analysis are listed in Table 1. There are clearly large differences in the size of fungal genomes and the number of predicted protein sequences. They range in size from 2.5 Mb, encoding 1996 predicted open reading frames (ORFs), for the microsporidian E. cuniculi, to >46 Mb, encoding some 17,467 ORFs, for R. oryzae. Among the Ascomycete species, the Pezizomycotina have larger genomes than the Saccharomycotina: in general, the Saccharomycotina genomes have sizes between 8 and 12 Mb, while the Pezizomycotina are between 29 and 39 Mb. The exception is Yarrowia lipolytica, which has a 20-Mb genome—nearly twice as large as that of the next largest in the Saccharomycotina. However, despite being larger, the Y. lipolytica genome does not appear to encode significantly more protein sequences than the other Saccharomycotina genomes. There are 6544 ORFs predicted for Y. lipolytica compared to 6317 in Debaromyces hansenii.
In order to generate clusters of proteins that are conserved among particular fungal species, translated amino acid sequences were first compared using BLASTP (Altschul et al. 1990
Phylogenies of the fungi based on concatenated protein sequences and Dollo parsimony
Protein families were selected for constructing the sequence similarity trees using protein clusters that spanned all the genomes but showed minimal duplication within genomes. The proteins obtained (see Supplemental Table 1) are generally involved in fundamental processes such as translation, transcription, and DNA replication and recombination. Because of the great divergence between species in this analysis, and the small size of the E. cuniculi genome, two sets of protein families were used to resolve the phylogeny. A well-resolved broader species tree based on concatenated sequences from 12 protein families, which included Homo sapiens and the model plant species Arabidopsis thaliana, clearly separates fungal, animal, and plant taxa with maximum bootstrap support, as shown in Figure 1A. The oomycete species are shown to have diverged prior to the divergence of fungi and animals. This is in agreement with previous phylogenetic analysis based on small subunit rRNA sequences (Baldauf et al. 2000
A second set of 29 protein families was used to generate a tree that resolved the deeper nodes, as shown in Figure 1B. Using this larger protein set, high levels of support have been achieved for the majority of branches. The tree shows that the divergence of the Zygomycetes followed by that of the Basidiomycetes mark the earliest bifurcations in the fungal tree, followed by the split of S. pombe (Taphrinomycotina) from the rest of the Ascomycetes. The Ascomycetes then divide into the Pezizomycotina and the Saccharomycotina. There is support for this tree from previous sequence-based phylogenetic studies. These results are also consistent with other recent phylogenies of the fungi (Fitzpatrick et al. 2006
Phylogenetic profiling, which uses the presence and absence of derived character states such as genes and gene families, is commonly used for tree construction (Fitz-Gibbon and House 1999
Gene duplication in the Fungi
A previous study (Dujon et al. 2004 Gene duplication among the Pezizomycotina, in general, appears to be slightly higher than among the Saccharomycotina. The exceptions are N. crassa and C. immitis (372 and 374 clusters, respectively), which both possess fewer duplication-containing clusters than S. cerevisiae. Among the Aspergillus genomes, Aspergillus niger and Aspergillus oryzae (which are positioned next to each other on the phylogenetic tree) possess the most duplication-containing clusters. In the Basidiomycetes, duplication in the P. chrysosporium genome (885 clusters) appears much higher than in that of U. maydis (300 clusters). There is little duplication in the E. cuniculi genome, a likely reflection of its very small genome size. In contrast, the Zygomycete R. oryzae possesses by far the most duplication clusters (2481) of all the fungi analyzed, almost three times as many as the next highest, P. chrysosporium. In order to ensure that these results were not due to the presence of transposons, the analysis was repeated with protein clusters containing likely transposons removed. This did not greatly alter the results of the analysis (see Supplemental Fig. 1B); although some fungal genomes appear to contain large numbers of transposons, the sequence similarity of these elements means that they occur in only a small number of clusters.
As discussed above, the low number of duplications in the N. crassa genome is a result of RIP activity. Orthologs of the enzyme responsible, a DNA methyltransferase encoded by the rid gene, have been identified in other Pezizomycotina (Galagan and Selker 2004
Gene duplication can lead to an organism possessing genes of novel function, potentially improving adaptation to its environment. It has therefore been proposed that duplication of different sets of genes may promote speciation (Lynch and Force 2000
We have investigated parallel duplication between pairs of fungal species by identifying instances of coduplication in protein clusters. The frequency with which these clusters occurred was compared to the expected frequency based on duplications occurring independently in each species using a
To investigate the genetic basis of the yeast and filamentous growth habits by fungi, we analyzed Pfam motifs associated with proteins from Saccharomycotina and Pezizomycotina species, to identify those motifs for which there is expansion in protein numbers for the Pezizomycotina compared to the Saccharomycotina and vice versa. Table 3 lists examples of the motifs identified. Many of the motifs expanded in the Pezizomycotina indicate increased metabolic flexibility compared to the Saccharomycotina. For example, there are expansions in protein families involved in transport into and out of the cell. Major facilitator superfamily transporters (PF07690), involved in the transport of small solutes, including sugar uptake and drug efflux (Maiden et al. 1987
In addition to the expansion in Pfam motifs associated with responses to environmental stresses and resources, there is an expansion in the motifs associated with regulation of gene expression. Analysis of 84 Pfam motifs associated with DNA binding shows that, on average, Pezizomycotina species possess almost twice as many proteins containing these motifs as Saccharomycotina species. This expansion includes the most frequently identified Pfam motifs—PF00172, PF04082, PF00096, PF00098, and PF00170.
Protein clusters that have expanded in the Saccharomycotina compared to the Pezizomycotina are rarer, a likely reflection of the fact that the Saccharomycotina have fewer predicted ORFs. The four clusters that we have identified are all associated with cell wall structure and biosynthesis. There is a large expansion in the number of proteins containing PIR motifs (PF00399). These motifs directly link to the 1,3-beta-glucan and are important for cell wall anchoring (Kapteyn et al. 1999
Protein family conservation and diversification among the Fungi
The composition of protein clusters encoded by S. cerevisiae and conserved in 4–6, 9–11, and 13–15 species has been determined. For the clusters in these peaks, each genome was scored as present or absent, and then the percentage of clusters in which a given genome was present was calculated. The results are displayed in Figure 2B. The clusters containing 4–6 genomes are clearly composed of proteins from the most closely related members of the Saccharomyces "sensu stricto" complex—Saccharomyces paradoxus, Saccharomyces mikatae, Saccharomyces bayanus, and Saccharomyces kudriavzevii. It is apparent that S. kudriavzevii has a lower score than the other "sensu stricto" genomes, reflecting the lower number of ORFs reported for this species (Cliften et al. 2003 11% and 1.4%, respectively). It appears, therefore, that the peak at 4–6 genomes represents a group of proteins specific to members of the Saccharomyces "sensu stricto" complex, which are largely absent from other budding yeasts.
We analyzed a set of 977 clusters containing essential S. cerevisiae proteins. The distribution of cluster sizes obtained using this set is different from that obtained using all S. cerevisiae proteins, the peak at 4–6 proteins is absent (see Supplemental Fig. 4). Only three essential genes that are peculiar to the Saccharomyces "sensu stricto" clade were identified: KAR1 (YNL188W), SPC29 (YPL124W), and TEN1 (YLR010C). Kar1p is an essential protein involved in nuclear fusion during mating and in spindle pole body duplication during mitosis (Pereira et al. 1999 Analysis of protein clusters containing 9–11 genomes shows that proteins from S. castellii and S. kluyveri now occur in the majority of the clusters, along with those from C. glabrata, E. gossypii, and K. lactis. The fraction of clusters in which a genome is found reduces sharply at C. albicans and D. hansenii. Again, this is consistent with the phylogenetic relationships. Analysis of clusters containing 13–15 genomes shows that C. albicans, D. hansenii, and Y. lipolytica now occur in the majority of the clusters, and the fraction now reduces sharply at the Pezizomycotina. It is interesting to note that despite S. pombe, P. chrysosporium, U. maydis, and R. oryzae being more diverged from S. cerevisiae than these filamentous fungi, they occur in a greater fraction of these protein clusters. This result is in agreement with the positioning S. pombe closer than expected to S. cerevisiae in the Dollo parsimony tree and the higher than expected coduplication results for S. cerevisiae, S. pombe, and R. oryzae. It suggests that targeted gene loss may have occurred, indicating that some non-Ascomycete species have retained some sequences found in the Saccharomycotina that have been lost in the Pezizomycotina. Gene Ontology annotation of proteins was used to identify GO terms that are over-represented in clusters associated with Saccharomycotina (those containing 1–15 species) compared to those associated with all fungal species (containing 30–36 species). There is an inherent bias in this analysis, since proteins that do not possess orthologs beyond the Saccharomycotina are less likely to have GO annotations. For Saccharomycotina-specific clusters, GO terms associated with the fungal cell wall (GO:0009277), with Saccharomycotina-specific transcription factors (GO:0003704, GO:0016455), and components of the spindle pole body (GO:0005816) appear to be over-represented. As Figure 2A shows, the frequency distribution of protein clusters obtained for S. pombe is quite different from that obtained for the budding yeasts. There is still the broad peak on the righthand side, representing proteins that are common to many or all fungal genomes, but there are no sharp peaks representing branches in the phylogenetic tree. Instead, there is a broad peak between six and 15 genomes. Further analysis shows that this is actually several overlapping peaks. Figure 2C shows the species represented in clusters containing either S. pombe and S. cerevisiae proteins, or proteins from S. pombe and the filamentous fungus G. zeae. The results show clearly that these clusters can be divided into subgroups of budding-yeast-specific proteins and filamentous-fungi-type proteins. This implies that, although S. pombe split from the rest of the fungi prior to the divergence of the budding yeasts and filamentous fungi, it has retained a set of budding-yeast-type proteins (which were subsequently lost from the filamentous fungi) and a set of filamentous-fungi-type proteins (which have been lost from the budding yeasts). The GO annotations associated with proteins in these clusters were also analyzed. There are 102 clusters specific to S. pombe and the Saccharomycotina compared to 464 specific to S. pombe and the Pezizomycotina. Among the S. pombe and Saccharomycotina clusters, we identified four containing nuclear pore proteins (GO:0006913). There were no S. pombe/Pezizomycotina clusters with this annotation. Among the S. pombe/Pezizomycotina clusters, we identified 16 clusters associated with cellular transport (GO:0006810), 16 with DNA repair (GO:0006281), nine with meiosis, and seven associated with progression through S phase in the mitotic cell cycle (GO:0000084). Analysis of clusters containing Y. lipolytica proteins shows that it also contains a set of Pezizomycotina-associated proteins that have been lost from other Saccharomycotina species (see Supplemental Fig. 5).
Analysis of fatty acid β-oxidation pathways in the Fungi
Fatty acid β-oxidation is a process that degrades fatty acids into acetyl-CoA, which is then fed into the TCA cycle to produce ATP or (in plants and fungi) can be used to synthesize carbohydrates via the glyoxylate cycle and gluconeogenesis. Fatty acids are activated by the addition of coenzyme A, in a reaction catalyzed by fatty acyl-CoA ligases that is often concomitant with transport. Subsequent steps bring about the complete oxidation of the β-carbon and the release of acetyl-CoA. The resulting acyl-CoA, shortened by two carbon units, can undergo additional β-oxidation cycles. In mammals, β-oxidation occurs in both the mitochondria and peroxisomes (Kunau et al. 1995
By surveying the available fungal genomes for the presence or absence of acyl-CoA dehydrogenases and acyl-CoA oxidases, it can be established whether each organism has peroxisomal or non-peroxisomal pathways for the β-oxidation of fatty acids (Fig. 3). The results suggest that all members of the class Saccharomycetes, except Y. lipolytica, possess only the peroxisomal β-oxidation pathway. The S. cerevisiae genome contains only one acyl-CoA oxidase gene, which encodes an enzyme that can accept fatty acid chains of differing lengths (Hiltunen et al. 2003
It is striking, however, that the fission yeast S. pombe and the obligate intracellular parasite E. cuniculi do not seem to possess either β-oxidation pathway. This observation has been confirmed by the lack of a homolog of the S. cerevisiae FOX2 gene (which encodes a multifunctional β-oxidation protein) in the genomes of either of these two fungi. The lack of β-oxidation has been noted before in the case of E. cuniculi and is not surprising, since this unicellular eukaryote does not possess mitochondria or peroxisomes (Katinka et al. 2001
The genome of the dimorphic yeast Y. lipolytica contains six acyl-CoA oxidase-encoding genes. Five have been cloned and their products shown to have different substrate chain-length preferences (Wang et al. 1999
The fact that mammals and many fungi possess both peroxisomal and non-peroxisomal β-oxidation pathways suggests that, during the evolution of some species of fungi, one or both of these pathways have been lost. Relating this to the fungal phylogeny generated in this study (Fig. 1), the evidence suggests that non-peroxisomal β-oxidation was lost after the divergence of the ancestor of Y. lipolytica from the rest of the Saccharomycetes. Strictly peroxisomal β-oxidation seems to have been lost after the divergence of the ancestor of N. crassa and M. grisea from the ancestor of G. zeae, as well as in the lineage of T. reesei, and may have been replaced with the glyoxysomal derivative of this metabolic pathway. Interestingly, recent evidence has shown that M. grisea still requires the peroxisomal biogenesis machinery to carry out fatty acid β-oxidation; therefore, the compartments are highly conserved (Bhambra et al. 2006
Comparative genomics is a powerful technique for identifying orthologous genes in different species (Tatusov et al. 1997 We have chosen to construct a bioinformatics platform for carrying out comparative functional genomics with the Fungi for several reasons. The Fungi are a group of organisms of great scientific, medical, agricultural, and industrial importance. There is a wealth of genome sequence and EST data for the Fungi, including for important model organisms (e.g., S. cerevisiae, S. pombe, and N. crassa), human (e.g., C. albicans, A. fumigatus), and plant (e.g., M. grisea, U. maydis) pathogens, as well as major process organisms in industry (e.g., A. niger), and others that play major ecological roles in the recycling of plant materials (e.g., P. chrysosporium). Moreover, techniques for the genetic manipulation of many of these fungi have been developed and are rapidly improving in both efficiency and scope. Finally, it is only in the Fungi that it is possible, at the moment, to perform comparative genomic analyses over an entire eukaryotic kingdom. The fungal kingdom shows great diversity, and thus comparisons at both short and long evolutionary distances may be made. This holds the prospect that the results of our analyses, and the tools that we develop to perform them, will have both general biological interest and utility as further eukaryotic genome sequences are generated.
The e-Fungi data warehouse is an object-oriented data warehouse based on the Genome Information Management System (GIMS) architecture (Cornell et al. 2003 The Fungi are not only an evolutionarily diverse group, but also their genomes span a size range of some 18-fold from the smallest (that of the microsporidian E. cuniculi) to the largest (the Zygomycete, R. oryzae). Our whole-genome-based phylogenetic analysis demonstrates that the Oomycetes are only very distant relatives of the fungi, as previously predicted from SSU rDNA analysis. Oomycetes diverged from the most recent common ancestor of both groups much earlier than the split between the fungi and the plants. Moreover, it is interesting to note that, despite mycology being the traditional province of departments of botany, animals are more closely related to the true fungi than plants.
For all this, the influence of overall size on the construction of whole-genome phylogenies should not be overlooked. Differences in total gene numbers complicate the construction of whole-genome phylogenies. E. cuniculi could only be placed in the coarse-grained tree, based on 12 protein families (Fig. 1A), since we could not find sufficient representatives in this genome of the 29 families used in the higher-resolution tree (Fig. 1B). Yet other problems associated with total gene numbers arise when the gene presence/absence criterion of the Dollo parsimony approach is used. Here, there were difficulties in the resolution of the Saccharomyces "sensu stricto" clade, which were the consequence of the major differences in gene annotation between two different sequencing centers, despite their very similar sequence coverage. This emphasizes that the genomes chosen as the model for the construction of microarrays for sequence comparisons using the CGH approach (Edwards-Ingram et al. 2004 The relative sizes of genomes reflects, at least in part, differences in the levels of gene duplication between species—a contention that is confirmed by the fact that E. gossypii has both the smallest genome and the lowest level of gene duplication of any of the Saccharomycotina. In general, among the Ascomycetes, the filamentous Pezizomycotina have greater levels of gene duplication than do the yeast-like Saccharomycotina. However, it appears that the levels of duplication in the genome of R. oryzae far exceed those in any of the fungal species in this study. As well as differences in the amounts of gene duplication there appear to be differences in the loss of duplicates. Analysis of sequence similarities within Pezizomycotina species suggests differences in the amount of RIP activity, and there appear to be differences in gene loss following the WGD in the genomes of S. cerevisiae and C. glabrata. We have also demonstrated that there is evidence of coevolution, as measured by coduplication for S. cerevisiae and S. pombe and, unexpectedly, for S. cerevisiae and R. oryzae. Protein clusters that are expanded in the Pezizomycotina, in comparison to the Saccharomycotina, include various families of transport proteins, proteins involved in the utilization of different carbon sources, and transcription factors. All of this reflects the greater metabolic versatility and broader substrate range of the filamentous Ascomycetes compared to the yeasts. An exception to this general rule is the dimorphic Saccharomycotina Y. lipolytica, which shares a number of gene families with the Pezizomycotina that have been lost from other members of the Saccharomycotina. Protein clusters that are expanded in the Saccharomycotina, as compared to the Pezizomycotina, are (predictably) rare, and the examples that we found are involved in cell wall structure. This may reflect the very different chemical composition of the cell walls of yeasts and fungal hyphae. This point is emphasized by the fact that the genomes of both filamentous Ascomycetes and Basidiomycetes have expanded families of chitosanases—chitin is the principal polysaccharide of hyphal cell walls, and chitosan is found at discrete stages of the development of most fungi, but both are only minor components of yeast cell walls.
The representation of the different fungal species among the various protein clusters that we have defined is instructive. The protein products of essential genes of S. cerevisiae (Giaever et al. 2002 The study has also highlighted the metabolic diversity of fungi reflected in the gene inventories associated with particular metabolic processes such as fatty acid β-oxidation, which appears to be absent as a process in the Taphrinomycotina S. pombe, and compartmentalized distinctly in the budding yeasts and filamentous fungi. This demonstrates that comparative genomic analysis can reveal new fundamental characteristics of even well-studied fungal species such as S. pombe. In summary, the e-Fungi data warehouse has been designed to provide an e-science framework for analysis of the diverse functional genomic information currently being generated for the Fungi. Using the transcriptional profiling, proteomic, and deletion-mutant data that have been extensively compiled for S. cerevisiae as a reference set, we aim to build a resource into which newly acquired data from filamentous fungi, including pathogenic species and industrially relevant fungi, can be integrated. The ability to carry out extensive, multifaceted querying is a key feature of the data warehouse, which is being implemented using e-science protocols that harness data stored remotely in distinct databases, allowing them to be interrogated at a single site. This study has provided the first evidence of the utility of such a resource for investigating fundamental features of the fungi and exploring their genetic and genomic relatedness.
Orthology assignments Predicted protein sequences for 36 genomes were downloaded from respective sequencing project repositories (see Table 1). All-against-all BLAST searches (Altschul et al. 1990
Formulating data sets for fungal species tree
Phylogenetic analysis procedure to generate fungal species trees
Dollo parsimony analyses
Pfam domain assignments
Identification of Pfam domains expanded in the Pezizomycotina and Saccharomycotina
Identification of transposon sequences in fungal species
Identification of over-represented GO terms
We thank the Broad Institute and the DOE Joint Genome Institute for releasing data ahead of publication. We acknowledge the financial support of the Biotechnology and Biological Sciences Research Council (BBSRC). The development of the e-Fungi database has been funded by the BBSRC Bioinformatics and e-Science programme II.
5 Present address: Department of Biochemistry, University of Cambridge, Cambridge CB2 1 GA, UK.
E-mail Steve.oliver{at}mole.bio.cam.ac.uk; fax 44-1223-766-002. [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6531807
Alam, I., Fuellen, G., Rehmsmeier, M., and Dress, A. 2004. Comparative homology agreement search: An effective combination of homology-search methods. Proc. Natl. Acad. Sci. 101: 13814–13819. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403–410.[CrossRef][Medline] Baldauf, S.L., Roger, A.J., Wenk-Siefert, I., and Doolittle, W.F. 2000. A kingdom-level phylogeny of eukaryotes based on combined protein data. Science 290: 972–977.[CrossRef][Medline] Bhambra, G.K., Wang, Z.-Y., Soanes, D.M., Wakley, G.E., and Talbot, N.J. 2006. Peroxisomal carnitine acetyl transferase is required for elaboration of penetration hyphae during plant infection by Magnaporthe grisea. Mol. Microbiol. 61: 46–60.[CrossRef][Medline] Bowman, S.M. and Free, S.J. 2006. The structure and synthesis of the fungal cell wall. Bioessays 28: 799–808.[CrossRef][Medline] Castillo, L., Martinez, A.I., Garcera, A., Elorza, M.V., Valentin, E., and Sentandreu, R. 2003. Functional analysis of the cysteine residues and the repetitive sequence of Saccharomyces cerevisiae Pir4/Cis3: The repetitive sequence is needed for binding to the cell wall beta-1,3-glucan. Yeast 20: 973–983.[CrossRef][Medline] Cliften, P.F., Hillier, L.W., Fulton, L., Graves, T., Miner, T., Gish, W.R., Waterston, R.H., and Johnston, M. 2001. Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res. 11: 1175–1186. Cliften, P., Sudarsanam, P., Desikan, A., Fulton, L., Fulton, B., Majors, J., Waterston, R., Cohen, B.A., and Johnston, M. 2003. Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 301: 71–76. Cornell, M., Paton, N.W., Hedeler, C., Kirby, P., Delneri, D., Hayes, A., and Oliver, S.G. 2003. GIMS: An integrated data storage and analysis environment for genomic and functional data. Yeast 20: 1291–1306.[CrossRef][Medline] de Boer, W., Folman, W.B., Summerbell, R.C., and Boddy, L. 2005. Living in a fungal world: The impact of fungi on soil bacterial niche development. FEMS Microbiol. Rev. 29: 795–811.[CrossRef][Medline] Deng, J., Carbone, I., and Dean, R.A. 2007. The evolutionary history of cytochrome P450 genes in four filamentous Ascomycetes. BMC Evol. Biol. 7: 30. doi: 10.1186/1471-2148-7-30.[CrossRef][Medline] Dujon, B., Sherman, D., Fischer, G., Durrens, P., Casaregola, S., Lafontaine, I., De Montigny, J., Marck, C., Neuveglise, C., Talla, E., et al. 2004. Genome evolution in yeasts. Nature 430: 35–44.[CrossRef][Medline] Eddy, S.R. 1998. Profile hidden Markov models. Bioinformatics 14: 755–763. Edwards-Ingram, L.C., Gent, M.E., Hoyle, D.C., Hayes, A., Stateva, L.I., and Oliver, S.G. 2004. Comparative genomic hybridization provides new insights into the molecular taxonomy of the Saccharomyces "sensu stricto" complex. Genome Res. 14: 1043–1051. Elliott, S., Knop, M., Schlenstedt, G., and Schiebel, E. 1999. Spc29p is a component of the Spc110p subcomplex and is essential for spindle pole body duplication. Proc. Natl. Acad. Sci. 96: 6205–6210. Farris, J.S. 1977. Phylogenetic analysis under Dollos Law. Syst. Zool. 26: 77–88. Felsenstein, J. 1978. Cases in which parsimony or compatibility methods will be positively misleading. Syst. Zool. 27: 401–410. Felsenstein, J. 1989. PHYLIP—Phylogeny inference package (version 3.2). Cladistics 5: 164–166. Finn, R.D., Mistry, J., Schuster-Bockler, B., Griffiths-Jones, S., Hollich, V., Lassmann, T., Moxon, S., Marshall, M., Khanna, A., Durbin, R., et al. 2006. Pfam: Clans, web tools and services. Nucleic Acids Res. 34: D247–D251. doi: 10.1093/nar/gkj149. Fitz-Gibbon, S.T. and House, C.H. 1999. Whole genome-based phylogenetic analysis of free-living microorganisms. Nucleic Acids Res. 27: 4218–4222. Fitzpatrick, D.A., Logue, M.E., Stajich, J.E., and Butler, G. 2006. A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis. BMC Evol. Biol. 6: 99. doi: 10.1186/1471-2148-6-99.[CrossRef][Medline] Galagan, J.E. and Selker, E.U. 2004. RIP: The evolutionary cost of genome defense. Trends Genet. 20: 417–423.[CrossRef][Medline] Galagan, J.E., Henn, M.R., Ma, L.J., Cuomo, C.A., and Birren, B. 2006. Genomics of the fungal kingdom: Insights into eukaryotic biology. Genome Res. 15: 1620–1631.[CrossRef] Giaever, G., Chu, A.M., Ni, L., Connelly, C., Riles, L., Veronneau, S., Dow, S., Lucau-Danila, A., Anderson, K., Andre, B., et al. 2002. Functional profiling of the Saccharomyces cerevisiae genome. Nature 418: 387–391.[CrossRef][Medline] Giles, P.F., Soanes, D.M., and Talbot, N.J. 2003. A relational database for the discovery of genes encoding amino acid biosynthetic enzymes in pathogenic fungi. Comp. Funct. Genomics 4: 4–15.[CrossRef] Goffeau, A. 2004. Evolutionary genomics: Seeing double. Nature 430: 25–26.[CrossRef][Medline] Goto, S., Okuno, Y., Hattori, M., Nishioka, T., and Kanehisa, M. 2002. LIGAND: Database of chemical compounds and reactions in biological pathways. Nucleic Acids Res. 30: 402–404. Grandin, N., Damon, C., and Charbonneau, M. 2001. Ten1 functions in telomere end protection and length regulation in association with Stn1 and Cdc13. EMBO J. 20: 1173–1183.[CrossRe | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||