|
|
|
|
Published online before print
March 9, 2007, 10.1101/gr.6111907 Genome Res. 17:492-502, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
Letter Computational and experimental approaches double the number of known introns in the pathogenic yeast Candida albicansDepartment of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, California 94143-2200, USA
Candida albicans is the most common fungal pathogen of humans. Frequently found as a commensal within the digestive tracts of healthy individuals, C. albicans is an opportunistic pathogen that causes a wide variety of clinical syndromes in immuno-compromised individuals. A comprehensive annotation of the C. albicans genome sequence was recently published. Because many C. albicans coding sequences are interrupted by introns, proper intron annotation is essential for the accurate definition of genes in this pathogen. Intron annotation is also important for identifying potential targets of splicing regulation, a common mechanism of gene control in eukaryotes. In this study, we report an improved annotation of C. albicans introns. In addition to correcting the existing intron annotations, 25% of which were incorrect, we have used novel computational and experimental approaches to identify new introns, bringing the total to 415, almost double the number previously known. Our identification methods focus primarily on intron features rather than protein-coding features, overcoming biases of traditional intron annotation methods. Introns are not randomly distributed in C. albicans, and are over-represented in genes involved in specific cellular processes, such as splicing, translation, and mitochondrial respiration. This nonrandom distribution suggests functional roles for these introns, and we demonstrate that splicing of two transcripts whose introns have unusual sequence features is responsive to environmental factors.
The yeast Candida albicans commonly inhabits the mucous membranes and digestive tracts of healthy individuals. Perturbation of a hosts immune defenses, however, can cause a dramatic shift to invasive, pathogenic growth. Particularly susceptible are those receiving immunosuppressive therapies, such as cancer and transplant patients (Kullberg and Filler 2002
Eukaryotic genes are often interrupted by introns, which must be spliced out of gene transcripts for coding sequences to be fully expressed. The regulation of intron splicing can also play an important role in controlling gene expression (for reviews, see Black 2003
The genome of C. albicans strain SC5314 was recently sequenced using a whole-genome shotgun approach (Jones et al. 2004
Accurate intron identification has long been a challenge for genome annotators (for review, see Brent 2005
The primary reliance on coding-sequence alignments introduces serious biases to the identification of introns. In particular, introns in poorly conserved genes or near the ends of even well-conserved genes are easily overlooked using traditional alignment-based methods. This bias may be particularly problematic for the Hemiascomycetes; in S. cerevisiae, introns very close to the 5'ends of coding sequences are quite common (Spingola et al. 1999 Given the inherent difficulties with traditional methods of intron identification, it is not surprising that the existing C. albicans intron annotations are far from complete. In the course of correcting the existing intron annotations, it became apparent to us that many had been overlooked. This prompted us to undertake a genome-wide search for undiscovered introns, including in our analyses two methods that do not rely on strong coding-sequence conservation. Here, we present a high-confidence set of 415 introns, almost doubling the number of known introns in C. albicans. We believe that this represents a nearly complete catalog of C. albicans introns. We show that these introns are not randomly distributed, but are dramatically over-represented in genes within specific functional categories. This nonrandom distribution suggests that some introns are under selective pressure, perhaps for roles in regulating gene expression. To test this idea, we have examined two genes, RPL30 and SPR28, whose introns have unusual sequence features, and have shown that their splicing is, indeed, responsive to environmental signals.
Refinement of intron predictions in published annotations Based on the recently published annotation of the C. albicans genome (Braun et al. 2005
Many of the originally annotated introns did not pass our filters and were removed from our assignments. Most often these questionable introns were introduced by previous annotators to correct for stop codons or frameshifts within the coding sequences. DNA alignments with the close relative Candida dubliniensis suggest that many of these stop codons and frameshifts are simply the result of sequencing errors, without which the open reading frames would be continuous in the absence of splicing. Throughout our studies, we found that such errors are quite common in the C. albicans genome sequence, in contrast to the sequenced genomes of other Candida species, such as C. dubliniensis (http://www.sanger.ac.uk) and Candida tropicalis (http://www.broad.mit.edu).
Overall, we found that 59 of the published intron annotations (
Candidate gene approach for intron identification
We were able to identify putative C. albicans orthologs for 70% of S. cerevisiae intron-containing genes. Because many genes have duplicated since the divergence of these two yeasts, particularly in the S. cerevisiae lineage (Byrne and Wolfe 2005
Nine of the introns we identified using our candidate approach reside outside of the coding sequences, within the 5'-untranslated regions (UTRs) of the RNAs. 5'-UTR introns can be difficult to validate without direct experimental data, as they are not surrounded by the protein-coding sequences that normally lend support to an intron assignment. In S. cerevisiae, however, the few 5'-UTR introns that have been identified have strong phylogenetic and experimental support (Spingola et al. 1999
Experimental approach for intron identification
In the process of pre-mRNA splicing, introns are liberated as lariat structures, in which the 5'-end is joined by a 5'2'-phosphodiester linkage to an internal branch point nucleotide (Fig. 2A). Lariat structures are subsequently linearized by a debranching enzyme, Dbr1, and rapidly degraded from their free 5'- and 3'-ends by exonucleases. Deletion of DBR1 in S. cerevisiae results in a dramatic accumulation of branched intron sequences, resistant to exonucleolytic degradation because they lack free ends (Chapman and Boeke 1991
To identify introns that accumulate in our dbr1 mutant, we designed a microarray with probes flanking all predicted C. albicans open reading frames (for details, see Methods). While such an array is not comprehensive, we reasoned that the most likely locations for undiscovered introns would be adjacent to predicted coding sequences. If a probe does hybridize to an expressed intron, it should show increased signal from dbr1 RNA compared to wild-type RNA, provided the probe is of sufficient sequence complexity to give a signal with specificity for the intron (Fig. 2B).
We synthesized cDNA from RNA from both wild-type and dbr1 We manually inspected the remaining sequences that exhibited a greater than twofold increase, using our phylogenetic criteria to look for independent evidence of introns in the nearby genes (Fig. 2B). We were able to identify 26 new introns with this approach (Supplemental Table S1). Many of the newly discovered introns were confirmed by hybridization to multiple probes. Only 15% of the probes with a greater than twofold increase identified regions in which we could find no independent evidence of an intron. The relative expression values for these false positives all fell within the bottom quartile of up-regulated probes.
In addition to identifying candidate introns, our microarray data also provide corroborating evidence that a sequence is spliced. Eleven of the 26 new introns we identified in our dbr1
Because our original microarray compared only yeast growing in standard media at 30°C, it would fail to identify introns that are not spliced or whose genes are not expressed in this condition. We therefore grew wild-type and dbr1
Global computational approach for intron identification
Our first approach was to model the C. albicans intron structure with a Hidden Markov Model (HMM), as has been done previously to highlight common features of introns in S. cerevisiae (Spingola et al. 1999 We generated a set of scored intron predictions, applying our algorithm to all predicted C. albicans open reading frames (ORFs). For every ORF, we included both the predicted coding sequence and 500 nt of upstream sequence. Because of the short length and 5' bias of most introns, we reasoned this would likely capture the majority of undiscovered intron sequences. We ranked all genes according to their intron scores, and inspected the top-scoring genes manually for new introns (Fig. 1). For each gene, we used the intron predicted by our algorithm as a starting point, but did not limit our inspection to this sequence. Importantly, our algorithm effectively distinguished the majority of known intron-containing genes from most of the rest of the genome (Fig. 3). More than 90% of our previously identified intron-containing genes scored within the top 6% of genes.
We inspected the remaining genes that received high scores from our algorithm, beginning with the top score and working down until false positives predominated (see below). Among the genes that scored within the top 6%, we were able to identify 94 new introns that passed our criteria for strong phylogenetic support (outlined in Fig. 1). The intron predictions that accompanied these scores accurately identified the precise splice junctions more than 85% of the time. In the remaining cases, the failure was almost always in predicting the correct 3'-splice site. This is not surprising, given that the 3'-splice site has the lowest information content, and accurate annotation often requires information from phylogenetic comparisons. Because we included upstream sequence to generate our intron predictions, several of the top scores were due to introns in upstream genes. In two cases, this identified new genes that had been missed in the genome annotation. Both genes are <100 codons in length and, had we not identified the introns first, would have been easily overlooked. One, orf19.2965.1, is a homolog of Homo sapiens SERF2, and the other, orf19.3223.1, encodes the 12-kDa subunit of mitochondrial NADH-ubiquinone oxidoreductase. Six of the intron predictions within the top 6% would reside within 5'-UTRs. Because these introns lack adjacent coding sequence, they failed to meet our criteria for strong phylogenetic support. To determine whether any of these were actually introns, we tested them directly, using RT-PCR to span the predicted splice junctions. We found that all of them were, indeed, spliced (Supplemental Fig. S1A).
Finally, some of the predictions for the genes we examined are clearly not introns. These false predictions are generally inconsistent with phylogenetic datathe putative splice sites are not conserved in closely related species, and splicing of the putative introns would disrupt conserved coding sequences. Our algorithm does not predict a discrete boundary between genes that do and genes that do not contain introns (Fig. 3), and we therefore used these false positives to determine how far down our list we would search. In the end, we examined the top-scoring We used the new introns identified by our computational approach to refine our intron model, and applied the refined model to the C. albicans ORFs for a second round of predictions. To capture additional introns we may have missed in our first run, we also included 500 nt of downstream sequence for each ORF, and extended the upstream sequence from 500 to 1000 nt. Since the median lengths of upstream and downstream intergenic sequences are 600 nt and 300 nt, respectively (as calculated from annotations at http://www.candidagenome.org), this should be sufficient for most genes. Manual inspection of the top-scoring genes revealed 16 new introns. Two of these introns had unusually degenerate splice sites, and received higher scores in our second analysis because of the refinements of our model. The other 14 were detected because of the additional sequence we included with each ORF: two were within downstream sequence, and 12 were within upstream sequence, including one 5'-UTR intron we confirmed by RT-PCR (Supplemental Fig. S1A). As before, some of the upstream introns were within previously unannotated genes. These included homologs of the S. cerevisiae genes ATP18, ATP19, DAD4, INO4, and SLX9, and a gene with similarity to H. sapiens BLOC1S2 (encoded by orf19.2018.1). As an independent test of our predictions, we examined several candidate introns directly using RT-PCR. We concentrated primarily on the introns with particularly unusual splice sites (and therefore low scores from our algorithm), figuring these should represent the weakest of our predictions. Of the eight genes we examined, we were able to confirm the existence of an intron of the predicted size in every case (Supplemental Fig. S1B). Interestingly, most of these introns were inefficiently spliced, perhaps because of, in part, the unusual splice sites.
Secondary candidate approach for intron identification Of the 338 D. hansenii genes whose annotations include introns, we found that 35 were likely misannotated, and probably did not contain introns. For three genes, we were unable to identify any C. albicans homolog. Of the remaining 300 genes, 176 had C. albicans homologs in which introns had already been identified, 56 of which were first identified as containing introns in our experimental and computational screens described above. For another 114 genes, we found that the C. albicans homologs did not contain introns. Interestingly, seven of the remaining 10 D. hansenii genes identified a homolog (and in one case two homologs) that had been overlooked in the C. albicans gene annotations. These genes have short ORFs and are interrupted by introns, which is presumably why they had not been discovered previously. Unlike the eight new genes we discovered using our computational approach, these genes are not close enough to a previously annotated ORF to have been identified in our computational search. The final three D. hansenii genes identify new introns in known C. albicans genes. These genes received scores below the cutoff in our computational approach (although two of the three were within the top 8%). The low scores were due to highly unusual features of the introns in C. albicans: one has a unique branch site sequence, while the other two have both a noncanonical splice site and an unusually long distance between the branch site and the 3'-splice site. Thus, there are almost certainly introns that remain to be discovered in C. albicans. We believe, however, that the small number of introns identified with this last approach suggests that our overall analysis has identified most.
Categories of intron-containing genes
The over-representation of introns within certain gene categories demonstrates that the widespread loss of introns within the hemiascomycetous lineage has not been random. This, in turn, suggests that the retention of certain introns is under selective pressure. One possible explanation is that these introns are involved in regulating the expression of the genes in which they reside, as has already been demonstrated for several S. cerevisiae introns (discussed above).
What predictions can we make about regulated splicing in C. albicans on the basis of sequence features alone? While not always the case, regulated splicing in S. cerevisiae often involves introns with noncanonical splice sites (Li et al. 1995
Alternative splicing of RPL30 pre-mRNA The published annotation of the RPL30 (orf19.3788.1) intron in C. albicans was incorrect, but the corrected sequence revealed that the Rpl30 RNA-binding site is conserved with S. cerevisiae. It appears that the splicing regulation may also have been conserved in C. albicans, as we were able to detect both RPL30 mRNA and pre-mRNA by RT-PCR (Fig. 5A). While inspecting the intron sequence, however, we found a canonical 5'-splice site sequence in the center of the RPL30 intron. Consistent with this observation, we detected an additional RT-PCR product of intermediate size, and sequencing identified it as an alternatively spliced mRNA arising through use of this internal 5'-splice site. As far as we are aware, this is the only known case of alternative splicing in C. albicans. Both the alternative mRNA and the unspliced pre-mRNA are unproductive transcripts, interrupted by non-coding sequence and therefore unable to produce Rpl30 protein. Interestingly, which of the two unproductive forms predominates appears to be influenced by growth temperature. When cells were shifted to 37°C, we detected more of the alternatively spliced form, and when cells were shifted to 16°C, we detected more of the unspliced form (Fig. 5A). This is in contrast to other transcripts we examined, for which splicing appears to be unaffected by growth temperature.
Splicing regulation of a septin pre-mRNA SPR28 (orf19.4266) is one of seven genes that encode septin proteins in C. albicans. Septins are structural proteins that form a network of filaments at the inner surface of the plasma membrane, and are required for a broad range of dynamic membrane events (for review, see Douglas et al. 2005 The SPR28 gene has several unusual features in C. albicans. While 93% of intron-containing genes have only a single intron, SPR28 has two. The first has a noncanonical branch site (TAT TAAC) located an unusually long distance (54 nt) from the 3'-splice site. The second intron has a noncanonical 5'-splice site (GTGAGT) found in only two other introns. These features suggested to us that SPR28 pre-mRNA might be inefficiently spliced. They also suggested that SPR28 gene expression might be regulated via splicing. To test these ideas, we examined SPR28 splicing by RT-PCR, using primers whose PCR products span both introns. We detected four distinct products, corresponding to unspliced pre-mRNA, fully spliced mRNA, and partially spliced RNA in which either the first or the second intron has been retained (Fig. 5B). Even with the inherent bias of PCR against amplification of longer products, we detected predominantly unspliced RNA and relatively little spliced mRNA. This is in contrast to several genes with canonical splice sites, for which we detected very little or no unspliced RNA by RT-PCR (data not shown).
Expression of the splicing factors LEA1 and SLU7 is regulated by exposure of C. albicans cells to
Using both experimental and computational approaches, we have corrected and extended the existing C. albicans intron annotations, creating a high-confidence, and we believe nearly comprehensive set of 415 introns (Supplemental Table S1). Several introns we found join together exons previously thought to represent separate genes. In other cases, we found exons that were not previously annotated. Some of these identify new protein domains and therefore new functions for the genes that contain them. For example, three genes (orf19.1604, orf19.6781, and orf19.6888) have Gal4-like DNA-binding domains in their upstream exons, identifying them as probable transcription factors. We also found 16 genes that had been previously overlooked, presumably because their relatively short coding sequences are interrupted by introns. The curators of the Candida Genome Database have kindly agreed to host our intron annotations on their Web site (http://www.candidagenome.org).
Contrasts between the introns of S. cerevisiae and C. albicans
There are several differences in the characteristics of the splice-site sequences. The most dramatic is the distance between the branch point nucleotide (the final adenosine of the branch site) and the 3'-splice site, with the median distance in S. cerevisiae nearly twice as long (Fig. 6A). There are more subtle differences in the amount of variation within the splice-site sequences (Fig. 6C). C. albicans 5'-splice sites adhere slightly more to consensus, while the 3'-splice sites show more variation at the 3 position. It will be interesting to determine whether these differences correlate with changes in spliceosomal components involved in splice site recognition. We have already found numerous unexpected differences within the small nuclear RNAs (snRNAs) that function at the catalytic core of the spliceosome (Q. Mitrovich, A. Johnson, and C. Guthrie, in prep.).
Our GO term analysis identified several categories over-represented among the intron-containing genes of C. albicans (Fig. 4). Such over-representation suggests functional roles for these introns, perhaps in the regulation of gene expression. Categories in common with S. cerevisiae intron-containing genes may represent shared splicing regulatory pathways. Transcripts in both of the common categories listedribosomal protein genes and meiosis factorsare known to be regulated by splicing in S. cerevisiae (Engebrecht et al. 1991
Introns with functional roles in gene expression
The first example of regulated splicing in C. albicans involves the intron in the ribosomal protein gene RPL30. In higher eukaryotes, excess ribosomal proteins can inhibit their own synthesis by promoting the use of unproductive alternative splice sites within their pre-mRNAs. This results in mRNAs whose coding sequences are disrupted by intron fragments and are therefore unable to express functional protein (Mitrovich and Anderson 2000
SPR28 is one of seven septin genes in C. albicans, and one of three that contain introns. In contrast, none of the S. cerevisiae septin genes contains introns. This over-representation in C. albicans suggests that splicing regulation may play an important role in septin biology. SPR28 pre-mRNA contains two introns with unusual sequence features, and its splicing is highly inefficient. This inefficient splicing is inhibited further by exposure of cells to
New approaches to intron discovery
We have shown that a debranchase mutant is a powerful tool for identifying introns experimentally. Previous studies have used other techniques to visualize the distribution of introns in S. cerevisiae dbr1 mutants by gel electrophoresis (Ruskin et al. 1984
Our global computational approach was very successful, yielding 116 previously undiscovered introns. This approach is particularly powerful for the Hemiascomycetes, in which alternative splicing is exceedingly rare (Davis et al. 2000 The overall success of our approaches suggests that they will be useful for generating high-confidence intron annotations for newly sequenced genomes, as well as for refining the intron sets of previously annotated genomes. For organisms with highly stereotyped introns, our computational approach focusing primarily on intron features rather than coding-sequence alignments should be particularly useful. For organisms in which the debranchase enzyme can be deleted or inactivated, our experimental approach holds great potential. The availability of increasingly high-density microarrays should allow for more comprehensive use of this method in the future. Finally, as comprehensive intron annotations are made available, they will inform the annotations of related genomes by providing candidate intron sets. We believe that our annotation of C. albicans introns will thus provide an additional benefit by informing the annotations of emerging Candida genomes.
See Supplemental Methods for details regarding yeast strains and cell culture, RNA extraction and cDNA synthesis, RT-PCR analysis, and GO term analysis.
Phylogenetic refinement and validation of intron predictions Once candidate introns are identified, we examine homologous genes from other species to determine whether there is strong phylogenetic support for any of the introns. We identify homologs using BLAST; the specific program we use is determined by phylogenetic distance from C. albicans (BLASTN for C. dubliniensis, TBLASTN for C. tropicalis, and BLASTP for D. hansenii and more distantly related species). For us to consider an intron validated, it must be supported by at least one of two sets of criteria, as follows:
Microarray design, hybridization, and analysis
cDNAs for microarrays were labeled with either Cy3 (DBR1) or Cy5 (dbr1
Microarray data were extracted from scans using GenePix Pro v5.1. Data were subjected to intensity-dependent loess normalization using Goulphar v1.1 (http://www.transcriptome.ens.fr/goulphar) and ranked by normalized intensity ratios. Probes exhibiting a greater than twofold dbr1
Intron search algorithm The score for a putative intron sequence is calculated as the sum of its three PSSM scores and two distance constraint scores. Each input sequence was searched exhaustively for a maximally scoring putative intron. Finally, the log of the squared input sequence length was subtracted from each score to correct for the increased likelihood of false positives in longer ORFs.
We thank Jing Zhu for assistance with C. albicans ortholog identification, Burk Braun for technical assistance and sharing of unpublished data, Oliver Homann and Hao Li for advice on computational analyses, and members of the Johnson and Guthrie labs for many helpful discussions and critical reading of our manuscript. Funding for this work was provided by NIH research grants AI49187 (A.J.) and GM21119 (C.G.), an NSF Predoctoral Graduate Fellowship (B.T.), and a Sandler Postdoctoral Research Fellowship (Q.M.).
1 Corresponding author.
E-mail quinn.mitrovich{at}ucsf.edu; fax (415) 502-4315. [Supplemental material is available online at www.genome.org. The microarray data from this study are available at ArrayExpress (http://www.ebi.ac.uk), accession no. E-MEXP-1003.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6111907
Alonso-Valle, H., Acha, O., Garcia-Palomo, J.D., Farinas-Alvarez, C., Fernandez-Mazarrasa, C., and Farinas, M.C. 2003. Candidemia in a tertiary care hospital: Epidemiology and factors influencing mortality. Eur. J. Clin. Microbiol. Infect. Dis. 22: 254257.[Medline] Ares Jr., M., Grate, L., and Pauling, M.H. 1999. A handful of intron-containing genes produces the lions share of yeast mRNA. RNA 5: 11381139.[CrossRef][Medline] Bennett, R.J. and Johnson, A.D. 2006. The role of nutrient regulation and the Gpa2 protein in the mating pheromone response of C. albicans. Mol. Microbiol. 62: 100119.[CrossRef][Medline] Black, D.L. 2003. Mechanisms of alternative pre-messenger RNA splicing. Annu. Rev. Biochem. 72: 291336.[CrossRef][Medline] Blencowe, B.J. 2006. Alternative splicing: New insights from global analyses. Cell 126: 3747.[CrossRef][Medline] Bon, E., Casaregola, S., Blandin, G., Llorente, B., Neuveglise, C., Munsterkotter, M., Guldener, U., Mewes, H.W., Van Helden, J., Dujon, B., et al. 2003. Molecular evolution of eukaryotic genomes: Hemiascomycetous yeast spliceosomal introns. Nucleic Acids Res. 31: 11211135. Braun, B.R., Het van Hoog, M., dEnfert, C., Martchenko, M., Dungan, J., Kuo, A., Inglis, D.O., Uhl, M.A., Hogues, H., Berriman, M., et al. 2005. A human-curated annotation of the Candida albicans genome. PLoS Genet. 1: 3657.[CrossRef][Medline] Brent, M.R. 2005. Genome annotation past, present, and future: How to define an ORF at each locus. Genome Res. 15: 17771786. Byrne, K.P. and Wolfe, K.H. 2005. The Yeast Gene Order Browser: Combining curated homology and syntenic context reveals gene fate in polyploid species. Genome Res. 15: 14561461. Chapman, K.B. and Boeke, J.D. 1991. Isolation and characterization of the gene encoding yeast debranching enzyme. Cell 65: 483492.[CrossRef][Medline] Chibana, H., Oka, N., Nakayama, H., Aoyama, T., Magee, B.B., Magee, P.T., and Mikami, Y. 2005. Sequence finishing and gene mapping for Candida albicans chromosome 7 and syntenic analysis against the Saccharomyces cerevisiae genome. Genetics 170: 15251537. Cliften, P., Sudarsanam, P., Desikan, A., Fulton, L., Fulton, B., Majors, J., Waterston, R., Cohen, B.A., and Johnston, M. 2003. Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 301: 7176. Cuccurese, M., Russo, G., Russo, A., and Pietropaolo, C. 2005. Alternative splicing and nonsense-mediated mRNA decay regulate mammalian ribosomal gene expression. Nucleic Acids Res. 33: 59655977. Davis, C.A., Grate, L., Spingola, M., and Ares Jr., M. 2000. Test of intron predictions reveals novel splice sites, alternatively spliced mRNAs and new introns in meiotically regulated genes of yeast. Nucleic Acids Res. 28: 17001706. De Virgilio, C., DeMarini, D.J., and Pringle, J.R. 1996. SPR28, a sixth member of the septin gene family in Saccharomyces cerevisiae that is expressed specifically in sporulating cells. Microbiology 142: 28972905.[Abstract] Douglas, L.M., Alvarez, F.J., McCreary, C., and Konopka, J.B. 2005. Septin function in yeast model systems and pathogenic fungi. Eukaryot. Cell 4: 15031512. Dujon, B. 2006. Yeasts illustrate the molecular mechanisms of eukaryotic genome evolution. Trends Genet. 22: 375387.[CrossRef][Medline] Durbin, R., Eddy, S.R., Krogh, A., and Mitchison, G. 1998. Biological sequence analysis: Probabilistic models of proteins and nucleic acids. Cambridge University Press, Cambridge, UK. Engebrecht, J.A., Voelkel-Meiman, K., and Roeder, G.S. 1991. Meiosis-specific RNA splicing in yeast. Cell 66: 12571268.[CrossRef][Medline] Faustino, N.A. and Cooper, T.A. 2003. Pre-mRNA splicing and human disease. Genes & Dev. 17: 419437. Fidel Jr., P.L. 2002. Immunity to Candida. Oral Dis. 8: 6975.[Medline] Hull, C.M. and Johnson, A.D. 1999. Identification of a mating type-like locus in the asexual pathogenic yeast Candida albicans. Science 285: 12711275. Johnson, A. 2003. The biology of mating in Candida albicans. Nat. Rev. Microbiol. 1: 106116.[CrossRef][Medline] Jones, T., Federspiel, N.A., Chibana, H., Dungan, J., Kalman, S., Magee, B.B., Newport, G., Thorstenson, Y.R., Agabian, N., Magee, P.T., et al. 2004. The diploid genome sequence of Candida albicans. Proc. Natl. Acad. Sci. 101: 73297334. Juneau, K., Miranda, M., Hillenmeyer, M.E., Nislow, C., and Davis, R.W. 2006. Introns regulate RNA and protein abundance in yeast. Genetics 174: 511518. Kellis, M., Patterson, N., Endrizzi, M., Birren, B., and Lander, E.S. 2003. Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature 423: 241254.[CrossRef][Medline] Calderone, R.A. 2002. Candidemia. In Candida and Candidiasis, pp. 327340. ASM Press, Washington, DC. Li, Z., Paulovich, A.G., and Woolford Jr., J.L. 1995. Feedback inhibition of the yeast ribosomal protein gene CRY2 is mediated by the nucleotide sequence and secondary structure of CRY2 pre-mRNA. Mol. Cell. Biol. 15: 64546464.[Abstract] Maxwell, E.S. and Fournier, M.J. 1995. The small nucleolar RNAs. Annu. Rev. Biochem. 64: 897934.[CrossRef][Medline] Mitrovich, Q.M. and Anderson, P. 2000. Unproductively spliced ribosomal protein mRNAs are natural targets of mRNA surveillance in C. elegans. Genes & Dev. 14: 21732184. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||