|
|
|
|
Genome Res. 13:1290-1300, 2003 ©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00 Impact of Alternative Initiation, Splicing, and Termination on the Diversity of the mRNA Transcripts Encoded by the Mouse Transcriptome1Laboratory of Computational Genomics, The Rockefeller University, New York, New York 10021-6399, USA 2Laboratory for Genome Exploration Research Group, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan 3Biomedical Knowledge Discovery Team, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan 4ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane Q4072, Australia 5Genome Science Laboratory, RIKEN, Hirosawa, Wako, Saitama 351-0198, Japan
We analyzed the FANTOM2 clone set of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences. We developed a new computational procedure to identify and classify the forms of splice variation evident in this data set and organized the results into a publicly accessible database that can be used for future expression array construction, structural genomics, and analyses of the mechanism and regulation of alternative splicing. Statistical analysis shows that at least 41% and possibly as much as 60% of multiexon genes in mouse have multiple splice forms. Of the transcription units with multiple splice forms, 49% contain transcripts in which the apparent use of an alternative transcription start (stop) is accompanied by alternative splicing of the initial (terminal) exon. This implies that alternative transcription may frequently induce alternative splicing. The fact that 73% of all exons with splice variation fall within the annotated coding region indicates that most splice variation is likely to affect the protein form. Finally, we compared the set of constitutive (present in all transcripts) exons with the set of cryptic (present only in some transcripts) exons and found statistically significant differences in their length distributions, the nucleotide distributions around their splice junctions, and the frequencies of occurrence of several short sequence motifs.
Numerous databases of alternative splice forms have been generated in recent years (Stamm et al. 1994
Exon inclusion into mature mRNA is a complex choice whose outcome is influenced by a variety of factors. Many exons are "cryptic" in the sense that they are included in some but not all transcripts, through mechanisms that are presently not completely understood. Cryptic exons have been reported to be shorter, on average, than constitutive exons (Berget 1995
To make effective use of the large repertoire of transcripts, a cell must be able to tightly regulate their expression. With the completion of the mouse genome sequence (ftp://wolfram.wi.mit.edu/pub/mousecontigs/MGSCV3 Here we report the results of our analysis of splice variation in the FANTOM2 clone set of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences.
Frequency of Alternative Splicing in the Mouse Transcriptome Of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences, 101,356 aligned to 36,617 genomic loci. A locus was defined based on the genomic map of the transcripts: Two transcripts were clustered together if their genomic maps overlapped by at least one nucleotide in at least one exon. We defined the genomic locus of a transcript cluster as the shortest genomic interval that contains the genomic maps of all of the transcripts in the cluster. We found that 77,640 of the cDNA sequences mapped to the genome at >95% identity over their entire length, with every exon being mapped at 95% identity or with at most five errors. Of these transcripts, 23,150 were either unspliced or came from single-exon genes. Of the remaining 54,490 multiexon transcripts, 7293 did not cluster with any other transcripts (we called these transcripts singletons), whereas 47,197 formed 11,677 multitranscript clusters, which we analyzed for splice variation. We constructed the union of the genomic mappings of all transcripts in a cluster and denoted the contiguous genomic regions in this union of mappings as "genomic exons." A genomic exon represented in some, but not all, transcripts in a cluster was denoted "cryptic." A genomic exon with different 5' and/or 3' boundaries in different transcripts, was denoted an exon with "alternative 3'- and/or 5'-splice sites." We chose not to report cases in which the variation could be explained by an intron inclusion, because we cannot readily distinguish computationally between intron inclusion and incomplete splicing. Genomic exons that were represented with the same splice junctions in all transcripts that were long enough to have contained them were denoted "constitutive." A cluster with at least one variant genomic exon was denoted "variant." Of the 11,677 multitranscript clusters, 41% (4750) were variant (Table 1).
This value is larger than our previous estimate on a smaller data set of full-length mouse cDNAs (30%; Zavolan et al. 2002
From our (Zavolan et al. 2002
Characterization of Splice Variants
We sought to independently validate the exon variants using mouse EST sequences from the dbEST database. EST mappings were filtered with the same stringency as the cDNA mappings, and were searched for the presence of the variant exon forms. An exon variation was considered confirmed when all the exon forms used to infer the variation were present in the EST data. That is, a confirmed cryptic exon is an exon that is present in some EST but skipped in others. A cryptic exon with alternative 5'-splice sites is confirmed when both forms of splice site usage, as well as the skipped form occur in the EST data. Table 2 summarizes the results. It is not surprising that multiple variations are confirmed at lower rates than single variations. Given that most ESTs are the result of 3'- or 5'-end sequencing, it is also to be expected that internal cryptic exons have lower confirmation rates than initial or terminal cryptic exons. Exons with multiple variations have, however, lower confirmation rates than we would predict if each variation requires independent confirmation. This indicates that certain of the variant exon forms are less common than others. For instance, such forms may be expressed in only a few cell types, or are generated through infrequent errors of the spliceosome (see below). Kan et al. (2002
Functional Splice Variation Versus Noise in the Splicing Process We observed that alternative 5'/3'-splice sites are frequently only a few nucleotides apart, indicating that some of the variation may be caused by random use of 5'- and 3'-splice sites within a short region around the exon boundary. To document this effect, we isolated exons with alternative 5'- and 3'-splice sites that matched the genome perfectly for at least 10 nt from the splice junction, and were preceded/followed by invariant exons with perfect alignment to the genome for at least 10 nt from the splice junction. This selection ensures that the variation in the placement of the splice junction is not caused by sequencing, assembly, or mapping errors. We then computed the distribution of distances between these alternative splice sites (Fig. 1). In 83 (18%) of the 452 cases of alternative 5'-splice sites and 237 (40%) of the 587 cases of alternative 3'-splice sites, the distance between alternative sites was <10 nt.
Whereas alternative 3'-splice sites seem to be selected for preserving the reading frame, alternative 5' sites are not. In 310 (52.8%, p-value < 2.2 x 10-16) of the 587 cases of exons with alternative 3'-splice sites, the distance between these sites was a multiple of 3, compared with 159 (35.2%, p-value = 0.4247) of the 452 alternative 5'-splice sites. The asymmetry is, however, entirely due to the frequent use (161 of 587 cases) of alternative 3'-splice sites that are only 3 nt apart. In summary, our data indicate that the spliceosome may frequently "slip" around the splice junction, and that, at the 3' site, slips that preserve the reading frame are more common than others.
Impact of Splice Variation on the Proteome
Representation of Gene Ontology Categories in Variant and Invariant Clusters To explore the functional repertoire of splice variants, we extracted from the variant and invariant clusters 4236 and 6234 representative transcripts (FANTOM2 Consortium and the RIKEN GSC Genome Exploration Group 2002
Comparative Analysis of Cryptic and Constitutive Exons The mechanism of exon selection has been extensively studied (Robberson et al. 1990
A Kolmogorov-Smirnov test indicates that the length distribution of constitutive exons differs significantly from that of cryptic exons (Fig. 2, p-value = 2.075 x 10-12). Contrary to our expectation that cryptic exons would be shorter than constitutive exons (Berget 1995
Figure 3 shows the distribution of nucleotides found around the splice sites in constitutive and cryptic exons (see Methods). The slightly larger size of the letters in the upper panels indicates that the splice junctions flanking constitutive exons show more conservation than those flaking cryptic exons. Statistically significant differences (see Methods) are apparent at positions +1, +4, and +5 in the 5'-splice site and positions -5 and -6 in the 3'-splice site (Fig. 4). These results are consistent with previous observations that cryptic exons have "weaker" splice sites (Stamm et al. 2000
We sought to identify sequence motifs that are over- or underrepresented in cryptic exons relative to constitutive exons, and are therefore candidate regulatory elements responsible for the inclusion of the cryptic exon in the mature mRNA. We extracted candidate motifs as described in Methods. In short, we select all motifs that occur significantly more frequently in constitutive than cryptic exons (or vice versa), and whose frequency is significantly different from that expected from the mononucleotide frequencies in at least one of the two sets of exons. We classified these motifs based on (1) the neighboring splice site (5' or 3'); (2) over- or underrepresentation in constitutive versus cryptic; (3) over- or underrepresentation of the motif when compared with the frequency expected from the mononucleotide frequencies in the exon set.
The most striking patterns that emerged are the following. Some of the previously reported splice enhancers (Fairbrother et al. 2002
In contrast, cryptic exons are enriched in pyrimidine-rich motifs, reminiscent of the SRp20-specific motifs described by Schaal and Maniatis (1999
Examples
Here we present one example to illustrate the breadth of strategies by which transcript and protein diversity appear to be achieved in mouse. In separate analyses, we have noted that alternative splicing is relatively common among the members of the very large zinc finger family of transcription factors, commonly leading to inclusion or exclusion of individual zinc fingers or other functional domains (Ravasi et al. 2003
As an example, Figure 5
shows different splice forms of the polypyrimidine-tract-binding protein (PTB), also known as hnRNP1, that are observed in our data set. This is an RNA-binding protein that controls 3'-end processing, internal initiation of translation, and RNA localization, in addition to splice acceptor recognition. As the figure indicates, this gene, involved in the control of alternative splicing, is itself extensively alternatively spliced. Human PTB is known to be expressed in at least three isoforms that differ by the insertion of 19 and 26 amino acids, respectively, between the second and the third RNA-recognition-motif domains (Romanelli et al. 2000
MouSDB: The Database of Splice Variants Identified in the Mouse Transcriptome All multitranscript clusters that we identified in this data set have been deposited in a Postgres database that can be queried via a Web interface at http://genomes.rockefeller.edu/MouSDB
Our study provides the first database and analysis of alternative splice forms identified in a large set of full-length cDNAs. The data set used in this analysis is unique in the following ways. With ESTs one generally has to infer the full transcript by combining sequence from several ESTs, and there is no guarantee that the inferred transcript occurs in vivo. In contrast, full-length cDNAs directly reveal the spectrum of splice forms that are realized in vivo. Additionally, the full-length cDNAs and 5'- and 3'-end EST sequences generated by the FANTOM project (FANTOM2 Consortium and the RIKEN GSC Genome Exploration Group 2002
One complication for the analysis of splice variation in this data set is that the sequences have been generated during the course of a project aimed at maximizing the nonredundant coverage of the mouse transcriptome. For this reason, the data set does not cover transcripts in proportion to their representation in the body (Konno et al. 2001 We developed a novel computational tool to automate the analysis of splice variation in cDNA and EST sequences. It maps the sequences to the genome, clusters sequences mapping to overlapping regions in the genome, produces multiple alignments of the sequences within a cluster, and annotates them in terms of splice variation. Our methods are highly conservative: To reduce the frequency of false positives, we only use sequences that map to the genome at very high percentage identity and coverage. This methodology can be applied to any set of EST or cDNA sequences for which the corresponding genome has been sequenced.
We organized the data into a publicly accessible resource that can be used to study specific genes of interest, for expression array construction, and for further analyses of the mechanism and regulation of alternative splicing. To increase our coverage of splice forms in the mouse transcriptome, we also incorporated 5'- and 3'-end RIKEN EST sequences and EST sequences from dbEST. Both data sets (cDNA sequences only and cDNA + EST sequences) are accessible at http://genomes.rockefeller.edu/MouSDB We found that 41% of the loci for which multiple spliced transcripts were present in the data set had multiple splice forms. We took advantage of the functional annotation of these sequences to confirm that most of the splice variation occurs inside the protein-coding region. Interestingly, in almost half of all transcription units that show splice variation, we found cases in which an apparent alternative transcription start (stop) site is associated with an alternative splice in the initial (terminal) exon. The use of alternative transcription in these cases indicates that the alternative splice forms are differentially expressed and are, therefore, functional. We also found numerous examples of exons whose alternative splice sites are only a few nucleotides apart, indicating that the spliceosome can slide around the splice junction.
Finally, given the high frequency of alternative splicing in mammalian genomes, the regulation of alternative splicing has now become a prominent question. How does the cell ensure that the appropriate splice forms are expressed in the right place and at the right time? Our analysis confirms that the splice junctions around cryptic exons deviate from those flanking constitutive exons, and we identified the precise positions at which significant deviations occur. We also found compositional differences between cryptic and constitutive exons, one notable difference being the enrichment of pyrimidine-rich motifs in cryptic exons. Other motifs, previously reported to act as splice enhancers (Fairbrother et al. 2002
Data Sets and Mapping The cDNA collection used in this study comprised the RIKEN set of 60,770 full-length mouse cDNA sequences and 44,122 public domain mouse mRNA sequences from LocusLink and from the non-EST divisions of the Mouse Gene Index and GenBank (FANTOM2 Consortium and the RIKEN GSC Genome Exploration Group 2002
EST sequences from dbEST (May 2002 release) and from the RIKEN data set of 5'- and 3'-end sequences were repeat-masked using the Paracel filtering package (Paracel Inc.) and aligned to the mouse genome using Paracel BLAST and BLAT (Kent 2002
Transcript Clustering Based on the Genome Mapping
Characterization of Splice Variation Clusters with multiple spliced transcripts were analyzed for the presence of cryptic exons and exons with alternative 5'/3'-splice sites. We compared all exons of all transcripts in a cluster in a pairwise fashion to identify those that used alternative 5'- and 3'-splice sites (Fig. 6). For each exon of a transcript, we tabulated all splice sites from the other transcript falling inside the exon. If the first of these splice sites was a 3'-splice site, both exons were marked as having alternative 3'-splice sites. If the last of these splice sites was a 5'-splice site, both exons were marked as having alternative 5'-splice sites. Note that this scheme does not classify intron inclusions as alternative splicing, because the above condition will not be satisfied. Our classification scheme did not consider intron inclusions as alternative splicing, because we could not distinguish them (computationally) from incomplete mRNA processing. Additionally, we identified cryptic exons, defined as exons that are present in some and skipped in other transcripts, in a cluster. We distinguished between cryptic exons internal to a transcript and initial or terminal cryptic exons. We believe that initial and terminal cryptic exons often occur in transcripts with alternative transcription, and we wanted to recognize these as a separate category for further investigation.
To avoid biases arising from cDNA library construction and cDNA amplification, we reported variation at the level of genomic exons. A genomic exon is defined as the union of all overlapping exons identified in the cDNAs of a cluster. We propagated the alternative splice information from transcript exons to the level of genomic exons as follows. A genomic exon is considered cryptic or constitutive depending on whether or not it is skipped in some of the transcripts in the cluster. In cases in which the cryptic exon occurred in both internal and initial/terminal positions in the transcripts, we called the genomic exon cryptic internal. If the genomic exon contained transcript exons with alternative 3'- and/or 5'-splice sites, the genomic exon was considered to have alternative 5'- and/or 3'-splice sites.
Estimation of the Frequency of Alternative Splicing in the Mouse Transcriptome
We assume that the prior P(n) is a so-called scale-prior, P(n) For the prior P(p1, p2, ···, pn n), we take a uniform distribution. This again puts a relatively high a priori weight on very skewed frequencies pi, and thus increases the a priori probability of observing only one splice form even if alternative splice forms exist. Thus, our calculation can be interpreted as providing an upper bound on the number of mouse genes that have alternative splice forms even though only one splice form occurred in the data. On the other hand, assuming that none of these genes has alternative splice forms provides a lower bound.
Given n and pi, the probability of observing k times the same splice form is given by
, .
The posterior probability that the gene has n distinct splice forms given that k identically spliced transcripts have been observed is
This calculation assumes that each transcript in our data is a sample of an independent mature mRNA. However, because we cannot exclude that the same transcript was amplified multiple times through PCR, only transcripts from different libraries are guaranteed to be independent. Assuming that only transcripts from different libraries are independent, we have 5996 multiple-transcript invariant clusters of which an expected 37% have splice variation according to our model. Combining this again with the transcripts for which splice variation was observed, we obtain an overall upper bound of 60% for the frequency of splice variation. Note that transcripts that differ by an intron inclusion are also guaranteed to be independent and are counted as such, and that single-transcript clusters are excluded from all these calculations because they do not contain any information about splice variation. Finally, note that our "upper bound" is not a formal upper bound; for example, it is in principle conceivable (although extremely unlikely) that every gene has an alternative splice form that occurs at such low frequencies that it would never be observed in our data set.
Relationship Between Exon Skipping and Alternative Splice Site Usage
Let n00, n10, n01, and n11 be the number of constitutive exons with invariant splice sites, cryptic exons with invariant splice sites, constitutive exons with alternative splice site usage, and cryptic exons with alternative splice site usage. Under model 1, let p be the probability that an exon is cryptic and q the probability that it has alternative splice sites. The likelihood of the observed data D given p and q is
Impact of Alternative Splicing on the Proteome Distinct proteins forms will be generated when cryptic exons are present within the genomic map of the CDS, and when alternative 5'- and/or 3'-splice sites fall within the genomic map of the CDS. All genomic exons falling into one of these categories were counted as "CDS exons." Genomic exons mapped upstream of the translation start of the representative transcript were counted as "5'-UTR exons," and exons mapped downstream of the translation stop were counted as "3'-UTR exons." Some clusters did not contain any transcripts with a CDS annotation, and for these, CDS status was deemed "unknown."
EST Confirmation of Splice Variants
Comparison of Splice Signals Flanking Cryptic and Constitutive Exons
.
We additionally want to identify positions in the splice signal for which there is a significant difference between the nucleotide distributions of constitutive and cryptic exons. To this end, we computed for each position i the likelihoods of the observed nucleotide frequencies assuming that (1) the two data sets had different underlying nucleotide frequencies at that position, and (2) the two data sets were generated using the same underlying nucleotide frequencies at that position. Under the first model, there are independent frequencies p
i
i and q i, we obtain
, and the integrals are over the simplex .
Similarly, if we assume that the nucleotide counts at position i
in the constitutive and cryptic exons derive from the same underlying distribution pi
i, we obtain
Comparative Analysis of Motif Composition in Cryptic and Constitutive Exons We further purged this list of motifs by removing those that are contained within longer motifs that also occur in the list. Finally, we compare the frequencies of the remaining motifs with the frequencies that we would expect based on the nucleotide frequencies in constitutive and cryptic exons. We again calculate the z-statistics and only retain the motifs that are significantly over- or underrepresented in at least one set of exons with respect to a mononucleotide frequency model. In short, we collect all motifs that have significantly different frequencies of occurrence in constitutive versus cryptic exons, and whose frequencies of occurrence are significantly different from the frequencies expected based on the mononucleotide frequencies in at least one set of exons. The resulting 90 motifs are shown in Supplementary Figure 1.
M.Z. thanks Henry Prince and Ben Snyder for assistance in developing the database, Magda Konaska for valuable suggestions, and Erik van Nimwegen for critically reading the manuscript. This research has been supported in part by National Cancer Institute Grant R33-CA84699 and National Science Foundation Grant DBI9984882 to T.G., and by the Rockefeller University Lita Annenberg Hazen Presidential Fellowship (M.Z.). The RIKEN data sets of full-length cDNA sequences and 5'- and 3'-end EST sequences have been generated by the Genomic Sciences Center RIKEN Yokohama Institute in Japan and by the Functional annotation of the Mouse Genome (FANTOM) Consortium.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1017303.
7 Corresponding author.
6 Takahiro Arakawa, Piero Carninci, and Jun Kawai. [Supplemental material is available online at www.genome.org.]
Basu, A., Dong, B., Krainer, C., and Howe, A.R. 1997. The intracisternal A-particle proximal enhancer-binding protein activates transcription and is identical to the RNA- and DNA-binding protein p54nrb/NonO. Mol. Cell. Biol. 17:677 -686.[Abstract]
Berget, S. 1995. Exon recognition in vertebrate splicing. J. Biol. Chem.
270:2411
-2414. Blencowe, B., Bauren, G., Eldridge, A., Issner, R., Nickerson, J., Rosonina, E., and Sharp, P. 2000. The SRm160/300 splicing coactivator subunits. RNA 6: 111-120.[Abstract] Brett, D., Hanke, J., Lehmann, G., Haase, S., Delbruck, S., Krueger, S., Reich, J., and Bork, P. 2000. EST comparison indicates 38% of human mRNAs contain possible alternative splice forms. FEBS Lett. 474:83 -86.[CrossRef][Medline] Brett, D., Pospisil, H., Valcarcel, J., Reich, J., and Bork, P. 2002. Alternative splicing and genome complexity. Nat. Genet. 30:29 -30.[CrossRef][Medline] Burd, C. and Dreyfuss, G. 1994. RNA binding of hnRNP A1: Significance of hnRNP A1 high-affinity binding sites in pre-mRNA splicing. EMBO J. 13:1197 -1204.[Medline]
Burke, J., Wang, H., Hide, W., and Davison, D. 1998. Alternative gene form discovery and candidate gene selection from gene indexing projects. Genome Res.
8: 276-290. Caceres, J. and Krainer, A. 1997. Mammalian pre-mRNA splicing factors. In Eukaryotic mRNA processing (ed. A. Krainer), pp. 174-182. Oxford IRL Press, New York. Carninci, P., Kvam, C., Kitamura, A., Ohsumi, T., Okazaki, Y., Itoh, M., Kamiya, M., Shibata, K., Sasaki, N., Izawa, M., et al. 1996. High-efficiency full-length cDNA cloning by biotinylated CAP trapper. Genomics 37:327 -336.[CrossRef][Medline] Chabot, B., Blanchette, M., Lapierre, I., and La Branche, H. 1997. An intron element modulating 5' splice site selection in the hnRNP A1 pre-mRNA interacts with hnRNP A1. Mol. Cell. Biol. 17:1776 -1786.[Abstract] Croft, L., Schandorff, S., Clark, F., Burrage, K., Arctander, P., and Mattick, J. 2000. ISIS, the intron information system, reveals the high frequency of alternative splicing in the human genome. Nat. Genet. 24:340 -341.[CrossRef][Medline]
Dominski, Z. and Kole, R. 1992. Cooperation of pre-mRNA sequence elements in splice site selection. Mol. Cell. Biol. 12:2108
-2114.
Eldridge, A., Li, Y., Sharp, P., and Blencowe, B. 1999. The SRm160/300 splicing coactivator is required for exon-enhancer function. Proc. Natl. Acad. Sci.
96:6125
-6130.
Fairbrother, W. and Chasin, L. 2000. Human genomic sequences that inhibit splicing. Mol. Cell. Biol.
20:6816
-6825.
Fairbrother, W., Yeh, R., Sharp, P., and Burge, C. 2002. Predictive identification of exonic splicing enhancers in human genes. Science
297:1007
-1013. FANTOM2 Consortium and the RIKEN GSC Genome Exploration Group. 2002. Analysis of the mouse transcriptome based upon functional annotation of 60,770 full length cDNAs. Nature 420:563 -573.[CrossRef][Medline] Florea, L., Hartzell, G., Zhang, Z., Rubin, G., and Miller, W. 1999. A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res. 8: 967-974. Furuno, M., Kasukawa, T., Saito, R., Adachi, J., Suzuki, H., Baldarelli, R., Hayashizaki, Y., and Okazaki, Y. 2003. CDS annotation in full-length cDNA sequence. Genome Res. (this issue).
Gelfand, M., Dubchak, I., Dralyuk, I., and Zorn, M. 1999. ASDB: Database of alternatively spliced genes. Nucleic Acids Res. 27:301
-302.
Henriksen, M.A., Betz, A., Fuccillo, M., and Darnell, J. 2002. Negative regulation of STAT92E by an N-terminally truncated STAT protein derived from an alternative promoter site. Genes & Dev. 16:2379
-2389.
Hoffman, B. and Grabowski, P. 1992. U1 snRNP targets an essential splicing factor, U2AF65, to the splice site by a network of interactions spanning the exon. Genes & Dev.
6:2554
-2568.
Ji, H., Zhou, Q., Wen, F., Xia, H., Lu, X., and Li, Y. 2001. AsMamDB: An alternative splice database of mammals. Nucleic Acids Res. 29:260
-263. |