Genome Res. 13:1290-1300, 2003
©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00
Impact of Alternative Initiation, Splicing, and Termination on the Diversity of the mRNA Transcripts Encoded by the Mouse Transcriptome
Mihaela Zavolan1,7,
Shinji Kondo2,
Christian Schönbach3,
Jun Adachi2,
David A. Hume4,
RIKEN GER Group2,
GSL Members5,6,
Yoshihide Hayashizaki2,5 and
Terry Gaasterland1
1Laboratory of Computational Genomics, The Rockefeller University, New York, New York 10021-6399, USA
2Laboratory for Genome Exploration Research Group, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
3Biomedical Knowledge Discovery Team, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
4ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane Q4072, Australia
5Genome Science Laboratory, RIKEN, Hirosawa, Wako, Saitama 351-0198, Japan
We analyzed the FANTOM2 clone set of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences. We developed a new computational procedure to identify and classify the forms of splice variation evident in this data set and organized the results into a publicly accessible database that can be used for future expression array construction, structural genomics, and analyses of the mechanism and regulation of alternative splicing. Statistical analysis shows that at least 41% and possibly as much as 60% of multiexon genes in mouse have multiple splice forms. Of the transcription units with multiple splice forms, 49% contain transcripts in which the apparent use of an alternative transcription start (stop) is accompanied by alternative splicing of the initial (terminal) exon. This implies that alternative transcription may frequently induce alternative splicing. The fact that 73% of all exons with splice variation fall within the annotated coding region indicates that most splice variation is likely to affect the protein form. Finally, we compared the set of constitutive (present in all transcripts) exons with the set of cryptic (present only in some transcripts) exons and found statistically significant differences in their length distributions, the nucleotide distributions around their splice junctions, and the frequencies of occurrence of several short sequence motifs.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1017303.
7 Corresponding author. E-MAIL mihaela{at}genomes.rockefeller.edu; FAX (212) 327-7765.
6 Takahiro Arakawa, Piero Carninci, and Jun Kawai.
[Supplemental material is available online at www.genome.org.]

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
M. Hiller, K. Szafranski, R. Sinha, K. Huse, S. Nikolajewa, P. Rosenstiel, S. Schreiber, R. Backofen, and M. Platzer
Assessing the fraction of short-distance tandem splice sites under purifying selection
RNA,
April 1, 2008;
14(4):
616 - 629.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T.-M. Chern, N. Paul, E. van Nimwegen, and M. Zavolan
Computational Analysis of Full-length cDNAs Reveals Frequent Coupling Between Transcriptional and Splicing Programs
DNA Res,
April 1, 2008;
15(2):
63 - 72.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Zhang, M. L. Hastings, A. R. Krainer, and M. Q. Zhang
Dual-specificity splice sites function alternatively as 5' and 3' splice sites
PNAS,
September 18, 2007;
104(38):
15028 - 15033.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K.-W. Tsai, W.-Y. Tarn, and W.-c. Lin
Wobble Splicing Reveals the Role of the Branch Point Sequence-to-NAGNAG Region in 3' Tandem Splice Site Selection
Mol. Cell. Biol.,
August 15, 2007;
27(16):
5835 - 5848.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Garg and P. Green
Differing patterns of selection in alternative and constitutive splice sites
Genome Res.,
July 1, 2007;
17(7):
1015 - 1022.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. G. Leparc and R. D. Mitra
Non-EST-based prediction of novel alternatively spliced cassette exons with cell signaling function in Caenorhabditis elegans and human
Nucleic Acids Res.,
May 11, 2007;
35(10):
3192 - 3202.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Y. Gracey
Interpreting physiological responses to environmental change through gene expression profiling
J. Exp. Biol.,
May 1, 2007;
210(9):
1584 - 1592.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Hiller, S. Nikolajewa, K. Huse, K. Szafranski, P. Rosenstiel, S. Schuster, R. Backofen, and M. Platzer
TassDB: a database of alternative tandem splice sites
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D188 - D192.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
U. Ohler
Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction
Nucleic Acids Res.,
November 6, 2006;
34(20):
5943 - 5950.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Gustincich, A. Sandelin, C. Plessy, S. Katayama, R. Simone, D. Lazarevic, Y. Hayashizaki, and P. Carninci
The complexity of the mammalian transcriptome
J. Physiol.,
September 1, 2006;
575(2):
321 - 332.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Hasegawa, S. Fukuda, K. Shimokawa, S. Kondo, N. Maeda, and Y. Hayashizaki
A RecA-mediated exon profiling method
Nucleic Acids Res.,
August 8, 2006;
34(13):
e97 - e97.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Thill, V. Castelli, S. Pallud, M. Salanoubat, P. Wincker, P. de la Grange, D. Auboeuf, V. Schachter, and J. Weissenbach
ASEtrap: A biological method for speeding up the exploration of spliceomes
Genome Res.,
June 1, 2006;
16(6):
776 - 786.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Nagasaki, M. Arita, T. Nishizawa, M. Suwa, and O. Gotoh
Automated classification of alternative splicing and transcriptional initiation and construction of visual database of classified patterns
Bioinformatics,
May 15, 2006;
22(10):
1211 - 1216.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. K. Shah and P. Bork
LSAT: learning about alternative transcripts in MEDLINE
Bioinformatics,
April 1, 2006;
22(7):
857 - 865.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. J. Law, E. M. Labut, R. D. Adams, and J. L. Merchant
An isoform of ZBP-89 predisposes the colon to colitis
Nucleic Acids Res.,
March 3, 2006;
34(5):
1342 - 1350.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Florea
Bioinformatics of alternative splicing and its regulation
Brief Bioinform,
March 1, 2006;
7(1):
55 - 69.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Werner and P. J. Nelson
Joining high-throughput technology with in silico modelling advances genome-wide screening towards targeted discovery
Brief Funct Genomic Proteomic,
March 1, 2006;
5(1):
32 - 36.
|
 |
|

|
 |

|
 |
 
R. Shemesh, A. Novik, S. Edelheit, and R. Sorek
Genomic fossils as a snapshot of the human transcriptome
PNAS,
January 31, 2006;
103(5):
1364 - 1369.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Kimura, A. Wakamatsu, Y. Suzuki, T. Ota, T. Nishikawa, R. Yamashita, J.-i. Yamamoto, M. Sekine, K. Tsuritani, H. Wakaguri, et al.
Diversification of transcriptional modulation: Large-scale identification and characterization of putative alternative promoters of human genes
Genome Res.,
January 1, 2006;
16(1):
55 - 65.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Ravasi, H. Suzuki, K. C. Pang, S. Katayama, M. Furuno, R. Okunishi, S. Fukuda, K. Ru, M. C. Frith, M. M. Gongora, et al.
Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome
Genome Res.,
January 1, 2006;
16(1):
11 - 19.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. S. Siddiqui, J. Khattra, A. D. Delaney, Y. Zhao, C. Astell, J. Asano, R. Babakaiff, S. Barber, J. Beland, S. Bohacec, et al.
A mouse atlas of gene expression: Large-scale digital gene-expression profiles from precisely defined developing C57BL/6J mouse tissues and cells
PNAS,
December 20, 2005;
102(51):
18485 - 18490.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. L. ZHENG, X.-D. FU, and M. GRIBSKOV
Characteristics and regulatory elements defining constitutive splicing and different modes of alternative splicing in human and mouse
RNA,
December 1, 2005;
11(12):
1777 - 1787.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Hiller, K. Huse, M. Platzer, and R. Backofen
Non-EST based prediction of exon skipping and intron retention events using Pfam information
Nucleic Acids Res.,
October 4, 2005;
33(17):
5611 - 5621.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Kan, P. W. Garrett-Engele, J. M. Johnson, and J. C. Castle
Evolutionarily conserved and diverged alternative splicing events show different expression and functional profiles
Nucleic Acids Res.,
September 29, 2005;
33(17):
5659 - 5666.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. Rosonina, J. Y. Y. Ip, J. A. Calarco, M. A. Bakowski, A. Emili, S. McCracken, P. Tucker, C. J. Ingles, and B. J. Blencowe
Role for PSF in Mediating Transcriptional Activator-Dependent Stimulation of Pre-mRNA Processing In Vivo
Mol. Cell. Biol.,
August 1, 2005;
25(15):
6734 - 6746.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. D. Choi, L. A. Underkoffler, A. J. Wood, J. N. Collins, P. T. Williams, J. A. Golden, E. F. Schuster Jr., K. M. Loomes, and R. J. Oakey
A Novel Variant of Inpp5f Is Imprinted in Brain, and Its Expression Is Correlated with Differential Methylation of an Internal CpG Island
Mol. Cell. Biol.,
July 1, 2005;
25(13):
5514 - 5522.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. A. Sharov, D. B. Dudekula, and M. S.H. Ko
Genome-wide assembly and analysis of alternative transcripts in mouse
Genome Res.,
May 1, 2005;
15(5):
748 - 754.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Gough
Convergent evolution of domain architectures (is rare)
Bioinformatics,
April 15, 2005;
21(8):
1464 - 1471.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. B. Wahl, U. Heinzmann, and K. Imai
LongSAGE analysis significantly improves genome annotation: identifications of novel genes and alternative transcripts in the mouse
Bioinformatics,
April 15, 2005;
21(8):
1393 - 1400.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Kim, S. Shin, and S. Lee
ECgene: Genome-based EST clustering and gene modeling for alternative splicing
Genome Res.,
April 1, 2005;
15(4):
566 - 576.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Kasai, S.-i. Hashimoto, T. Yamada, J. Sese, S. Sugano, K. Matsushima, and S. Morishita
5'SAGE: 5'-end Serial Analysis of Gene Expression database
Nucleic Acids Res.,
January 1, 2005;
33(suppl_1):
D550 - D552.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Ding, A. Sabo, N. Berkowicz, R. R. Meyer, Y. Shotland, M. R. Johnson, K. H. Pepin, R. K. Wilson, and J. Spieth
EAnnot: A genome annotation tool using experimental evidence
Genome Res.,
December 1, 2004;
14(12):
2503 - 2509.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Kim, S. Shin, and S. Lee
ASmodeler: gene modeling of alternative splicing from genomic alignment of mRNA, EST and protein sequences
Nucleic Acids Res.,
July 1, 2004;
32(suppl_2):
W181 - W186.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. G. Noble, P. A. Walker, L. J. Calder, and I. A. Taylor
Rna14-Rna15 assembly mediates the RNA-binding capability of Saccharomyces cerevisiae cleavage factor IA
Nucleic Acids Res.,
June 23, 2004;
32(11):
3364 - 3375.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Nagashima, H. Matsuda, D. G. Silva, N. Petrovsky, RIKEN GER Group, GSL Members, A. Konagaya, and C. Schonbach
FREP: a database of functional repeats in mouse cDNAs
Nucleic Acids Res.,
January 1, 2004;
32(90001):
D471 - 475.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Shiraki, S. Kondo, S. Katayama, K. Waki, T. Kasukawa, H. Kawaji, R. Kodzius, A. Watahiki, M. Nakamura, T. Arakawa, et al.
Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage
PNAS,
December 23, 2003;
100(26):
15776 - 15781.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. M. Johnson, J. Castle, P. Garrett-Engele, Z. Kan, P. M. Loerch, C. D. Armour, R. Santos, E. E. Schadt, R. Stoughton, and D. D. Shoemaker
Genome-Wide Survey of Human Alternative Pre-mRNA Splicing with Exon Junction Microarrays
Science,
December 19, 2003;
302(5653):
2141 - 2144.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|