|
|
|
|
Vol. 9, Issue 12, 1288-1293, December 1999
LETTER
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Alternative splicing can produce variant proteins and expression patterns as different as the products of different genes, yet the prevalence of alternative splicing has not been quantified. Here the spliced alignment algorithm was used to make a first inventory of exon-intron structures of known human genes using EST contigs from the TIGR Human Gene Index. The results on any one gene may be incomplete and will require verification, yet the overall trends are significant. Evidence of alternative splicing was shown in 35% of genes and the majority of splicing events occurred in 5' untranslated regions, suggesting wide occurrence of alternative regulation. Most of the alternative splices of coding regions generated additional protein domains rather than alternating domains.
| |
INTRODUCTION |
|---|
|
|
|---|
The total size of human genomic DNA sequences in GenBank
exceeds 100 million bases and is rising
exponentially. However, the majority of human genomic sequences are
uncharacterized or characterized incompletely. Thus, although a large
amount of data has been published about alternative splicing of
individual genes (Gelfand et al. 1999
), this information remains mostly
anecdotal and does not allow for any generalizations. On the other
hand, it has been estimated that at least half of the human genes are
represented in the existing EST collections (Schuler et al. 1996
).
Since these collections are created by partial sequencing of mRNAs from
many different tissues and developmental stages, one would expect that the diversity of alternative splicing variants in EST data banks would
be larger than in the standard samples of annotated human genes.
The problem of using ESTs for genomic DNA annotation and prediction of
exon-intron structure is not trivial. It has been studied by several
groups, most notably GRAIL (Xu and Uberbacher 1997
). One of the main
difficulties is that a considerable number of ESTs map to intergenic or
intronic regions, or could be products of aberrant or incomplete
splicing. It is likely that these matches constitute at least one fifth
of the existing EST databases (Wolfsberg and Landsman 1997
). Thus, the
most informative ESTs are those that correspond to several exons.
However, in this case simple matching of ESTs to genomic sequences by
BLAST-like programs is not sufficient because BLAST does not map
exactly the exon-intron boundaries (Altshul et al. 1990
). Recently two
programs were published that align EST sequences with genomic DNA (Mott
1997
; Florea et al. 1998
).
We have developed a program for prediction of the exon-intron structure
of genomic DNA fragments using EST data. The program Procrustes-EST is
based on the modified spliced alignment algorithm (Gelfand et al.
1996
). When applied to known human genes and TIGR EST assemblies (Adams
et al. 1995
), the program found a large number of alternatively spliced
genes (~35%). Most of the alternative splicing events occurred in
5'-untranslated regions. In many cases the use of the program
allowed for linking and merging multiple existing assemblies into
single contigs.
| |
RESULTS |
|---|
|
|
|---|
Superstructures and EST Contigs
After aligning EST contigs to genomic DNA, the latter was used as an anchor for additional clustering and assembly of ESTs. The partial gene structures generated by spliced alignment were merged whenever they shared consecutive splicing sites spanning an intron. The superstructures so formed correspond to all possible gene structures for which each complete exon is supported by at least one of the alignments (Methods). On the EST level this leads to formation of superassemblies. Each superassembly is a merge of initial EST contigs matching a predicted superstructure. Note that linking of EST contigs to the genomic sequence and the requirement that all splicing sites in merged EST contigs coincide, precludes formation of spurious superassemblies.
Table 1 presents the distribution of the number of
EST contigs that are merged to form one superassembly. In ~50% of
cases, no further merging could be done. Because the procedure for
creating superstructures is local (Methods), 10% of all
superstructures are chimeric in the sense that the full superstructure
is not supported by any one of the original EST contigs and thus
possibly includes exons from different splicing variants. The remaining 40% of superassemblies are formed by more than one contig, showing that matching ESTs to the genome allows a significant amount of additional assembly.
|
Table 2 describes the number of superassemblies of
which each EST contig is a part (equivalently, the number of
superstructures to which each EST contig maps). More than half of
contigs (55%) are a part of only one superassembly, and slightly more
than one fourth of contigs (27%) are orphans (having no common
complete exons or introns with any of the genes in the starting set).
Approximately 3% of contigs are part of complex alternative splicing
events (being a part of four or more variant superassemblies).
|
Alternative Splicing
Table 3 presents the number of alternative
exon-intron structures predicted per gene. More than one-third of
genes have at least two variants of exon-intron structures. The
alternative structures were classified initially from the point of view
of mature mRNAs. Thus we distinguish alternatives at the 5' end
(5' forks), alternatives at the 3' end (3' forks) and
internal alternatives (loops including bulges). 5' forks occurred
in 73 genes (54% of alternatively spliced genes), loops in 41 genes
(30%), and 3' forks in 64 genes (47%) (the total exceeds 100%
because these cases are not mutually exclusive). Gautheret et al.
(1998)
found that in 1000 EST clusters, 189 showed clear evidence of
alternative polyadenlyation. These results are not directly comparable
to ours, as we did not attempt to determine the location of
polyadenylation sites.
|
We then analyzed the distribution of particular variants of alternative splicing, where 23% of loops were generated by alternative acceptors, 16% were generated by alternative donors, and 27% were exons that were present in one of the two structures and absent in the other one. There were rare instances of retained introns, alternative introns, and alternative exons. Of those examined, 25% were complex cases that could not be classified because they combined several elementary events of alternative splicing. Furthermore, 22% of 5'-forks were alternative 5' exons, 18% had different transcription start points and an additional intron in one of the variants, and the rest were complex cases. Finally, 11% of 3' forks were alternative terminal exons, 35% had different end points (polyadenylation sites) and an additional intron in one variant, and the rest were complex cases.
Classifying the alternatives by functional region rather than by location in the alignment, we saw that 80% of alternatively spliced genes had an alternative in the 5'-untranslated region, whereas only 20% had alternatives in the coding region as described in GenBank, and 19% had alternatives in 3'-untranslated region (the total exceeds 100% since alternatives may occur in two or all three of these regions).
True Alternatives or Splicing Errors?
Intron retention, through either genomic contamination or incomplete/incorrect splicing, is perhaps the most likely artifact that could cause misleading results. However, we placed strict conditions on the inclusion/formation of superstructures (Methods) and in the final data observed only four cases where comparison of superstructures showed one retaining an intron relative to the other (considering not only coding regions, but the entire transcript). Thus, the possibility of intron contamination can be ruled out in the vast majority of the gene structures we considered.
We also performed additional analysis, considering the influence of discovered alternatives on reading frame for those cases (161 genes) where the alternative regions were situated completely within the annotated coding region. In 95 cases (59%) the alternative influenced an integer number of codons. Of these, 23 cases involved multiple (usually two) compensated events, for example alternative exon and alternative site in the next exon. Noncompensated frameshifting (40 cases of added/lost exons, 74 cases of alternatives choice of sites) usually happens near the 3' end of the coding region, and thus it affects only the carboxyl terminus of the protein. It is interesting to note that more than one third of frameshifts in predicted structures can be eliminated, preserving a strong EST contig to genome alignment, if we allow splicing at noncanonical sites and do not force the introns to start at GT and end at AG.
Of course, to distinguish with certainty between true alternative splicing and artifactual sequences, one has to perform detailed case by case analysis including experimental work, for example, if some variant persists in a particular tissue, it is likely to be functional. However, all of the above evidence, even if circumstantial, does suggest that we are observing true splice variants in most cases.
Examples of Individual Cases
Sixteen genes from our sample (<5%) had alternative splicing
variants found by preliminary analysis described explicitly, or at
least mentioned in GenBank annotations (Gelfand et al. 1996
; Sze and
Pevzner 1997
). In four cases no alternatives were constructed, in four
cases the predicted set of alternative structures coincided with the
GenBank annotation, and in eight cases additional splicing variants
were found. The latter group is described in Table 4.
|
In particular, we have observed three alternative acceptor sites of exon 3 of somatotropin and somatotropin variant genes. The sequences of these two genes are very close. Two variants of this site were annotated for each gene and we have observed only one of them (Fig. 1). The last exon of both these genes has an alternative intron in the 3'-untranslated region. Pulmonary surfactant protein C gene has an alternative donor site of the last exon. In addition, its exon 2 can be spliced out (its length, 159 nucleotides, is a multiple of 3), and there is an alternative intron with alternative donor sites in the 3'-untranslated region (Fig. 2). In the fragile X mental retardation syndrome gene, in addition to known variants generated by alternative acceptor sites of exons 15 and 17, exon 12 can be spliced out. In the sex hormone-binding globulin known variants are generated by alternative first exons; newly discovered alternative splicing is the result of splicing out of exon 7. Other new variants of genes with known alternative splicing result from alternative splicing of untranslated regions (Table 4).
|
|
| |
DISCUSSION |
|---|
|
|
|---|
Relatively few genes have been investigated for alternative splice forms, and it has been difficult to estimate the extent and trends of alternative splicing in human genes. We have presented a quantitative study of the prevalence of alternative splicing across many gene families. The results on any one gene may be incomplete and will require verification, yet the overall trends are significant. The results suggest that at least one-third of human genes are alternatively spliced. In particular, we have observed frequent alternative splicing in untranslated regions, specifically in the 5' UTR. The alternative splicing at the 5' end coupled to different starting points of transcription is probably a mechanism that allows the cell to use several differently regulated promoters for the same gene. The majority of alternative splicing events within the coding regions produces additional protein domains rather than alternating domains.
The problem of mapping ESTs to genomic sequences is addressed by
several different programs, in particular EST_GENOME (Mott 1997
) and
SIM4 (Florea et al. 1998
). The main difference between our approach and
straightforward application of these and other tools is in the
postprocessing step used to filter out unreliable EST hits. Moreover,
the use of genomic data has allowed us to merge EST contigs in the
situations where the EST overlaps alone provide insufficient evidence
for contig construction. Indeed, 40% of superassemblies were produced
by more than one contig.
Fraction of Genes with Alternative Splicing Is Probably Underestimated
A study such as this has many possible sources of error. However, using a very conservative approach, it is unlikely that genes for which we found alternative superstructures actually have no alternative splicing (although we may have missed some cases of genuine alternative splicing). To the best of our knowledge, we used the most conservative collection of EST contigs and found no case of an EST contig with distant genomic matches implying incorrect assembly. Alignments between EST contigs and genomic sequence were examined individually if there was any sign that the automated alignment was incorrect. When multiple EST contigs were merged, we guarded against merging of contigs from different genes by anchoring the assembly to genomic sequence. To prevent, insofar as possible, the inclusion of sequence from genomic clones or incompletely/incorrectly spliced mRNA, we only merged exons into gene structures when the overlap included splice junctions spanning an intron. The fact that only four genes showed structures with retained introns, and that alternative structures often seemed to be constrained by the reading frame, suggests that our safeguard measures were successful.
Interference among members of multigene families should not produce additional splicing variants. Indeed, since we used strict thresholds on relative alignment score in order to accept a prediction, and in addition checked local drops of similarity, interference would require extremely strong conservation of intron sequences. This can happen only for very close and recently duplicated genes (i.e., somatotropin and somatotropin variant genes, shown in Fig. 1). It is very likely that splicing alternatives in such cases are the same. The interference of nearly identical genes may have led to an overestimation, of the number of EST contigs that can be merged using genomic sequences. However, such cases are rare and the overestimation most likely small.
Our main conclusion is that alternative splicing is likely to occur in at least one-third of all genes; however, the actual fraction could be significantly higher. This is evidenced by the fact that in 4 of 16 cases with known alternative splicing, only 1 variant was found in our analysis. The underestimation is unavoidable in that many variants can have very limited tissue or stage specificity. However, in taking a number of conservative steps, we may have further reduced the estimation.
Possible Overestimation of the Number of Gene Structures per Gene
Alternative splicing events in different parts of a gene may not be independent. In our study we combined all events independently, even when no single EST contig supported the full structure. Thus, our estimation of the number of alternative splice forms per gene may be high. This does not, however, affect our main conclusions regarding the extent and classification of the alternative splicing events themselves.
This problem cannot be resolved by computer analysis. Indeed, even the construction of EST contigs from relatively short ESTs can produce chimaeric contigs. Only sequencing of full-length mRNAs or directed RT-PCR-based analysis using primers to alternating regions can resolve these cases. However, this does not influence our main conclusion about the frequency of alternative splicing and its prevalence in 5' UTRs. The latter can even be underestimated because we did not consider hanging ends of EST targets that cannot be matched to the sequenced portion of genomic DNA as alternatives.
| |
CONCLUSIONS |
|---|
|
|
|---|
Case by case analysis of many individual genes, including experimental verification, will refine our understanding of human alternative splicing significantly. We hope to begin to test our main conclusions on a set of unannotated cosmid-sized sequences. Nevertheless, we believe that the joint accumulation of EST and genomic data has provided a sufficient basis to gain some important new insights into the extent and style of human alternative splicing. On the computational side, further research will be aimed at improvement of methods for distinguishing between true alternative and aberrant splicing as well as algorithms for support of experiments on identification of alternative splicing variants.
| |
METHODS |
|---|
|
|
|---|
Human genomic DNA fragments containing complete multiexon genes
were compiled by merging samples (Gelfand et al. 1996
; Kulp et al.
1996
). Genes were considered to be duplicates if their described
exon-intron structures were identical (minor differences in intron
lengths were allowed) and the longest representative from each group of
duplicates was selected. The final sample consisted of 392 genes.
Repeats were filtered from the genomic sequences by RepeatMasker (Smit
1999
). EST contigs corresponding to a gene were selected from the TIGR
Human Gene Index (Adams et al. 1995
) using BLASTN (Altschul et al.
1990
). The E-value threshold was set to 10
50.
For our upper limit, 10 of the highest scoring contigs (targets) per
gene were retained. This limit, originally set arbitrarily to reduce
the volume of output, turned out to apply rarely and did not affect the
conclusions. All types of contigs were used, including singletons and
contigs containing full-length cDNA.
Genes having at least one common target were grouped into clusters (i.e., two genes were linked if they had at least one common target and clusters were defined as maximal connected components in the obtained graph). The target sets for each cluster were merged and ascribed to each member of the cluster. Finally, the sequences complementary to the targets were added to the target sets.
Exon-intron structures were predicted using Procrustes-EST. This
program predicts candidate splicing sites with a very weak threshold
and then finds a chain of exons with the highest local similarity to
the target using the spliced alignment algorithm (Gelfand et al. 1996
).
The following parameters were used: match weight = 1, mismatch
penalty = 1, gap initiation penalty = 4, and gap extension
penalty = 2. Introns are not considered as gaps by Procrustes-EST, so
that gaps in the alignments produced are actually very rare and the
alignments are robust with regard to a particular choice of gap
penalties within a reasonable range (data not shown). Relative
similarity between a predicted gene and a target was defined as the
ratio of the spliced alignment score and the score of the trivial
alignment of the target with itself (Mironov et al. 1998
).
The threshold for accepting the prediction was set to 80% relative
similarity to avoid interference of members of multigene families.
Cases with local drops of similarity between predicted genes and
targets (defined as
10 out of 25 mismatching nucleotides) were
analyzed manually and 61 prediction errors caused by loss of sites or
spurious short exons at prediction termini were corrected. Ends of EST
contigs that did not match to the genomic sequence were ignored. Such
ends could correspond to unsequenced distal ends of the gene or be the
consequence of deteriorated sequence quality at 5' ends of EST
reads. As we could not distinguish between these possibilities, we did
not count such cases as alternative splicing.
At the postprocessing stage predicted exon-intron structures corresponding to one gene were merged into superstructures if they intersected without local contradictions. Superstructures were constructed as follows. All triples (intron-exon-intron) from predicted structures were considered (this step did not depend on annotated coding sequence). Two triples were merged if the right intron of the first triple coincided with the left intron of the second triple. Thus, even short overlaps between exons were accepted if they were supported by reliable exon-intron junctions. On the other hand, even long simple matches within exons were not sufficient for construction of superstructures if they did not span an intron, as they are often caused by unspliced ESTs (see discussion of orphans, below).
This procedure was performed until no triples could be added to the constructed superstructure. All possible superstructures were constructed. Because alternative splicing in different parts of pre-mRNA may not be independent, creation of chimeric superstructures not corresponding to any mRNA are possible. However, comparison with EST contigs formed from shorter ESTs is not a good method for analysis of long-range correlations between splicing events, and in the absence of full-length mRNA sequences further conclusions cannot be reached.
Contigs or superstructures that intersected neither the annotated coding sequence nor any other superstructure in a common complete exon or intron were termed orphans and were not counted. There are two types of such superstructures. First, they could lie completely outside all other superstructures and coding sequence. These superstructures are likely to correspond to parts of unannotated genes in the analyzed fragments. Second, such superstructures, usually consisting of just one exon, could lie completely within an intron of a known gene, partially overlap with a known exon, or span without interactions several consecutive exons and introns. These cases probably correspond to mis-spliced pre-mRNAs (products of aberrant or incomplete splicing) or to antisense transcripts.
| |
ACKNOWLEDGMENTS |
|---|
This work was supported partially by the Russian State Scientific Program Human Genome, the Russian Fund of Basic Research, and the U.S. Department of Energy. We are grateful to R. Guigo, S. Hannenhali, P. Pevzner, M. Roytberg, and S. Sze for useful discussions.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
4 Corresponding author.
E-MAIL mgelfand{at}anchorgen.com; FAX (310) 434-0120.
| |
REFERENCES |
|---|
|
|
|---|
Received March 22, 1999; accepted in revised form October 1, 1999.
This article has been cited by other articles:
![]() |
T. Castrignano, M. D'Antonio, A. Anselmo, D. Carrabino, A. D'Onorio De Meo, A. M. D'Erchia, F. Licciulli, M. Mangiulli, F. Mignone, G. Pavesi, et al. ASPicDB: A database resource for alternative splicing analysis Bioinformatics, May 15, 2008; 24(10): 1300 - 1304. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Lucitt, T. S. Price, A. Pizarro, W. Wu, A. K. Yocum, C. Seiler, M. A. Pack, I. A. Blair, G. A. FitzGerald, and T. Grosser Analysis of the Zebrafish Proteome during Embryonic Development Mol. Cell. Proteomics, May 1, 2008; 7(5): 981 - 994. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Irimia, J. L. Rukov, D. Penny, J. Garcia-Fernandez, J. Vinther, and S. W. Roy Widespread Evolutionary Conservation of Alternatively Spliced Exons in Caenorhabditis Mol. Biol. Evol., February 1, 2008; 25(2): 375 - 382. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. He, Z. Zuo, H. Chen, L. Zhang, F. Zhou, H. Cheng, and R. Zhou Genome-wide detection of testis- and testicular cancer-specific alternative splicing Carcinogenesis, December 1, 2007; 28(12): 2484 - 2490. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-C. Lin, L.-C. Hsieh, M.-W. Kuo, J. Yu, H.-H. Kuo, W.-L. Lo, R.-J. Lin, A. L. Yu, and W.-H. Li Human TRIM71 and Its Nematode Homologue Are Targets of let-7 MicroRNA and Its Zebrafish Orthologue Is Essential for Development Mol. Biol. Evol., November 1, 2007; 24(11): 2525 - 2534. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Araud, R. Genolet, P. Jaquier-Gubler, and J. Curran Alternatively spliced isoforms of the human elk-1 mRNA within the 5' UTR: implications for ELK-1 expression Nucleic Acids Res., July 9, 2007; 35(14): 4649 - 4663. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ner-Gaon, N. Leviatan, E. Rubin, and R. Fluhr Comparative Cross-Species Alternative Splicing in Plants Plant Physiology, July 1, 2007; 144(3): 1632 - 1641. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Santic, S. M. Schmidhuber, R. Lang, I. Rauch, E. Voglas, N. Eberhard, J. W. Bauer, S. D. Brain, and B. Kofler Alarin is a vasoactive peptide PNAS, June 12, 2007; 104(24): 10217 - 10222. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. G. Leparc and R. D. Mitra Non-EST-based prediction of novel alternatively spliced cassette exons with cell signaling function in Caenorhabditis elegans and human Nucleic Acids Res., May 11, 2007; 35(10): 3192 - 3202. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Rukov, M. Irimia, S. Mork, V. K. Lund, J. Vinther, and P. Arctander High Qualitative and Quantitative Conservation of Alternative Splicing in Caenorhabditis elegans and Caenorhabditis briggsae Mol. Biol. Evol., April 1, 2007; 24(4): 909 - 917. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tanner, Z. Shen, J. Ng, L. Florea, R. Guigo, S. P. Briggs, and V. Bafna Improving gene annotation using peptide mass spectrometry Genome Res., February 1, 2007; 17(2): 231 - 239. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Nagaraj, R. B. Gasser, and S. Ranganathan A hitchhiker's guide to expressed sequence tag (EST) analysis Brief Bioinform, January 1, 2007; 8(1): 6 - 21. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Katzenberger, M. S. Marengo, and D. A. Wassarman ATM and ATR Pathways Signal Alternative Splicing of Drosophila TAF1 Pre-mRNA in Response to DNA Damage Mol. Cell. Biol., December 15, 2006; 26(24): 9256 - 9267. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Hasegawa, S. Fukuda, K. Shimokawa, S. Kondo, N. Maeda, and Y. Hayashizaki A RecA-mediated exon profiling method Nucleic Acids Res., August 8, 2006; 34(13): e97 - e97. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Thill, V. Castelli, S. Pallud, M. Salanoubat, P. Wincker, P. de la Grange, D. Auboeuf, V. Schachter, and J. Weissenbach ASEtrap: A biological method for speeding up the exploration of spliceomes Genome Res., June 1, 2006; 16(6): 776 - 786. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-B. Wang and V. Brendel Genomewide comparative analysis of alternative splicing in plants PNAS, May 2, 2006; 103(18): 7175 - 7180. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Temple, P. Lamesch, S. Milstein, D. E. Hill, L. Wagner, T. Moore, and M. Vidal From genome to proteome: developing expression clone resources for the human genome. Hum. Mol. Genet., April 15, 2006; 15(suppl_1): R31 - R43. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. P. Belancio, D. J. Hedges, and P. Deininger LINE-1 RNA splicing and influences on mammalian gene expression Nucleic Acids Res., March 22, 2006; 34(5): 1512 - 1521. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Tan, N. Mohandas, and J. G. Conboy High frequency of alternative first exons in erythroid genes suggests a critical role in regulating gene function Blood, March 15, 2006; 107(6): 2557 - 2561. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Florea Bioinformatics of alternative splicing and its regulation Brief Bioinform, March 1, 2006; 7(1): 55 - 69. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-C. Chen, S.-S. Wang, C.-J. Chen, W.-H. Li, and T.-J. Chuang Alternatively and Constitutively Spliced Exons Are Subject to Different Evolutionary Forces Mol. Biol. Evol., March 1, 2006; 23(3): 675 - 682. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhou, Z. Liu, J. Wu, J.-h. Liu, S. M. Hyder, E. Antoniou, and D. B. Lubahn Identification and Characterization of Two Novel Splicing Isoforms of Human Estrogen-Related Receptor {beta} J. Clin. Endocrinol. Metab., February 1, 2006; 91(2): 569 - 579. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Su, J. Wang, J. Yu, X. Huang, and X. Gu Evolution of alternative splicing after gene duplication Genome Res., February 1, 2006; 16(2): 182 - 189. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Liang and L. F. Landweber A genome-wide study of dual coding regions in human alternatively spliced genes Genome Res., February 1, 2006; 16(2): 190 - 196. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Shemesh, A. Novik, S. Edelheit, and R. Sorek Genomic fossils as a snapshot of the human transcriptome PNAS, January 31, 2006; 103(5): 1364 - 1369. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zhang and W. Gish Improved spliced alignment from an information theoretic approach Bioinformatics, January 1, 2006; 22(1): 13 - 20. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Le Sommer, M. Lesimple, A. Mereau, S. Menoret, M.-R. Allo, and S. Hardy PTB Regulates the Processing of a 3'-Terminal Exon by Repressing both Splicing and Polyadenylation Mol. Cell. Biol., November 1, 2005; 25(21): 9595 - 9607. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.-S. KIM, V. GUSTI, S. G. PILLAI, and R. K. GAUR An artificial riboswitch for controlling pre-mRNA splicing RNA, November 1, 2005; 11(11): 1667 - 1677. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Dixon, I. C. Eperon, L. Hall, and N. J. Samani A genome-wide survey demonstrates widespread non-linear mRNA in expressed sequences from multiple species Nucleic Acids Res., October 19, 2005; 33(18): 5904 - 5913. [Abstract] [Full Text] [PDF] |
||||