|
|
|
Published online before print
August 16, 2001, 10.1101/gr.190501
Vol. 11, Issue 9, 1520-1526, September 2001
LETTER
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Alternate polyadenylation affects a large fraction of higher eucaryote mRNAs, producing mature transcripts with 3' ends of variable length. This variation is poorly represented in the current transcript catalogs derived from whole genome sequences, mostly because such posttranscriptional events are not detectable directly at the DNA level. Alternate polydenylation of an mRNA is better understood by comparision to EST databases. Comparing ESTs to mRNAs, however, is a difficult task subjected to the pitfalls of internal priming, presence of intron sequences, repeated elements, chimerical ESTs or matches with EST from paralogous genes. We present here a computer program that addresses these problems and displays ESTs matches to a query mRNA sequence to predict alternate polyadenylation and to suggest library-specific forms. The output highlights effective polyadenylation signals, possible sources of artifacts such as A-rich stretches in the mRNA sequences, and allows for a direct visualization of EST libraries using color codes. Statistical biases in the distribution of alternative mRNA forms among EST libraries were systematically sought. About 1450 human and 200 mouse mRNAs displayed such biases, suggesting in each case a tissue- or disease-specific regulation of polyadenylation.
| |
INTRODUCTION |
|---|
|
|
|---|
Most eukaryotic pre-mRNAs contain long 3'
untranslated regions (UTRs) spanning hundreds of nucleotides, and
undergoing cleavage and polyadenylation at one or several
polyadenylation sites (PAS). Poly(A) sites are defined by a hexameric
polyadenylation signal (AAUAAA or a one-base variant thereof), located
~15 bases upstream of the cleavage site and, sometimes, a GU
(Guanosyl Uridy-R)-rich element located 20-40 bases downstream of the
site (for reviews, see Proudfoot 1991
; Colgan and Manley 1997
). A
significant fraction of UTRs has two or more functional, producing
mature mRNAs with 3' regions of variable lengths. As UTRs may contain
regulatory elements affecting mRNA stability or translation efficiency,
the choice of alternate polyadenylation sites may strongly affect the
final expression of the gene. Indeed, differential polyadenylation has
been shown repeatedly to occur in a tissue- or disease-specific manner
(Edwalds-Gilbert et al. 1997
).
Although genome sequencing projects are now polishing complete gene
catalogs for several animal species, including human, transcript
catalogs covering every polyadenylation or splice variant are still far
from completion. Alternate polyadenylation cannot be predicted from the
genomic sequence alone, since polyadenylation signals, or GU-rich
regions do not carry enough information to constitute useful
signatures. The most reliable data on mRNA 3' ends is experimental, and
available in the form of expressed sequence tags (ESTs). The dbEST
database (Boguski et al. 1993
), currently contains 7.3 million partial
cDNAs. These data are highly redundant, the 3 million human ESTs
available representing ~100 times the estimated number of human genes
(Lander et al. 2001
; Venter et al. 2001
). A large fraction of ESTs are
sequenced from the 3' end of mRNAs, and this redundant coverage of the
3' region often comprises several polyadenylation variants. Computer
analyses of EST databases have improved our understanding of
polyadenylation signals and alternate polyadenylation (Gautheret et al.
1998
; Graber et al. 1999
). Studies based on ESTs evaluated that over 29% of human mRNAs had multiple polyadenylation sites (Beaudoing et
al. 2000
), or >40% if one considers alternative cleavage sites occurring downstream of a single polyadonylation signal. (Pauws et al. 2001
).
EST-based annotation requires aligning the mRNA or gene under study to
EST sequences. Standard sequence alignment tools such as
BLAST (Altschul et al. 1997
) can be used for this purpose,
provided that certain pitfalls of EST comparisons are dealt with
properly. This includes the detection of internally primed ESTs (which
can be mistaken for true mRNA 3' ends), chimeras, and ESTs from
paralogous genes. We developed a program (ESTparser) that
performs BLAST searches against EST databases and filters
the output to produce a general picture of alternatively polyadenylated
forms and the in which tissues they occur. We applied this program to a
database of human 3' UTRs (Pesole et al. 1999
) and systematically
sought instances of tissue-specific 3' variants. This procedure
identified over 3500 events of statistically significant biases. Each
bias does not necessarily imply a true differential polyadenylation
event because library-specific artifacts may affect the accuracy of
ESTs counts. However, outputs of ESTparser show a large
number of intriguing cases that combine evidences for alternate poly(A
sites and suggestions of tissue- or) disease-specific forms, thus
prompting further experimental validations.
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
We analyzed ~13,000 human and 6000 mouse UTRs using the October
2000 release of dbEST. The number of UTRs displaying two or more
putative polyadenylation sites was 5127 for human and 1296 for mouse
sequences. From the library information in dbEST (4960 human and 468 mouse libraries), we classified ESTs into 117 tissue-types, subdivided
into 14 categories or organ systems (Table
1). Among UTRs with multiple poly(A) sites,
we then sought biases in tissue-distribution. Fisher's Exact tests
(Agresti 1992
) were performed systematically for each pair of poly(A)
sites in the same UTR as described in Methods. We observed 3619 biases
in polyadenylation site usage in 1438 different human UTRs (Table
2) and 310 biases in 189 different mouse
UTRs (Table 3). A single UTR may display
several biases as each poly(A) site and library is tested
independently. The number of observed biases for each tissue type is
roughly proportional to the number of ESTs and/or libraries available for this tissue, which could be expected because biases are sought on a
library-by-library basis.
|
|
|
We did not observe a strong positional preference for the differentially polyadenylated forms, except that the shortest UTR form was preferred in two-thirds of the biased libraries. We inspected the UTR sequences between alternate polyadenylation sites for the presence of ARE destabilization elements (AU-rich elements of the type AUUUA or UUAUUUA[U/A][U/A]). The density of ARE in these segments did not differ significantly from that in other UTR regions (data not shown).
A representative output is shown in Figure
1. In this example, the 3' UTR sequence of
a zinc-finger DNA-binding protein mRNA (Muraosa et al. 1996
) was
analyzed. The red line on top represents the UTR sequence, numbered
from zero at Stop codon. Fifty ESTs (color lines) were found to match
this UTR within the required length and identity criteria. Color coding
is described in the figure legend. ESTs shown with dashed lines are
from cancer libraries. There is evidence for three polyadenylation
signals, at positions 1111, 1292, and 1532. The signals at 1111 and
1532 are AATAAA (blue box) and the signal at 1292 is ATTAAA (orange
box). The thickened black underlines indicate regions of query masking, which means the program would not consider hits contained entirely in
this region as significant because of the presence of a low complexity
region, vector sequence, or human repeat such as Alu. The open
circle near position 1100 indicates a poly(A) stretch in the query
sequence, that is, a possible source of internal priming. Four ESTs
(AL119620, H01828, T94752, and WW00668) appear to have been produced by
internal priming at this site. Dots at the extremities of ESTs indicate
that a fragment larger than 20 nt or 15 nt, respectively at the 3' or
5' end of the EST, does not match the query sequence. Dots appearing
past the 5' end of the query indicate ESTs extending into the coding
region (e.g., the first three ESTs). Dots present within the limits of the query sequence indicate discrepancies between the EST and query
(e.g., EST T94751). The most common explanation for these is the poor
sequence quality of EST extremities, but other phenomena, such as
chimeras, presence of intronic sequences, or alternative exons may also
produce such mismatches. Therefore, these dubious ESTs should not be
considered in alternative form counts.
|
ESTs from libraries with a 3' end bias are shown boxed. Here, three ESTs from Soares fetal heart library NbHH19W have their 3' end at signal 1532 (red, boxed ESTs), whereas no EST from this library ends at signal 1111 or 1292. When combining all other tissues, the number of ESTs with a 3' end at 1111 and 1532 is 17 and 3, respectively. Fisher's exact value for the quadruplet (0,3,17,4) is 0.017. Thus there is a statistically significant bias for ESTs from Soares fetal heart library NbHH19W to use the polyadenylation signal at position 1532 rather than the signal at 1111. Comparing sites 1532 and 1292 would not give a significant bias.
Among the most interesting cases of differential polyadenylation are
those linked to human pathologies. Distinct causes, such as alterations
of the 3' regions of genes or changes in the expression of UTR-binding
proteins, induce variations in polyadenylation site selection and
processing or stability of transcripts that have been linked to a
number of diseases (for review, see Conne et al. 2000
). These different
phenomena may all affect the distribution of alternate mRNA forms and
should be detectable when transcriptional profiles from affected and
unaffected tissues are compared. ESTs from the Cancer Genome Anatomy
Project (CGAP; Strausberg et al. 1997
) and other EST sequencing efforts
(e.g., Simpson 1999
; Sese et al. 2001
) now offer this opportunity. CGAP
has produced, to date, >2.4 million EST sequences from cancer and
normal cells, constituting an invaluable source of expression data in
pathological tissues. Our analysis identified 1030 biases involving
human cancer libraries, distributed in 504 UTRs.
An example of potential cancer-specific polyadenylation is shown in
Figure 2 for mRNA KIAA0764, coding for an
unknown protein (Nagase et al. 1998
). The UTR is 2673 bp long and shows
multiple polyadenylation signals. The strongest sites are observed
after signals AATATA 404, AATAAA 1199, and ATTAAA 2644. Minor sites are
also observed around positions 102 (no signal), 215 (GATAAA), 465 (no
signal), 1100 (AATATA), 2290 (GATAAA), and 2450 (AATACA). Interestingly, most of the polyadenylation signals in this UTR differ
from the canonical AATAAA and ATTAAA sequences and would have been
overlooked in the absence of EST information. The most significant bias
involves ESTs from lung carcinoid tissue library NCI_CGAP_Lu24
(Strausberg et al. 1997
), represented with dashed light-blue lines.
Eleven ESTs from this library and eight ESTs from other libraries use
the poly(A) signal at 2644. In comparison, the poly(A) signal at
position 404 has no EST from library NCI_CGAP_Lu24 (or from another
lung cancer library) and has 47 ESTs from other libraries. This
distribution obtains a Fisher's Exact P value <10
6. Approximately one-half of the biases in our analysis
involve cancer libraries similar to the ones in this case.
|
Conclusion
Even though reasonably accurate gene models can now be obtained from complete genome sequences, reconstructing the 3' UTR and its alternative forms remains a challenging task. To date, this task is best performed using the experimental expression data available in the form of ESTs. The present software should help in identifying actual polyadenylation sites and in providing insight into possible tissue-specific 3' ends. Running the program in batch mode on complete mRNA datasets from the newly sequenced eucaryotic genomes, we also expect to acquire a better understanding of alternate polyadenylation in general and its functional implications.
| |
METHODS |
|---|
|
|
|---|
Polyadenylation Site Identification
Human 3' UTR sequences were obtained from UTRdb-nr release 13 (Pesole et al. 2000
), a nonredundant database of eukaryotic UTRs
generated by parsing the Feature table in the EMBL database (ftp://area.ba.cnr.it/pub/embnet/database/utr). We compared the 13,681 human and 6016 mouse UTRs to 2,452,892 human and 1,657,567 mouse ESTs
from dbEST (October 2000 release) based on the sequence comparison
procedure defined previously (Gautheret et al. 1998
; Beaudoing et al.
2000
) and summarized hereafter. UTR sequences were masked for common
repeats and low complexity sequences using Repbase, Nov. 2000 release
(Jurka 2000
), and for vector sequences. ESTs were required to match the
UTR sequence with at least 95% identity, encompassing the entire
length of the EST sequence (at least 40 nucleotides), except for
allowed 25 nt and 5 nt mismatches at the EST 5' and 3' sides,
respectively, as revealed by the boundaries of the BLAST
hit. This was intended to dismiss probable chimerical ESTs, ESTs
produced from alternatively spliced or unspliced RNAs and ESTs
exhibiting lane tracking errors or high error rates in the terminal
region. Poly(A) and poly(T) trailers were removed from EST sequences
prior to BLAST runs to avoid additional dangling regions.
Internal priming (cDNA primers hybridized to internal poly(A) stretches
instead of the actual poly(A) tail) was assessed by seeking adenine
stretches in the UTR region flanking the 3' extremity of the EST.
Polyadenylation sites flanking eight or more consecutive adenines, or
nine adenines in a 10-nucleotide window within +/
15 bases of a
poly(A) signal were considered artifactual, except when the poly(A)
stretch formed the tail of the query sequence. Further, one of the two
following conditions was required to validate a polyadenylation
site:(1) two or more ESTs ending within 30 nt downstream of an AAUAAA
polyadenylation signal or any single-base variant described by
Beaudoing et al. (2000)
. In this case, the 3' base of the signal was
selected as the transcript end; (2) in the absence of signal, two or
more ESTs ending at the exact same 3' position. In this case, the
transcript end was taken as the EST extremity (such signal-less
polyadenylation sites are frequent and should be allowed (Beaudoing et
al. 2000
).
Finally, when two or more predicted poly(A) sites occurred <30 nt from
each other, only the one with the largest number of associated ESTs was
retained. Since alternative poly(A) sites have been observed <30 nt
apart (see Pauws et al. 2001
), we left this minimal distance as a
user-defined parameter on the Web interface. However, nearby poly(A)
sites are less likely to be functionally important and their analysis
will be hampered by error-prone 3' ends in nonpolyadenylated ESTs.
Tissue Biases in 3' End Usage
Organ and tissue data in dbEST reports are present under the "Library Description" section. These data, however, are inconsistently annotated in fields "Name," "Organ," "Development Stage," "Cell line," or "Tissue." We extracted this information using a Perl script identifying a number of representative keywords, and categorized it into 117 tissues and 14 tissue categories or organ systems, as described in Table 1. For each EST, the library name, tissue, and organ system were recorded. After putative poly(A) sites were identified in a given UTR, biased site usage with respect to EST libraries were sought as follows: Let Si, Sj a pair of polyadenylation sites and Ni, Nj their respective number of ESTs (that is, the ESTs that permitted to identify the sites). Let any EST library L, represented by ni ESTs at site Si and nj ESTs at site Sj. A preference for polyadenylation site Si in library L is computed using Fisher's Exact test (2-tail) on the quadruplet {ni, Ni-ni, Nj, Nj-nj} This actually compares the occurrence of library L to that of all other libraries combined. This turned out to be more practicable than comparing all libraries pairwise, which increased considerably the number of tests and produced too many uninteresting hits. Also, we treated poly(A) sites independently instead of comparing one site against the others. This last option would probably have brought to light a few more interesting cases, but it would have masked others: for instance when one library is overrepresented at more than one site. Fisher's exact test calculations were performed using the C code provided by T. Kadosawa (http://infofarm.cc.affrc.go.jp/~kadosawa/fishertest.htm). Any value <0.05 was considered significant and was highlighted in the graphical user interface. Detailed output for all significant biases was observed in human and mouse 3' UTR are available at http://tagc.univ-mrs.fr/bioinfo/ESTparser.Graphical User Interface
A graphical user interface (GUI) has been specifically designed to highlight polyadenylation signals/sites and tissue biases. Any cDNA or mRNA sequence (intronless) can be used as input. An example output is shown in Figure 1. Graphical and color symbols are explained in Figure 1 legend. A Web server (http://tagc.univ-mrs.fr/bioinfo/ESTparser) allows a user to perform the whole analysis on any user-defined mRNA sequence. The sequence analysis program and GUI were both developed in Perl on Linux workstations.| |
ACKNOWLEDGMENTS |
|---|
E.B. was supported by a Ph.D. studentship from Association pour la Recherche sur le Cancer. The authors thank Rémi Houlgatte for critical reading of the manuscript
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
1
Corresponding author.
E-MAIL
gautheret{at}esil.univ-mrs.fr; FAX 33-491-82-8621.
Article
published on-line before print: Genome Res., 10.1101/gr. 190501.
Article and publication are at www.genome.org/cgi/doi/10.1101/gr.190501.
| |
REFERENCES |
|---|
|
|
|---|
database for expressed sequence tags.
Nat. Genet.
4:
332-333.
Received March 30, 2001; accepted in revised form June 12, 2001.
This article has been cited by other articles:
![]() |
R. Sandberg, J. R. Neilson, A. Sarma, P. A. Sharp, and C. B. Burge Proliferating Cells Express mRNAs with Shortened 3' Untranslated Regions and Fewer MicroRNA Target Sites Science, June 20, 2008; 320(5883): 1643 - 1647. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Guan, R. M. Caratozzolo, R. Goraczniak, E. S. Ho, and S. I. Gunderson A bipartite U1 site represses U1A expression by synergizing with PIE to inhibit nuclear polyadenylation RNA, December 1, 2007; 13(12): 2129 - 2140. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Thomas, J. I. Andrews, and K. Z. Liu Intronic polyadenylation signal sequences and alternate splicing generate human soluble Flt1 variants and regulate the abundance of soluble Flt1 in the placenta FASEB J, December 1, 2007; 21(14): 3885 - 3895. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hellquist, M. Zucchelli, K. Kivinen, U. Saarialho-Kere, S. Koskenmies, E. Widen, H. Julkunen, A. Wong, M.-L. Karjalainen-Lindsberg, T. Skoog, et al. The human GIMAP5 gene has a common polyadenylation polymorphism increasing risk to systemic lupus erythematosus J. Med. Genet., May 1, 2007; 44(5): 314 - 321. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Moucadel, F. Lopez, T. Ara, P. Benech, and D. Gautheret Beyond the 3' end: experimental validation of extended transcript isoforms Nucleic Acids Res., March 19, 2007; 35(6): 1947 - 1957. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Jaworski, M. Beem-Miller, G. Lluri, and R. Barrantes-Reynolds Potential regulatory relationship between the nested gene DDC8 and its host gene tissue inhibitor of metalloproteinase-2 Physiol Genomics, January 17, 2007; 28(2): 168 - 178. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Liu, J. M. Brockman, B. Dass, L. N. Hutchins, P. Singh, J. R. McCarrey, C. C. MacDonald, and J. H. Graber Systematic variation in mRNA 3'-processing signals during mouse spermatogenesis Nucleic Acids Res., January 12, 2007; 35(1): 234 - 246. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Nagaraj, R. B. Gasser, and S. Ranganathan A hitchhiker's guide to expressed sequence tag (EST) analysis Brief Bioinform, January 1, 2007; 8(1): 6 - 21. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Lopez, S. Granjeaud, T. Ara, B. Ghattas, and D. Gautheret The disparate nature of "intergenic" polyadenylation sites RNA, October 1, 2006; 12(10): 1794 - 1801. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Hernandez-Sanchez, O. Bartulos, A. I. Valenciano, A. Mansilla, and F. de Pablo The regulated expression of chimeric tyrosine hydroxylase-insulin transcripts during early development Nucleic Acids Res., July 13, 2006; 34(12): 3455 - 3464. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Parra, A. Reymond, N. Dabbouseh, E. T. Dermitzakis, R. Castelo, T. M. Thomson, S. E. Antonarakis, and R. Guigo Tandem chimerism as a means to increase protein complexity in the human genome Genome Res., January 1, 2006; 16(1): 37 - 44. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. H. Pratt, C. Liang, M. Shah, F. Sun, H. Wang, St. P. Reid, A. R. Gingle, A. H. Paterson, R. Wing, R. Dean, et al. Sorghum Expressed Sequence Tags Identify Signature Genes for Drought, Pathogenesis, and Skotomorphogenesis from a Milestone Set of 16,801 Unique Transcripts Plant Physiology, October 1, 2005; 139(2): 869 - 884. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. R. Henderson, F. Liu, S. Drea, G. G. Simpson, and C. Dean An allelic series reveals essential roles for FY in plant development in addition to flowering-time control Development, August 15, 2005; 132(16): 3597 - 3607. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. de la Grange, M. Dutertre, N. Martin, and D. Auboeuf FAST DB: a website resource for the study of the expression regulation of human gene products Nucleic Acids Res., July 28, 2005; 33(13): 4276 - 4284. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Sharov, D. B. Dudekula, and M. S.H. Ko Genome-wide assembly and analysis of alternative transcripts in mouse Genome Res., May 1, 2005; 15(5): 748 - 754. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nakao, R. A. Barrero, Y. Mukai, C. Motono, M. Suwa, and K. Nakai Large-scale analysis of human alternative protein isoforms: pattern classification and correlation with subcellular localization signals Nucleic Acids Res., April 28, 2005; 33(8): 2355 - 2363. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Wahl, U. Heinzmann, and K. Imai LongSAGE analysis significantly improves genome annotation: identifications of novel genes and alternative transcripts in the mouse Bioinformatics, April 15, 2005; 21(8): 1393 - 1400. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kim, S. Shin, and S. Lee ECgene: Genome-based EST clustering and gene modeling for alternative splicing Genome Res., April 1, 2005; 15(4): 566 - 576. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yan and T. G. Marr Computational analysis of 3'-ends of ESTs shows four classes of alternative polyadenylation in human, mouse, and rat Genome Res., March 1, 2005; 15(3): 369 - 375. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sachchithananthan, S. J. Stasinopoulos, J. Wilusz, and R. L. Medcalf The relationship between the prothrombin upstream sequence element and the G20210A polymorphism: the influence of a competitive environment for mRNA 3'-end formation Nucleic Acids Res., February 17, 2005; 33(3): 1010 - 1020. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Tian, J. Hu, H. Zhang, and C. S. Lutz A large-scale analysis of mRNA polyadenylation of human and mouse genes Nucleic Acids Res., January 12, 2005; 33(1): 201 - 212. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhang, J. Hu, M. Recce, and B. Tian PolyA_DB: a database for mammalian mRNA polyadenylation Nucleic Acids Res., January 1, 2005; 33(suppl_1): D116 - D120. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Premzl, J. E. Gready, L. S. Jermiin, T. Simonic, and J. A. Marshall Graves Evolution of Vertebrate Genes Related to Prion and Shadoo Proteins--Clues from Comparative Genomic Analysis Mol. Biol. Evol., December 1, 2004; 21(12): 2210 - 2231. [Abstract] [Full Text] [PDF] |
||||
![]() |
The Ludwig-FAPESP Transcript Finishing Initiative, M. C. Sogayar, and A. A. Camargo A Transcript Finishing Initiative for Closing Gaps in the Human Transcriptome Genome Res., July 1, 2004; 14(7): 1413 - 1423. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Louie, J. Ott, and J. Majewski Nucleotide Frequency Variation Across Human Genes Genome Res., December 1, 2003; 13(12): 2594 - 2601. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Cui and C. L. Denis In Vivo Evidence that Defects in the Transcriptional Elongation Factors RPB2, TFIIS, and SPT5 Enhance Upstream Poly(A) Site Utilization Mol. Cell. Biol., November 1, 2003; 23(21): 7887 - 7901. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Iseli, B. J. Stevenson, S. J. de Souza, H. B. Samaia, A. A. Camargo, K. H. Buetow, R. L. Strausberg, A. J.G. Simpson, P. Bucher, and C. V. Jongeneel Long-Range Heterogeneity at the 3' Ends of Human mRNAs Genome Res., July 1, 2002; 12(7): 1068 - 1074. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||