|
|
|
Vol. 10, Issue 7, 1001-1010, July 2000
LETTER
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
The formation of mature mRNAs in vertebrates involves the cleavage and polyadenylation of the pre-mRNA, 10-30 nt downstream of an AAUAAA or AUUAAA signal sequence. The extensive cDNA data now available shows that these hexamers are not strictly conserved. In order to identify variant polyadenylation signals on a large scale, we compared over 8700 human 3' untranslated sequences to 157,775 polyadenylated expressed sequence tags (ESTs), used as markers of actual mRNA 3' ends. About 5600 EST-supported putative mRNA 3' ends were collected and analyzed for significant hexameric sequences. Known polyadenylation signals were found in only 73% of the 3' fragments. Ten single-base variants of the AAUAAA sequence were identified with a highly significant occurrence rate, potentially representing 14.9% of the actual polyadenylation signals. Of the mRNAs, 28.6% displayed two or more polyadenylation sites. In these mRNAs, the poly(A) sites proximal to the coding sequence tend to use variant signals more often, while the 3'-most site tends to use a canonical signal. The average number of ESTs associated with each signal type suggests that variant signals (including the common AUUAAA) are processed less efficiently than the canonical signal and could therefore be selected for regulatory purposes. However, the position of the site in the untranslated region may also play a role in polyadenylation rate.
| |
INTRODUCTION |
|---|
|
|
|---|
The 3' untranslated regions (UTRs) of
eukaryotic mRNAs contain regulatory elements affecting mRNA
translation, stability, and transport. Mature 3' UTRs are formed by
polyadenylation of the pre-mRNA, a coupled reaction involving
endonucleolytic cleavage followed by poly(A) synthesis. A significant
fraction of mRNAs display multiple polyadenylation sites (Gautheret et
al. 1998
). The choice of poly(A) sites may influence the stability,
translation efficiency, or localization of an mRNA in a tissue- or
disease-specific manner (Edwalds-Gilbert et al. 1997
). In the mammalian
system, effective polyadenylation requires two main sequence
components: a highly conserved AAUAAA signal located 10-30 nucleotide
5' to the cleavage site and a more variable GU-rich element, 20-40
bases 3' of the site (see Proudfoot 1991
; Colgan and Manley 1997
for reviews). Although the AAUAAA signal is often considered to be present in 90% of the mRNAs and replaced by a AUUAAA variant in the
other 10% (Wahle and Keller 1996
; Colgan and Manley 1997
), alternate signals are certainly present in a significant fraction of
the 3' ends (Claverie 1997
; Gautheret et al. 1998
; Tabaska and
Zhang 1999
; Graber et al. 1999
).
The expressed sequence tag (EST) database, dbEST (Boguski et al. 1993
),
which contains highly redundant partial cDNAs, especially from the
3' UTRs, is a rich source of information on mRNA 3' ends. Analyzing clustered EST sequences, we previously identified multiple cases of alternate polyadenylation in mRNA (Gautheret et al. 1998
). Based on a public EST collection now containing over 1.4 million human
sequences, the present work focuses on the region immediately upstream
of the cleavage sites, collecting statistics on the most frequent
polyadenylation signals, their position in the UTR, and their frequency
of use in UTRs with multiple cleavage sites. In order to compensate for
the low accuracy of EST sequences, we selected ESTs with near perfect
matches to UTR sequences from Genbank and used the Genbank sequence as
the reference. Therefore, sequence errors are minimized. This study
provided evidence for the existence of 10 variant polyadenylation
signals that may be responsible for up to 14.9% of the mRNA 3'
ends. We then analyzed the distribution of noncanonical signals in UTRs
with alternate poly(A) sites and assessed the processing efficiency of
polyadenylation signals in function of their sequence and their
position in the UTR. Significant biases were observed, with interesting
consequences for the regulation of mRNA 3' end formation.
| |
RESULTS |
|---|
|
|
|---|
The comparison of 8775 human UTR sequences to the 157,775 ESTs with
a poly(A) or poly(T) extremity was performed using the criteria exposed
in Methods to reduce experimental artifacts, including internal priming
and partial matches from chimeric ESTs or confusion between ESTs from
paralogous genes. This selected 4344 UTRs with at least one putative
polyadenylation site. The number of polyadenylation sites per mRNA
molecule is distributed as shown in Table 1. 3377 sequences (77.7%) have one putative poly(A) site, and 967 sequences
(22.3%) have two sites or more. This figure supersedes our previous
minimum estimate of 18.9% alternatively polyadenylated mRNAs
(Gautheret et al. 1998
). The total number of putative poly(A) sites
observed is 5647. The 50-nucleotide fragment preceding each of these
sites was collected, producing a database of 5647 sequences.
Hexanucleotide frequencies in this 3' fragment database were
analyzed as described in Methods. Results are shown in Tables 2 and
3. The AAUAAA and AUUAAA
polyadenylation signals are by far the most frequently found hexamers,
present in 58.2% and 14.9% of the 3' fragments,
respectively. The remaining 26.8% of the 3' fragments do not
contain a usual polyadenylation signal. Analyzing 5-mer, 7-mer, and
8-mer frequencies did not identify any recurrent word other than
combinations of the hexameric motifs, such as AAAUAAA (data not shown).
|
|
|
Variant Signals
The right part of Table 2 shows the distribution of hexamer
positions over the 3' fragment (position of the sixth nucleotide of
hexamer is plotted). The AAUAAA and AUUAAA hexamers are clearly clustered around
15/
16 nt upstream of the putative poly(A)
site, as expected from experimentally validated signals (Chen and Shyu 1995
). In a preliminary analysis, several motifs with high
P-values were found scattered along the 3' segment. The
absence of spatial preference with respect to the poly(A) site
suggested that these motifs were not involved in any specific
interaction with the polyadenylation machinery. Since our primary focus
was on polyadenylation-related motifs, we first sought spatially
"clustered" motifs. We did so based on the standard deviation (SD)
around the mean motif position (see Methods). The list of significant
motifs in Table 2 comprises only those motifs with P < 10
5 and SD < 9 nt. Variant hexamers are also clustered
around positions
15/
20. The most significant motifs with SD > 9 nt are shown separately (Table 3). The first two motifs in this
table most frequently occur near
15/
20, albeit less obviously
than the previous motifs.
Even though the 50-nt UTR fragments used for hexamer searches are from Genbank rather than EST sequences, one may argue that unexpected hexamers could result from sequencing errors in the UTR sequences, especially when these hexamers have a single base difference from the common AAUAAA signal. This hypothesis can be rejected on the basis of the very good agreement between UTR and EST sequences. A control analysis that required a 99% similarity between UTR and EST sequences (instead of 95%) produced nearly the same proportion of noncanonical hexamers (data not shown). Further, alignments of UTRs with their corresponding ESTs were inspected visually for agreement at the level of noncanonical hexamers. AAUACA hexamers (70 UTR sequences; Table 2) and AAUAGA hexamers (43 UTR sequences; Table 2) were confirmed by at least one EST in 92% and 93% of the cases, respectively.
Hexamers AGUAAA, UAUAAA, CAUAAA, GAUAAA, AAUAUA, AAUACA, AAUAGA,
and ACUAAA are significantly overrepresented near the polyadenylation site, and their spatial distribution (Table 2) closely follows that of
known poly(A) signals (Chen and Shyu 1995
). Both facts strongly suggest
that these motifs are widespread polyadenylation signals in human mRNA.
The penultimate motif (AAAAAG) is actually a statistical artifact
caused by the high rate of the AAGAAA motifs within this region (Table
3, see below). Although the motifs shown in Table 3 are more scattered
along the 3' segment, the AAGAAA and AAUGAA hexamers display minor
but distinguishable peaks at position
15/
20, which is best
explained by their role as a polyadenylation signal. Combined together,
and neglecting statistical noise and sequence errors, the 10 variant
motifs could account for 14.9% of the putative mRNA 3' ends,
which potentially represents a considerable number of mRNA forms in the
whole transcriptome.
Positional Preferences
Messenger RNAs with two or more putative poly(A) sites represent 967 mRNAs (22.3%) and 2270 poly(A) sites (40.2%) in our study. Using this large data set, we can now analyze on a large scale alternatively polyadenylated mRNAs for possible biases in poly(A) signal sequence and position.
Figure 1 presents the average position of putative polyadenylation sites on 3' UTRs as a function of the number of observed alternative sites. The high standard deviations (error bars) indicate that locations of poly(A) sites are highly variable. Indeed, the observed distribution resembles that expected from a random selection of n points in the same sequence set. When four random points are picked in our sequence set (with the fourth point taken at the end of the sequence), the average positions and standard deviations of the first to fourth sites are 518 ± 472, 1064 ± 1023, 1635 ± 1231, and 2059 ± 1220, which is very similar to the result shown in Figure 1 with 4 polyA sites. Multiple sites are interspersed on average every 600 bp on the 3' UTR. This average number, however, could be affected by the presence of yet unidentified sites in UTRs. What appears to be the "first" site may actually be the second one, and so forth.
|
Table 4 presents, for mRNAs with a given number of putative poly(A)
sites, the average number of sites per molecule containing AAUAAA,
AUUAAA, or other signals. "Signals" are
understood here as in Table 2-3, that is, found in the 50-nt segment
upstream of an EST-supported poly(A) site and in the absence of a more
frequent signal. For instance, mRNAs with three poly(A) sites have on
average 1.30 AAUAAA signals and 0.58 AUUAAA signals. It appears that, as the number of poly(A) sites in an mRNA molecule increases, the
proportion of canonical AAUAAA signals decreases (see the ratio
AAUAAA/AUUAAA on the last line). In other words, mRNAs with multiple
poly(A) sites tend to use a higher proportion of noncanonical signals.
This is true for all noncanonical signals, including the common AUUAAA.
|
We then counted occurrences of each type of polyadenylation signal at different sites on the UTR (Fig. 2). There is a striking difference between the 3'-most distal site and other sites closer to the Stop codon. The 3' distal site predominantly uses a canonical signal, while all other sites predominantly use noncanonical signals, particularly one-base variants of the AAUAAA sequence. Unidentified signals ("Others" in Fig. 2), which represent a significant fraction of the poly(A) sites closer to the Stop codon, should be taken cautiously because they could result from internally primed ESTs that have escaped our filtering procedure. In any case, putting aside other signals, the one-base variants of the AAUAAA signal are more represented than the canonical signal in sites proximal to the Stop codon.
|
Processing Efficiency
Highly expressed mRNAs are commonly expected to result in a higher number of ESTs than weakly expressed ones. However, because normalization procedures have been applied to most EST libraries, artificially reducing EST levels for certain types of mRNAs, biases in EST counts are not always meaningful. In this context, can we use EST counts as a rough estimate of the polyadenylation rate at various sites? Here, we will not compare the expression of different mRNAs but, instead, the efficiency of different types of poly(A) sites, whatever mRNA or EST library is considered. Answers to this question should less be affected by biases induced by library construction protocols.
Table 5 shows the mean numbers of ESTs observed associated with each
putative poly(A) signal (hereafter called "revealing" ESTs). For instance, putative poly(A) sites with an
AAUAAA hexamer are supported on average by 5.4 ESTs. The number of
revealing ESTs is higher with the AAUAAA signal than with any other
signal. This effect cannot be attributed to some canonical signals
associated to abundantly expressed genes, as it is also observed when
both types of signals are found on the same gene. For instance, mRNAs having both an AAUAAA and a noncanonical signal in their UTR have nearly twice as many ESTs associated to the canonical signal on average
(data not shown). This strongly suggests that sites with noncanonical
signals are processed less efficiently than those with a canonical
signal. Interestingly, the common AUUAAA signal falls in the same range
as the less frequent variant AGUAAA.
|
We finally asked how processing efficiency varied with the position of poly(A) sites in alternatively polyadenylated mRNAs. Histograms in Figure 3 give the number of revealing ESTs associated on average with each polyadenylation site in mRNAs with one, two, three, and four observed polyadenylation sites. Sites with canonical or other poly(A) signals are distinguished. The hierarchy of canonical and noncanonical signals with respect to polyadenylation rate is maintained independently of the cleavage position. However, the 3'-most distal cleavage sites generally have more revealing ESTs than sites closer to the Stop codon, suggesting that 3'-terminal sites are processed more efficiently. A possible pitfall in this conclusion would be the presence of erroneous 3' ends among sites closer to the Stop codon. Such incorrect poly(A) sites would have fewer associated ESTs, lowering average EST counts. If this were true, poly(A) sites with a canonical signal would not be lowered since they most likely correspond to true 3' ends. However, when the signal closest to the Stop codon is AAUAAA, there are also fewer revealing ESTs, further suggesting a position dependency of poly(A) site processing efficiency.
|
| |
DISCUSSION |
|---|
|
|
|---|
The human polyadenylation signals identified in this study are
summarized in Figure 4. Until recently, only a single-variant hexamer,
AGUAAA, had been identified as a possibly recurrent signal in human
mRNA (Gautheret et al. 1998
; Tabaska and Zhang 1999
). After this work
was completed, a study by Graber et al. (1999)
was published
identifying variant polyadenylation signals in 3' EST sequences
from diverse species. The set of 4427 human ESTs selected in that study
was analyzed independently of reference mRNA or genomic sequences,
which probably raised the sequence error rate (only 53.2% of 3'
ends had a AAUAAA signal vs. 58.2% in our study). Nevertheless,
these authors did use reference genomic sequences to analyze Drosophila
ESTs and obtained a list of variant poly(A) signals very similar to
ours (Graber et al. 1999
). The effect of poly(A) signals' mutations on
polyadenylation and cleavage rates has been studied experimentally in
vivo (Sheets et al. 1990
). Comparing in silico and in vitro results,
Graber et al. noted that the natural frequency of variant signals in
Drosophila was closely related to in vitro polyadenylation rate (Graber
et al. 1999
). This striking observation also applies to the human
poly(A) signals.
With respect to in vivo studies, a literature search for the 10 variant
signals reveals that most have been occasionally reported as forming
"unusual" poly(A) signals in mammalian or mammalian virus mRNAs
(Table 6). The agreement between our results and the
literature is excellent: those naturally occurring polyadenylation signals that do not figure in our list are either weakly active, deleterious, or found only in plants. Interestingly, the AAUAGA motif
was reported as functional solely in flatworms (Wahlberg and Johnson
1997
), while its presence in human
-globin mRNA in replacement of
the canonical signal is a known cause of
-thalassemia (Jankovic et
al. 1990
; van Solinge et al. 1996
). Mutations in poly(A) signals
causing
- and
-thalassemia result in elongated mRNAs (Orkin
et al. 1985
; Smetanina et al. 1996
), meaning the poly(A) signal either
is not functional or is used inefficiently. The situation is similar
for the AAGAAA and AAUGAA motifs. AAGAAA is reportedly an active
polyadenylation signal in a mammalian mRNA (Anand et al. 1997
), but
this motif is also commonly used in replacement of canonical signals in
order to inactivate polyadenylation sites in DNA viruses (Moore et al.
1988
; Wilusz and Shenk 1988
). Likewise, AAUGAA is a potentially
deleterious polyadenylation signal (Jankovic et al. 1990
; Yuregir et
al. 1992
), but is nevertheless functional in two mammalian mRNAs
(Martins et al. 1995
; Battersby et al. 1999
). This is no reason to
believe that the AAUAGA, AAGAAA, or AAUGAA signals we observed are
inactive since all correspond to experimentally identified (in the form
of ESTs) mature mRNA terminations. However, their possibly deleterious
effects suggest that either their function is context dependent (e.g.,
external factors might inactivate them) or their efficiency is
intrinsically different than that of a canonical signal.
|
The principal components of the polyadenylation machinery in mammals are the two cleavage factors CFI and CFII; the poly(A) polymerase (PAP), and two factors involved in RNA sequence recognition: CstF (Cleavage Stimulation Factor), which binds the downstream GU-rich region, and CPSF (Cleavage/Polyadenylation Specificity Factor), which binds the polyadenylation signal. Given the variability of polyadenylation signals, can we suggest the existence of several cognate CPSFs? Probably not. All the observed signals are single-base variants of the canonical AAUAAA hexamer. Positions 3, 4, and 6 are highly conserved, while positions 1, 2, and 5 are tolerant to point mutations (Fig. 4). Combinations of two or more mutations have not been observed at a significant level. For instance, although AUUAAA is observed 843 times (Table 2), we did not find the prefix AUU associated with any of the other possible suffixes (ACA, AUA, AGA, or GAA). This suggests a model where a unique polyadenylation machinery is tolerant to a limited level of mutation in its regular signal.
|
The mRNAs with multiple poly(A) sites tend to use noncanonical
polyadenylation signals (including the common AUUAAA) more often than
mRNAs with a single poly(A) site (Table 4). Why would variant signals
be selected in these mRNAs? The prevailing hypothesis for the
occurrence of variant polyadenylation signals is that variation of
control sequences mediates variation in polyadenylation rate, thus
regulating gene expression (Edwalds-Gilbert et al. 1997
; Graber et al.
1999
). Expressed sequence tag counts, used as a measure of
polyadenylation rate, provide in silico evidence in favor of this
hypothesis. Table 5 and Figure 3 show that poly(A) sites with a
noncanonical signal (including AUUAAA) were usually revealed by a lower
number of ESTs than poly(A) sites with an AAUAAA signal. This
observation is true independently of the number and position of the
sites on the mRNA (Fig. 3) and cannot be explained by a bias in EST
library construction or in our poly(A) site selection procedure. This
suggests that variant signals are not processed as efficiently as the
AAUAAA signal. This differential rate is of functional interest for
mRNAs with multiple poly(A) sites since it provides a means to regulate
synthesis of specific mRNA forms. The mRNAs with multiple sites may
then use noncanonical (presumably weaker) signals because it is easier
to regulate alternative polyadenylation with these weak signals. An
additional form of regulation could be that 3'-terminal sites are
processed more efficiently, as suggested by results in Figure 3.
Current models for the binding of the polyadenylation machinery to its
targets on the 3' UTR
hexameric signal, GU-rich region, cleavage
site (Colgan and Manley 1997
)
do not help to explain this phenomena.
Another factor probably contributes to a higher polyadenylation rate at
3' terminal sites. When observing the distribution of signals in
alternatively polyadenylated mRNAs (Fig. 2), we noticed that AAUAAA
signals, which are generally processed more efficiently, are more
frequent at 3'-terminal sites. We may predict from this body of
expression data that the major form of alternatively polyadenylated
mRNAs will in general be the longest one. This high rate of long versus
short 3' UTRs might denote a better stability of the longer mRNA
form. However, long 3' UTRs are not necessarily more stable than
shorter ones, especially since they often contain destabilization
signals (Gautheret et al. 1998
). This predominance of long forms may
thus suggest the future discovery of stabilization signals in extended
3' UTR fragments.
| |
CONCLUSION |
|---|
|
|
|---|
Ten variant polyadenylation signals characterized by a significant
overrepresentation in EST-supported mRNA 3' ends and by a peak of
occurrence around position 15-17 (last base of signal) upstream of the
putative poly(A) site have been identified. This information on poly(A)
signal variation, combined with that of other polyadenylation control
elements, should be incorporated in gene-detection programs, the
performances of which are very poor in delineating 3' UTRs. The
consensus sequences or position weight matrices used for
polyadenylation signal detection (Salamov and Solovyev 1997
; Tabaska
and Zhang 1999
) can be adapted to agree with these observations.
Similarly, statistics on the differential use of alternate sites can be
incorporated in these programs.
On the biological side, two interesting questions are now raised. First, only 88% of the mRNA 3' ends studied contained a characteristic poly(A) signal variant, leaving 678 putative 3' ends with no detectable polyadenylation signal. A fraction of these may be artifactual 3' ends (e.g., internally primed) that went through our selection procedure, but we cannot exclude that radically different signals or mechanisms may be used for the polyadenylation of this class of mRNAs. A detailed study of these unusual mRNAs, their function and pattern of expression, must be carried on to address this question. The second issue is that of the regulation of polyadenylation rate at different sites on the same mRNA. Is there a higher processing efficiency at the 3'-most poly(A) site, as our results suggest? Which unknown mechanism could produce this effect? While extensive experimental data is available on the processing of polyadenylation control elements in a single-site context, little is known about the effect of the relative position of multiple polyadenylation signals (including the downstream GU-rich region). This question is closely related to that of the kinetics and mechanisms of control sequence recognition by the polyadenylation machinery.
| |
METHODS |
|---|
|
|
|---|
Human 3' UTR sequences were taken from UTRdb-nr release 10 (Pesole et al. 2000
), a nonredundant database of eukaryotic UTRs generated by parsing the feature keys in the EMBL database. UTRdb can
be retrieved from ftp://area.ba.cnr.it/pub/embnet/database/utr.
We compared the 8775 human UTRs to ESTs from dbEST (July 1999 release),
using a variant of the sequence comparison procedure presented
previously (Gautheret et al. 1998
). Based on the gapped BLAST program
(Altschul et al. 1997
), this procedure seeks 3' ESTs corresponding
to mature mRNA 3' ends. A typical dbEST match to an mRNA or UTR
sequence contains a mixture of 5' and 3' ESTs, spurious hits
from low complexity or repeated sequences, chimeric ESTs and ESTs
resulting from internal priming. Our goal was to identify in this
mixture those ESTs resulting from bona fide mRNA 3' ends.
As a first criterion, and since actual 3' ESTs are not consistently annotated in the database, we selected ESTs with a poly(T) or poly(A) extremity of length 10 or more. This filter retained only 157,775 of the original 1,561,241 human ESTs. Untranslated region sequences were masked for common human repeats, low complexity, and vector sequences. We then imposed ESTs to match the template mRNA sequence with at least 95% identity (a level of mismatch required to accommodate errors in EST sequences), encompassing the entire length of the EST sequence except for allowed 25-nt and a 5-nt mismatches at the EST 5' and 3' sides, respectively, as revealed by the boundaries of the BLAST hit. This last requirement dismisses about 23% of the ESTs, comprising probable chimeric ESTs, ESTs produced from alternatively spliced RNAs, and ESTs exhibiting lane tracking errors or high error rates in the terminal region. Poly(A) and poly(T) trailers were removed from EST sequences before running BLAST to ensure these tails did not create additional dangling regions. Internal priming, that is, cDNA primers hybridized to internal poly(A) stretches instead of the actual poly(A) tail, was assessed by seeking adenine stretches in the UTR region flanking the 3' extremity of the EST. Six or more consecutive adenines, or eight adenines in a 10-nt window, were considered as a possible source of internal priming, and the corresponding EST sequence was discarded. Finally, the use of UTRs instead of complete mRNAs as query sequences eliminated the risk of identifying false 3' ends in coding regions.
Any EST respecting the above constraints was considered indicative of a polyadenylation site at the 3' end of the match. When several putative polyadenylation sites occurred in a region of 30 nt or less, we retained the site represented by the highest number of ESTs. Each potential polyadenylation site was recorded (mRNA, position, number of revealing ESTs), and the 50-nt segment preceding the site in the UTR was extracted (3' fragment database) and searched for recurrent sequence motifs.
Significant 6-nt patterns were identified by comparing hexamer
frequencies in the 3' fragment database to those expected by chance
from its nucleotide composition. Probabilities were computed assuming a
cumulative binomial distribution (Press et al. 1992
). Significant
hexamers were collected iteratively as follows. After the most
significant hexamer (lowest P-value) was identified, all
3' fragments containing this motif were removed from the database before the next most frequent hexamer was sought. This procedure ensured that sequences overlapping the most frequent motifs (such as
AUAAAN or NAAUAA for AAUAAA) were not improperly selected. The spatial
distribution of motifs along the 50-nt segment was also considered in
our selection of significant hexamers. The mean position of each motif
in the 50-mer was computed, and the standard deviation (SD) around this
average was used as a measure of scattering. Motifs with SD > 9 nt
(empirical value) were considered as "scattered" and less likely to
form a polyadenylation signal.
| |
ACKNOWLEDGMENTS |
|---|
We thank Stéphane Audic for his advice and for sharing useful Perl Scripts with us.
| |
FOOTNOTES |
|---|
2 Isis Pharmaceuticals, 2292 Faraday Avenue, Carlsbad, California 92008, USA.
3 Corresponding author.
E-MAIL gauthere{at}igs.cnrs-mrs.fr; FAX 33 4 91 16 45 49.
| |
REFERENCES |
|---|
|
|
|---|
database for expressed sequence tags.
Nat. Genet.
4:
332-333[CrossRef][Medline].
(+)-thalassemia.
Br. J. Haematol.
75:
122-126[Medline].
-globin mutation interacting with other genetic elements.
Eur. J. Pediatr.
574:
574-576[CrossRef].
-globin gene mutation co-inherited with haemoglobin E-disease.
Eur. J. Clin. Chem. Clin. Biochem.
34:
949-954[Medline].This article has been cited by other articles:
![]() |
T. Ghosh, K. Soni, V. Scaria, M. Halimani, C. Bhattacharjee, and B. Pillai MicroRNA-mediated up-regulation of an alternatively polyadenylated variant of the mouse cytoplasmic {beta}-actin gene Nucleic Acids Res., October 3, 2008; (2008) gkn624v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Y. Lee, Z. Ji, and B. Tian Phylogenetic analysis of mRNA polyadenylation sites reveals a role of transposable elements in evolution of the 3'-end of genes Nucleic Acids Res., October 1, 2008; 36(17): 5581 - 5590. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Lian, A. Karpikov, J. Lian, M. C. Mahajan, S. Hartman, M. Gerstein, M. Snyder, and S. M. Weissman A genomic analysis of RNA polymerase II modification and chromatin architecture related to 3' end RNA polyadenylation Genome Res., August 1, 2008; 18(8): 1224 - 1237. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Oshikawa, Y. Sugai, R. Usami, K. Ohtoko, S. Toyama, and S. Kato Fine Expression Profiling of Full-length Transcripts using a Size-unbiased cDNA Library Prepared with the Vector-capping Method DNA Res, June 1, 2008; 15(3): 123 - 136. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Stoecklin, S. A. Tenenbaum, T. Mayo, S. V. Chittur, A. D. George, T. E. Baroni, P. J. Blackshear, and P. Anderson Genome-wide Analysis Identifies Interleukin-10 mRNA as Target of Tristetraprolin J. Biol. Chem., April 25, 2008; 283(17): 11689 - 11699. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. L. Sartini, H. Wang, W. Wang, C. F. Millette, and D. L. Kilpatrick Pre-Messenger RNA Cleavage Factor I (CFIm): Potential Role in Alternative Polyadenylation During Spermatogenesis Biol Reprod, March 1, 2008; 78(3): 472 - 482. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Davila Lopez and T. Samuelsson Early evolution of histone mRNA 3' end processing RNA, January 1, 2008; 14(1): 1 - 10. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-F. Chen, M. Shi, F. Huang, and X.-x. Chen Characterization of two genes of Cotesia vestalis polydnavirus and their expression patterns in the host Plutella xylostella J. Gen. Virol., December 1, 2007; 88(12): 3317 - 3322. [Abstract] [Full Text] [PDF] |
||||