|
|
|
|
Genome Res. 13:2042-2051, 2003 ©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00 Letter Widespread Selection for Local RNA Secondary Structure in Coding Regions of Bacterial GenesDepartment of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
Redundancy of the genetic code dictates that a given protein can be encoded by a large collection of distinct mRNA species, potentially allowing mRNAs to simultaneously optimize desirable RNA structural features in addition to their protein-coding function. To determine whether natural mRNAs exhibit biases related to local RNA secondary structure, a new randomization procedure was developed, DicodonShuffle, which randomizes mRNA sequences while preserving the same encoded protein sequence, the same codon usage, and the same dinucleotide composition as the native message. Genes from 10 of 14 eubacterial species studied and one eukaryote, the yeast Saccharomyces cerevisiae, exhibited statistically significant biases in favor of local RNA structure as measured by folding free energy. Several significant associations suggest functional roles for mRNA structure, including stronger secondary structure bias in the coding regions of intron-containing yeast genes than in intronless genes, and significantly higher folding potential in polycistronic messages than in monocistronic messages in Escherichia coli. Potential secondary structure generally increased in genes from the 5' to the 3' end of E. coli operons, and secondary structure potential was conserved in homologous Salmonella typhi operons. These results are interpreted in terms of possible roles of RNA structures in RNA processing, regulation of mRNA stability, and translational control.
Single-stranded RNA molecules can form local secondary structures through the interactions of complementary segments. These secondary structure elements may influence many cellular processes, including mRNA stability and localization, transcription, RNA processing, and translation. Functionally important RNA secondary structures can be found in untranslated regions (UTRs), introns, and coding sequences. For example, in eukaryotes, stem-loop structures in 5' UTRs may prevent association of the 40S ribosomal subunit with the mRNA, inhibiting translation initiation (Gray and Wickens 1998
Most previous computational analyses of mRNA secondary structures have focused on the 5' and 3' UTRs, excluding coding regions. It is widely believed that secondary structure in ORFs can interfere with translation (Klionsky et al. 1986
We sought to resolve this contradiction by developing a new protocol, called DicodonShuffle, which randomizes mRNAs preserving dinucleotide composition (as in Workman and Krogh 1999 In contrast to previous analyses, we studied local RNA structuresstructures formed within short sequence regions of 50 basesrather than the structures predicted for folding of an entire mRNA. Such local structures are more likely to form in vivo in actively translating mRNAs. Finally, the sets of mRNAs used by the two previous groups contained a random collection of sequences from organisms as diverse as Escherichia coli and Homo sapiens, potentially blurring any effects that might be present in prokaryotes but absent from eukaryotes, for example. Therefore, we analyzed each organism separately. We took advantage of the recent availability of numerous complete genomes and analyzed thousands of coding regions from 28 different organisms with several representatives each from Archaea, Eukaryota, and Eubacteria. Overall, coding regions were found to contain significantly more local RNA secondary structure than expected in most eubacterial species studied, but this phenomenon was observed only sporadically in archaeal and eukaryotic organisms. To investigate the possible roles of secondary structures in coding regions, we systematically studied the folding potential of mRNAs in relation to their expression levels, half-lives, positions in bacterial operons, and other properties.
Secondary Structure Bias Exists in Coding Regions of Many Organisms To determine whether coding regions of genes in E. coli are biased against (or for)local RNA secondary structure potential, the folding free energies of native mRNAs were compared with folding free energies computed from sequences randomized by three different methods.
The first randomization procedure, which we call CodonShuffle, preserves the same encoded protein sequence and the same codon usage of the native mRNA, as in Seffens and Digby (1999 Next, the comparison between native and shuffled mRNAs was repeated with sequences randomized preserving the same dinucleotide frequencies, encoded protein, and codon usage as the native mRNA using a newly developed method that we call DicodonShuffle (see Methods). DicodonShuffle generates sequences that are, on average, more similar to the corresponding native coding regions than CodonShuffle (85% versus 80% identity; data not shown) because of the additional constraint that must be satisfied. Nevertheless, native sequences again had significantly different (more negative)average free energies than DicodonShuffled mRNA controls (P < 2.2e-16; Tables 1 and 2).
For completeness, the sequences were also randomized preserving only dinucleotide composition (but not encoded amino acid sequence or codon usage)using a protocol called DiShuffle (Methods). DiShuffled sequences have essentially no detectable sequence similarity to the native mRNA (just slightly above 25% identity). A significant bias toward secondary structure in native sequences was still present ( , , P < 2.2e-16).
Applying the three shuffling protocols to human mRNAs revealed a significant bias toward RNA structure in native mRNAs when compared with CodonShuffled sequences (data not shown), but this effect disappeared when DiShuffle or DicodonShuffle was used (data not shown; Table 2, respectively), indicating that it is an artifact resulting from the failure to control for dinucleotide composition. These results are similar to those obtained previously (Seffens and Digby 1999
The general result reported above for E. coli mRNAs was invariant over a range of window sizes tested (Table 1), with native mRNAs always showing a highly significant bias toward more negative folding free energy (higher secondary structure potential). For most subsequent calculations, we chose a window size of 50 bases (with step size of 10 bases), as most known functionally important secondary structures are small and local, and because RNA folding algorithms are most accurate for short sequences (Mathews et al. 1999
To investigate the generality of the bias toward local secondary structures observed in E. coli mRNAs, the folding potential was computed for native and randomized mRNAs in the genomes of 27 additional species chosen to represent evolutionarily diverse taxa (Table 2). For each organism, the average folding energies were determined for a set of native and DicodonShuffled mRNAs, and the excess folding potential,
Secondary Structure Bias at the Gene Level
Calculating Z scores is more computationally intensive by an order of magnitude than the simpler EFP statistic, so we applied this type of analysis to only a few representative organisms. In all cases tested, there was good agreement between these two measures of secondary structure bias. Specifically, mean Z-score values of 0.45, 1.04, 0.44, and 0.25 were determined for Bacillus subtilis, Mycoplasma pneumoniae, Treponema pallidum, and S. cerevisiae, respectively, all organisms that have significant bias toward secondary structure according to the EFP statistic (Table 2). On the other hand, Helicobacter pylori had a mean Z-score of 0.01, consistent with the absence of significant bias for secondary structure observed using the EFP statistic (Table 2).
Distribution of Secondary Structure Along the Coding Regions
Can One Region Per Gene Account for the Bias in Structure? Next, we asked whether a single 50-bp window per gene could potentially account for the observed EFP in coding regions, or whether it must result from an additive effect of several nonoverlapping sequence windows. For every coding sequence in S. cerevisiae and E. coli, the 50-bp window with most negative folding free energy was removed, as well as all overlapping 50-base windows. As controls, the corresponding sequence segments were removed from the DicodonShuffled version of each mRNA.
Comparing the average folding free energy over all remaining windows in native sequences to the corresponding regions of randomized controls showed that in yeast, the EFP was reduced by two-thirds (EFP = 0.4 kcal/mole/kb, versus 1.2 kcal/mole/kb in Table 2), whereas in E. coli, it was reduced by one-half (EFP = 2.4 kcal/mole/kb, versus 4.8 in Table 2). This result indicates that the secondary structure bias observed in yeast and bacteria can potentially be attributed to as few as one or two short ( To identify highly folded subregions in individual genes, we used a local Z statistic, in which the free energy of folding of a 50-base window is compared with the mean and standard deviations of folding free energies for the corresponding window in DicodonShuffled sequences (see Methods). Such local Z scores were calculated for every 50-base window in ORFs from the E. coli, S. cerevisiae, and B. subtilis genomes. Some examples of genes/windows with extremely low local Z scores from these organisms are listed in Supplemental Tables 13 (available online at www.genome.org). These genes and regions provide potential targets for the experimental investigation of the roles of local mRNA structures in gene expression.
Secondary Structure in Bacterial Operons
Similar results were obtained for the genome of Salmonella typhi, using operon annotation inferred by homology to the better-annotated E. coli genome (see Methods). Specifically, secondary structure potential also tended to increase from the first gene in operons to the sixth (Fig. 3B). Given these results, it was of obvious interest to ask whether there was evidence for conservation of secondary structure potential between homologous E. coli/S. typhi genes, above and beyond that resulting from conservation of the encoded protein sequence. For this purpose a program, EvolveGene, was developed to generate synthetic S. typhi mRNAs, designated S. typhiS, from alignments of homologous E. coli/S. typhi genes. In brief, EvolveGene generates a random synthetic S. typhi homolog of a given E. coli gene, which has exactly the same degree of sequence similarity to the E. coli gene as the true S. typhi homolog, and the same amino acid and codon usage as the S. typhi homolog (see Methods for details). For this experiment, we focused on the subset of 147 polycistronic E. coli genes that had high-folding potential (Z score < 2)and also had clear S. typhi orthologs. The result was that the mean Z score for the native S. typhi homologs of these E. coli genes was significantly more negative than for the synthetic S. typhiS genes ( s.typhi = 2.06 < s.typhiS = 1.75, P < 0.01). This result provides evidence that natural selection is acting to preserve RNA secondary structure in the coding regions of polycistronic genes in these two bacteria.
Six operons contained multiple genes with significant conserved folding potential (Z score < 2 for both E. coli and S. typhi homologs), the atp operon (genes atpH, atpG, atpD), the lpx operon (lpxB, dnaE), the hyb operon (hybB, hybE), the nuo operon (nuoF, nuoH), the S10 operon (rplD, rplB), and the spc operon (rplX, rplR). Interestingly, at least three of these operonsatp, S10 and spcare known to be regulated at the level of translation rate and/or differential mRNA stability (Freedman et al. 1987
High Secondary Structure Potential in Mitochondrial-Encoded mRNAs
Folding Potential of Yeast mRNAs Correlates With Presence of Introns, But Not With Measures of Expression Level or Half-Life To explore the biological significance of mRNA secondary structure in eukaryotes, we focused on S. cerevisiae, as this organism exhibits a bias toward RNA structure (Table 2) and has been the subject of a number of large-scale studies of gene expression. Two publicly available data sets were used to study the relationship between expression level, half-life, and folding potential of mRNAs. First, the transcriptional profile of 1003 genes measured under different conditions by Northern analysis (Brown et al. 2001 = 0.50 versus 0.24, respectively; P = 0.004), suggesting that secondary structure in yeast exons might play a role in nuclear RNA processing. Although the set of S. cerevisiae genes that contain introns is heavily biased toward ribosomal protein genes, the mean Z-score for ribosomal proteins was 0.20, comparable with that observed for intronless genes, implying that the secondary structure bias is associated with presence of an intron rather than with this particular functional group of genes.
In a large-scale study using thousands of mRNA sequences derived from 28 different species, we found evidence of selection for local RNA structure in the coding regions of a number of different organisms (Table 2). In 10 of 14 eubacterial species, mRNA coding regions had a small, but significant bias toward more local secondary structure potential than expected (Table 2), and yeast nuclear and mitochondrial genes share this bias (Results). Our method for measuring secondary structure bias randomizes mRNAs, while preserving the same encoded protein sequence (thereby controlling for selection acting at the protein sequence or protein structure level), as well as the same codon usage and dinucleotide composition as in the native mRNA. Codon usage may be under selection related to translation rate and is, therefore, a variable that should be controlled for in this context. Dinucleotide composition was not controlled for in a previous study (Seffens and Digby 1999 What aspects of mRNA biogenesis or function could account for the observed biases toward local secondary structure potential in many yeast and bacterial ORFs? Formally, secondary structure might play a role at any step in the lifetime of an mRNA molecule, for example, transcription, processing/modification, subcellular localization, translation, or degradation. We consider these possibilities in turn, omitting mRNA localization, for which no large-scale data sources are currently available.
RNA Secondary Structure and Transcription
RNA Secondary Structure and RNA Processing
RNA Secondary Structure and Translation
RNA Secondary Structure and mRNA Stability/Decay
In eukaryotes, sequences in the coding regions of certain genes such as FOS, MYC, and tubulin are known to regulate mRNA half-life (Ross 1995
Conclusions
RNA Folding Standard methods for RNA secondary structure prediction compute the folding free energy for the most favorable conformation from a vast number of possible structures. The Vienna RNAfold program (Hofacker et al. 1994
Definitions
Codon Adaptation Index
Sequences
mRNA Randomization Procedures
CodonShuffle randomly permutes the set of codons used in a transcript to encode each amino acid, preserving the exact count of each codon and the precise order of encoded amino acids as in the original transcript. This algorithm is identical to that used previously (Seffens and Digby 1999
The CodonShuffle protocol preserves dinucleotide composition at the (1,2) and (2,3) positions of codons (first/second bases and second/third codon bases, respectively) of the native sequence, because it preserves codon usage. However, it does not, in general, preserve the dinucleotide composition at (3,1) positions, that is, dinucleotides formed by the last base of one codon and the first base of the next. The DicodonShuffle algorithm, described below, remedies this limitation, preserving the dinucleotide composition at (3,1), (1,2), and (2,3) positions, as well as the same encoded amino acid sequence and codon usage of the native mRNA. The essential idea of this new algorithm is to make only those synonymous codon swaps which either (1)preserve (3,1) dinucleotide composition by themselves, or (2) which can be paired with another reciprocal synonymous codon swap, such that simultaneous swapping of both codon pairs results in no net change in (3,1) dinucleotide composition. The steps in the DicodonShuffle algorithm are as follows: (1)set the sequence to be randomized equal to the native mRNA; (2) pick an amino acid at random, indicated by the letter i, from the list of amino acids not yet considered (initially, all 20 amino acids); (3) generate a random number j between 1 and Ni, the number of codons for amino acid i that occur in the native mRNA, and consider the consequences of making a swap between the first and jth codons for this amino acid that occur in the native mRNA; (4) if the swap chosen in step 3 would not change the (3,1) dinucleotide composition of the sequence, make the swap; however, if the swap would alter the (3,1) dinucleotide composition, make a list of reciprocal swaps (from among all synonymous codon pairs in the sequencesee below), pick one such swap at random from this list, then make both swaps in the sequence, and mark the codons in the reciprocal pair as having been swapped and therefore unavailable for future swaps (if the list of reciprocal swaps is empty, then do not make any swaps); (5) generate a random number k between 2 and Ni, and consider the swap between codon 2 and codon k, as in steps 3 and 4, and repeat until all codons for amino acid i have been considered; then go to step 2 and repeat the procedure until all 20 amino acids have been considered. By way of example, in the sample mRNA sequence above, AUGCCGUUUCGAUACCCACGGCUGCGU, the net effect of a swap between the two proline codons, CCG and CCA, is to eliminate one AU dinucleotide and one GC dinucleotide and to create one GU dinucleotide and one AC dinucleotide at (3,1) positions. A swap between the first (CGG) and second (CGA) arginine codons in this sequence is reciprocal to this swap, as it eliminates one GU and one AC dinucleotide, while creating one AU and one GC dinucleotide. Therefore, the following sequence could be generated by the DicodonShuffle program: AUGCCAUUUCGGUACCCGCGACUGCGU. (Changed codons are in bold face.) Note that this randomized mRNA has identical encoded amino acid sequence, codon usage, and dinucleotide composition as the native mRNA, as desired. Tests on short mRNAs show that DicodonShuffle samples almost all of the possible sequences that have these properties. For example, 94% of the possible randomized sequences with these properties were generated in 1000 shuffles of a 12-codon mRNA. The DiShuffle program simply derives first-order Markov transition probabilities (conditional probability of nucleotide j at a given position given nucleotide i at the previous position) from the conditional frequencies in the input sequence, and generates a random sequence from this model. Source code for all three of these randomization programsCodonShuffle, DicodonShuffle, and DiShuffleis available upon request to the authors.
Analysis of E. coli and S. typhi operons
On the basis of this alignment, EvolveGene might generate the following synthetic S. typhi sequence:
Statistical Analyses
y = average folding free energy for windows in native sequence; xk = average folding free energy for kth shuffled sequence; N = total number of randomizations (at least 20 for all studies).
We thank Alan P. Jasanoff for suggesting the idea to analyze RNA secondary structure in coding regions and for many helpful discussions about this work. We also thank Phil Green, Dirk Holste, and Uttam RajBhandary for helpful suggestions on the manuscript, and Daniel Herschlag for communication of results prior to publication. This work was supported by a Functional Genomics Innovation Award (C.B.B. and Phillip A. Sharp). L.K. is supported by an NIH postdoctoral fellowship. The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
[Supplementary material available online at www.genome.org.] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1257503.
1 Corresponding author.
Bonen, L. and Vogel, J. 2001. The ins and outs of group II introns. Trends Genet. 17: 322331.[CrossRef][Medline] Brown, A.J., Planta, R.J., Restuhadi, F., Bailey, D.A., Butler, P.R., Cadahia, J.L., Cerdan, M.E., De Jonge, M., Gardner, D.C., Gent, M.E., et al. 2001. Transcript analysis of 1003 novel yeast genes using high-throughput northern hybridizations. EMBO J. 20: 31773186.[CrossRef][Medline]
Campbell, A., Mrazek, J., and Karlin, S. 1999. Genome signature comparisons among prokaryote, plasmid, and mitochondrial DNA. Proc. Natl. Acad. Sci. 96: 91849189. Carrier, T.A. and Keasling, J.D. 1997. Controlling messenger RNA stability in bacteria: Strategies for engineering gene expression. Biotechnol. Prog. 13: 699708.[CrossRef][Medline] Chartrand, P., Meng, X.H., Singer, R.H., and Long, R.M. 1999. Structural elements required for the localization of ASH1 mRNA and of a green fluorescent protein reporter particle in vivo. Curr. Biol. 9: 333336.[CrossRef][Medline] Deutscher, M.P. and Li, Z. 2001. Exoribonucleases and their multiple roles in RNA metabolism. Prog. Nucleic Acid Res. Mol. Biol. 66: 67105.[Medline]
Diwa, A., Bricker, A.L., Jain, C., and Belasco, J.G. 2000. An evolutionarily conserved RNA stem-loop functions as a sensor that directs feedback regulation of RNase E gene expression. Genes & Dev. 14: 12491260.
Fairbrother, W.G., Yeh, R.F., Sharp, P.A., and Burge, C.B. 2002. Predictive identification of exonic splicing enhancers in human genes. Science 297: 10071013.
Freedman, L.P., Zengel, J.M., Archer, R.H., and Lindahl, L. 1987. Autogenous control of the S10 ribosomal protein operon of Escherichia coli: Genetic dissection of transcriptional and posttranscriptional regulation. Proc. Natl. Acad. Sci. 84: 65166520. Fridovich, I. 1974. Evidence for the symbiotic origin of mitochondria. Life Sci. 14: 819826.[CrossRef][Medline] Gray, M.W. 1992. The endosymbiont hypothesis revisited. Int. Rev. Cytol. 141: 233357.[Medline]
Gray, M.W., Burger, G., and Lang, B.F. 1999. Mitochondrial evolution. Science 283: 14761481. Gray, N.K. and Wickens, M. 1998. Control of translation initiation in animals. Annu. Rev. Dev. Biol. 14: 399458.[CrossRef][Medline]
Grover, A., Houlden, H., Baker, M., Adamson, J., Lewis, J., Prihar, G., Pickering-Brown, S., Duff, K., and Hutton, M. 1999. 5' splice site mutations in tau associated with the inherited dementia FTDP-17 affect a stem-loop structure that regulates alternative splicing of exon 10. J. Biol. Chem. 274: 1513415143. Guisez, Y., Robbens, J., Remaut, E., and Fiers, W. 1993. Folding of the MS2 coat protein in Escherichia coli is modulated by translational pauses resulting from mRNA secondary structure and codon usage: A hypothesis. J. Theor. Biol. 162: 243252.[CrossRef][Medline] Hofacker, I.L., Fontana, W., Stadler, P.F., Bonhoeffer, L.S., Tacker, M., and Schuster, P. 1994. Fast folding and comparison of RNA secondary structures. Monatshefte f. Chemie 125: 167188.[CrossRef] Holstege, F.C., Jennings, E.G., Wyrick, J.J., Lee, T.I., Hengartner, C.J., Green, M.R., Golub, T.R., Lander, E.S., and Young, R.A. 1998. Dissecting the regulatory circuitry of a eukaryotic genome. Cell 95: 717728.[CrossRef][Medline]
Howe, K.J. and Ares Jr., M. 1997. Intron self-complementarity enforces exon inclusion in a yeast pre-mRNA. Proc. Natl. Acad. Sci. 94: 1246712472.
Huerta, A.M., Salgado, H., Thieffry, D., and Collado-Vides, J. 1998. RegulonDB: A database on transcriptional regulation in Escherichia coli. Nucleic Acids Res. 26: 5559. Karn, J. 1999. Tackling Tat. J. Mol. Biol. 293: 235254.[CrossRef][Medline]
Klionsky, D.J., Skalnik, D.G., and Simoni, R.D. 1986. Differential translation of the genes encoding the proton-translocating ATPase of Escherichia coli. J. Biol. Chem. 261: 80968099. Komar, A.A., Lesnik, T., and Reiss, C. 1999. Synonymous codon substitutions affect ribosome traffic and protein folding during in vitro translation. FEBS Lett. 462: 387391.[CrossRef][Medline] Libri, D., Lescure, A., and Rosbash, M. 2000. Splicing enhancement in the yeast rp51b intron. RNA 6: 352368.[Abstract]
Lindahl, L., Archer, R.H., McCormick, J.R., Freedman, L.P., and Zengel, J.M. 1989. Translational coupling of the two proximal genes in the S10 ribosomal protein operon of Escherichia coli. J. Bacteriol. 171: 26392645. Mathews, D.H., Sabina, J., Zuker, M., and Turner, D.H. 1999. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J. Mol. Biol. 288: 911940.[CrossRef][Medline]
Mattheakis, L., Vu, L., Sor, F., and Nomura, M. 1989. Retroregulation of the synthesis of ribosomal proteins L14 and L24 by feedback repressor S8 in Escherichia coli. Proc. Natl. Acad. Sci. 86: 448452. McCarthy, J.E. 1990. Post-transcriptional control in the polycistronic operon environment: Studies of the atp operon of Escherichia coli. Mol. Microbiol. 4: 12331240.[CrossRef][Medline] McCarthy, J.E., Gerstel, B., Surin, B., Wiedemann, U., and Ziemke, P. 1991. Differential gene expression from the Escherichia coli atp operon mediated by segmental differences in mRNA stability. Mol. Microbiol. 5: 24472458.[CrossRef][Medline] Meijer, H.A. and Thomas, A.M. 2002. Control of eukaryotic protein synthesis by upstream open reading frames in the 5'-untranslated region of an mRNA. Biochem. J. 367: 111.[CrossRef][Medline] Myers, E.W. and Miller, W. 1989. Optimal alignments in linear space. Comput. Appl. Biosci. 4: 1117. Regnier, P. and Hajnsdorf, E. 1991. Decay of mRNA encoding ribosomal protein S15 of Escherichia coli is initiated by an RNase E-dependent endonucleolytic cleavage that removes the 3' stabilizing stem and loop structure. J. Mol. Biol. 217: 283292.[CrossRef][Medline] Rho, S.B. and Martinis, S.A. 2000. The bI4 group I intron binds directly to both its protein splicing partners, a tRNA synthetase and maturase, to facilitate RNA splicing activity. RNA 6: 18821894.[Abstract]
Rocha, E.P., Danchin, A., and Viari, A. 1999. Translation in Bacillus subtilis: Roles and trends of initiation and termination, insights from a genome analysis. Nucleic Acids Res. 27: 35673576.
Ross, J. 1995. mRNA stability in mammalian cells. Microbiol Rev. 59: 423450.
Schmittgen, T.D., Danenberg, K.D., Horikoshi, T., Lenz, H.J., and Danenberg, P.V. 1994. Effect of 5-fluoro- and 5-bromouracil substitution on the translation of human thymidylate synthase mRNA. J. Biol. Chem. 269: 1626916275.
Seffens, W. and Digby, D. 1999. mRNAs have greater negative folding free energies than shuffled or codon choice randomized sequences. Nucleic Acids Res. 27: 15781584.
Sharp, P.M. and Li, W.H. 1987. The codon Adaptation IndexA measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 15: 12811295. Tanner, N.K. and Sargueil, B. 1995. Dissecting and analyzing the secondary structure domains of group I introns through the use of chimeric intron constructs. J. Mol. Biol. 252: 583595.[CrossRef][Medline] Thanaraj, T.A. and Argos, P. 1996. Ribosome-mediated translational pause and protein domain organization. Protein Sci. 5: 15941612.[Abstract]
Wang, Y., Liu, C.L., Storey, J.D., Tibshirani, R.J., Herschlag, D., and Brown, P.O. 2002. Precision and functional specificity in mRNA decay. Proc. Natl. Acad. Sci. 99: 58605865.
Weik, M., Modrof, J., Klenk, H.D., Becker, S., and Muhlberger, E. 2002. Ebola virus VP30-mediated transcription is regulated by RNA secondary structure formation. J. Virol. 76: 85328539.
Workman, C. and Krogh, A. 1999. No evidence that mRNAs have lower folding free energies than random sequences with the same dinucleotide distribution. Nucleic Acids Res. 27: 48164822.
Zuker, M. and Stiegler, P. 1981. Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic Acids Res. 9: 133148.
http://ncbi.nlm.nih.gov; National Center for Biotechnology Information Web site, home of GenBank database. http://www.r-project.org/; home page of the R Projectfree version of the S+ statistical software package.
Received February 10, 2003;
accepted in revised format July 1, 2003.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||