Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Published online before print May 16, 2002, 10.1101/gr.87702
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
GR-877Rv1
12/6/962    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Jordan, I. K.
Right arrow Articles by Koonin, E. V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Jordan, I. K.
Right arrow Articles by Koonin, E. V.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 6, 962-968, June 2002

LETTER
Essential Genes Are More Evolutionarily Conserved Than Are Nonessential Genes in Bacteria

I. King Jordan, Igor B. Rogozin, Yuri I. Wolf, and Eugene V. Koonin1

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
RESULTS AND DISCUSSION
METHODS
WEB SITE REFERNCES
REFERENCES

The "knockout-rate" prediction holds that essential genes should be more evolutionarily conserved than are nonessential genes. This is because negative (purifying) selection acting on essential genes is expected to be more stringent than that for nonessential genes, which are more functionally dispensable and/or redundant. However, a recent survey of evolutionary distances between Saccharomyces cerevisiae and Caenorhabditis elegans proteins did not reveal any difference between the rates of evolution for essential and nonessential genes. An analysis of mouse and rat orthologous genes also found that essential and nonessential genes evolved at similar rates when genes thought to evolve under directional selection were excluded from the analysis. In the present study, we combine genomic sequence data with experimental knockout data to compare the rates of evolution and the levels of selection for essential versus nonessential bacterial genes. In contrast to the results obtained for eukaryotic genes, essential bacterial genes appear to be more conserved than are nonessential genes over both relatively short (microevolutionary) and longer (macroevolutionary) time scales.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
RESULTS AND DISCUSSION
METHODS
WEB SITE REFERNCES
REFERENCES

Rates of evolution vary tremendously among protein-coding genes. Molecular evolutionary studies have revealed an ~1000-fold range of nonsynonymous substitution rates (Li and Graur 1991). The strength of negative (purifying) selection is thought to be the most important factor in determining the rate of evolution for the protein-coding regions of a gene (Kimura 1983; Ohta 1992; Li 1997). Consistent with this idea, Alan Wilson and colleagues (1997) proposed that essential genes should evolve more slowly than nonessential genes. This is the so-called "knockout-rate" prediction (Hurst and Smith 1999). "Essential" and "nonessential" are classic molecular genetic designations that relate to the functional significance of a gene with respect to its effect on organismic fitness. A gene is considered to be essential if a knock-out results in (conditional) lethality or infertility. On the other hand, nonessential genes are those for which knock-outs yield viable and fertile individuals. It was reasoned that purifying selection should be more intense for essential genes because they are, by definition, less functionally dispensable and/or redundant than are nonessential genes. Given the role of purifying selection in determining evolutionary rates, the greater levels of purifying selection on essential genes should be manifest as a lower rate of evolution relative to that of nonessential genes.

To systematically evaluate the relationship between the fitness effects of genes and their rates of evolution, a combination of a substantial amount of experimental knock-out data and sequence data from numerous genes is required. Only recently has enough data accumulated to allow for tests of the straightforward and seemingly intuitive knock-out rate prediction. However, examinations of sequence data with respect to this prediction have yielded equivocal results. For example, a survey of substitution rates for mouse and rat orthologous genes appeared to indicate a slower rate of evolution for essential genes. But when genes thought to evolve under directional selection were excluded from the analysis, essential and nonessential genes were found to evolve at similar rates (Hurst and Smith 1999). A more recent analysis of the evolutionary distances between Saccharomyces cerevisiae and Caenorhabditis elegans proteins did indicate that the fitness effect of a protein influences its rate of evolution (Hirsh and Fraser 2001). Nevertheless, this study (Hirsh and Fraser 2001) was also unable to reveal any difference between the rates of evolution for essential and nonessential genes.

The results from both of these studies were taken to indicate that the fitness differences between essential and nonessential genes do not influence evolutionary rates to the extent that was expected. However, the studies relied on the analyses of relatively few genes (n = 175 and n = 287, respectively) and comparisons between species that diverged at least tens of millions of years ago. It might be the case that these results reflect a lack of power and sensitivity of the approaches that were used. The recent availability of complete genome sequences from different strains of the same bacterial species provides an opportunity to address the issue with an unprecedented level of resolution. In the present study, to test the knockout-rate prediction, the relationship between the fitness class of genes (essential versus nonessential) and their rate of evolution was assessed for three bacterial species: Escherichia coli, Helicobacter pylori, and Neisseria meningitidis, for each of which at lease two complete genome sequences are available.


    RESULTS AND DISCUSSION
TOP
ABSTRACT
INTRODUCTION
RESULTS AND DISCUSSION
METHODS
WEB SITE REFERNCES
REFERENCES

The Profiling of the E. coli Genome (PEC) database (http://www.shigen.nig.ac.jp/ecoli/pec/) was used to characterize E. coli genes as essential, nonessential, or undetermined. E. coli genes in this database are characterized as essential or nonessential based largely on experimental (null mutations) evidence. In addition to the use of experimental evidence, a much smaller number of genes were designated as essential or nonessential based simply on their known function (see Methods). For example, the genes for ribosomal proteins were assumed to be essential, whereas genes involved in flagellation, motility, and chemotaxis were classified as nonessential. Comparisons between essential and nonessential E. coli genes were performed using all of the data available from the PEC database and with a reduced data set that contained only essential and nonessential genes for which experimental evidence existed. Sets of orthologous protein sequences and the corresponding nucleotide sequences shared by the two completely sequenced E. coli strains were identified and aligned, and nucleotide sequence alignments were used to calculate the synonymous (Ks) and nonsynonymous (Ka) substitution rates (see Methods). Values of Ks and Ka were compared for E. coli genes designated as either essential or nonessential. Analysis of these genes showed that the average Ks and Ka were significantly lower for essential genes than for nonessential genes (Table 1). This is the case for both the entire data set and the reduced set containing only experimentally characterized genes (Table 1). The reduction in Ka for essential genes indicates a reduction in the intraspecific rate of essential protein evolution, and the reduction in Ks is consistent with the positive correlation between Ks and Ka (Table 1). Such a correlation between Ks and Ka values has also been observed for several other species (Wolfe and Sharp 1993; Ohta and Ina 1995; Makalowski and Boguski 1998). This correlation could reflect a mechanistic bias in mutation or indicate that synonymous sites are also subject to some degree of selection (or both). However, the significantly lower value of Ka/Ks for essential genes (Table 1) indicates that the difference between the rates of evolution for the two gene classes is more pronounced for Ka than for Ks and is consistent with more stringent negative selection against amino acid replacements acting on essential genes. The undetermined genes, which were not included in either the essential or the nonessential class, had somewhat greater average Ka and Ka/Ks values than those of the nonessential genes (Table 1).

                              
View this table:
[in this window]
[in a new window]
 
Table 1.   The Rates of Synonymous (Ka) and Nonsynonymous (Ks) Nucleotide Substitutions for Essential Versus Nonessential Bacterial Genes

It is a formal possibility that a relatively few genes with extreme values contributed disproportionately to the average Ka, Ks, and Ka/Ks and were thus largely responsible for the difference between these average values for essential and nonessential E. coli genes. The fact that we observed the significant difference between the essential and nonessential genes after the nonexperimentally assigned genes for ribosomal proteins were removed from the essential set (Table 1) argues against this possibility. Indeed, although the ribosomal protein genes are highly conserved and had substantially lower average values of Ka, Ks, and Ka/Ks than those for the rest of the essential genes (data not shown), these genes alone clearly did not account for the difference between the essential and nonessential set.

To further assess the effect of potential biases in the essential gene set on the observed differences in evolutionary rates, the values of Ka, Ks, and Ka/Ks for essential and nonessential E. coli genes were reanalyzed with a bootstrap test (Methods). The frequency distributions of bootstrapped average Ka, Ks, and Ka/Ks show clear distinctions between essential and nonessential genes (Fig. 1). In addition, for each bootstrap replicate, the average values of essential and nonessential genes were calculated, and the significance of the difference between the essential and nonessential average values was assessed. For Ka and Ks each, all 1000 replicates differed at the P < 0.01 level; for Ka/Ks, 968 replicates differed at the P < 0.01 level. Thus, the results of the bootstrap analysis reject the possibility that the average values of Ka, Ks, and Ka/Ks for essential and/or nonessential genes are greatly influenced by a few genes with extreme values.


View larger version (21K):
[in this window]
[in a new window]
 
Figure 1   Bootstrap test of the average rates of synonymous (Ks), nonsynonymoys (Ka), and Ka/Ks for essential and nonessential Escherichia coli genes. Frequency distributions for the average values of Ka, Ks, and Ka/Ks for 1000 bootstrap replicates are shown.

Evolutionary conservation over longer time scales was measured by calculating the phyletic distribution parameter of essential versus nonessential E. coli genes. This parameter indicates the extent to which orthologs of a gene are distributed among the 26 taxonomic groups in the Clusters of Orthologous Groups (COGs) database (see Methods). Orthologs of essential E. coli genes are more broadly distributed (P < 10-10, Mann-Whitney U test) among bacterial and archaeal species than are orthologs of nonessential E. coli genes (Fig. 2).


View larger version (34K):
[in this window]
[in a new window]
 
Figure 2   Average phyletic distribution parameter values for essential (E) and nonessential (NE) genes in the three bacterial species surveyed. Average values (±SE) are shown above the bars. For the definition of the phyletic distribution parameter (PDP), see Methods.

Orthologs of essential and nonessential genes from E. coli (all genes in each category, without removing nonexperimentally characterized genes) were identified in H. pylori and N. meningitidis (see Methods). The classification of these orthologs as essential or nonessential in E. coli was taken as an approximation of their classification in H. pylori and N. meningitidis, and the same evolutionary comparisons were performed on them. For both species, the rates of Ks and Ka for 98 predicted essential genes were significantly lower than the rates for 130 predicted nonessential genes (Table 1). The differences among Ks and Ka values between all three species surveyed merely reflect the fact that for each species, the time to common ancestry for the two sequenced genomes, in all likelihood, differs significantly. Also like in the case of E. coli, the average Ka/Ks values are lower for the predicted essential genes (Table 1). Orthologs of predicted essential genes in these species were also more broadly phyletically distributed (P = 1.6×10-9 Mann-Whitney U test) than are orthologs of predicted nonessential genes (Fig. 2). An additional aspect of these comparisons was that N. meningitidis and particularly H. pylori had significantly greater values of Ks, Ka, and Ka/Ks than those for E. coli, such that, for example, even essential genes of H. pylori appeared to evolve faster than nonessential genes of E. coli (Table 1). A detailed examination of these differences is beyond the scope of the present work, but it seems interesting to speculate that they might reflect the parasitic lifestyle of N. meningitidis and H. pylori.

Finally, sets of interspecific (E. coli, H. pylori, and N. meningitidis) orthologous proteins were aligned, and the average pair-wise evolutionary distance for each set was calculated (see Methods). Essential proteins show a significantly lower (P = 6.3 × 10-6 Mann-Whitney U test) average level of per site amino acid sequence variation (avgerage ± SE = 4.85 ± 0.25) among these three species than do nonessential genes (average ±SE = 6.83 ± 0.34).

A previous comparison of the rates of evolution for essential versus nonessential genes in mammals initially revealed significantly lower rates of evolution for essential genes (Hurst and Smith 1999). However, when genes involved in the immune system, which are thought to evolve under diversifying (positive) selection, were removed from the analyzed data set, this difference disappeared. This result was attributed to substantial differences in the rates of evolution for different functional classes of genes. We sought to explore the potential contribution of this phenomenon to the observed difference between evolutionary rates of essential versus nonessential bacterial genes by breaking down the E. coli genes into four broad functional categories (see Methods): (1) information processing and storage, (2) cellular processes, (3) metabolism, and (4) poorly characterized. Not unexpectedly, comparison of the numbers of genes of each functional class that were designated as essential or nonessential revealed a nonrandom distribution (data not shown). For example, information storage and processing genes are vastly over-represented among essential genes, whereas there are far fewer poorly characterized genes than would be expected by chance in this same set. Notably, however, there were no significant differences in the evolutionary rates among information processing, cellular processes, and metabolic categories within the essential, nonessential, or undetermined sets; only the poorly characterized genes appeared to evolve significantly faster (Table 2). In three of the four functional classes, the essential genes had significantly lower values of Ka and tended to have significantly lower values of Ks and Ka/Ks than did the nonessential genes. In addition, in each of the classes, the undetermined genes were found to evolve even faster than nonessential ones, and this difference was significant on two occasions (Table 2). The only exception to this pattern was among the poorly characterized genes. Poorly characterized essential genes do have substantially lower rates of evolution than do nonessential genes, but the low number of these essential genes (n = 12) results in a statistical comparison that lacks power. Thus, the slower rate of evolution of essential genes compared with nonessential genes in bacteria appears to be a general phenomenon that is not limited to a particular functional category of genes.

                              
View this table:
[in this window]
[in a new window]
 
Table 2.   The Rates of Synonymous (Ks) and Nonsynonymous (Ka) Nucleotide Substitutions among Different Functional Categories for Essential, Nonessential, and Undetermined Escherichia coli Genes

Despite intense scrutiny of the factors that influence the rate of protein evolution, confirmation of the straightforward prediction that essential genes should be more evolutionarily conserved than are nonessential genes (Wilson et al. 1977) has proven elusive. A recent study of protein variation between S. cerevisiae and C. elegans did reveal a significant linear relationship between the protein's level of dispensability (fitness class as determined in S. cerevisiae) and their rate of evolution (Hirsh and Fraser 2001). This is consistent with the idea that the rate of the evolution of a gene depends on its contribution to the fitness of the organism. However, this same study did not find any difference between the rates of evolution of essential versus nonessential genes. This lack of difference was attributed to the fact that ablation of many of the nonessential genes may have enough of an effect on organismal fitness to render them evolutionarily equivalent to essential genes. However, in the present study, a dense sampling of orthologous genes, facilitated by the completion of multiple bacterial genome sequences, allowed us to show that essential genes in bacteria are more conserved than are nonessential genes. It is unclear whether the difference between the findings for eukaryotic and bacterial genes is merely an effect of the sampling or a reflection of a real distinction between the evolutionary modes of these two domains of life.

A similar survey of mouse and rat orthologous genes initially found a difference between the evolutionary rates of essential and nonessential genes, but when genes thought to evolve under directional selection were excluded from the analysis, this distinction disappeared (Hurst and Smith 1999). This raises the question of whether the differences between essential and nonessential genes reported here is because of positive selection being more prevalent in nonessential genes or to purifying selection being more stringent among essential genes. We did not find any evidence of positive selection (i.e., Ka/Ks > 1) in our comparisons. However, Ka/Ks > 1 is an extremely conservative criterion, which will reveal only cases of strong positive selection acting on large portions of genes. Therefore, it remains a formal possibility that the differences described here could be owing in small part to differences between essential and nonessential genes in the frequency of positive selection. However, purifying selection is clearly the rule in protein evolution, as evidenced by the Ka/Ks values reported here and in numerous other studies. Thus, consistent with the knockout-rate prediction (Wilson et al. 1977), differences in levels of purifying selection have certainly had the decisive role in determining the different rates of evolution of essential and nonessential bacterial genes.

Until this time, attempts to verify the knockout-rate prediction by comparing the evolutionary rate of essential versus nonessential genes have yielded equivocal results. This has led to the speculation that the rate of evolution for a given gene is determined more by the proportion of amino acid residues in the encoded protein that are critical for maintaining function than by the magnitude of the selection coefficient against deleterious mutations in that gene (Brookfield 2000). However, these two factors, in reality, might not be independent. In light of the results reported here, it might be the case that at least for bacterial genes, the rate of protein evolution is determined by the proportion of sites in a protein that has a large selection coefficient against deleterious mutations.


    METHODS
TOP
ABSTRACT
INTRODUCTION
RESULTS AND DISCUSSION
METHODS
WEB SITE REFERNCES
REFERENCES

Classification of E. coli K12 genes as essential, nonessential, or undetermined was taken from the PEC database (http://www.shigen.nig.ac.jp/ecoli/pec/). The PEC database classifies genes as essential or nonessential on the basis of a combination of experimental evidence and general functional considerations. If a strain has a null mutation in a gene and is able to grow, the gene in question is considered to be nonessential. Genes for which conditional lethal mutants have been isolated (Chow and Berg 1988; Harris et al. 1992) are classified as essential. In addition to the experimentally characterized genes, a much smaller subset of E. coli K12 genes were classified as essential or nonessential based on their functional characteristics (in absence of the specific experimental data listed above). For example, ribosomal structural genes and genes encoding unique aminoacyl-tRNA synthetases are classified as essential. Conversely, genes involved in flagellation, chemotaxis, and mobility were classified as nonessential. Essential, nonessential, and undetermined genes were placed into four broad functional categories using the classification scheme used in the COGs database (Tatusov et al. 1997, 2000, 2001).

Proteobacterial species with more than one complete genome sequence available as of GenBank release 123.0 were analyzed here: E. coli K12 (Blattner et al. 1997), E. coli O157:H7 (Perna et al. 2001), H. pylori 26695 (Tomb et al. 1997), H. pylori J99 (Alm et al. 1999), N. meningitidis serogroup B strain MC58 (Tettelin et al. 2000), and N. meningitidis serogroup A strain Z2491 (Parkhill et al. 2000). Nucleotide sequences and protein sequences predicted from complete bacterial genomes were obtained from the National Center for Biotechnology Information (NCBI) FTP server (ftp://ncbi.nlm.nih.gov/genbank/genomes/bacteria).

Orthologous protein sequences encoded by complete intraspecific genomes were identified using an all-against-all BLAST (Altschul et al. 1990, 1997) procedure. The SEALS package (Walker and Koonin 1997) was used to implement multiple BLAST searches and to postprocess the results of these searches. For any two intraspecific genomes, two proteins were considered orthologs if they were symmetrical best hits in each reciprocal all-against-all BLAST search. BLAST searches were run with a bits-per-position cutoff of 0.7. Orthologous proteins were aligned using ClustalW (Thompson et al. 1994) with default options. Orthologous protein-encoding nucleotide sequences were obtained from the NCBI FTP server and aligned to correspond to the protein sequence alignments using SEALS. The number of synonymous nucleotide substitutions per synonymous site (Ks), and the number of nonsynonymous nucleotide substitutions per nonsynonymous site (Ka) were estimated for the resulting orthologous nucleotide sequence alignments using the Pamilo-Bianchi-Li method (Li 1993; Pamilo and Bianchi 1993).

Orthologous protein sequences encoded by interspecific genomes (E. coli, H. pylori, and N. meningitidis) were identified using a similar all-against-all BLAST approach implemented with the SEALS package. Proteins that formed mutually consistent triangles of symmetrical best hits (Tatusov et al. 1997) were considered to be orthologs (hence an identical number of orthologs from each genome in this analysis). Orthologous protein sequences were aligned using ClustalW as described above, and the resulting sequence alignments were used to calculate evolutionary distances between orthologous proteins. Distances were calculated using the 3BRANCH program (Y.I. Wolf, unpubl.; available on request) that implements a distance correction for multiple hits based on the gamma -distribution of site rate variation (Ota and Nei 1994; Grishin 1995) with an alpha -parameter of 1.0. Distances are expressed as the number of substitutions per position.

The phyletic distribution of orthologous proteins was determined using the COGs database (Tatusov et al. 1997, 2000, 2001). The COGs database at the time of this work included species that fell into 26 distinct taxonomic groups. For each orthologous protein, a phyletic distribution parameter, which is equal to the number of taxonomic groups represented in the corresponding COG, was calculated.

The bootstrap analysis was performed by resampling with replacement from the original sets of per gene values of Ka, Ks, and Ka/Ks for essential and nonessential genes (calculated as described above). For each of 1000 bootstrap replicates, a resampled set with the same number of values as the original set was constructed. The average values for these resampled sets were then calculated and the averages of the essential and nonessential sets were compared.

Levels of significance for the difference among average Ks, Ka, Ka/Ks, phyletic distribution parameter, and levels of protein sequence variation were determined using the Mann-Whitney U test.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.


    WEB SITE REFERNCES
TOP
ABSTRACT
INTRODUCTION
RESULTS AND DISCUSSION
METHODS
WEB SITE REFERNCES
REFERENCES

ftp://ncbi.nlm.nih.gov/genbank/genomes/bacteria; Bacterial genomes from the National Center for Biotechnology Information FTP server.

http://www.shigen.nig.ac.jp/ecoli/pec/; Profiling of the E. coli Genome database.


    FOOTNOTES

1 Corresponding author.

E-MAIL koonin{at}ncbi.nlm.nih.gov; FAX (301) 435-7794.

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.87702. Article published online before print in May 2002.


    REFERENCES
TOP
ABSTRACT
INTRODUCTION
RESULTS AND DISCUSSION
METHODS
WEB SITE REFERNCES
REFERENCES

  • Alm, R.A., Ling, L.S., Moir, D.T., King, B.L., Brown, E.D., Doig, P.C., Smith, D.R., Noonan, B., Guild, B.C., deJonge, B.L. 1999. Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori. Nature 397: 176-180[CrossRef][Medline].
  • Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410[CrossRef][Medline].
  • Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402[Abstract/Free Full Text].
  • Blattner, F.R., Plunkett, G., III, Bloch, C.A., Perna, N.T., Burland, V., Riley, M., Collado-Vides, J., Glasner, J.D., Rode, C.K., Mayhew, G.F. 1997. The complete genome sequence of Escherichia coli K-12. Science 277: 1453-1474[Abstract/Free Full Text].
  • Brookfield, J.F. 2000. What determines the rate of sequence evolution? Curr. Biol. 10: R410-R411[CrossRef][Medline].
  • Chow, W.Y. and Berg, D.E. 1988. Tn5tac1, a derivative of transposon Tn5 that generates conditional mutations. Proc. Natl. Acad. Sci. 85: 6468-6472[Abstract/Free Full Text].
  • Grishin, N.V. 1995. Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites. J. Mol. Evol. 41: 675-679.
  • Harris, S.D., Cheng, J., Pugh, T.A., and Pringle, J.R. 1992. Molecular analysis of Saccharomyces cerevisiae chromosome, I: On the number of genes and the identification of essential genes using temperature-sensitive-lethal mutations. J. Mol. Biol. 225: 53-65[CrossRef][Medline].
  • Hirsh, A.E. and Fraser, H.B. 2001. Protein dispensability and rate of evolution. Nature 411: 1046-1049[CrossRef][Medline].
  • Hurst, L.D. and Smith, N.G. 1999. Do essential genes evolve slowly? Curr. Biol. 9: 747-750[CrossRef][Medline].
  • Kimura, M. 1983. The neutral theory of molecular evolution. Cambridge University Press, NewYork, NY.
  • Li, W.H. 1993. Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J. Mol. Evol. 36: 96-99[CrossRef][Medline].
  • -----. 1997. Molecular evolution. Sinauer Associates, Sunderland, MA.
  • Li, W.H. and Graur, D. 1991. Fundamentals of molecular evolution. Sinauer Associates, Sunderland, MA.
  • Makalowski, W. and Boguski, M.S. 1998. Evolutionary parameters of the transcribed mammalian genome: An analysis of 2,820 orthologous rodent and human sequences. Proc. Natl. Acad. Sci. 95: 9407-9412[Abstract/Free Full Text].
  • Ohta, T. 1992. The nearly neutral theory of molecular evolution. Annu. Rev. Ecol. Syst. 23: 263-286[CrossRef].
  • Ohta, T. and Ina, Y. 1995. Variation in synonymous substitution rates among mammalian genes and the correlation between synonymous and nonsynonymous divergences. J. Mol. Evol. 41: 717-720[Medline].
  • Ota, T. and Nei, M. 1994. Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites. J. Mol. Evol. 38: 642-643.
  • Pamilo, P. and Bianchi, N.O. 1993. Evolution of the Zfx and Zfy genes: Rates and interdependence between the genes. Mol. Biol. Evol. 10: 271-281[Abstract].
  • Parkhill, J., Achtman, M., James, K.D., Bentley, S.D., Churcher, C., Klee, S.R., Morelli, G., Basham, D., Brown, D., Chillingworth, T. 2000. Complete DNA sequence of a serogroup A strain of Neisseria meningitidis Z2491. Nature 404: 502-506[CrossRef][Medline].
  • Perna, N.T., Plunkett, G., III, Burland, V., Mau, B., Glasner, J.D., Rose, D.J., Mayhew, G.F., Evans, P.S., Gregor, J., Kirkpatrick, H.A. 2001. Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature 409: 529-533[CrossRef][Medline].
  • Tatusov, R.L., Koonin, E.V., and Lipman, D.J. 1997. A genomic perspective on protein families. Science 278: 631-637[Abstract/Free Full Text].
  • Tatusov, R.L., Galperin, M.Y., Natale, D.A., and Koonin, E.V. 2000. The COG database: A tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 28: 33-36[Abstract/Free Full Text].
  • Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22-28[Abstract/Free Full Text].
  • Tettelin, H., Saunders, N.J., Heidelberg, J., Jeffries, A.C., Nelson, K.E., Eisen, J.A., Ketchum, K.A., Hood, D.W., Peden, J.F., Dodson, R.J. 2000. Complete genome sequence of Neisseria meningitidis serogroup B strain MC58. Science 287: 1809-1815[Abstract/Free Full Text].
  • Thompson, J.D., Higgins, D.G., and Gibson, T.J. 1994. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22: 4673-4680[Abstract/Free Full Text].
  • Tomb, J.F., White, O., Kerlavage, A.R., Clayton, R.A., Sutton, G.G., Fleischmann, R.D., Ketchum, K.A., Klenk, H.P., Gill, S., Dougherty, B.A. 1997. The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature 388: 539-547[CrossRef][Medline].
  • Walker, D.R. and Koonin, E.V. 1997. SEALS: A system for easy analysis of lots of sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 5: 333-339[Medline].
  • Wilson, A.C., Carlson, S.S., and White, T.J. 1977. Biochemical evolution. Annu. Rev. Biochem. 46: 573-639[CrossRef][Medline].
  • Wolfe, K.H. and Sharp, P.M. 1993. Mammalian gene evolution: Nucleotide sequence divergence between mouse and rat. J. Mol. Evol. 37: 441-456[Medline].

Received January 23, 2002; accepted in revised form March 25, 2002.


12:962-968 ©2002 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/02 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
M. Michaut, S. Kerrien, L. Montecchi-Palazzi, F. Chauvat, C. Cassier-Chauvat, J.-C. Aude, P. Legrain, and H. Hermjakob
InteroPORC: automated inference of highly conserved protein interaction networks
Bioinformatics, July 15, 2008; 24(14): 1625 - 1631.
[Abstract] [PDF]


Home page
CarcinogenesisHome page
R. Yang, B. Frank, K. Hemminki, C. R. Bartram, B. Wappenschmidt, C. Sutter, M. Kiechle, P. Bugert, R. K. Schmutzler, N. Arnold, et al.
SNPs in ultraconserved elements and familial breast cancer risk
Carcinogenesis, February 1, 2008; 29(2): 351 - 355.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
X. Chen, S. F. Perry, S. Aris-Brosou, C. Selva, and T. W. Moon
Characterization and functional divergence of the {alpha}1-adrenoceptor gene family: insights from rainbow trout (Oncorhynchus mykiss)
Physiol Genomics, December 19, 2007; 32(1): 142 - 153.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. M. Andres, C. de Hemptinne, and J. Bertranpetit
Heterogeneous Rate of Protein Evolution in Serotonin Genes
Mol. Biol. Evol., December 1, 2007; 24(12): 2707 - 2715.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. T. Saunders and P. Green
Insights from Modeling Protein Evolution with Context-Dependent Mutation and Asymmetric Amino Acid Selection
Mol. Biol. Evol., December 1, 2007; 24(12): 2632 - 2647.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
Y. Wang, R. G. Kleespies, A. M. Huger, and J. A. Jehle
The Genome of Gryllus bimaculatus Nudivirus Indicates an Ancient Diversification of Baculovirus-Related Nonoccluded Nudiviruses of Insects
J. Virol., May 15, 2007; 81(10): 5395 - 5406.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
M. A. Croxen, P. B. Ernst, and P. S. Hoffman
Antisense RNA Modulation of Alkyl Hydroperoxide Reductase Levels in Helicobacter pylori Correlates with Organic Peroxide Toxicity but Not Infectivity
J. Bacteriol., May 1, 2007; 189(9): 3359 - 3368.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
N. Georgelis, E. L. Braun, J. R. Shaw, and L. C. Hannah
The Two AGPase Subunits Evolve at Different Rates in Angiosperms, yet They Are Equally Sensitive to Activity-Altering Amino Acid Changes When Expressed in Bacteria
PLANT CELL, May 1, 2007; 19(5): 1458 - 1472.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
D. P. Denning and M. F. Rexach
Rapid Evolution Exposes the Boundaries of Domain Structure and Function in Natively Unfolded FG Nucleoporins
Mol. Cell. Proteomics, February 1, 2007; 6(2): 272 - 282.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Borenstein, T. Shlomi, E. Ruppin, and R. Sharan
Gene loss rate: a probabilistic measure for the conservation of eukaryotic genes
Nucleic Acids Res., January 12, 2007; 35(1): e7 - e7.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. K. Choi, S. C. Kim, J. Seo, S. Kim, and J. Bhak
Impact of Transcriptional Properties on Essentiality and Evolutionary Rate
Genetics, January 1, 2007; 175(1): 199 - 206.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
L. Rohmer, M. Brittnacher, K. Svensson, D. Buckley, E. Haugen, Y. Zhou, J. Chang, R. Levy, H. Hayden, M. Forsman, et al.
Potential Source of Francisella tularensis Live Vaccine Strain Attenuation Determined by Genome Comparison
Infect. Immun., December 1, 2006; 74(12): 6895 - 6906.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
B. E. Shakhnovich and E. V. Koonin
Origins and impact of constraints in evolution of gene families
Genome Res., December 1, 2006; 16(12): 1529 - 1536.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
B.-Y. Liao, N. M. Scott, and J. Zhang
Impacts of Gene Essentiality, Expression Pattern, and Gene Compactness on the Evolutionary Rate of Mammalian Proteins
Mol. Biol. Evol., November 1, 2006; 23(11): 2072 - 2080.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. Julenius and A. G. Pedersen
Protein Evolution Is Faster Outside the Cell
Mol. Biol. Evol., November 1, 2006; 23(11): 2039 - 2048.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Bern, D. Goldberg, and E. Lyashenko
Data mining for proteins characteristic of clades
Nucleic Acids Res., September 11, 2006; 34(16): 4342 - 4353.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. Seringhaus, A. Paccanaro, A. Borneman, M. Snyder, and M. Gerstein
Predicting essential genes in fungal genomes
Genome Res., September 1, 2006; 16(9): 1126 - 1135.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. D. Bloom, D. A. Drummond, F. H. Arnold, and C. O. Wilke
Structural Determinants of the Rate of Protein Evolution in Yeast
Mol. Biol. Evol., September 1, 2006; 23(9): 1751 - 1761.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. O. Wilke and D. A. Drummond
Population Genetics of Translational Robustness
Genetics, May 1, 2006; 173(1): 473 - 481.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
S. T. Cardona, C. L. Mueller, and M. A. Valvano
Identification of Essential Operons with a Rhamnose-Inducible Promoter in Burkholderia cenocepacia
Appl. Envir. Microbiol., April 1, 2006; 72(4): 2547 - 2555.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
T. Makino and T. Gojobori
The Evolutionary Rate of a Protein Is Influenced by Features of the Interacting Partners
Mol. Biol. Evol., April 1, 2006; 23(4): 784 - 789.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. C. Chiu, E. K. Lee, M. G. Egan, I. N. Sarkar, G. M. Coruzzi, and R. DeSalle
OrthologID: automation of genome-scale ortholog identification within a parsimony framework
Bioinformatics, March 15, 2006; 22(6): 699 - 707.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
W. Gu, Y. Gibert, T. Wirth, A. Elischer, W. Bloch, A. Meyer, O. K. Steinlein, and G. Begemann
Using Gene-History and Expression Analyses to Assess the Involvement of LGI Genes in Human Disorders
Mol. Biol. Evol., November 1, 2005; 22(11): 2209 - 2216.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
G. Fang, E. Rocha, and A. Danchin
How Essential Are Nonessential Genes?
Mol. Biol. Evol., November 1, 2005; 22(11): 2147 - 2156.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. A. Drummond, J. D. Bloom, C. Adami, C. O. Wilke, and F. H. Arnold
Why highly expressed proteins evolve slowly
PNAS, October 4, 2005; 102(40): 14338 - 14343.
[Abstract] [Full Text] [PDF]