Genome Research cityscape

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mackiewicz, P.
Right arrow Articles by Cebrat, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Mackiewicz, P.
Right arrow Articles by Cebrat, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 9, Issue 5, 409-416, May 1999

LETTER
How Does Replication-Associated Mutational Pressure Influence Amino Acid Composition of Proteins?

Pawel Mackiewicz,1 Agnieszka Gierlik,1 Maria Kowalczuk,2 Miroslaw R. Dudek,1 and Stanislaw Cebrat1,3

1 Institute of Microbiology, Wroclaw University, 51-148 Wroclaw, Poland; 2 Institute of Theoretical Physics, Wroclaw University, 50-204 Wroclaw, Poland

    ABSTRACT
Top
Abstract
Introduction
RESULTS
DISCUSSION
References

We have performed detrended DNA walks on whole prokaryotic genomes, on noncoding sequences and, separately, on each position in codons of coding sequences. Our method enables us to distinguish between the mutational pressure associated with replication and the mutational pressure associated with transcription and other mechanisms that introduce asymmetry into prokaryotic chromosomes. In many prokaryotic genomes, each component of mutational pressure affects coding sequences not only in silent positions but also in positions in which changes cause amino acid substitutions in coded proteins. Asymmetry in the silent positions of codons differentiates the rate of translation of mRNA produced from leading and lagging strands. Asymmetry in the amino acid composition of proteins resulting from replication-associated mutational pressure also corresponds to leading and lagging roles of DNA strands, whereas asymmetry connected with transcription and coding function corresponds to the distance of genes from the origin or terminus of chromosome replication.

    INTRODUCTION
Top
Abstract
Introduction
RESULTS
DISCUSSION
References

There are many mechanisms in nucleic acids metabolism that introduce asymmetry into nucleotide composition of the two DNA strands (for review, see Francino and Ochman 1997; Mrázek and Karlin 1998). DNA asymmetry can be described in terms of relations between numbers of the four different nucleotides in DNA strands, or it can be visualized in diagrams representing different kinds of DNA walks. Filipski (1990) first interpreted the asymmetry in G/C content as a result of asymmetric replication-associated mutational pressure in viruses. Lobry showed asymmetry in nucleotide composition of some prokaryotic genomes by two-dimensional DNA walks (Lobry 1996a) and by the analysis of sliding windows (Lobry 1996b). He pointed out that this asymmetry changes its polarity at the origin and terminus of chromosome replication, where DNA strands change their role from leading to lagging or vice versa. Later, Freeman et al. (1998), Mrázek and Karlin (1998), Grigoriev (1998), and McLean et al. (1998) described DNA strand asymmetry in different numerical ways. Mechanisms that might be responsible for the observed asymmetry have been also discussed many times (Freeman et al. 1998; Grigoriev 1998; Mrázek and Karlin 1998). One of the accepted hypotheses states that the potential cause of asymmetry is deamination of methylated cytosines, which leads to thymines. Some investigators believe that this type of substitution differentiates between sense and antisense strands of coding sequences, and transcription mechanism introduces the asymmetry into DNA strands (Beletskii and Bhagwat 1996; Francino et al. 1996; Francino and Ochman 1997; Freeman et al. 1998).

Intergenic noncoding sequences seem to be the most likely to accumulate substitutions. However asymmetry in the third position in codons, which could be the effect of silent substitutions, is also observed. These substitutions are not necessarily neutral; they can lead to asymmetry in the distribution of genes on the chromosome according to the rate of translation of their products. The average codon adaptation index (CAI; Sharp and Li 1987) for genes on the leading strand is different from that for genes located on the lagging strand. Such effects were observed in the Escherichia coli (Francino and Ochman 1997) and Borrelia burgdorferi (McInerney 1998) genomes. These workers observed preferences for transcription of the DNA strand in the direction of replication rather than in the inverse direction, which was reflected by higher numbers of coding sequences on leading than on lagging strands in many genomes (Brewer 1990; Blattner et al. 1997; Kunst et al. 1997). Conversely, some experiments have proved that the frequency of mutations introduced into the nontranscribed DNA strand is higher than those in the transcribed strand (Francino et al. 1996).

Replication is thought to be another cause of strand compositional asymmetry in genomes. Although it is not clear if replication of only one or both strands is discontinuous (Okazaki et al. 1968; Kornberg and Baker 1992; Wang and Chen 1992, 1994), the topology of the replication fork requires different enzymatic mechanisms for the synthesis of leading and lagging DNA strands with different error rates (Kornberg and Baker 1992; Kunkel 1992; Waga and Stillman 1994). Moreover, some experiments have shown that differences in processivity of leading and lagging DNA strands may be responsible for the unequal fidelity of replication of these two strands (Fijalkowska et al. 1998).

Usually, DNA asymmetry analyses of genomes were performed on sliding windows. We have performed detrended DNA walks for nucleotide composition analysis of coding and noncoding sequences. This enables us to distinguish between the mutational effect of replication and the effect of transcription and/or coding functions (Cebrat et al. 1999). We have performed separate analyses of the two DNA strands: the Watson (W) strand (GenBank), and the Crick (C) strand (complementary to W). The asymmetry introduced by replication-associated mechanisms into open reading frames (ORFs) lying on the leading and lagging DNA strands is of the reciprocal sign. Thus, when detrended DNA walks on ORFs situated on the W strand in the scale of the chromosome are added to DNA walks performed on ORFs from the C strand, the values of asymmetry compensate each other and disappear, leaving the effect of asymmetry introduced by other mechanisms (see Methods for details). In contrast, the asymmetry in ORFs resulting from their coding function or transcription is of the same sign independent of their location on leading or lagging strands. Thus, the addition of DNA walks cumulates asymmetry introduced by mechanisms not related to replication-associated mutational
pressure. Addition on both DNA strands results in asymmetries that are the result of the same, unbalanced composition of linked genes from complementary DNA strands.

In this paper we have shown that replication, as well as other mutational pressure mechanisms, is responsible for introducing nucleotide substitutions into DNA that are not silent and change amino acid composition of coded proteins.

    RESULTS
Top
Abstract
Introduction
RESULTS
DISCUSSION
References

In Figure 1a detrended DNA walks on the Treponema pallidum chromosome have been shown, illustrating a nucleotide composition of ORFs of >150 codons, not counting shorter ORFs from overlapping pairs of ORFs, situated on the W strand (i.e., the coding strand lies on the W strand). In the T. pallidum genome, ~60% of coding sequences are located on the leading DNA strand. Because the walks in Figure 1a are presented in the scale of the new sequence of the spliced ORFs, the measure of asymmetry in coding density is the value of shifts of the extrema at the middle of the x-axis. In these DNA walks numbers on the x-axis correspond to coordinates of the walker on the sequences of spliced conescutive ORFs situated on the W strand, not to their real coordinates on the chromosome. Numbers on the y-axis represent differences between the found number of the analyzed nucleotide and its expected number if ORFs were distributed evenly on the chromosome independently of W or C strands or leading/lagging DNA strands. Figure 1b presents the same DNA walks for T. pallidum but in the scale of the chromosome. In these walks numbers on the x-axis represent the real coordinates of ORFs on the chromosome. These walks lose their information on coding density. The extrema have to be in the middle of the plot, where the replication terminus is situated. Figure 1, c and d, shows analogous walks for the C strand. In Figure 1e the results of subtraction of the walks presented in Figure 1, b and d, (for strands W and C) are presented. Let us assume (after Beletski and Bhagwat 1996; Francino et al. 1996; Freeman et al. 1998) that transcription introduces asymmetry by preferentially high mutation rate in the nontranscribed strand. Then, whether the ORF is on a leading or a lagging strand, asymmetry is of the same sign and subtraction should eliminate it. Because the effect of asymmetry introduced by replication is of the reciprocal sign on leading and lagging strands, subtraction cumulates it. Thus, Figure 1e represents the cumulative effect of replication-associated mutational pressure on ORFs lying on both DNA strands. In contrast, the addition of walks performed on W and C strands eliminates the asymmetry introduced by replication, leaving the cumulative effect of transcription-associated mutational pressure (Fig. 1f). One can also expect to see local asymmetry after addition of DNA walks if genes mapped on chromosomes in the same region, independent of the leading or lagging strand, have specific bias in nucleotide composition. In Figure 2, the results of subtraction of DNA walks (W-C) for six eubacterial genomes are shown. Note that in all plots but Figure 2f, the replication terminus is in the center of the x-axis. Figure 2f represents DNA walks on the linear genome of B. burgdorferi, where the origin of replication is in the middle of the chromosome (the center of x-axis). In all of these genomes the leading strand is relatively richer in G than the lagging strand. This is also true for other eubacterial genomes (supplementary information available at www.genome.org and http://smorfland. microb.uni.wroc.pl).


View larger version (41K):
[in this window]
[in a new window]
 
Figure 1   Detrended DNA walks on T. pallidum ORFs longer than 150 codons. (a) All ORFs of the W strand were spliced, and the walks were performed in the scale of the new sequence (x-axis). The vertical line indicates the middle of the analyzed sequence; numbers on the y-axis indicate the relative cumulative abundance of the nucleotide (see Methods). (b) The same walks but the scale of the x-axis shows the position of ORFs on the chromosome. (c,d) The same kinds of walks done on the C strand. (e) The result of subtraction of walks presented on b and d. (f) The results of the addition of walks presented in b and d. The terminus of replication is at the extrema of the diagrams.


View larger version (42K):
[in this window]
[in a new window]
 
Figure 2   The result of subtraction of the DNA walk on strand C from the DNA walk on strand W for ORFs longer than 150 codons for the genomes of Chlamydia trachomatis (a); Mycobacterium tuberculosis (b); Escherichia coli (c); Helicobacter pylori (d); Haemophilus influenzae (e); and Borrelia burgdorferi (f). In all plots but f the terminus of replication is in the center of the x-axis. In the plot representing the B. burgdorferi genome, the origin of replication is in the center of the diagram.

The analysis of DNA walks on W and C strands done for the third positions in coding sequences and for intergenic sequences of the T. pallidum genome is shown in Figure 3. These results suggest that the asymmetry introduced by replication-associated mutational pressure into the third codon positions resembles that of intergenic sequences. Analyses of the results of addition of DNA walks on intergenic sequences have not shown any traces of asymmetry introduced by mechanisms other than replication-associated ones.


View larger version (14K):
[in this window]
[in a new window]
 
Figure 3   The result of subtraction of the DNA walk on the C strand from the DNA walk on the W strand for the third positions in codons (a) and the DNA walk on the W strand of intergenic sequences (b) of the T. pallidum genome. To make possible a quantitative comparison of a and b we have multiplied the values on the scale of y-axis for intergenic sequences---sequences that do not belong to any ORF longer than 70 codons---by a factor of 1.8 (the number of nucleotides in analyzed intergenic sequences was 1.8 times lower than the number of nucleotides in the third positions.

Some mutations of the third positions in codons, for example, almost all transitions, are silent, but others are not and belong to the class of missenes mutations. If we assume that most of the accumulated mutations are in the fourfold-degenerated codons, in which each mutation in the third position is silent, we should find differences in the accumulation of mutations in codons where transversions in the third positions are missense (twofold-degenerated codons). To check this, we have performed separate walks on twofold- and fourfold-degenerated codons. Both classes of codons accumulate mutations, and some of these mutations (transversions in twofold-degenerated codons) are of the missense class. The DNA walks presenting asymmetric accumulation of mutations in the twofold- and fourfold-degenerated codons in eubacterial genomes are presented at www.genome.org and http://smorfland.microb.uni.wroc.pl.

Because a transversion even in the third positions can change the encoded amino acid, we have performed walks on amino acids coded by ORFs lying on the two DNA strands, and we have subtracted and added the resulting walks to separate the effect of replication-associated mutational pressure from the effect of transcription and/or other effects. In Figure 4 the effect of replication on amino acid composition of proteins coded by genes on leading and lagging strands of T. pallidum, Chlamydia trachomatis, and B. burgdorferi genomes is shown. Analyzing the results of the subtraction of walks, we have found amino acids that prevail on the leading or lagging strand in different genomes. In E. coli, Bacillus subtilis, T. pallidum, B. burgdorferi, and C. trachomatis Gly, Val, and Asp were coded relatively more frequently on the leading strand, whereas Ile, Thr, and His were more prevalent on the lagging strand. Nevertheless, eubacterial genomes differ significantly in prevalence of specific amino acids on leading or lagging strands (supplementary information available at www.genome.org and http://smorfland.microb.uni.wroc.pl). These results prove that the skew found previously in the prevalence of some codons in genes transcribed in the direction of replication (Fraser et al. 1998) is connected to replication-associated mutational pressure.


View larger version (32K):
[in this window]
[in a new window]
 
Figure 4   The effect of subtraction of walks on amino acids for the genomes T. pallidum (a), C. trachomatis (b), and B. burgdorferi (c). Numbers on the y-axis indicate the relative cumulative abundance of amino acids.

In the T. pallidum genome no effects other than those connected with the leading/lagging role of DNA strands on protein composition have been observed. However, in large genomes (E. coli and B. subtilis) addition of DNA walks done for ORFs from W and C strands differentiates between regions proximal and distal to the origin of replication of the chromosome (Fig. 5). Note that replication-associated effects divide chromosomes into two replichores---left and right---with extrema in the center of plots. Other effects that we have observed are connected with proximal/distal parts of chromosomes with extrema near the middle of replichores. The trends at the left and right ends of the plot (Fig. 5) are the same and reciprocal to the trends in the central part of the plots. The central part of the plot corresponds to the region close to the terminus of replication (from both sides), and both ends of plots correspond to regions close to the origin of replication (from both sides).


View larger version (37K):
[in this window]
[in a new window]
 
Figure 5   The effect of addition of walks on amino acids for two groups of amino acids coded by the B. subtilis genome.

Thus, in the region close to the replication terminus of the B. subtilis genome (Fig. 5) different trends are observed for different groups of amino acids. Generally, hydrophobic amino acids are more abundant in proteins coded by the proximal region of the chromosome, whereas hydrophilic amino acids are more abundant in proteins coded by regions close to the terminus of chromosome replication.

Information about the asymmetry in DNA nucleotide composition that reflects the asymmetry in amino acid composition of proteins can be shown in a more degenerated form by analyzing DNA walks done for the first, second, and third positions in codons (Fig. 6, results for the B. subtilis genome). The asymmetry is seen even in the second positions (Fig. 6c), which are crucial for the properties of the amino acids coded. Still, the effect of the replication-associated mutational pressure on the second positions is weaker (Fig. 6d) than that of transcription and/or other coding functions (Fig. 6c). Especially in the second positions, asymmetry is seen in the A/T ratio in proximal versus distal parts of the chromosome. Asymmetry in the third positions is seen for both leading versus lagging strands (Fig. 6f) and proximal versus distal parts of chromosomes (Fig. 6e).


View larger version (42K):
[in this window]
[in a new window]
 
Figure 6   Addition (a,c,e) and subtraction (b,d,f) of DNA walks performed on nucleotides in different positions in codons of ORFs longer than 150 codons of the B. subtilis genome. (a,b) First positions; (c,d) second positions; (e,f) third positions.

    DISCUSSION
Top
Abstract
Introduction
RESULTS
DISCUSSION
References

The effect of replication-associated mutational pressure on nucleotide compositional bias of eubacterial chromosomes can be separated from the effects introduced by other mechanisms. Usually, the transcription itself and transcription-coupled repair are blamed for introducing mutations into coding sequences. In such cases, these substitutions should be similar, independent of the location of the transcribed strand of the gene on the leading or lagging DNA strand. If it is purely an effect of transcription, it should be also independent of the distance from the origin or terminus of replication, unless there is some other correlation between the rate or frequency of transcription and the location of genes on chromosome (in respect of proximal/distal location). In such a case, the observed trends can be created by various nucleotide substitutions resulting from different transcription rate in proximal and distal regions of the chromosome.

We have observed some classes of substitution that seem to be characteristic of ORFs lying in the same region of the chromosome but in opposite directions. For example, after the addition of walks, there is an evident surplus of adenine over thymine in the second positions of codons in the distal region of chromosome in the B. subtilis genome. The bias observed in the three positions of codons is affected very weakly by replication-associated mutational pressure (see subtraction of walks in Fig. 6) and possesses some specific features:
1. There is no correlation between this asymmetry and types of substitutions in any other positions of codons or intergenic sequences.
2. It is not introduced by replication-associated mutational pressure (it has the same sign for ORFs of leading and lagging strands).
3. The A/T relations in the second positions reflect the hydrophilic/hydrophobic amino acid composition in coded proteins.

Thus, it seems reasonable to accept the hypothesis that this effect is caused by nonrandom selection of recombinants with preferential location of genes coding for hydrophobic (transmembrane) proteins near the origin of replication. One can argue that this asymmetry can be generated by insertion of phage genomes or a local grouping of genes with the same compositional bias. This might explain irregularities near the origin of the B. subtilis genome, where >20 ribosomal proteins are coded. Nevertheless, almost half of the B. subtilis genome (halves of replichores from both sides of the terminus) are relatively richer in adenine in the second positions in codons.

On the other hand, the nonrandom topology of microbial genomes can be a mechanism of gene control by discrimination. In fast-dividing cells, the copy number of proximal genes can be up to eight times higher than that of distal genes (Cooper and Helmstetter 1968). This reflects the topology of replication when the cell cycle is shorter than the time needed for replication of the whole chromosome. Nevertheless, it is possible that the composition of the third positions in codons, influencing the rate of translation, is superimposed on other levels of gene control. We have observed different codon usage in proximal and distal regions in relatively large genomes of B. subtilis and E. coli (Fig. 6e), but we have not found such differences in the smaller genomes of Treponema or Borrelia (as observed previously by Karlin et al. 1998). Thus, the relations in abundance of products of proximal versus distal genes can change under different growing conditions.

Whereas the effect seen after the elimination of replication-associated mutational pressure seems to depend on recombinational events, the role of replication-associated pressure itself in amino acid composition of proteins is not so obvious. Amino acid composition of proteins depends on the location of coding genes on leading or lagging DNA strands. In many genomes the effect of position on the leading or lagging strand is seen not only in the third codon position but also in the first position (Fig 6b). It is difficult to assume that all of the accumulated substitutions are neutral. Thus, it seems that the selection for appropriate positions of genes and/or codon usage controlling rate of translation are also responsible for the observed leading/lagging strand nucleotide composition asymmetry. The topology of the B. burgdorferi genome supports this view. There is an extremely strong effect of position on the leading versus the lagging strand in that genome. Codon usage on both strands is quite distinct (McInerney 1998), and coding sequences of the two DNA strands are so different that they form two nonoverlapping sets of genes when their contents are analyzed (data not shown). However, genes are not randomly distributed into these two sets---all genes coding for ribosomal proteins are located on the leading strand---which supports the hypothesis that recombination is also responsible for the specific distribution of the genes generating the nucleotide bias in a genome. This could explain the relative conservation of the general genetic map topology of related bacterial genera, for example, Escherichia and Salmonella (Wilkins 1988).

Because amino acid composition of proteins strongly depends on the positions of their genes on the chromosome, any phylogenetic analysis, as well as algorithms recognizing coding sequences by content sensors, should respect the location of ORFs on the chromosome. Furthermore, for some genomes it is important to know only the leading/lagging positions (i.e., T. pallidum, B. burdorferi, but for other genomes additional information about proximal/distal location might be required (i.e., E. coli, B. subtilis).

Data for Analysis and Methods

The results presented in this paper were obtained by analysis of prokaryotic genomic sequences downloaded from the following: ftp://ftp.pasteur.fr (2/18/98): B. subtilis (Kunst et al. 1997); http://utmmg.med.uth.tmc.edu (2/16/98): T. pallidum (Fraser et al. 1998); http://www.genetics.wisc.edu (11/14/97): E. coli (Blattner et al. 1997); http://www.ncbi.nlm.nih.gov (10/13/97): Haemophilus influenzae (Fleischmann et al. 1995); http://www.ncbi.nlm.nih.gov (10/16/98): Helicobacter pylori (Tomb et al. 1997); http://www.ncbi.nlm.nih.gov (10/30/98): Mycobacterium tuberculosis (Cole et al. 1998); http://www.ncbi.nlm.nih.gov (11/13/98): C. trachomatis (Stephens et al. 1998); and http://www.ncbi.nlm.nih.gov (3/3/98): B. burgdorferi (Fraser et al. 1997). The data have not been updated after the date of retrieval.

To show DNA compositional bias, different DNA walks and their transformations were done. Detailed descriptions of DNA walks, their possible interpretation, and nomenclature are according to Cebrat and Dudek (1998).

To demonstrate local trends independent of coding functions, we performed detrended DNA walks, in which we eliminated strong trends resulting from base composition of coding ORFs (Cebrat et al. 1997; Cebrat and Dudek 1998) because they mask the asymmetry of strands introduced by mutational pressure.

To eliminate these coding trends we counted the following value for a given ORF: J = [N- (F × L), where J = the value of the walker jump for the ORF, N = the number of nucleotides (A, T, G, or C) in the analyzed positions of the ORF, F = the frequency of the given nucleotide at the examined positions in the whole set of analyzed ORFs, and L = the length of the given ORF in codons. When intergenic sequences were analyzed, F was the frequency of the nucleotide in the whole set of intergenic sequences and L was the length of the visited sequence in nucleotides. We applied an analogous procedure to the distribution analysis of amino acids on the chromosome. In this case, in the above equation, we inserted the number of analyzed amino acid residues instead of N and the frequency of the given amino acid in the set of the analyzed ORFs instead of F.

When walks for the two strands were added, the walker visited nonoverlapping ORFs of both strands as they appeared on the chromosome, scanned them in the proper reading frame, and moved according to the result of scanning. When the walks for the C strand were subtracted from the walks for the W strand, the value of the walker jump for each ORF in the C strand was multiplied by -1. Note that detrended walks done in the scale of the chromosome lose their information on asymmetry in the total length of ORFs on leading and lagging strands. That is why addition of these walks done for ORFs of W and C strands eliminates the effect of replication associated with mutational pressure and does not depend on differences in coding density of leading versus lagging strands.

Availability of the Results

The detailed results of coding sequence asymmetry for all eubacterial and archaebacterial genomes completed so far are available on the World Wide Web at the following addresses: www.genome.org and http://smorfland.microb.uni.wroc.pl.

    ACKNOWLEDGMENTS

This work was supported by The State Committee for Scientific Research (6PO4A 030 14).

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.

    FOOTNOTES

3 Corresponding author.

E-MAIL cebrat{at}angband.microb.uni.wroc.pl; FAX 48-71-3252-151.

    REFERENCES
Top
Abstract
Introduction
RESULTS
DISCUSSION
References

  • Beletskii, A. and A.S. Bhagwat. 1996. Transcription-induced mutations: Increase in C to T mutations in the nontranscribed strand during transcription in Escherichia coli. Proc. Natl. Acad. Sci. 93: 13919-13924[Abstract/Free Full Text].
  • Blattner, F.R., G. Plunkett, III, C.A. Bloch, N.T. Perna, V. Burland, M. Riley, J. Coolado-Vides, J.F. Glasner, C.K. Rode, G.F. Mayhew 1997. The complete genome sequence of Escherichia coli K-12. Science 277: 1453-1462[Abstract/Free Full Text].
  • Brewer, B.J. 1988. When polymerases collide: Replication and the transcriptional organization of the E. coli chromosme. Cell 53: 679-686[CrossRef][Medline].
  • Cebrat, S. and M.R. Dudek. 1998. The effect of DNA phase structure on DNA walks. Eur. Phys. J. B 3: 271-276.[CrossRef]
  • Cebrat, S., M.R. Dudek, P. Mackiewicz, M. Kowalczuk, and M. Fita. 1997. Asymmetry of coding versus non-coding strands in coding sequences of different genomes. Microb. Comp. Genomics 2: 259-268.[Medline]
  • Cebrat, S., M.R. Dudek, A. Gierlik, M. Kowalczuk, and P. Mackiewicz. 1999. Effect of replication on the third base of codons. Physica A 265: 78-94.[CrossRef]
  • Cole, S.T., R. Brosch, J. Parkhill, T. Garnier, C. Churcher, D. Harris, S.V. Gordon, K. Eiglmeier, S. Gas, C.E. Barry, III 1998. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393: 537-544[CrossRef][Medline].
  • Cooper, S. and C.E. Helmstetter. 1968. Chromosome replication and the division of E. coli B/r. J. Mol. Biol. 43: 1-12[CrossRef].
  • Fijalkowska, I.J., P. Jonczyk, M. Maliszewska Tkaczyk, M. Bialoskorska, and R.M. Schaaper. 1998. Unequel fidelity of leading strand and lagging strand DNA replication on the Escherichia coli chromosome. Proc. Natl. Acad. Sci. 95: 10020-10025[Abstract/Free Full Text].
  • Filipski, J. 1990. Evolution of DNA sequences. Contributions of mutational bias and selection to the origin of chromosomal compartments. In Advances in mutagenesis research 2 (ed. G. Obe), pp. 1-54. Springer-Verlag, Berlin, Germany.
  • Fleischmann, R.D., M.D. Adams, O. White, R.A. Clayton, E.F. Kirkness, A.R. Kerlavage, C.J. Bult, J.F. Tomb, B.A. Dougherty, J.M. Merrick 1995. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269: 496-512[Abstract/Free Full Text].
  • Francino, M.P. and H. Ochman. 1997. Strand asymmetries in DNA evolution. Trends Genet. 13: 240-245[CrossRef][Medline].
  • Francino, M.P., L. Chao, M.A. Riley, and H. Ochman. 1996. Asymmetries generated by transcription-coupled repair in enterobacterial genes. Science 272: 107-109[Abstract].
  • Fraser, C.M., S. Casjens, W.M. Huang, G.G. Sutton, R. Clayton, R. Lathigra, O. White, K.A. Ketchum, R. Dodson, E.K. Hickey 1997. Genome sequence of a Lyme disease spirochaete, Borrelia burgdorferi. Nature 390: 580-586[CrossRef][Medline].
  • Fraser, C.M., S.J. Norris, G.M. Weinstock, O. White, G.G. Sutton, R. Dodson, M. Gwinn, E.K. Hickey, R. Clayton, K.A. Ketchum 1998. Complete genome sequence of Treponema pallidum, the syphilis spirochete. Science 281: 375-388[Abstract/Free Full Text].
  • Freeman, J.M., T.N. Plasterer, T.F. Smith, and S.C. Mohr. 1998. Patterns of genome organization in bacteria. Science 279: 1827[Free Full Text]. Full text at www.sciencemag.org/cgi/content/full/279/5338/1827a.
  • Grigoriev, A. 1998. Analyzing genomes with cumulative skew diagrams. Nucleic Acids Res. 26: 2286-2290[Abstract/Free Full Text].
  • Karlin, S., A.M. Campbell, and J. Mrázek. 1998. Comparative DNA analysis across diverse genomes. Annu. Rev. Genet. 32: 185-225[CrossRef][Medline].
  • Kornberg, A. and T.A. Baker. 1992. DNA replication. Freeman, New York, NY.
  • Kunkel, T.A. 1992. Biological asymmetries and the fidelity of eukaryotic DNA replication. BioEssays 14: 303-308[CrossRef][Medline].
  • Kunst, F., N. Ogasawara, I. Moszer, A.M. Albertini, G. Alloni, V. Azevedo, M.G. Bertero, P. Bessieres, A. Bolotin, S. Borchert 1997. The complete genome sequence of the gram-positive bacterium Bacillus subtilis. Nature 390: 249-256[CrossRef][Medline].
  • Lobry, J.R. 1996a. A simple vectorial representation of DNA sequence for the detection of replication origins in bacteria. Biochimie 78: 323-326[Medline].
  • -----. 1996b. Asymmetric substitution patterns in the two DNA strands of bacteria. Mol. Biol. Evol. 13: 660-665[Abstract].
  • McInerney, J.O. 1998. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc. Natl. Acad. Sci. 95: 106698-10703.
  • McLean, M.J., K.H. Wolfe, and K.M. Devine. 1998. Base composition skews, replication orientation, and gene orientation in 12 prokaryote genomes. J. Mol. Evol. 47: 691-696[CrossRef][Medline].
  • Mrázek, J. and S. Karlin. 1998. Strand compositional asymmetry in bacterial and large viral genomes. Proc. Natl. Acad. Sci. 95: 3720-3725[Abstract/Free Full Text].
  • Okazaki, R., T. Okazaki, K. Sakabe, K. Sugimoto, and A. Sugino. 1968. Mechanism of DNA chain growth. I. Possible discontinuity and unusual secondary structure of newly synthesized chains. Proc. Natl. Acad. Sci. 59: 598-605[Free Full Text].
  • Sharp, P.M. and W.-H. Li. 1987. The codon adaptation index: a measure of directional synonymous codon usage bias and its potential applications. Nucleic Acids Res. 15: 1281-1295[Abstract/Free Full Text].
  • Stephens, R.S., S. Kalman, C. Lammel, J. Fan, R. Marathe, L. Aravind, W. Mitchell, L. Olinger, R.L. Tatusov, Q. Zhao 1998. Genome sequence of an obligate intracellular pathogen of humans: Chlamydia trachomatis. Science 282: 754-759[Abstract/Free Full Text].
  • Tomb, J.F., O. White, A.R. Kerlavage, R.A. Clayton, G.G. Sutton, R.D. Fleischmann, K.A. Ketchum, H.P. Klenk, S. Gill, B.A. Dougherty 1997. The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature 388: 539-547[CrossRef][Medline].
  • Waga, S. and B. Stillman. 1994. Anatomy of a DNA replication fork revealed by reconstitution of SV40 DNA replication in vitro. Nature 369: 207-212[CrossRef][Medline].
  • Wang, T.C. and S.H. Chen. 1992. Similar-sized daughter-strand gaps are produced in the leading and lagging strands of DNA in UV-irradiated E. coli uvrA cells. Biochem. Biophys. Res. Commun. 184: 1496-1503[CrossRef][Medline].
  • -----. 1994. DNA fragments contain equal amounts of lagging-strand and leading-strand sequences. Biochem. Biophys. Res. Commun. 198: 844-849[CrossRef][Medline].
  • Wilkins, B.M. 1988. Organization and plasticity of enterobacterial genomes. Soc. Appl. Bacteriol. Symp. Ser. 17: 51-69.

Received July 10, 1998; accepted in revised form March 17, 1999.


9:409-416 ©1999 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/99 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Genome Res.Home page
E. P.C. Rocha, M. Touchon, and E. J. Feil
Similar compositional biases are caused by very different mutational effects
Genome Res., December 1, 2006; 16(12): 1537 - 1547.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Mackiewicz, J. Zakrzewska-Czerwinska, A. Zawilak, M. R. Dudek, and S. Cebrat
Where does bacterial replication start? Rules for predicting the oriC region
Nucleic Acids Res., July 16, 2004; 32(13): 3781 - 3791.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
E. P. C. Rocha
The replication-related organization of bacterial genomes
Microbiology, June 1, 2004; 150(6): 1609 - 1627.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. Palacios and J. J. Wernegreen
A Strong Effect of AT Mutational Bias on Amino Acid Usage in Buchnera is Mitigated at High-Expression Genes
Mol. Biol. Evol., September 1, 2002; 19(9): 1575 - 1584.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
E. P. C. Rocha and A. Danchin
Ongoing Evolution of Strand Composition in Bacterial Genomes
Mol. Biol. Evol., September 1, 2001; 18(9): 1789 - 1799.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. P. Francino and H. Ochman
Deamination as the Basis of Strand-Asymmetric Evolution in Transcribed Escherichia coli Sequences
Mol. Biol. Evol., June 1, 2001; 18(6): 1147 - 1150.
[Full Text]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mackiewicz, P.
Right arrow Articles by Cebrat, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Mackiewicz, P.
Right arrow Articles by Cebrat, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?


Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.