|
|
|
|
Genome Res. 13:2475-2484, 2003 ©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00 Methods Fast Evaluation of Fluctuations in Biochemical Networks With the Linear Noise ApproximationDepartment of Cell & Molecular Biology, Uppsala University, BMC, 751 24 Uppsala, Sweden
Biochemical networks in single cells can display large fluctuations in molecule numbers, making mesoscopic approaches necessary for correct system descriptions. We present a general method that allows rapid characterization of the stochastic properties of intracellular networks. The starting point is a macroscopic description that identifies the system's elementary reactions in terms of rate laws and stoichiometries. From this formulation follows directly the stationary solution of the linear noise approximation (LNA) of the Master equation for all the components in the network. The method complements bifurcation studies of the system's parameter dependence by providing estimates of sizes, correlations, and time scales of stochastic fluctuations. We describe how the LNA can give precise system descriptions also near macroscopic instabilities by suitable variable changes and elimination of fast variables.
A key element in systems biology is the design of mathematical models that faithfully describe the dynamics of intracellular chemical networks. In general, chemical reactions in single cells occur far from thermodynamic equilibrium (Keizer 1987 (Poisson statistics). This would mean that if the average number (n) of molecules of a certain kind in an Escherichia coli cell is 100 or more (corresponding to a concentration >0.2 µM; Donachie and Robinson 1987 /n = 1/ ) would be 10% or less. Accordingly, when there are more than, say, 1000 molecules in the cell, then the relative fluctuations would be only 3% and therefore insignificant in most instances. However, intracellular chemical reactions take place far from equilibrium. Therefore, Poisson statistics may not apply, and the stochastic fluctuations around an average of n molecules can be much larger as well as much smaller than (Paulsson and Ehrenberg 2001
Chemical reactions in the test tube are normally described macroscopically by (1) applying the law of mass action (Cornish-Bowden 1995
To deal with the stochastic nature of these reactions, one has to abandon the macroscopic perspective of chemical reactions, where fluctuations are neglected, in favor of mesoscopic approaches (van Kampen 1997
Large relative fluctuations of copy numbers characterizes many mRNA levels in bacterial cells, but there are also other cases in which noise becomes highly significant, also when the average molecule copy numbers are very large. One example is systems with a continuum of macroscopic stationary states, like a limit cycle attractor (Barkai and Leibler 2000
No simple methods for the mesoscopic analysis of such systems exist today, and numerical approaches are often difficult to carry out because of the high computational demands normally associated with this type of problem. Monte Carlo methods, such as the Gillespie algorithm (Gillespie 1977
Here, we present a general method, based on the linear noise approximation (LNA; van Kampen 1997 All examples are chosen so that analytical approximations of solutions of the Master equation can be obtained, and these approximations are then used to clarify the origin of fluctuations in each case and to trace the effects of different parameters on the stochastic properties of the system. How the present theory can be applied to arbitrarily large biochemical networks is described in the Discussion.
The Linear Noise Approximation The chemical Master equation describes how the joint probability distribution of the copy numbers of different chemical species evolve in time in a spatially homogenous chemical system (van Kampen 1997 -expansion of the Master equation (van Kampen 1997 -expansion means that the Master equation is Taylor-expanded near macroscopic system trajectories or stationary solutions in powers of 1/ , where is the system volume. When the Master equation is approximated near a macroscopically stable stationary solution, terms of first order in 1/ give the macroscopic rate equations, and terms of second order in 1/ give the Linear Noise Approximation (LNA). The rationale behind this approach is that for constant average concentrations, relative fluctuations will tend to decrease with the inverse of the square root of the volume. The LNA is therefore very accurate for systems for which the local linearization of the chemical rate laws is valid for component copy numbers that are frequently reached by fluctuations away from the stationary state. This is true when fluctuations are sufficiently small in relation to the component means. However, we have found that the LNA can give very good estimates of fluctuations in molecule numbers, also when they are larger than the corresponding means, and, furthermore, that the technique of separation of fast variables (see below) can be used to improve further the performance of the LNA.
Here, the LNA is derived for the stationary state of a general system of chemical reactions (Supplemental material available online at www.genome.org; Methods). The correlation matrix, C, containing the variances and covariances for the fluctuations of all the components in the system, is calculated from the linearized macroscopic rate equations near the stationary point as described by the Jacobian matrix, A, and the stoichiometries of the reactions and their rates as described by the diffusion matrix, D. The relation between the matrix C, containing information about the fluctuations, and the macroscopic matrices A and D is given by a Lyapunov matrix equation, equation 32.
The first example is a system that contains only one type of molecule, X. These molecules are synthesized and consumed by two different reactions with rate laws f1(
, the equation for the deviation![]() from steady state is
is the compounded rate constant (in units of sec-1) by which the system relaxes back to steady state after a perturbation. From equation 37 in Methods, the LNA approximation C of the variance X2 for the number of X molecules at the steady state is
X2 normalized to the average number (X) of molecules is often called the Fano-factor (Fano 1947
![]() of the number of X molecules in the steady state. The numerical magnitude of the Fano-factor is primarily determined by the ratio between the rate f1( )/ of turnover of all the molecules in the X-pool and the rate of relaxation . The Fano-factor is also shaped by the creation (n) and elimination (m) stoichiometries. Below we use equation 6 to inspect the role of the stoichiometries (Example 1) and the role of the relaxation rate (Example 2) for the stochastic properties of the scheme in equation 2.
Example 1: The Mesoscopic Importance of Stoichiometries in Chemical Reactions
To illustrate the role of the stoichiometries, we compare two different reaction systems (A and B) with different reaction stoichometries:
Example 2: Zero-Order Sensitivity and the Size of Mesoscopic Fluctuations
We exemplify this by treating the case with a protein that is converted from its unmodified state X1 to its modified state X2 by one enzyme and back to X1 from X2 by another. With a constant total concentration
1) and f2( 1), respectively. The modifying enzymes obey Michaelis-Menten kinetics, and when steady-state relations for their internal states are rapidly established (see discussion about elementary complex reactions in Methods), the rate equations are given by
![]() 1 is the LNA estimate of the average number of X1 molecules. In Figure 1A, this estimate is compared with the correct value as calculated directly from the Master equation, for different values of Vmax2. The sensitivity amplification (logarithmic gain; Savageau 1976
1 in response to the percentage change in Vmax2. The system can be said to be ultrasensitive if |a 1Vmax1| > 1 (Goldbeter and Koshland Jr. 1982 0 >> Km, both enzymes are close to saturation when Vmax1 = Vmax2 and the reaction displays ultrasensitivity (Goldbeter and Koshland Jr. 1981
The Master equation for the probability of having X1 unmodified molecules can be solved analytically in special cases (Berg et al. 2000
0)/(2Km), is obtained when Vmax1 = Vmax2. The LNA of the Fano-factor (equation 15) is plotted in Figure 1B for increasing Vmax2 and is compared with values calculated from the exact Master equation for some parameter sets. The comparison reveals that the LNA is very accurate in these cases.
The Fano-factor are much larger than 1, as seen in Figure 1B. The reason for these large fluctuations and the strong deviation from Poisson statistics (Fano-factor 1), is high turnover rates of the molecules at steady state in combination with slow relaxation back to steady state. This is exactly what is described in equation 15, where f1/
Example 3: Coupled Fluctuations in Multiple Component Systems Although the Master equation is usually impossible to solve for multicomponent systems, the LNA can be used to estimate the covariance matrix directly from the macroscopic rate equations and the stoichiometry matrix as described in Methods.
To illustrate, we consider an anabolic system, where, respectively, molecules X and Y are synthesized from reservoirs A and B at rates k1 and k2. X and Y are assumed to associate irreversibly with association rate constant k5 in the formation of a heterodimer C. The molecules X and Y also decay with first-order rate constants k3 and k4, respectively, as described by the scheme:
1 and 2, respectively. This general system is analyzed in the Supplemental material. Here, we focus on the most interesting parameter regime, where X and Y are synthesized at equal rates (k1 = k2 = ), and decay with equal and low rates (k3 = k4 = µ), that is,
. This, as we shall see, is the condition for anomalously large component number fluctuations.
The macroscopic differential equations for the concentrations and their steady-state values are
1 = ![]() k5. The stochastic reaction events will make the concentrations fluctuate freely (free diffusion) on this curve, such that the variance is infinite.
When 0 < µ <<
The primary source for the anomalously large fluctuations and the highly correlated fluctuations is therefore, again, high turnover of the molecule pools compared with the (decay) rate of relaxation to steady state. This is quantified in the LNA of the system as described in Methods.
In the LNA of the two-component system, given by equation 18, the fluctuations are analyzed in the variables w =
/ ). The variance in w( 2w) as calculated by the LNA, normalized to the number of molecules in one of the pools at steady state, is
/ ) and the relaxation rate µ that causes the very large fluctuations in w. Results from the LNA of the full system (equation 17) over large parameter regions are shown in Figure 2. The LNA is compared with solutions of the Master equation at some points. A major advantage of the LNA is that the stochastic properties of a system can be rapidly scanned over a large parameter space at a very modest computational cost.
Example 4: Elimination of Fast Variables
It should be noted that this technique does not mean that one just assumes rapid equilibration among some of the molecular species. The number of relevant slow variables can, in fact, be lower than such a direct elimination would presuppose. To illustrate the principle, we exemplify with a four-component system, in which the large fluctuations are contained in just one slow variable. The example is an extended version of the system discussed in Example 3 above:
The results in this work follow in a straightforward way from the mesoscopic treatises by van Kampen (1997
We have outlined a direct path (Methods) from macroscopic descriptions of intracellular chemical networks to the linear noise approximations (van Kampen 1997 This type of analysis facilitates screening of any intracellular network with respect to its stochastic properties. The characteristics of the covariance and Jacobian matrices will decide when fluctuations are so small and fast that a system can be analyzed macroscopically. They will reveal the existence of anomalously large fluctuations that require mesoscopic approaches for precise system descriptions.
Our results show how giant fluctuations in multidimensional systems can arise owing to large terms in the diffusion matrix (large turnover rates of component pools that generate internal noise) and small eigenvalues of the Jacobian matrix (weak restoring forces). When there is a single stable, but weakly attracting, stationary state so that diffusion is almost free in its neighborhood, very large relative fluctuations can occur for systems of finite size. A striking example is the behavior of anabolic reactions in which two or more substrates are independently synthesized and then irreversibly joined by an unsaturated enzyme. When the rates of synthesis of all substrates are balanced, all combinations of substrate concentrations that are compatible with the joint exit flow are macroscopically allowed, and there is free diffusion throughout this continuum of states. In growing systems or in systems in which the rates of synthesis are attenuated by feedback or product inhibition there will be a single stationary state, but it is often so weakly constrained that highly significant fluctuations prevail (Elf et al. 2003b
Our discussion of the LNA complements existing numerical tools (Sauro 1993
Our general analysis, exemplified by concrete examples of how anomalous fluctuations arise in intracellular networks, has also been extended beyond direct applications of the LNA to intracellular systems. When fluctuations are large compared with the range of validity of linear rate laws, then the LNA is expected to fail in many cases. We have analyzed systems in which the macroscopic stationary states are almost degenerate. In those cases, the eigenvalues of the Jacobian are very small, although the flows through the system are significant. Here, large fluctuations are associated with slow relaxation modes, and we have used this system property to systematically eliminate fast variables (Gardiner 1985
We have described several cases in which the LNA or the LNA improved by elimination of fast variables gives accurate mesoscopic system descriptions. However, in cases in which fluctuations bring a system far from the region of validity of the local LNA, a more detailed mesoscopic analysis is required. This is, for instance, the case in systems with noise-induced transitions (Horsthemke and Lefever 1984
The linear Fokker-Planck equation used here to discuss the stochastic properties of intracellular chemical networks can be reformulated in terms of stochastic differential equations (Methods). This formulation is mathematically equivalent to the Fokker-Planck equations and allows direct comparisons of the effects of intrinsic noise in intracellular chemical networks, on the one hand, with the consequences of external noise on technical feedback systems, on the other (Glad and Ljung 2000
Macroscopic and Mesoscopic Dynamics Consider an intracellular biochemical system with volume and N different chemical components. The concentrations of the components are summarized in the vector x = (x1,..., xN)T and the numbers of molecules in X = x = (X1, X2,..., XN)T. The state of the system is defined by X (or x), and a state change takes place by one of R elementary reactions. The probability that the elementary reaction number j will occur in a small time interval t is given by ![]() j(x, ) t, where j(x, ) is a transition rate. By such an event the chemical component number i changes from Xi to Xi + Sij molecules. The integers Sij, i = 1, 2,..., N; j = 1, 2,..., R are the elements of the N x R stoichiometric matrix S of the reaction network (Heinrich and Schuster 1996
When the system volume
is the vector of macroscopic concentrations for which the dynamics is determined by N ordinary rate equations (Cornish-Bowden 1995
j in equation 25 is replaced by its macroscopic counterparts fj in equation 24. How the macroscopic equation 24 emerges from the mesoscopic equation 25 when the system volume goes to infinity is shown in the derivation of the linear noise approximation in the Supplemental material (equation A8).
A crucial point in the mesoscopic description is correct identification of the elementary reactions in equations 24 and 25. Elementary means that the transition rates
Linearization of Macroscopic Dynamics and the Linear Noise Approximation
associated with equation 26 leads to a matrix equation for the deviations from according to
Q-1, where is a diagonal matrix with the eigenvalues ( 1, 2,..., N) of A as elements and Q is a matrix with the eigenvectors of A in the columns, the solution to equation 27 is
is the initial condition (Strang 1988 , where , leads to a new set of variables (normal modes) with the property that
The Linear Noise Approximation
is diagonal with the elements ; j = 1, 2,..., R. It depends on the macroscopic rate laws , rather than on the mesoscopic transition rates , because the difference between the two disappears in the LNA (see Supplemental material). The stationary distribution of the Fokker-Planck equation is a multivariate Gaussian with zero mean vector and covariance matrix C = X XT, which is determined by the (Lyapunov) matrix equation
. C contains the second-order moments of the stationary Gaussian distribution for the stochastic vector X, according to
X according to
for the new variables ![]() =Q-1 X, that is, fluctuations as normal modes (compare with equation 30). ![]() is associated with the linearly transformed stoichiometric matrix = Q-1S, and it follows from equation 31 that B=Q . The covariance matrix C for X is defined from C = QCQ* (Horn and Johnson 1985 Q-1. Insertion in equation 32 gives
, that is, the variances , are the summed squares of the transformed stoichiometric coefficients ij (which can be complex valued) multiplied with their corresponding rate laws (flows), divided by the real part of the rate of relaxation for the normal mode in equation 30:
ij| and that fluctuations are attenuated by a large (negative) real part of the eigenvalue of the normal mode in equation 30. When A is symmetric, Q can be made unitary and then = QTS, which simplifies analysis and interpretation (see Example 3 in the Supplemental material).
Single-Component Systems
The Fano-Factor and the Correlation Coefficient
Elimination of Fast Variables Can Make the LNA More Accurate Near Critical Points
are locked to their macroscopic steady-state values, conditional on , by setting the differential equations gj + 1...gN equal to zero and solving the following system of equations:
1,..., j). This is a normal distribution centered at the macroscopic steady state with a covariance matrix 1...j, 1...j. For each value of ( 1,..., j), the fast variables are locked to their conditional stationary values , and the overall stationary distribution function is therefore:
( k - hk( 1,..., j)) is the Dirac function and i = -1 i. ( ) is not a normal distribution any more because the h = (hj + 1,..., hN) functions generally are nonlinear. Finally, we get the probability distribution function in the original variables as
is a normalization constant. For a multidimensional P(X), it is convenient to consider only the marginal distributions, defined as
This work was supported by the National Graduate School of Scientific Computing (NGSSC). We thank Martin Lovmar for comments on the manuscript and Lars Elden for pointing out the Horn and Johnson reference. The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
1 Corresponding authors. E-MAIL Johan.elf{at}icm.uu.se; FAX 46184714262. E-MAIL ehrenberg{at}xray.bmc.uu.se; FAX 46184714262. Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1196503. [Supplemental material is available online at www.genome.org.]
Barkai, N. and Leibler, S. 2000. Circadian clocks limited by noise. Nature 403: 267-268.[Medline] Becskei, A. and Serrano, L. 2000. Engineering stability in gene networks by autoregulation. Nature 405: 590-593.[CrossRef][Medline] Berg, O.G. 1978. A model for the statistical fluctuations of protein numbers in a microbial population. J. Theor. Biol. 71: 587-603.[CrossRef][Medline]
Berg, O.G., Paulsson, J., and Ehrenberg, M. 2000. Fluctuations and quality of control in biological cells: Zero-order ultrasensitivity reinvestigated. Biophys. J. 79: 1228-1236. Cornish-Bowden, A. 1995. Fundamentals of enzyme kinetics. Portland Press, London. Dogterom, M. and Leibler, S. 1993. Physical aspects of the growth and regulation of microtubule structures. Phys. Rev. Lett. 70: 1347-1350.[CrossRef][Medline] Donachie, W.D. and Robinson, A. 1987. Cell division: Parameter values and the process. In Escherichia coli and Salmonella typhimurium: Cellular and molecular biology (eds. F. Neidhardt et al.), pp. 1578-1593. ASM Press, Washington, DC. Elf, J., Doncic, A., and Ehrenberg, M. 2003a. Mesoscopic reaction-diffusion in intracellular signaling. In Fluctuations and noise in biological, biophysical and biomedical systems (eds. S. Bezrukov et al.), pp. 114-124. SPIE, Bellingham, WA.
Elf, J., Paulsson, J., Berg, O.G., and Ehrenberg, M. 2003b. Near-critical phenomena in intracellular metabolite pools. Biophys. J. 84: 154-170.
Elowitz, M.B., Levine, A.J., Siggia, E.D., and Swain, P.S. 2002. Stochastic gene expression in a single cell. Science 297: 1183-1186. Fano, U. 1947. Ionization yield of ratios. II. The fluctuations of the number of ions. Phys. Rev. 72: 26-29.[CrossRef] Fersht, A.R. 1998. Structure and mechanism in protein science. Freeman, New York. Gardiner, C. 1985. Handbook of stochastic methods. Springer Verlag, Berlin. Gibson, M. and Bruck, J. 2000. Efficient exact stochastic simulation of chemical systems with many species and channels. J. Phys. Chem. A 104: 1876-1889.[CrossRef] Gillespie, D. 1976. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J. Comput. Phys. 22: 403-434.[CrossRef] ____. 1977. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81: 2340-2361.[CrossRef] Glad, T. and Ljung, L. 2000. Control theory: Multivariable and nonlinear methods. Taylor & Francis, London.
Goldbeter, A. and Koshland Jr., D.E. 1981. An amplified sensitivity arising from covalent modification in biological systems. Proc. Natl. Acad. Sci. 78: 6840-6844. ____. 1982. Sensitivity amplification in biochemical systems. Q. Rev. Biophys. 15: 555-591.[Medline] Guptasarama, P. 1995. Does replication-induced transcription regulate the myriad of low copy number proteins of Escherichia coli? Bioessays 17: 987-997.[CrossRef][Medline] Haken, H. 1982. Synergetics. Nonequilibrium phase transitions and self-organization in biological systems. In Thermodynamics and kinetics in biological processes (eds. I. Lamprecht and A.I. Zotin). Walter de Gruyter, Berlin. Heinrich, R. and Schuster, S. 1996. The regulation of cellular systems. Chapman and Hall, New York. Horn, R.A. and Johnson, C.R. 1985. Matrix analysis. Cambridge University Press, Cambridge, UK. ____. 1991. Topics in matrix analysis. Cambridge University Press, Cambridge, UK. Horsthemke, W. and Lefever, R. 1984. Noise-induced transitions. Theory and applications in physics, chemistry, and biology. Springer Verlag, Berlin. Keizer, J. 1987. Statistical thermodynamics of nonequilibrium processes. Springer-Verlag, Berlin.
Kepler, T.B. and Elston, T.C. 2001. Stochasticity in transcriptional regulation: Origins, consequences, and mathematical representations. Biophys. J. 81: 3116-3136. Mendes, P. 1997. Biochemistry by numbers: Simulation of biochemical pathways with Gepasi 3. Trends Biochem. Sci. 22: 361-363.[CrossRef][Medline] Ozbudak, E.M., Thattai, M., Kurtser, I., Grossman, A.D., and van Oudenaarden, A. 2002. Regulation of noise in the expression of a single gene. Nat. Genet. 31: 69-73.[CrossRef][Medline] Paulsson, J. and Ehrenberg, M. 2001. Noise in a minimal regulatory network: Plasmid copy number control. Q. Rev. Biophys. 34: 1-59.[CrossRef][Medline]
Qian, H., Saffarian, S., and Elson, E.L. 2002. Concentration fluctuations in a mesoscopic oscillating chemical reaction system. Proc. Natl. Acad. Sci. 99: 10376-10381. Risken, H. 1984. The Fokker-Planck equation. Spinger Verlag, Berlin. Sauro, H.M. 1993. SCAMP: A general-purpose simulator and metabolic control analysis program. CABIOS 9: 441-450. Savageau, M.A. 1976. Biochemical systems analysis: A study of function and design in molecular biology. Addison-Wesley, Reading, MA. Schrödinger, E. 1944. What is life? Cambridge University Press, Cambridge, UK. Strang, G. 1988. Linear algebra and its applications. Harcourt Brace Jovanovich Collage Publishers, Fort Worth, TX. Strogatz, S.H. 1994. Nonlinear dynamics and chaos: With applications to physics, biology, chemistry, and engineering. Addison-Wesley, Cambridge, MA. Stundzia, A.B. and Lumsden, C.J. 1996. Stochastic simulation of coupled reaction-diffusion processes. J. Comput. Phys. 127: 196-207.[CrossRef]
Thattai, M. and van Oudenaarden, A. 2001. Intrinsic noise in gene regulatory networks. Proc. Natl. Acad. Sci. 98: 8614-8619. van Kampen, N.G. 1997. Stochastic processes in physics and chemistry, 2nd ed. Elsevier, Amsterdam.
Vilar, J.M., Kueh, H.Y., Barkai, N., and Leibler, S. 2002. Mechanisms of noise-resistance in genetic oscillators. Proc. Natl. Acad. Sci. 99: 5988-5992. Wax, N. 1954. Selected papers on noise and stochastic processes. Dover, New York.
Received January 20, 2003;
accepted in revised format July 22, 2003.
|