New Research In
Physical Sciences
Social Sciences
Featured Portals
Articles by Topic
Biological Sciences
Featured Portals
Articles by Topic
 Agricultural Sciences
 Anthropology
 Applied Biological Sciences
 Biochemistry
 Biophysics and Computational Biology
 Cell Biology
 Developmental Biology
 Ecology
 Environmental Sciences
 Evolution
 Genetics
 Immunology and Inflammation
 Medical Sciences
 Microbiology
 Neuroscience
 Pharmacology
 Physiology
 Plant Biology
 Population Biology
 Psychological and Cognitive Sciences
 Sustainability Science
 Systems Biology
Normal forms for reduced stochastic climate models

Contributed by Andrew J. Majda, January 9, 2009 (received for review December 8, 2008)
Abstract
The systematic development of reduced lowdimensional stochastic climate models from observations or comprehensive highdimensional climate models is an important topic for atmospheric lowfrequency variability, climate sensitivity, and improved extended range forecasting. Here techniques from applied mathematics are utilized to systematically derive normal forms for reduced stochastic climate models for lowfrequency variables. The use of a few Empirical Orthogonal Functions (EOFs) (also known as Principal Component Analysis, Karhunen–Loéve and Proper Orthogonal Decomposition) depending on observational data to span the lowfrequency subspace requires the assessment of dyad interactions besides the more familiar triads in the interaction between the low and highfrequency subspaces of the dynamics. It is shown below that the dyad and multiplicative triad interactions combine with the climatological linear operator interactions to simultaneously produce both strong nonlinear dissipation and Correlated Additive and Multiplicative (CAM) stochastic noise. For a single lowfrequency variable the dyad interactions and climatological linear operator alone produce a normal form with CAM noise from advection of the large scales by the small scales and simultaneously strong cubic damping. These normal forms should prove useful for developing systematic strategies for the estimation of stochastic models from climate data. As an illustrative example the onedimensional normal form is applied below to lowfrequency patterns such as the North Atlantic Oscillation (NAO) in a climate model. The results here also illustrate the short comings of a recent linear scalar CAM noise model proposed elsewhere for lowfrequency variability.
There is recent interest in deriving reduced stochastic models for climate and extendedrange weather prediction. An attractive property of atmospheric lowfrequency variability is that it can be efficiently described by just a few largescale teleconnection patterns. These patterns exert a huge impact on surface climate and seasonal predictability. Thus, such reduced stochastic models are an attractive alternative for extended range prediction and climate sensitivity studies because they are computationally much more efficient than stateoftheart climate models and have been shown to have comparable prediction skill (1). Because such reduced models capture the essence of lowfrequency processes they allow for a better understanding of the climate system. Thus, systematic strategies are essential in successfully developing reduced models. The recently developed stochastic mode reduction strategy provides a systematic procedure for the derivation of reduced stochastic models (2–6). In these studies the functional form of reduced models is systematically derived for geophysical flow models. In refs. 7 and 8 the procedure is refined to allow a seamless way of deriving stochastic loworder models from data of complex geophysical flow models. Of course, the predicted functional form (2–4, 6) can also be used for other procedures to infer stochastic models from data. Reduced models can also be used for climate sensitivity and climate change studies via the fluctuation dissipation theorem (FDT) (4, 9, 10). For example, there is a recent observational link (11) between the Greenland ice sheet collapse and changes in the primary lowfrequency atmospheric teleconnection pattern in the Atlantic sector, the North Atlantic Oscillation (NAO).
The systematic stochastic mode reduction strategy (2–4, 6) predicts the functional form of the deterministic and stochastic terms. In particular, it predicts two types of noises. One is a simple additive noise, which stems from the nonlinear driving of the resolved scales by the unresolved scales, and the other is Correlated Additive and Multiplicative (CAM) noise. CAM noise has the following structural form:
where
In a recent study a linear scalar model with CAM noise is proposed for the description of climate variability (12). The proposed reduced model form from ref. 12 has several implications for scalar stochastic models of climate variables that are at odds with systematic procedures for the derivation of reduced stochastic models of climate variables (3, 6–8) and climate data (8, 13, 14). The linear CAM noise model of ref. 12 predicts a probability density function (PDF) whose tails decay according to a power law. This is certainly at odds with studies that examined a subspace spanned by the first few leading Empirical Orthogonal Functions (EOFs) from data generated by long integrations of general circulation models (8, 14) and reanalysis data (13, 15). These studies find only small, though significant, deviations from Gaussianity. These deviations occur most strongly in the form of excess skewness and to a much lesser extent in the form of excess kurtosis. Thus, there is not much empirical evidence for powerlaw decay of the tails of PDFs of the leading EOFs. Below we show that for a systematically derived scalar reduced stochastic climate model the PDF tails are dominated by a Gaussian decay due to cubic damping. However, this argument applies for lowfrequency data (measured by the leading EOFs) and does not rule out the possible usefulness of the proposed form (12) for other climate variables at high frequencies in station data (16).
In the remainder of this study we will derive normal forms for nonlinear stochastic climate models, discuss the structural form of the PDF in the scalar case and utilize the normal forms for advanced parameter estimation of scalar climate variables for lowfrequency teleconnection patterns such as the NAO.
Structural Implications of Energy Conservation for the Normal Form
The nature of the normal forms proposed here is tied to energy conservation of the nonlinear terms (3, 17, 18) as will be shown next.
The dynamical core of comprehensive largescale models for the climate has the form where F is a constant forcing, L a linear, and B a quadratically nonlinear operator, and u ∈ R^{M} denotes the state vector with M ≫ 1. In Eq. 2 we assume that u denotes deviations from a climatological basic state and that basic state effects are already included in F and L (7). The quadratically nonlinear terms, B(u,u), alone arise from nonlinear advection, conserve energy, and, thus, satisfy the symmetry condition in Eq. 2.
The state vector u can be expanded into an orthonormal basis in the energy metric,
Now we partition the variable
Canonical Stochastic Model for a Scalar LowFrequency Climate Variable
For a scalar reduced model, x ∈ R^{1}, all lowfrequency triad climate interactions vanish since all three variables are required to be distinct. We already know from the stochastic mode reduction procedure (2–4, 6, 7) that the I_{jp}^{A} terms produce additional damping and uncorrelated additive noise. For simplicity, and to concentrate on the new phenomena involving the interactions of multiplicative dyads with the linear operator, we do not discuss the effects arising from I_{jp}^{A} in detail in the following. Repeating the explicit mode elimination procedure for ɛ ≪ 1 (2–4, 6, 20) and coarse graining time as
The scalar stochastic climate model (7) has the associated Fokker–Planck equation with p the probability density function.
The stationary distribution of (7) can be derived analytically by integrating the stationary Fokker–Planck equation Eq. 10 and is given by
with
The factors in 11a and 11b are the type of PDF that arises from linear CAM noise alone (12) and that has unphysical powerlaw tails for any lowfrequency pattern with significant variance; the above theory for dyad interactions automatically produces the factor in 11c with the Gaussian tail due to the cubic dissipation that ameliorates this unphysical effect.
The Normal Form in General Case of N Climate Variables
It is straightforward to generalize the normal form to N dimensions by using the above discussion together with that in refs. (2–4) including triad interactions. The result is given by where the last line in 12 is a vector system of CAM noise. Again, it is clear from 12 that the existence of CAM noise is directly linked to the presence of the deterministic cubic terms. This property is linked to the energy conservation of the nonlinear operator. The cubic nonlinearity from the triad interactions (2, 3) represented by the second summand in 12c is also included since it is needed below.
Constraints on the N Dimensional Normal Forms
In practice (7, 14, 20) the original system has wellbehaved PDFs for the lowfrequency patterns with rapidly decaying tails; thus, the nonlinear operator in 12c should induce an effective damping.
This can be shown for the N dimensional case by multiplying 12 by x_{i}, neglecting the noise terms and all other terms besides the cubic terms, and summing over i leading to the energy identity
with
Inverse Modeling and Parameter Estimation for SingleVariable Model
The functional form predicted by the mode elimination procedure developed above and in refs. 2, 3, 7, and 19, can be used to infer a stochastic climate model from time series, by estimating the model parameters from the data. For a 1dimensional model with an stochastic differential equation (SDE) of the form (7) and associated Fokker–Planck equation of the form (10), there are 7 parameters to estimate as shown earlier. The drift and diffusion functions [A(x) and B(x)] in 10 have the form A(x) = F + ax + bx^{2} − cx^{3} and B(x) = B_{0} + B_{1}x + B_{2}x^{2}. The parameters F,a,b,c,B_{0},B_{1},B_{2} can be inferred by using a parametric estimation procedure, under the constraints c ≥ 0, B_{0} ≥ 0, B_{2} ≥ 0. The last 2 constraints are necessary in order to have B(x) ≥ 0 for all x. In principle, one should also impose B_{1}^{2} ≤ 4B_{0}B_{2} to guarantee B(x) ≥ 0. However, this makes the estimation more complicated; also, the estimated diffusions are all positive on the observed ranges of x. Therefore, we ignore the constraint B_{1}^{2} ≤ 4B_{0}B_{2} here.
We use data from a quasigeostrophic atmospheric circulation model with 3 vertical layers and horizontal triangular truncation at wavenumber 21 (T21) for the Northern Hemisphere with realistic climatological mean state, storm tracks, and lowfrequency variability (8, 21) to estimate the parameters for a scalar reduced stochastic model. We utilize 6h output data for a total of 50,000 days from the quasigeostrophic model. The model is run under perpetual winter conditions. Hence, the parameters of the SDE in 7 are timeindependent. From the data we take scalar time series for the leading Principal Components (PCs) of total energy EOFs (8) as well as a scalar time series resulting from the projection of the model stream function fields onto the observed NAO pattern at 500 hPa (8). The PCs and the NAO have only small deviations from Gaussianity in form of excess skewness (not shown; see refs. 8, 14, and 20). Each time series is rescaled by an overall constant factor so that it has unit variance; this facilitates parameter estimation and makes it easier to compare the relative size of the various terms in the normal form.
Given these (rescaled) scalar time series, we estimate the parameters by using a variant of the procedure developed in ref. 22. This estimation procedure is based on convex quadratic optimization and the use of leading eigenfunction–eigenvalue pairs of the Fokker–Planck operator. For each time series, the estimation is carried out 6 times: once using the entire time series and 5 times using different blocks of 10,000 days (nonoverlapping). For PCs 1 and 2, and the NAO, the resulting drift and diffusion functions are shown in Fig. 1. The functions are plotted over the entire range of x as observed in the time series. All of the three reduced stochastic models clearly have a cubic drift as predicted by the above theory with that for the NAO and PC1 more pronounced than that for PC2. Differences between the drift functions estimated from different blocks of the same time series are mainly visible in the tails of the probability distribution. The diffusion functions estimated from the time series are nearly constant, and PC2 has a weak linear slope. Quadratic terms in the diffusion functions for the PCs are either zero or so small that their contribution is not visible on the observed range of x. The diffusion functions estimated from the NAO time series show significant variance between the different blocks of data; nevertheless, most of the estimated functions are nearly constant.
All these results suggest that the additive noise terms from the climate base state dominate the diffusion process but the dyad interaction terms developed here make a significant nonlinear contribution to the drift for the lowfrequency patterns in the present climate model.
Concluding Discussion
By using systematic principles from applied mathematics unique normal forms for reduced stochastic climate models for lowfrequency teleconnection patterns have been developed here. Even for a scalar variable these normal forms predict a cubic nonlinear drift and a multiplicative correction to constant diffusion through CAM noise. The normal forms were applied in a parameter estimation strategy to fit the lowfrequency patterns, such as the NAO, of a prototype climate model, and the confirmation of the predicted nonlinear cubic drift was evident in these results. Normal forms together with energy constraints were also developed here for reduced stochastic models in several dimensions. Important future directions should involve careful elucidation of the mathematical properties of the normal forms presented here, in conjunction with their implications through model estimation strategies for important issues in climate change response and prediction.
Acknowledgments
A.J.M. is supported in part by National Science Foundation Grant DMS0456713 and Office of Naval Research Grant N00140511064. D.C. is supported by the Netherlands Organisation for Scientific Research through the research cluster Nonlinear Dynamics of Natural Systems.
Footnotes
 ^{1}To whom correspondence should be addressed. Email: jonjon{at}cims.nyu.edu

Author contributions: A.J.M. designed research; A.J.M., C.F., and D.C. performed research; and A.J.M., C.F., and D.C. wrote the paper.

The authors declare no conflict of interest.

Freely available online through the PNAS open access option.
 © 2009 by The National Academy of Sciences of the USA
References
 ↵
 ↵
 Majda AJ,
 Timofeyev I,
 VandenEijnden E
 ↵
 ↵
 Majda AJ,
 Abramov R,
 Grote M
 ↵
 Majda AJ,
 Franzke C,
 Fischer A,
 Crommelin DT
 ↵
 ↵
 ↵
 Franzke C,
 Majda AJ
 ↵
 ↵
 ↵
 ↵
 Sardeshmukh P,
 Sura P
 ↵
 Sura P,
 Newman M,
 Penland C,
 Sardeshmukh P
 ↵
 ↵
 ↵
 ↵
 Salmon R
 ↵
 Timofeyev I
 ↵
 ↵
 ↵
 ↵
 Crommelin DT,
 VandenEijnden E
Citation Manager Formats
Article Classifications
 Physical Sciences
 Applied Mathematics
Sign up for Article Alerts
Jump to section
 Article
 Abstract
 Structural Implications of Energy Conservation for the Normal Form
 Canonical Stochastic Model for a Scalar LowFrequency Climate Variable
 The Normal Form in General Case of N Climate Variables
 Constraints on the N Dimensional Normal Forms
 Inverse Modeling and Parameter Estimation for SingleVariable Model
 Concluding Discussion
 Acknowledgments
 Footnotes
 References
 Figures & SI
 Info & Metrics
You May Also be Interested in
More Articles of This Classification
Physical Sciences
Related Content
 No related articles found.
Cited by...
 Datadriven discovery of partial differential equations
 Discovering governing equations from data by sparse identification of nonlinear dynamical systems
 Statistical energy conservation principle for inhomogeneous turbulent dynamical systems
 Quantifying uncertainty in climate change science through empirical information theory
 High skill in lowfrequency climate response through fluctuation dissipation theorems despite structural instability