Deep learning to represent subgrid processes in climate models
- aMeteorological Institute, Ludwig-Maximilian-University, 80333 Munich, Germany;
- bDepartment of Earth System Science, University of California, Irvine, CA 92697;
- cDepartment of Earth and Environmental Engineering, Earth Institute, Columbia University, New York, NY 10027;
- dData Science Institute, Columbia University, New York, NY 10027
See allHide authors and affiliations
Edited by Isaac M. Held, Geophysical Fluid Dynamics Laboratory, National Oceanic and Atmospheric Administration, Princeton, NJ, and approved August 8, 2018 (received for review June 14, 2018)

Significance
Current climate models are too coarse to resolve many of the atmosphere’s most important processes. Traditionally, these subgrid processes are heuristically approximated in so-called parameterizations. However, imperfections in these parameterizations, especially for clouds, have impeded progress toward more accurate climate predictions for decades. Cloud-resolving models alleviate many of the gravest issues of their coarse counterparts but will remain too computationally demanding for climate change predictions for the foreseeable future. Here we use deep learning to leverage the power of short-term cloud-resolving simulations for climate modeling. Our data-driven model is fast and accurate, thereby showing the potential of machine-learning–based approaches to climate model development.
Abstract
The representation of nonlinear subgrid processes, especially clouds, has been a major source of uncertainty in climate models for decades. Cloud-resolving models better represent many of these processes and can now be run globally but only for short-term simulations of at most a few years because of computational limitations. Here we demonstrate that deep learning can be used to capture many advantages of cloud-resolving modeling at a fraction of the computational cost. We train a deep neural network to represent all atmospheric subgrid processes in a climate model by learning from a multiscale model in which convection is treated explicitly. The trained neural network then replaces the traditional subgrid parameterizations in a global general circulation model in which it freely interacts with the resolved dynamics and the surface-flux scheme. The prognostic multiyear simulations are stable and closely reproduce not only the mean climate of the cloud-resolving simulation but also key aspects of variability, including precipitation extremes and the equatorial wave spectrum. Furthermore, the neural network approximately conserves energy despite not being explicitly instructed to. Finally, we show that the neural network parameterization generalizes to new surface forcing patterns but struggles to cope with temperatures far outside its training manifold. Our results show the feasibility of using deep learning for climate model parameterization. In a broader context, we anticipate that data-driven Earth system model development could play a key role in reducing climate prediction uncertainty in the coming decade.
Footnotes
- ↵1To whom correspondence should be addressed. Email: s.rasp{at}lmu.de.
Author contributions: M.S.P. and P.G. designed research; S.R. and M.S.P. performed research; S.R. analyzed data; and S.R., M.S.P., and P.G. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
Data deposition: All code can be found in the following repositories: https://doi.org/10.5281/zenodo.1402384 and https://gitlab.com/mspritch/spcam3.0-neural-net/tree/nn_fbp_engy_ess.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1810286115/-/DCSupplemental.
- Copyright © 2018 the Author(s). Published by PNAS.
This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).
Citation Manager Formats
Article Classifications
- Physical Sciences
- Earth, Atmospheric, and Planetary Sciences














