Distributional conformal prediction
Edited by Emmanuel J. Candès, Stanford University, Stanford, CA, and approved October 5, 2021 (received for review April 24, 2021)
Significance
Prediction problems are important in many contexts. Examples include cross-sectional prediction, time series forecasting, counterfactual prediction and synthetic controls, and individual treatment effect prediction. We develop a prediction method that works in conjunction with many powerful classical methods (e.g., conventional quantile regression) as well as modern high-dimensional methods for estimating conditional distributions (e.g., quantile neural networks). Unlike many existing prediction approaches, our method is valid conditional on the observed predictors and efficient under some conditions. Importantly, our method is also robust; it exhibits unconditional coverage guarantees under model misspecification, under overfitting, and with time series data.
Abstract
We propose a robust method for constructing conditionally valid prediction intervals based on models for conditional distributions such as quantile and distribution regression. Our approach can be applied to important prediction problems, including cross-sectional prediction, k–step-ahead forecasts, synthetic controls and counterfactual prediction, and individual treatment effects prediction. Our method exploits the probability integral transform and relies on permuting estimated ranks. Unlike regression residuals, ranks are independent of the predictors, allowing us to construct conditionally valid prediction intervals under heteroskedasticity. We establish approximate conditional validity under consistent estimation and provide approximate unconditional validity under model misspecification, under overfitting, and with time series data. We also propose a simple “shape” adjustment of our baseline method that yields optimal prediction intervals.
Data Availability
Data and computer codes to replicate all the results in this paper have been deposited in GitHub (https://github.com/kwuthrich/Replication_DCP). All data are referenced in the main text.
Acknowledgments
We thank the editor, two anonymous referees, Dimitris Politis, and Allan Timmermann for valuable comments. V.C. acknowledges funding from the NSF. All remaining errors are our own.
Supporting Information
Appendix 01 (PDF)
- Download
- 405.02 KB
References
1
R. Koenker, G. Bassett, Regression quantiles. Econometrica 46, 33–50 (1978).
2
S. Foresi, F. Peracchi, The conditional distribution of excess returns: An empirical analysis. J. Am. Stat. Assoc. 90, 451–466 (1995).
3
V. Chernozhukov, I. Fernandez-Val, B. Melly, Inference on counterfactual distributions. Econometrica 81, 2205–2268 (2013).
4
J. W. Taylor, A quantile regression neural network approach to estimating the conditional density of multiperiod returns. J. Forecast. 19, 299–311 (2000).
5
P. Chaudhuri, W. Y. Loh, Nonparametric estimation of conditional quantiles using quantile regression trees. Bernoulli 8, 561–576 (2002).
6
N. Meinshausen, Quantile regression forests. J. Mach. Learn. Res. 7, 983–999 (2006).
7
M. D. Cattaneo, Y. Feng, R. Titiunik, Prediction intervals for synthetic control methods. arXiv [Preprint] (2019). https://arxiv.org/abs/1912.07120 (Accessed 20 August 2021).
8
V. Chernozhukov, K. Wüthrich, Y. Zhu, An exact and robust conformal inference method for counterfactual and synthetic controls. J. Am. Stat. Assoc., (2021).
9
D. Kivaranovic, R. Ristl, M. Posch, H. L. Leeb, Conformal prediction intervals for the individual treatment effect. arXiv [Preprint] (2020). https://arxiv.org/abs/2006.01474 (Accessed 20 August 2020).
10
L. Lei, E. J. Candès, Conformal inference of counterfactuals and individual treatment effects. arXiv [Preprint] (2020). https://arxiv.org/abs/2006.06138 (Accessed 20 August 2021).
11
Y. Romano, R. F. Barber, C. Sabatti, E. J. Candes, With malice towards none: Assessing uncertainty via equalized coverage. arXiv [Preprint] (2019). https://arxiv.org/abs/1908.05428 (Accessed 20 August 2021).
12
R. Foygel Barber, E. J. Candès, A. Ramdas, R. J. Tibshirani, The limits of distribution-free conditional predictive inference. J. IMA 10, 455–482 (2021).
13
J. Lei, L. Wasserman, Distribution-free prediction bands for non-parametric regression. J. Royal Stat. Soc. Ser. B. Stat. Methodol. 76, 71–96 (2014).
14
M. Sesia, E. J. Candes, A comparison of some conformal quantile regression methods. Stat 9, e261 (2020).
15
V. Vovk, “Conditional validity of inductive conformal predictors” in Proceedings of the Asian Conference on Machine Learning, S. C. H. Hoi, W. Buntine, Eds. (PMLR, Singapore Management University, Singapore), vol. 25, pp. 475–490 (2012).
16
Y. Romano, E. Patterson, E. Candes, Conformalized quantile regression. Adv. Neural Inf. Process. Syst., 32, 3543–3553 (2019).
17
V. Vovk, A. Gammerman, G. Shafer, Algorithmic Learning in a Random World (Springer Science & Business Media, 2005).
18
V. Vovk, I. Nouretdinov, A. Gammerman, On-line predictive linear regression. Ann. Stat. 37, 1566–1590 (2009).
19
D. N. Politis, Model-Free Prediction and Regression: A Transformation-Based Approach to Inference (Springer, New York, NY, 2015).
20
J. Lei, M. G. Sell, A. Rinaldo, R. J. Tibshirani, L. Wasserman, Distribution-free predictive inference for regression. J. Am. Stat. Assoc. 113, 1094–1111 (2018).
21
R. Koenker, G. Bassett, Robust tests for heteroscedasticity based on regression quantiles. Econometrica 50, 43–61 (1982).
22
R. Koenker, Quantile Regression, Econometric Society Monographs (Cambridge University Press, 2005).
23
J. Lei, J. Robins, L. Wasserman, Distribution free prediction sets. J. Am. Stat. Assoc. 108, 278–287 (2013).
24
V. Chernozhukov, K. Wüthrich, Z. Yinchu, “Exact and robust conformal inference methods for predictive machine learning with dependent data” in Proceedings of the 31st Conference on Learning Theory, S. Bubeck, V. Perchet, P. Rigollet, Eds. (PMLR, Cambridge, MA, 2018), vol. 75, pp. 732–749.
25
D. N. Politis, Model-free model-fitting and predictive distributions. Test 22, 183–221 (2013).
26
I. Komunjer, “Chapter 17 - Quantile prediction” in Handbook of Economic Forecasting, G. Elliott, A. Timmermann, Eds. (Elsevier, 2013), pp. 961–994.
27
D. Kivaranovic, K. D. Johnson, H. Leeb, “Adaptive, distribution-free prediction intervals for deep networks” in Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, S. Chiappa, R. Calandra, Eds. (PMLR, Cambridge, MA, 2020), vol. 108, pp. 4346–4356.
28
V. Vovk et al., “Conformal calibrators” in Proceedings of the Ninth Symposium on Conformal and Probabilistic Prediction and Applications, A. Gammerman, V. Vovk, Z. Luo, E. Smirnov, G. Cherubin, Eds. (PMLR, Cambridge, MA, 2020), vol. 128, pp. 84–99.
29
D. J. Eck, F. W. Crawford, Efficient and minimal length parametric conformal prediction regions. arXiv [Preprint] (2019). https://arxiv.org/abs/1905.03657 (Accessed 20 August 2021).
30
R. Izbicki, G. T. Shimizu, R. B. Stern, Flexible distribution-free conditional predictive bands using density estimators. arXiv [Preprint] (2019). https://arxiv.org/abs/1910.05575 (Accessed 20 August 2021).
31
R. Izbicki, G. Shimizu, R. B. Stern, CD-split and HPD-split: Efficient conformal regions in high dimensions. arXiv [Preprint] (2020). https://arxiv.org/abs/2007.12778 (Accessed 20 August 2021).
32
L. Gyorfi, H. Walk, “Nearest neighbor based conformal prediction” (Rep. Stuttgarter Mathematische Berichte 2020-002, Universität Stuttgart, Stuttgart, Germany, 2020).
33
M. Sesia, Y. Romano, Conformal histogram regression. arXiv [Preprint] (2021). https://arxiv.org/abs/2105.08747 (Accessed 20 August 2021).
34
W. Chen, Z. Wang, W. Ha, R. F. Barber, Trimmed conformal prediction for high-dimensional models. arXiv [Preprint] (2016). https://arxiv.org/abs/1611.09933 (Accessed 20 August 2021).
35
V. Chernozhukov, I. Fernandez-Val, A. Galichon, Improving point and interval estimators of monotone functions by rearrangement. Biometrika 96, 559–575 (2009).
36
V. Chernozhukov, I. Fernández-Val, A. Galichon, Quantile and probability curves without crossing. Econometrica 78, 1093–1125 (2010).
37
W. Hoeffding, The large-sample power of tests based on permutations of observations. Ann. Math. Stat. 23, 169–192 (1952).
38
E. J. Candès, L. Lei, Z. Ren, Conformalized survival analysis. arXiv [Preprint] (2021). https://arxiv.org/abs/2103.09763 (Accessed 20 August 2021).
39
R Core Team, R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, Austria, 2021).
40
G. Elliott, A. Timmermann, Economic Forecasting (Princeton University Press, 2016).
41
R. K. French, Kenneth French Data Library. Fama/French 3 Factors [Daily] Data (2021). http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html. Accessed 17 August 2021.
42
F. Boussama, “Ergodicité, mélange et estimation dans les modeles GARCH,” PhD thesis, Paris 7, Paris, France (1998).
43
M. Carrasco, X. Chen, Mixing and moment properties of various GARCH and stochastic volatility models. Econom. Theory 18, 17–39 (2002).
44
C. Francq, J. M. Zakoïan, Mixing properties of a general class of GARCH (1,1) models without moment assumptions on the observed process. Econom. Theory 22, 815–834 (2006).
45
V. Chernozhukov, C. Hansen, M. Spindler, hdm: High-dimensional metrics. R J. 8, 185–199 (2016).
Information & Authors
Information
Published in
Classifications
Copyright
© 2021. Published under the PNAS license.
Data Availability
Data and computer codes to replicate all the results in this paper have been deposited in GitHub (https://github.com/kwuthrich/Replication_DCP). All data are referenced in the main text.
Submission history
Accepted: September 24, 2021
Published online: November 23, 2021
Published in issue: November 30, 2021
Keywords
Acknowledgments
We thank the editor, two anonymous referees, Dimitris Politis, and Allan Timmermann for valuable comments. V.C. acknowledges funding from the NSF. All remaining errors are our own.
Notes
This article is a PNAS Direct Submission.
Published under the PNAS license.
Authors
Competing Interests
The authors declare no competing interest.
Metrics & Citations
Metrics
Altmetrics
Citations
Cite this article
Distributional conformal prediction, Proc. Natl. Acad. Sci. U.S.A.
118 (48) e2107794118,
https://doi.org/10.1073/pnas.2107794118
(2021).
Copied!
Copying failed.
Export the article citation data by selecting a format from the list below and clicking Export.
Cited by
Loading...
View Options
View options
Download this article as a PDF file.
PDFeReader
View this article with eReader.
eReaderLogin options
Check if you have access through your login credentials or your institution to get full access on this article.
Personal login Institutional LoginRecommend to a librarian
Recommend PNAS to a LibrarianPurchase options
Purchase this article to access the full text.
