Measuring the Stability of Results From Supervised Statistical Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Measuring the Stability of Results From Supervised Statistical Learning

Authors:	Michel Philipp Thomas Rusch Kurt Hornik Carolin Strobl

Affiliation:	1. Department of Psychological Methods, Evaluation and Statistics, University of Zurich, Zürich, Switzerland;2. Competence Center for Empirical Research Methods, WU Vienna University of Economics and Business, Vienna, Austria;3. Institute for Statistics and Mathematics, WU Vienna University of Economics and Business, Vienna, Austria

Abstract:	Stability is a major requirement to draw reliable conclusions when interpreting results from supervised statistical learning. In this article, we present a general framework for assessing and comparing the stability of results, which can be used in real-world statistical learning applications as well as in simulation and benchmark studies. We use the framework to show that stability is a property of both the algorithm and the data-generating process. In particular, we demonstrate that unstable algorithms (such as recursive partitioning) can produce stable results when the functional form of the relationship between the predictors and the response matches the algorithm. Typical uses of the framework in practical data analysis would be to compare the stability of results generated by different candidate algorithms for a dataset at hand or to assess the stability of algorithms in a benchmark study. Code to perform the stability analyses is provided in the form of an R package. Supplementary material for this article is available online.

Keywords:	Recursive partitioning Resampling package stablelearner

设为首页 | 免责声明 | 关于勤云 | 加入收藏