Model selection and prediction: Normal regression |
| |
Authors: | T P Speed and Bin Yu |
| |
Institution: | (1) Department of Statistics, University of California at Berkeley, 94720, CA, USA;(2) Department of Statistics, University of Wisconsin-Madison, 53706, WI, USA |
| |
Abstract: | This paper discusses the topic of model selection for finite-dimensional normal regression models. We compare model selection criteria according to prediction errors based upon prediction with refitting, and prediction without refitting. We provide a new lower bound for prediction without refitting, while a lower bound for prediction with refitting was given by Rissanen. Moreover, we specify a set of sufficient conditions for a model selection criterion to achieve these bounds. Then the achievability of the two bounds by the following selection rules are addressed: Rissanen's accumulated prediction error criterion (APE), his stochastic complexity criterion, AIC, BIC and the FPE criteria. In particular, we provide upper bounds on overfitting and underfitting probabilities needed for the achievability. Finally, we offer a brief discussion on the issue of finite-dimensional vs. infinite-dimensional model assumptions.Support from the National Science Foundation, grant DMS 8802378 and support from ARO, grant DAAL03-91-G-007 to B. Yu during the revision are gratefully acknowledged. |
| |
Keywords: | Model selection prediction lower bound accumulated prediction error (APE) AIC BIC FPE stochastic complexity overfit and underfit probability |
本文献已被 SpringerLink 等数据库收录! |
|