PRIM analysis |
| |
Authors: | Wolfgang Polonik Zailong Wang |
| |
Affiliation: | a Department of Statistics, University of California, One Shields Ave., Davis, CA 95616-8705, United States b Novartis Pharmaceuticals Corporation, One Heath Plaza, East Hanover, NJ 07936-1080, United States |
| |
Abstract: | This paper analyzes a data mining/bump hunting technique known as PRIM [1]. PRIM finds regions in high-dimensional input space with large values of a real output variable. This paper provides the first thorough study of statistical properties of PRIM. Amongst others, we characterize the output regions PRIM produces, and derive rates of convergence for these regions. Since the dimension of the input variables is allowed to grow with the sample size, the presented results provide some insight about the qualitative behavior of PRIM in very high dimensions. Our investigations also reveal some shortcomings of PRIM, resulting in some proposals for modifications. |
| |
Keywords: | primary 62G20 62G05 62H12 |
本文献已被 ScienceDirect 等数据库收录! |
|