首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A sliced inverse regression approach for data stream
Authors:Marie Chavent  Stéphane Girard  Vanessa Kuentz-Simonet  Benoit Liquet  Thi Mong Ngoc Nguyen  Jérôme Saracco
Institution:1. Institut de Mathématiques de Bordeaux, UMR CNRS 5251, Université de Bordeaux, 351 cours de la libération, 33405?, Talence Cedex, France
2. CQFD Team, Inria Bordeaux Sud-Ouest, Talence Cedex, France
3. LJK, MISTIS Team, Inria Grenoble Rh?ne-Alpes, Inovallée, 655, av. de l’Europe, Montbonnot, 38334?, Saint-Ismier Cedex, France
4. Unité ADBX “Aménités et Dynamiques des Espaces Ruraux”, IRSTEA, 50 Avenue de Verdun, Gazinet, 33612?, Cestas Cedex, France
5. ISPED, Centre INSERM U-897-Epidémiologie-Biostatistique, Université de Bordeaux, Bordeaux?, 33000, France
6. ISPED, Centre INSERM U-897-Epidémiologie-Biostatistique, INSERM, Bordeaux?, 33000, France
7. IRMA, UMR 7501, Université de Strasbourg, 7 rue René Descartes, 67084?, Strasbourg Cedex, France
Abstract:In this article, we focus on data arriving sequentially by blocks in a stream. A semiparametric regression model involving a common effective dimension reduction (EDR) direction \(\beta \) is assumed in each block. Our goal is to estimate this direction at each arrival of a new block. A simple direct approach consists of pooling all the observed blocks and estimating the EDR direction by the sliced inverse regression (SIR) method. But in practice, some disadvantages appear such as the storage of the blocks and the running time for large sample sizes. To overcome these drawbacks, we propose an adaptive SIR estimator of \(\beta \) based on the optimization of a quality measure. The corresponding approach is faster both in terms of computational complexity and running time, and provides data storage benefits. The consistency of our estimator is established and its asymptotic distribution is given. An extension to multiple indices model is proposed. A graphical tool is also provided in order to detect changes in the underlying model, i.e., drift in the EDR direction or aberrant blocks in the data stream. A simulation study illustrates the numerical behavior of our estimator. Finally, an application to real data concerning the estimation of physical properties of the Mars surface is presented.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号