首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 109 毫秒
1.
稳健统计(Ⅲ)   总被引:1,自引:1,他引:0  
1.10稳健化和M估计 我们看到正态方法(最小二乘法)不能抵抗(阻尼)离群值的破坏性大影响,它的破坏点为0,影响函数是无界的,总之,它是不稳健的.它可以被改造成稳健方法,这种修正的过程称为稳健化;或者找到另一种代用的稳健方法,称为稳健代用品. 由主观判断识别出离群值;或者用离群值检验诊断的办法,识别出离群值,然后剔除离群值,对于剩余样本用传统的正态方法。这样,实际上已实现了稳健化.L估计中有许多稳健方法,例如,样本中位数、切尾均值都是稳健位置估计.另外,可以从非参数统计中找到稳健代用品。例如R估计就是由(不依赖于总体分布的)秩…  相似文献   

2.
线性回归模型多个离群点的向前逐步诊断方法   总被引:3,自引:0,他引:3  
当线性回归模型中存在多个离群点时,经典的诊断方法常常因掩盖和淹没现象而失效,导致模型误用。针对此问题,本文在回顾有关文献的基础上,将稳健回归技术与经典诊断量相结合,提出一种向前逐步诊断方法。通过对模拟数据的分析,说明该法可有效地识别回归数据中潜在的离群点,并作正式的统计检验。  相似文献   

3.
时间序列自回归AR模型在建模过程中易受离群值的影响,导致计算结果与实际不相符.针对这一现象,将Hampel权函数运用于自相关函数中,从而构建出自回归AR模型的稳健估计算法,以克服离群值的影响.并对此方法进行了模拟和实证分析,模拟和实证分析均表明:当时序数据中不存在离群值时,传统估计方法与稳健估计方法得到的结果基本保持一致;当数据中存在离群值时,运用传统估计方法得到的结果出现较大变化,而运用稳健估计方法得到的结果基本不变.这说明相对于传统估计方法,稳健估计方法能有效抵抗离群值的影响,具有良好的抗干扰性和高抗差性.  相似文献   

4.
传统自回归与滑动平均(ARMA)残差控制图往往对离群值比较敏感,容易导致监控失效.为了解决这一问题,本文利用稳健统计的思想对传统ARMA残差控制图进行修正,构建出稳健ARMA残差控制图算法,以克服离群值对模型的影响.模拟和实证结果表明:当数据中不存在离群值时,由传统和稳健ARMA残差控制图得到的监控结果基本一致;当数据中存在离群值时,相对于传统ARMA残差控制图而言,稳健ARMA残差控制图能更有效地抵抗离群值的影响,具有较好的抗干扰性和高抗差性.  相似文献   

5.
本文在文献的基础上,给出残差为AR(P)序列并联混合回归模型参数的一种稳健估计——两步M估计,并证明了估计的相容性与渐近正态性.  相似文献   

6.
《数理统计与管理》2015,(4):636-646
由于非参数回归模型复杂灵活,被广泛应用。在众多估计方法中,最小二乘法最为常用,一般情况下具有良好的性质,但在处理厚尾分布及异常点时表现的不够稳健。本文针对此,提出了基于Walsh平均的稳健样条估计。我们理论地推导了估计结果的相合性和渐近正态性;并与多项式样条回归做比较。计算得Walsh平均的样条估计相对于多项式样条回归的渐近相对效率与Wilcoxon符号秩检验相对于t-检验的渐近相对效率是一样的。在正态情形下我们的方法与多项式样条回归差不多,在非正态情形下,我们的方法表现更为稳健,效率明显优于多项式样条回归。  相似文献   

7.
企业将资产运用于生产经营活动,并由此赚取更多的资产,即产生公司的收入.因此企业资产与收入之间必定存在一定的相关关系.在对上市公司总资产与营业收入进行一般线性拟合的基础之上,采用分位回归模型对上市公司的总资产与营业收入的关系进行深入剖析.结果表明,传统的线性模型只能揭示出总资产与营业收入呈正相关关系,而分位回归方法能更好地看出,高分位点营业收入的企业在提高一定总资产时,会更能促进营业收入的增长.由于收集到的数据中存在离群点,在第5节讨论了线性分位回归模型的统计诊断,类比于一般线性模型的R square得到不同分位点上的R square.通过删除离群点的处理,得出分位回归模型比一般线性模型更加稳健,数据在高分位点的拟合效果更好一些.  相似文献   

8.
随机设计下非参数回归模型方差变点Ratio检验   总被引:1,自引:1,他引:0  
研究随机设计下非参数回归模型方差变点Ratio检验.首先用局部多项式方法估计回归曲线得到残差序列,其次基于残差的平方序列构造Ratio检验统计量并推导检验统计量的极限分布.最后数值模拟与实例分析结果表明方法的有效性.  相似文献   

9.
本文考虑纵向数据下线性回归模型的稳健估计问题.通过结合模态回归(modal regression)方法和二次推断函数(quadratic inference functions)技术,提出了一种基于模态回归的估计过程.证明了回归系数的估计是相合的,并给出了其渐近分布.数据模拟结果表明所提出的估计方法具有较好的稳健性和有效性.  相似文献   

10.
本文提出一种针对纵向数据回归模型下的均值和协方差矩阵同时进行的有效稳健估计.基于对协方差矩阵的Cholesky分解和对模型的改写,我们提出一个加权最小二乘估计,其中权重是通过广义经验似然方法估计出来的.所提估计的有效性得益于经验似然方法的优势,稳健性则是通过限制残差平方和的上界来达到.模拟研究表明,和已有的针对纵向数据的稳健估计相比,所提估计具有更高的效率和可比的稳健性.最后,我们把所提估计方法用来分析一组实际数据.  相似文献   

11.
In multiple linear regression model, we have presupposed assumptions (independence, normality, variance homogeneity and so on) on error term. When case weights are given because of variance heterogeneity, we can estimate efficiently regression parameter using weighted least squares estimator. Unfortunately, this estimator is sensitive to outliers like ordinary least squares estimator. Thus, in this paper, we proposed some statistics for detection of outliers in weighted least squares regression.  相似文献   

12.
线性模型参数的稳健化有偏估计   总被引:1,自引:1,他引:0  
本文讨论复共线性和粗差同时存在时线性模型的参数估计问题,基于等价权原理提出了一个稳健有偏估计类(稳健压缩估计),并且建立了稳健压缩估计的计算方法,为了满足实际问题的需要,构造了许多很有意义的稳健有偏估计,例如稳健岭估计、稳健主成分估计,稳健组合主成估计、稳健单参数主成分估计、稳健根方估计等等,最后通过一个算例表明,本文提出的稳健有偏估计具有既可克服复共线性影响又可抵抗粗差干扰的良好性质。  相似文献   

13.
This study considers rank estimation of the regression coefficients of the single index regression model. Conditions needed for the consistency and asymptotic normality of the proposed estimator are established. Monte Carlo simulation experiments demonstrate the robustness and efficiency of the proposed estimator compared to the semiparametric least squares estimator. A real-life example illustrates that the rank regression procedure effectively corrects model nonlinearity even in the presence of outliers in the response space.  相似文献   

14.
To achieve robustness against the outliers or heavy-tailed sampling distribution, we consider an Ivanov regularized empirical risk minimization scheme associated with a modified Huber's loss for nonparametric regression in reproducing kernel Hilbert space. By tuning the scaling and regularization parameters in accordance with the sample size, we develop nonasymptotic concentration results for such an adaptive estimator. Specifically, we establish the best convergence rates for prediction error when the conditional distribution satisfies a weak moment condition.  相似文献   

15.
Due to the complicated mathematical and nonlinear nature of ridge regression estimator, Liu (Linear-Unified) estimator has been received much attention as a useful method to overcome the weakness of the least square estimator, in the presence of multicollinearity. In situations where in the linear model, errors are far away from normal or the data contain some outliers, the construction of Liu estimator can be revisited using a rank-based score test, in the line of robust regression. In this paper, we define the Liu-type rank-based and restricted Liu-type rank-based estimators when a sub-space restriction on the parameter of interest holds. Accordingly, some improved estimators are defined and their asymptotic distributional properties are investigated. The conditions of superiority of the proposed estimators for the biasing parameter are given. Some numerical computations support the findings of the paper.  相似文献   

16.
Modal regression based on nonparametric quantile estimator is given. Unlike the traditional mean and median regression, modal regression uses mode but not mean or median to represent the center of a conditional distribution, which helps the model to be more robust for outliers, asymmetric or heavy-taileddistribution. Most of solutions for modal regression are based on kernel estimation of density. This paper studies a new solution for modal regression by means of nonparametric quantile estimator. This method builds on the fact that the distribution function is the inverse of the quantile function, then the flexibility of nonparametric quantile estimator is utilized to improve the estimation of modal function. The simulations and application show that the new model outperforms the modal regression model via linear quantile function estimation.  相似文献   

17.
The kernel-based regression (KBR) method, such as support vector machine for regression (SVR) is a well-established methodology for estimating the nonlinear functional relationship between the response variable and predictor variables. KBR methods can be very sensitive to influential observations that in turn have a noticeable impact on the model coefficients. The robustness of KBR methods has recently been the subject of wide-scale investigations with the aim of obtaining a regression estimator insensitive to outlying observations. However, existing robust KBR (RKBR) methods only consider Y-space outliers and, consequently, are sensitive to X-space outliers. As a result, even a single anomalous outlying observation in X-space may greatly affect the estimator. In order to resolve this issue, we propose a new RKBR method that gives reliable result even if a training data set is contaminated with both Y-space and X-space outliers. The proposed method utilizes a weighting scheme based on the hat matrix that resembles the generalized M-estimator (GM-estimator) of conventional robust linear analysis. The diagonal elements of hat matrix in kernel-induced feature space are used as leverage measures to downweight the effects of potential X-space outliers. We show that the kernelized hat diagonal elements can be obtained via eigen decomposition of the kernel matrix. The regularized version of kernelized hat diagonal elements is also proposed to deal with the case of the kernel matrix having full rank where the kernelized hat diagonal elements are not suitable for leverage. We have shown that two kernelized leverage measures, namely, the kernel hat diagonal element and the regularized one, are related to statistical distance measures in the feature space. We also develop an efficiently kernelized training algorithm for the parameter estimation based on iteratively reweighted least squares (IRLS) method. The experimental results from simulated examples and real data sets demonstrate the robustness of our proposed method compared with conventional approaches.  相似文献   

18.
The estimation of correlation dimension of continuous and discreet deterministic chaotic processes corrupted by an additive noise and outliers observations is investigated. In this paper we propose a new estimator of correlation dimension based on similarity between the evolution of Gaussian kernel correlation sum (Gkcs) and that of modified Boltzmann sigmoidal function (mBsf), this estimator is given by the maximum value of the first derivative of logarithmic transform of Gkcs against logarithmic transform of bandwidth, so the proposed estimator is independent of the choice of regression region like other regression estimators of correlation dimension. Simulation study indicates the robustness of proposed estimator to the presence of different types of noise such us independent Gaussian noise, non independent Gaussian noise and uniform noise for high noise level, moreover, this estimator is also robust to presence of 60% of outliers observations. Application of this new estimator with determination of their confidence interval using the moving block bootstrap method to adjusted closed price of S&P500 index daily time series revels the stochastic behavior of such financial time series.  相似文献   

19.
We consider the problem of deleting bad influential observations (outliers) in linear regression models. The problem is formulated as a Quadratic Mixed Integer Programming (QMIP) problem, where penalty costs for discarding outliers are used into the objective function. The optimum solution defines a robust regression estimator called penalized trimmed squares (PTS). Due to the high computational complexity of the resulting QMIP problem, the proposed robust procedure is computationally suitable for small sample data. The computational performance and the effectiveness of the new procedure are improved significantly by using the idea of ε-Insensitive loss function from support vectors machine regression. Small errors are ignored, and the mathematical formula gains the sparseness property. The good performance of the ε-Insensitive PTS (IPTS) estimator allows identification of multiple outliers avoiding masking or swamping effects. The computational effectiveness and successful outlier detection of the proposed method is demonstrated via simulated experiments. This research has been partially funded by the Greek Ministry of Education under the program Pythagoras II.  相似文献   

20.
Deleting Outliers in Robust Regression with Mixed Integer Programming   总被引:1,自引:0,他引:1  
In robust regression we often have to decide how many are the unusual observations, which should be removed from the sample in order to obtain better fitting for the rest of the observations. Generally, we use the basic principle of LTS, which is to fit the majority of the data, identifying as outliers those points that cause the biggest damage to the robust fit. However, in the LTS regression method the choice of default values for high break down-point affects seriously the efficiency of the estimator. In the proposed approach we introduce penalty cost for discarding an outlier, consequently, the best fit for the majority of the data is obtained by discarding only catastrophic observations. This penalty cost is based on robust design weights and high break down-point residual scale taken from the LTS estimator. The robust estimation is obtained by solving a convex quadratic mixed integer programming problem, where in the objective function the sum of the squared residuals and penalties for discarding observations is minimized. The proposed mathematical programming formula is suitable for small-sample data. Moreover, we conduct a simulation study to compare other robust estimators with our approach in terms of their efficiency and robustness.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号