首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
We propose a number of diagnostic methods that can be used whenever multiple outliers are identified by robust estimates for multivariate location and scatter. Their main purpose is visualization of the multivariate data to help determine whether the detected outliers (a) form separate clusters or (b) are isolated or randomly scattered (such as heavy tails compared with Gaussian). We make use of Mahalanobis distances and linear projections, to check for separation and to reveal additional aspects of the data structure. Several real data examples are analyzed, and artificial examples are used to illustrate the diagnostic power of the proposed plots.

Code to perform the diagnostics, datasets used as examples in the article and documention are available in the online supplements.  相似文献   

2.
Abstract

The existence of outliers in a data set and how to deal with them is an important problem in statistics. The minimum volume ellipsoid (MVE) estimator is a robust estimator of location and covariate structure; however its use has been limited because there are few computationally attractive methods. Determining the MVE consists of two parts—finding the subset of points to be used in the estimate and finding the ellipsoid that covers this set. This article addresses the first problem. Our method will also allow us to compute the minimum covariance determinant (MCD) estimator. The proposed method of subset selection is called the effective independence distribution (EID) method, which chooses the subset by minimizing determinants of matrices containing the data. This method is deterministic, yielding reproducible estimates of location and scatter for a given data set. The EID method of finding the MVE is applied to several regression data sets where the true estimate is known. Results show that the EID method, when applied to these data sets, produces the subset of data more quickly than conventional procedures and that there is less than 6% relative error in the estimates. We also give timing results illustrating the feasibility of our method for larger data sets. For the case of 10,000 points in 10 dimensions, the compute time is under 25 minutes.  相似文献   

3.
Multivariate analysis of variance (MANOVA) extends the ideas and methods of univariate ANOVA in simple and straightforward ways. But the familiar graphical methods typically used for univariate ANOVA are inadequate for showing how measures in a multivariate response vary with each other, and how their means vary with explanatory factors. Similarly, the graphical methods commonly used in multiple regression are not widely available or used in multivariate multiple regression (MMRA). We describe a variety of graphical methods for multiple-response (MANOVA and MMRA) data aimed at understanding what is being tested in a multivariate test, and how factor/predictor effects are expressed across multiple response measures.

In particular, we describe and illustrate: (a) Data ellipses and biplots for multivariate data; (b) HE plots, showing the hypothesis and error covariance matrices for a given pair of responses, and a given effect; (c) HE plot matrices, showing all pairwise HE plots; and (d) reduced-rank analogs of HE plots, showing all observations, group means, and their relations to the response variables. All of these methods are implemented in a collection of easily used SAS macro programs.  相似文献   

4.
This article studies M-type estimators for fitting robust generalized additive models in the presence of anomalous data. A new theoretical construct is developed to connect the costly M-type estimation with least-squares type calculations. Its asymptotic properties are studied and used to motivate a computational algorithm. The main idea is to decompose the overall M-type estimation problem into a sequence of well-studied conventional additive model fittings. The resulting algorithm is fast and stable, can be paired with different nonparametric smoothers, and can also be applied to cases with multiple covariates. As another contribution of this article, automatic methods for smoothing parameter selection are proposed. These methods are designed to be resistant to outliers. The empirical performance of the proposed methodology is illustrated via both simulation experiments and real data analysis. Supplementary materials are available online.  相似文献   

5.
本文讨论了具多非线性模不确定的一般Lurie系统的鲁棒问题.用Lyapunov函数方法,得到了一些新的鲁棒绝对稳定代数准则,这些准则减少了现有的结果的保守性.  相似文献   

6.
经验似然方法己经被广泛应用于许多模型的统计推断.本文基于经验似然对部分线性模型进行统计诊断.首先给出模型的估计方程,进而得到模型参数的极大经验似然估计;其次,基于经验似然研究了三种不同的影响曲率;最后通过随机模拟和实例分析,说明了统计诊断方法的有效性.  相似文献   

7.
运用Lyapunov函数法分析并建立了脉冲差分系统结构扰动下的R obust稳定性准则.  相似文献   

8.
This paper considers a stochastic facility location problem in which multiple capacitated facilities serve customers with a single product, and a stockout probabilistic requirement is stated as a chance constraint. Customer demand is assumed to be uncertain and to follow either a normal or an ambiguous distribution. We study robust approximations to the problem in order to incorporate information about the random demand distribution in the best possible, computationally tractable way. We also discuss how a decision maker’s risk preferences can be incorporated in the problem through robust optimization. Finally, we present numerical experiments that illustrate the performance of the different robust formulations. Robust optimization strategies for facility location appear to have better worst-case performance than nonrobust strategies. They also outperform nonrobust strategies in terms of realized average total cost when the actual demand distributions have higher expected values than the expected values used as input to the optimization models.  相似文献   

9.
This paper gives some new results on multi-time first-order PDE constrained control optimization problem in the face of data uncertainty (MCOPU). We obtain the robust sufficient optimality conditions for (MCOPU). Further, we construct an unconstrained multi-time control optimization problem (MCOPU)? corresponding to (MCOPU) via absolute value penalty function method. Then, we show that the robust optimal solution to the constrained problem and a robust minimizer to the unconstrained problem are equivalent under suitable hypotheses. Moreover, we give some non-trivial examples to validate the results established in this paper.  相似文献   

10.
We present a new method called UTAGMSINT for ranking a finite set of alternatives evaluated on multiple criteria. It belongs to the family of Robust Ordinal Regression (ROR) methods which build a set of preference models compatible with preference information elicited by the Decision Maker (DM). The preference model used by UTAGMSINT is a general additive value function augmented by two types of components corresponding to “bonus” or “penalty” values for positively or negatively interacting pairs of criteria, respectively. When calculating value of a particular alternative, a bonus is added to the additive component of the value function if a given pair of criteria is in a positive synergy for performances of this alternative on the two criteria. Similarly, a penalty is subtracted from the additive component of the value function if a given pair of criteria is in a negative synergy for performances of the considered alternative on the two criteria. The preference information elicited by the DM is composed of pairwise comparisons of some reference alternatives, as well as of comparisons of some pairs of reference alternatives with respect to intensity of preference, either comprehensively or on a particular criterion. In UTAGMSINT, ROR starts with identification of pairs of interacting criteria for given preference information by solving a mixed-integer linear program. Once the interacting pairs are validated by the DM, ROR continues calculations with the whole set of compatible value functions handling the interacting criteria, to get necessary and possible preference relations in the considered set of alternatives. A single representative value function can be calculated to attribute specific scores to alternatives. It also gives values to bonuses and penalties. UTAGMSINT handles quite general interactions among criteria and provides an interesting alternative to the Choquet integral.  相似文献   

11.
Most algorithms for highly robust estimators of multivariate location and scatter start by drawing a large number of random subsets. For instance, the FASTMCD algorithm of Rousseeuw and Van Driessen starts in this way, and then takes so-called concentration steps to obtain a more accurate approximation to the MCD. The FASTMCD algorithm is affine equivariant but not permutation invariant. In this article, we present a deterministic algorithm, denoted as DetMCD, which does not use random subsets and is even faster. It computes a small number of deterministic initial estimators, followed by concentration steps. DetMCD is permutation invariant and very close to affine equivariant. We compare it to FASTMCD and to the OGK estimator of Maronna and Zamar. We also illustrate it on real and simulated datasets, with applications involving principal component analysis, classification, and time series analysis. Supplemental material (Matlab code of the DetMCD algorithm and the datasets) is available online.  相似文献   

12.
A new algorithm to solve exact least trimmed squares (LTS) regression is presented. The adding row algorithm (ARA) extends existing methods that compute the LTS estimator for a given coverage. It employs a tree-based strategy to compute a set of LTS regressors for a range of coverage values. Thus, prior knowledge of the optimal coverage is not required. New nodes in the regression tree are generated by updating the QR decomposition of the data matrix after adding one observation to the regression model. The ARA is enhanced by employing a branch and bound strategy. The branch and bound algorithm is an exhaustive algorithm that uses a cutting test to prune nonoptimal subtrees. It significantly improves over the ARA in computational performance. Observation preordering throughout the traversal of the regression tree is investigated. A computationally efficient and numerically stable calculation of the bounds using Givens rotations is designed around the QR decomposition, avoiding the need to explicitly update the triangular factor when an observation is added. This reduces the overall computational load of the preordering device by approximately half. A solution is proposed to allow preordering when the model is underdetermined. It employs pseudo-orthogonal rotations to downdate the QR decomposition. The strategies are illustrated by example. Experimental results confirm the computational efficiency of the proposed algorithms. Supplemental materials (R package and formal proofs) are available online.  相似文献   

13.
线性回归模型的误差项不服从正态分布或存在多个离群点时,可以将残差秩次的某些函数作为权重引入估计模型来减少离群点的不良影响。本文从参数估计、稳健性质、回归诊断等方面对基于残差秩次的一类稳健回归方法进行介绍.通过模拟研究和实例分析表明,R和GR估计是一种估计效率较高的稳健回归方法,其中GR估计可同时避免X与Y空间离群点,而高失效点HBR估计可通过控制某个参数在稳健性与估计效率之间进行折衷.  相似文献   

14.
一类不确定双线性系统的状态反馈Robust控制*   总被引:2,自引:0,他引:2  
本文针对一类含不确定性的双线性系统设计了一种线性状态反馈控制.在一定的条件下,利用改进的李雅普诺夫第二方法关于稳定性的理论,证明了系统的稳定性.并给出例子说明.  相似文献   

15.
In the framework of generalized linear models, the nonrobustness of classical estimators and tests for the parameters is a well known problem, and alternative methods have been proposed in the literature. These methods are robust and can cope with deviations from the assumed distribution. However, they are based on first order asymptotic theory, and their accuracy in moderate to small samples is still an open question. In this paper, we propose a test statistic which combines robustness and good accuracy for moderate to small sample sizes. We combine results from Cantoni and Ronchetti [E. Cantoni, E. Ronchetti, Robust inference for generalized linear models, Journal of the American Statistical Association 96 (2001) 1022–1030] and Robinson, Ronchetti and Young [J. Robinson, E. Ronchetti, G.A. Young, Saddlepoint approximations and tests based on multivariate M-estimators, The Annals of Statistics 31 (2003) 1154–1169] to obtain a robust test statistic for hypothesis testing and variable selection, which is asymptotically χ2-distributed as the three classical tests but with a relative error of order O(n−1). This leads to reliable inference in the presence of small deviations from the assumed model distribution, and to accurate testing and variable selection, even in moderate to small samples.  相似文献   

16.
在有异常值的数据中,Bootstrap样本可能比原有样本含有更高的“污染”,这会降低所要做的统计推断的有效性.本文讨论在非参数回归N-W估计中,如何利用影响函数得到重新抽样的概率,使用倾斜的Bootstrap方法得到曲线的拟合,从而达到有效地抵制异常值对回归函数影响的目的,数值模拟的结果表明这种处理方式的有效性.  相似文献   

17.
In this article, we study some robust control problems associated with the multilayer quasi-geostrophic equations of the ocean and related to data assimilation in oceanography. We consider higher norms (compared to [T. Tachim Medjo, Robust control problems associated with the multilayer quasi-geostrophic equations of the ocean, Appl. Math. Optim. 51(3) (2005) 333–360]) in the definition of the cost functionals. We prove the existence and uniqueness of solutions. The result relies on better a priori estimates on the solutions to the multilayer quasi-geostrophic system obtained using a new formulation that we introduce for the multilayer quasi-geostrophic equation of the ocean. The new formulation replaces the non-homogenous boundary conditions (and the non-local constraint) on the stream-function by a simple homogenous Dirichlet boundary condition.  相似文献   

18.
不确定时滞Lurie系统的鲁棒绝对稳定性分析   总被引:1,自引:0,他引:1  
研究了一类不确定Lurie时滞系统的鲁棒绝对稳定性问题,不确定项范数有界。通过构造特殊的李亚普诺夫函数,分别考虑了时滞相关和时滞无关两情况,得到了系统鲁棒绝对稳定的充分条件。  相似文献   

19.
本文首先给出了一种新的判定多项式稳定的充要条件(引理2.1).然后,在此基础上,研究了区间多项式的鲁棒稳定性,得到了若干判别区间多项式的充分条件(定理2.2-定理2.3).由于所得的摄动界完全可由原末被扰动的多项式的系数所决定,这使得本文的方法比现有的结果简单好用.文末的例子说明了本文方法的有效性.  相似文献   

20.
区间动力系统的鲁棒稳定性分析   总被引:1,自引:0,他引:1  
高利新  汪治华 《应用数学》2004,17(4):497-502
本文研究区间系统的鲁棒稳定性问题 ,把连续区间系统的鲁棒Hurwtiz稳定和离散区间系统的鲁棒Schur稳定的问题等价转换于一参数扰动矩阵集的鲁棒非奇异问题 ,然后给出鲁棒Hurwtiz稳定和鲁棒Schur稳定的基于 μ 分析的充分必要条件 .  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号