首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Understanding and modeling dependence structures for multivariate extreme values are of interest in a number of application areas. One of the well-known approaches is to investigate the Pickands dependence function. In the bivariate setting, there exist several estimators for estimating the Pickands dependence function which assume known marginal distributions [J. Pickands, Multivariate extreme value distributions, Bull. Internat. Statist. Inst., 49 (1981) 859-878; P. Deheuvels, On the limiting behavior of the Pickands estimator for bivariate extreme-value distributions, Statist. Probab. Lett. 12 (1991) 429-439; P. Hall, N. Tajvidi, Distribution and dependence-function estimation for bivariate extreme-value distributions, Bernoulli 6 (2000) 835-844; P. Capéraà, A.-L. Fougères, C. Genest, A nonparametric estimation procedure for bivariate extreme value copulas, Biometrika 84 (1997) 567-577]. In this paper, we generalize the bivariate results to p-variate multivariate extreme value distributions with p?2. We demonstrate that the proposed estimators are consistent and asymptotically normal as well as have excellent small sample behavior.  相似文献   

2.
In this article, the Stein-Haff identity is established for a singular Wishart distribution with a positive definite mean matrix but with the dimension larger than the degrees of freedom. This identity is then used to obtain estimators of the precision matrix improving on the estimator based on the Moore-Penrose inverse of the Wishart matrix under the Efron-Morris loss function and its variants. Ridge-type empirical Bayes estimators of the precision matrix are also given and their dominance properties over the usual one are shown using this identity. Finally, these precision estimators are used in a quadratic discriminant rule, and it is shown through simulation that discriminant methods based on the ridge-type empirical Bayes estimators provide higher correct classification rates.  相似文献   

3.
A weighted multivariate signed-rank test is introduced for an analysis of multivariate clustered data. Observations in different clusters may then get different weights. The test provides a robust and efficient alternative to normal theory based methods. Asymptotic theory is developed to find the approximate p-value as well as to calculate the limiting Pitman efficiency of the test. A conditionally distribution-free version of the test is also discussed. The finite-sample behavior of different versions of the test statistic is explored by simulations and the new test is compared to the unweighted and weighted versions of Hotelling’s T2 test and the multivariate spatial sign test introduced in [D. Larocque, J. Nevalainen, H. Oja, A weighted multivariate sign test for cluster-correlated data, Biometrika 94 (2007) 267-283]. Finally, a real data example is used to illustrate the theory.  相似文献   

4.
The varying coefficient partially linear model is considered in this paper. When the plug-in estimators of coefficient functions are used, the resulting smoothing score function becomes biased due to the slow convergence rate of nonparametric estimations. To reduce the bias of the resulting smoothing score function, a profile-type smoothed score function is proposed to draw inferences on the parameters of interest without using the quasi-likelihood framework, the least favorable curve, a higher order kernel or under-smoothing. The resulting profile-type statistic is still asymptotically Chi-squared under some regularity conditions. The results are then used to construct confidence regions for the parameters of interest. A simulation study is carried out to assess the performance of the proposed method and to compare it with the profile least-squares method. A real dataset is analyzed for illustration.  相似文献   

5.
It is well-known that the univariate generalized Pareto distributions (GPD) are characterized by their peaks-over-threshold (POT) stability. We extend this result to multivariate GPDs.It is also shown that this POT stability is asymptotically shared by distributions which are in a certain neighborhood of a multivariate GPD. A multivariate extreme value distribution is a typical example.The usefulness of the results is demonstrated by various applications. We immediately obtain, for example, that the excess distribution of a linear portfolio with positive weights ai, id, is independent of the weights, if (U1,…,Ud) follows a multivariate GPD with identical univariate polynomial or Pareto margins, which was established by Macke [On the distribution of linear combinations of multivariate EVD and GPD distributed random vectors with an application to the expected shortfall of portfolios, Diploma Thesis, University of Würzburg, 2004, (in German)] and Falk and Michel [Testing for tail independence in extreme value models. Ann. Inst. Statist. Math. 58 (2006) 261-290]. This implies, for instance, that the expected shortfall as a measure of risk fails in this case.  相似文献   

6.
We consider a panel data semiparametric partially linear regression model with an unknown parameter vector for the linear parametric component, an unknown nonparametric function for the nonlinear component, and a one-way error component structure which allows unequal error variances (referred to as heteroscedasticity). We develop procedures to detect heteroscedasticity and one-way error component structure, and propose a weighted semiparametric least squares estimator (WSLSE) of the parametric component in the presence of heteroscedasticity and/or one-way error component structure. This WSLSE is asymptotically more efficient than the usual semiparametric least squares estimator considered in the literature. The asymptotic properties of the WSLSE are derived. The nonparametric component of the model is estimated by the local polynomial method. Some simulations are conducted to demonstrate the finite sample performances of the proposed testing and estimation procedures. An example of application on a set of panel data of medical expenditures in Australia is also illustrated.  相似文献   

7.
A finite sample performance measure of multivariate location estimators is introduced based on “tail behavior”. The tail performance of multivariate “monotone” location estimators and the halfspace depth based “non-monotone” location estimators including the Tukey halfspace median and multivariate L-estimators is investigated. The connections among the finite sample performance measure, the finite sample breakdown point, and the halfspace depth are revealed. It turns out that estimators with high breakdown point or halfspace depth have “appealing” tail performance. The tail performance of the halfspace median is very appealing and also robust against underlying population distributions, while the tail performance of the sample mean is very sensitive to underlying population distributions. These findings provide new insights into the notions of the halfspace depth and breakdown point and identify the important role of tail behavior as a quantitative measure of robustness in the multivariate location setting.  相似文献   

8.
In this paper, a fixed design regression model where the errors follow a strictly stationary process is considered. In this model the conditional mean function and the conditional variance function are unknown curves. Correlated errors when observations are missing in the response variable are assumed. Four nonparametric estimators of the conditional variance function based on local polynomial fitting are proposed. Expressions of the asymptotic bias and variance of these estimators are obtained. A simulation study illustrates the behavior of the proposed estimators.  相似文献   

9.
Parallel to Cox's [JRSS B34 (1972) 187-230] proportional hazards model, generalized logistic models have been discussed by Anderson [Bull. Int. Statist. Inst. 48 (1979) 35-53] and others. The essential assumption is that the two densities ratio has a known parametric form. A nice property of this model is that it naturally relates to the logistic regression model for categorical data. In astronomic, demographic, epidemiological, and other studies the variable of interest is often truncated by an associated variable. This paper studies generalized logistic models for the two-sample truncated data problem, where the two lifetime densities ratio is assumed to have the form exp{α+φ(x;β)}. Here φ is a known function of x and β, and the baseline density is unspecified. We develop a semiparametric maximum likelihood method for the case where the two samples have a common truncation distribution. It is shown that inferences for β do not depend the nonparametric components. We also derive an iterative algorithm to maximize the semiparametric likelihood for the general case where different truncation distributions are allowed. We further discuss how to check goodness of fit of the generalized logistic model. The developed methods are illustrated and evaluated using both simulated and real data.  相似文献   

10.
This paper develops estimation approaches for nonparametric regression analysis with surrogate data and validation sampling when response variables are measured with errors. Without assuming any error model structure between the true responses and the surrogate variables, a regression calibration kernel regression estimate is defined with the help of validation data. The proposed estimator is proved to be asymptotically normal and the convergence rate is also derived. A simulation study is conducted to compare the proposed estimators with the standard Nadaraya-Watson estimators with the true observations in the validation data set and the complete observations, respectively. The Nadaraya-Watson estimator with the complete observations can serve as a gold standard, even though it is practically unachievable because of the measurement errors.  相似文献   

11.
In this paper the concept of near-exact approximation to a distribution is introduced. Based on this concept it is shown how a random variable whose exponential has a Beta distribution may be closely approximated by a sum of independent Gamma random variables, giving rise to the generalized near-integer (GNI) Gamma distribution. A particular near-exact approximation to the distribution of the logarithm of the product of an odd number of independent Beta random variables is shown to be a GNI Gamma distribution. As an application, a near-exact approximation to the distribution of the generalized Wilks Λ statistic is obtained for cases where two or more sets of variables have an odd number of variables. This near-exact approximation gives the exact distribution when there is at most one set with an odd number of variables. In the other cases a near-exact approximation to the distribution of the logarithm of the Wilks Lambda statistic is found to be either a particular generalized integer Gamma distribution or a particular GNI Gamma distribution.  相似文献   

12.
For statistical inference connected to the scalar skew-normal distribution, it is known that the so-called centred parametrization provides a more convenient parametrization than the one commonly employed for writing the density function. We extend the definition of the centred parametrization to the multivariate case, and study the corresponding information matrix.  相似文献   

13.
Variance function estimation in multivariate nonparametric regression is considered and the minimax rate of convergence is established in the iid Gaussian case. Our work uses the approach that generalizes the one used in [A. Munk, Bissantz, T. Wagner, G. Freitag, On difference based variance estimation in nonparametric regression when the covariate is high dimensional, J. R. Stat. Soc. B 67 (Part 1) (2005) 19-41] for the constant variance case. As is the case when the number of dimensions d=1, and very much contrary to standard thinking, it is often not desirable to base the estimator of the variance function on the residuals from an optimal estimator of the mean. Instead it is desirable to use estimators of the mean with minimal bias. Another important conclusion is that the first order difference based estimator that achieves minimax rate of convergence in the one-dimensional case does not do the same in the high dimensional case. Instead, the optimal order of differences depends on the number of dimensions.  相似文献   

14.
In this paper, a new measure of dependence is proposed. Our approach is based on transforming univariate data to the space where the marginal distributions are normally distributed and then, using the inverse transformation to obtain the distribution function in the original space. The pseudo-maximum likelihood method and the two-stage maximum likelihood approach are used to estimate the unknown parameters. It is shown that the estimated parameters are asymptotical normally distributed in both cases. Inference procedures for testing the independence are also studied.  相似文献   

15.
A general notion of trimmed regions for empirical distributions in d-space is introduced. The regions are called weighted-mean trimmed regions. They are continuous in the data as well as in the trimming parameter. Further, these trimmed regions have many other attractive properties. In particular they are subadditive and monotone which makes it possible to construct multivariate measures of risk based on these regions. Special cases include the zonoid trimming and the ECH (expected convex hull) trimming. These regions can be exactly calculated for any dimension. Finally, the notion of weighted-mean trimmed regions extends to probability distributions in d-space, and a law of large numbers applies.  相似文献   

16.
In this paper we introduce a family of symmetrised M-estimators of multivariate scatter. These are defined to be M-estimators only computed on pairwise differences of the observed multivariate data. Symmetrised Huber's M-estimator and Dümbgen's estimator serve as our examples. The influence functions of the symmetrised M-functionals are derived and the limiting distributions of the estimators are discussed in the multivariate elliptical case to consider the robustness and efficiency properties of estimators. The symmetrised M-estimators have the important independence property; they can therefore be used to find the independent components in the independent component analysis (ICA).  相似文献   

17.
A new nonparametric approach to the problem of testing the joint independence of two or more random vectors in arbitrary dimension is developed based on a measure of association determined by interpoint distances. The population independence coefficient takes values between 0 and 1, and equals zero if and only if the vectors are independent. We show that the corresponding statistic has a finite limit distribution if and only if the two random vectors are independent; thus we have a consistent test for independence. The coefficient is an increasing function of the absolute value of product moment correlation in the bivariate normal case, and coincides with the absolute value of correlation in the Bernoulli case. A simple modification of the statistic is affine invariant. The independence coefficient and the proposed statistic both have a natural extension to testing the independence of several random vectors. Empirical performance of the test is illustrated via a comparative Monte Carlo study.  相似文献   

18.
A set of n-principal points of a distribution is defined as a set of n points that optimally represent the distribution in terms of mean squared distance. It provides an optimal n-point-approximation of the distribution. However, it is in general difficult to find a set of principal points of a multivariate distribution. Tarpey et al. [T. Tarpey, L. Li, B. Flury, Principal points and self-consistent points of elliptical distributions, Ann. Statist. 23 (1995) 103-112] established a theorem which states that any set of n-principal points of an elliptically symmetric distribution is in the linear subspace spanned by some principal eigenvectors of the covariance matrix. This theorem, called a “principal subspace theorem”, is a strong tool for the calculation of principal points. In practice, we often come across distributions consisting of several subgroups. Hence it is of interest to know whether the principal subspace theorem remains valid even under such complex distributions. In this paper, we define a multivariate location mixture model. A theorem is established that clarifies a linear subspace in which n-principal points exist.  相似文献   

19.
A generalized Mahalanobis distance for mixed data   总被引:1,自引:0,他引:1  
A distance for mixed nominal, ordinal and continuous data is developed by applying the Kullback-Leibler divergence to the general mixed-data model, an extension of the general location model that allows for ordinal variables to be incorporated in the model. The distance obtained can be considered as a generalization of the Mahalanobis distance to data with a mixture of nominal, ordinal and continuous variables. Moreover, it includes as special cases previous Mahalanobis-type distances developed by Bedrick et al. (Biometrics 56 (2000) 394) and Bar-Hen and Daudin (J. Multivariate Anal. 53 (1995) 332). Asymptotic results regarding the maximum likelihood estimator of the distance are discussed. The results of a simulation study on the level and power of the tests are reported and a real-data example illustrates the method.  相似文献   

20.
High dimensional data routinely arises in image analysis, genetic experiments, network analysis, and various other research areas. Many such datasets do not correspond to well-studied probability distributions, and in several applications the data-cloud prominently displays non-symmetric and non-convex shape features. We propose using spatial quantiles and their generalizations, in particular, the projection quantile, for describing, analyzing and conducting inference with multivariate data. Minimal assumptions are made about the nature and shape characteristics of the underlying probability distribution, and we do not require the sample size to be as high as the data-dimension. We present theoretical properties of the generalized spatial quantiles, and an algorithm to compute them quickly. Our quantiles may be used to obtain multidimensional confidence or credible regions that are not required to conform to a pre-determined shape. We also propose a new notion of multidimensional order statistics, which may be used to obtain multidimensional outliers. Many of the features revealed using a generalized spatial quantile-based analysis would be missed if the data was shoehorned into a well-known probabilistic configuration.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号