首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
We propose a new definition of the Neyman chi-square divergence between distributions. Based on convexity properties and duality, this version of the χ2 is well suited both for the classical applications of the χ2 for the analysis of contingency tables and for the statistical tests in parametric models, for which it is advocated to be robust against outliers.We present two applications in testing. In the first one, we deal with goodness-of-fit tests for finite and infinite numbers of linear constraints; in the second one, we apply χ2-methodology to parametric testing against contamination.  相似文献   

2.
The usual assumption in multivariate hypothesis testing is that the sample consists of n independent, identically distributed Gaussian m-vectors. In this paper this assumption is weakened by considering a class of distributions for which the vector observations are not necessarily either Gaussian or independent. This class contains the elliptically symmetric laws with densities of the form f(X(n × m)) = ψ[tr(X ? M)′ (X ? M?1]. For testing the equality of k scale matrices and for the sphericity hypothesis it is shown, by using the structure of the underlying distribution rather than any specific form of the density, that the usual invariant normal-theory tests are exactly robust, for both the null and non-null cases, under this wider class.  相似文献   

3.
A general class of optimal and distribution-free rank tests for the two-sample modal directions problem on (hyper-) spheres is proposed, along with an asymptotic distribution theory for such spherical rank tests. The asymptotic optimality of the spherical rank tests in terms of power-equivalence to the spherical likelihood ratio tests is studied, while the spherical Wilcoxon rank test, an important case for the class of spherical rank tests, is further investigated. A data set is reanalyzed and some errors made in previous studies are corrected. On the usual sphere, a lower bound on the asymptotic Pitman relative efficiency relative to Hotelling’s T2-type test is established, and a new distribution for which the spherical Wilcoxon rank test is optimal is also introduced.  相似文献   

4.
In protein threading, one is given a protein sequence, together with a database of protein core structures that may contain the natural structure of the sequence. The object of protein threading is to correctly identify the structure(s) corresponding to the sequence. Since the core structures are already associated with specific biological functions, threading has the potential to provide biologists with useful insights about the function of a newly discovered protein sequence. Statistical tests for threading results based on the theory of extreme values suggest several combinatorial problems. For example, what is the number of waysm′=# t {L i >x i } i =0n of choosing a sequence {X i } i =1n from the set {1, 2, ...,t}, subject to the difference constraints {L i =X i+1?X i >x i } i =0n , whereX 0=0,X n+1=t+1, and {x i } i =0n is an arbitrary sequence of integers? The quantitym′ has many attractive combinatorial interpretations and reduces in special continuous limits to a probabilistic formula discovered by the Finetti. Just as many important probabilities can be derived from de Finetti's formula, many interesting combinatorial quantities can be derived fromm′. Empirical results presented here show that the combinatorial approach to threading statistics appears promising, but that structural periodicities in proteins and energetically unimportant structure elements probably introduce statistical correlations that must be better understood.  相似文献   

5.
Let θ(n) denote the maximum likelihood estimator of a vector parameter, based on an i.i.d. sample of size n. The class of estimators θ(n) + n?1q(θ(n)), with q running through a class of sufficiently smooth functions, is essentially complete in the following sense: For any estimator T(n) there exists q such that the risk of θ(n) + n?1q(θ(n)) exceeds the risk of T(n) by an amount of order o(n?1) at most, simultaneously for all loss functions which are bounded, symmetric, and neg-unimodal. If q1 is chosen such that θ(n) + n?1 q1(n)) is unbiased up to o(n?12), then this estimator minimizes the risk up to an amount of order o(n?1) in the class of all estimators which are unbiased up to o(n?12).The results are obtained under the assumption that T(n) admits a stochastic expansion, and that either the distributions have—roughly speaking—densities with respect to the lebesgue measure, or the loss functions are sufficiently smooth.  相似文献   

6.
For multinormal distributions, testing against a global shift alternative, the Hotelling T2-test is uniformly most powerful invariant, and hence admissible. For testing against restricted alternatives this feature may no longer be true. It is shown that whenever the dispersion matrix is an M-matrix, Hotelling's T2-test is inadmissible, though some union-intersection tests may not be so.  相似文献   

7.
We propose different nonparametric tests for multivariate data and derive their asymptotic distribution for unbalanced designs in which the number of factor levels tends to infinity (large a, small ni case). Quasi gratis, some new parametric multivariate tests suitable for the large a asymptotic case are also obtained. Finite sample performances are investigated and compared in a simulation study. The nonparametric tests are based on separate rankings for the different variables. In the presence of outliers, the proposed nonparametric methods have better power than their parametric counterparts. Application of the new tests is demonstrated using data from plant pathology.  相似文献   

8.
In this paper we consider the estimation of the error distribution in a heteroscedastic nonparametric regression model with multivariate covariates. As estimator we consider the empirical distribution function of residuals, which are obtained from multivariate local polynomial fits of the regression and variance functions, respectively. Weak convergence of the empirical residual process to a Gaussian process is proved. We also consider various applications for testing model assumptions in nonparametric multiple regression. The model tests obtained are able to detect local alternatives that converge to zero at an n−1/2-rate, independent of the covariate dimension. We consider in detail a test for additivity of the regression function.  相似文献   

9.
The so-called independent component (IC) model states that the observed p-vector X is generated via X=ΛZ+μ, where μ is a p-vector, Λ is a full-rank matrix, and the centered random vector Z has independent marginals. We consider the problem of testing the null hypothesis H0:μ=0 on the basis of i.i.d. observations X1,…,Xn generated by the symmetric version of the IC model above (for which all ICs have a symmetric distribution about the origin). In the spirit of [M. Hallin, D. Paindaveine, Optimal tests for multivariate location based on interdirections and pseudo-Mahalanobis ranks, Annals of Statistics, 30 (2002), 1103-1133], we develop nonparametric (signed-rank) tests, which are valid without any moment assumption and are, for adequately chosen scores, locally and asymptotically optimal (in the Le Cam sense) at given densities. Our tests are measurable with respect to the marginal signed ranks computed in the collection of null residuals , where is a suitable estimate of Λ. Provided that is affine-equivariant, the proposed tests, unlike the standard marginal signed-rank tests developed in [M.L. Puri, P.K. Sen, Nonparametric Methods in Multivariate Analysis, Wiley & Sons, New York, 1971] or any of their obvious generalizations, are affine-invariant. Local powers and asymptotic relative efficiencies (AREs) with respect to Hotelling’s T2 test are derived. Quite remarkably, when Gaussian scores are used, these AREs are always greater than or equal to one, with equality in the multinormal model only. Finite-sample efficiencies and robustness properties are investigated through a Monte Carlo study.  相似文献   

10.
The first problem considered is that of testing for the reality of the covariance matrix of a p-dimensional complex normal distribution, while the second is that of testing that a 2p-dimensional real normal distribution has a p-dimensional complex structure. Both problems are reduced by invariance to their maximal invariant statistics, and the null and non-null distributions of these are obtained. Complete classes of unbiased, invariant tests are described for both problems, the locally most powerful invariant tests are obtained, and the admissibility of the likelihood ratio tests is established.  相似文献   

11.
In this paper we develop an econometric method for consistent variable selection in the context of a linear factor model with observable factors for panels of large dimensions. The subset of factors that best fit the data is sequentially determined. Firstly, a partial R2 rule is used to show the existence of an optimal ordering of the candidate variables. Secondly, We show that for a given order of the regressors, the number of factors can be consistently estimated using the Bayes information criterion. The Akaike will asymptotically lead to overfitting of the model. The theory is established under approximate factor structure which allows for limited cross-section and serial dependence in the idiosyncratic term. Simulations show that the proposed two-step selection technique has good finite sample properties. The likelihood of selecting the correct specification increases with the number of cross-sections both asymptotically and in small samples. Moreover, the proposed variable selection method is computationally attractive. For K potential candidate factors, the search requires only 2K regressions compared to 2K for an exhaustive search.  相似文献   

12.
13.
A multivariate dispersion ordering based on random simplices is proposed in this paper. Given a Rd-valued random vector, we consider two random simplices determined by the convex hulls of two independent random samples of sizes d+1 of the vector. By means of the stochastic comparison of the Hausdorff distances between such simplices, a multivariate dispersion ordering is introduced. Main properties of the new ordering are studied. Relationships with other dispersion orderings are considered, placing emphasis on the univariate version. Some statistical tests for the new order are proposed. An application of such ordering to the clinical evaluation of human corneal endothelia is provided. Different analyses are included using an image database of human corneal endothelia.  相似文献   

14.
Let X and Y be two nonnegative and dependent random variables following a generalized Farlie-Gumbel-Morgenstern distribution. In this short note, we study the impact of a dependence structure of X and Y on the tail behavior of XY. We quantify the impact as the limit, as x, of the quotient of Pr(XY>x) and Pr(XY>x), where X and Y are independent random variables identically distributed as X and Y, respectively. We obtain an explicit expression for this limit when X is regularly varying or rapidly varying tailed.  相似文献   

15.
Homogeneity tests based on several progressively Type-II censored samples   总被引:2,自引:0,他引:2  
In this paper, we discuss the problem of testing the homogeneity of several populations when the available data are progressively Type-II censored. Defining for each sample a univariate counting process, we can modify all the methods that were developed during the last two decades (see e.g. [P.K. Andersen, Ø. Borgan, R. Gill, N. Keiding, Statistical Models Based on Counting Processes, Springer, New York, 1993]) for use to this problem. An important aspect of these tests is that they are based on either linear or non-linear functionals of a discrepancy process (DP) based on the comparison of the cumulative hazard rate (chr) estimated from each sample with the chr estimated from the whole sample (viz., the aggregation of all the samples), leading to either linear tests or non-linear tests. Both these kinds of tests suffer from some serious drawbacks. For example, it is difficult to extend non-linear tests to the K-sample situation when K?3. For this reason, we propose here a new class of non-linear tests, based on a chi-square type functional of the DP, that can be applied to the K-sample problem for any K?2.  相似文献   

16.
Goodness-of-fit tests allow one to conclude that k possible outcomes are not equally likely. In this paper, we develop an exact equivalence test that allows one to conclude that k possible outcomes are approximately equally likely. We show that the power properties of the test compare favorably to those of possible alternative tests, and we develop an associated simultaneous confidence interval procedure. We apply the test to data sets on the digits of π, winning roulette numbers, and winning numbers from the Pennsylvania Lottery.  相似文献   

17.
Model identification and discrimination are two major statistical challenges. In this paper we consider a set of models Mk for factorial experiments with the parameters representing the general mean, main effects, and only k out of all two-factor interactions. We consider the class D of all fractional factorial plans with the same number of runs having the ability to identify all the models in Mk, i.e., the full estimation capacity.The fractional factorial plans in D with the full estimation capacity for k?2 are able to discriminate between models in Mu for u?k*, where k*=(k/2) when k is even, k*=((k-1)/2) when k is odd. We obtain fractional factorial plans in D satisfying the six optimality criterion functions AD, AT, AMCR, GD, GT, and GMCR for 2m factorial experiments when m=4 and 5. Both single stage and multi-stage (hierarchical) designs are given. Some results on estimation capacity of a fractional factorial plan for identifying models in Mk are also given. Our designs D4.1 and D10 stand out in their performances relative to the designs given in Li and Nachtsheim [Model-robust factorial designs, Technometrics 42(4) (2000) 345-352.] for m=4 and 5 with respect to the criterion functions AD, AT, AMCR, GD, GT, and GMCR. Our design D4.2 stands out in its performance relative the Li-Nachtsheim design for m=4 with respect to the four criterion functions AT, AMCR, GT, and GMCR. However, the Li-Nachtsheim design for m=4 stands out in its performance relative to our design D4.2 with respect to the criterion functions AD and GD. Our design D14 does have the full estimation capacity for k=5 but the twelve run Li-Nachtsheim design does not have the full estimation capacity for k=5.  相似文献   

18.
We study non-parametric tests for checking parametric hypotheses about a multivariate density f of independent identically distributed random vectors Z1,Z2,… which are observed under additional noise with density ψ. The tests we propose are an extension of the test due to Bickel and Rosenblatt [On some global measures of the deviations of density function estimates, Ann. Statist. 1 (1973) 1071-1095] and are based on a comparison of a nonparametric deconvolution estimator and the smoothed version of a parametric fit of the density f of the variables of interest Zi. In an example the loss of efficiency is highlighted when the test is based on the convolved (but observable) density g=f*ψ instead on the initial density of interest f.  相似文献   

19.
For normally distributed data from the k populations with m×m covariance matrices Σ1,…,Σk, we test the hypothesis H:Σ1=?=Σk vs the alternative AH when the number of observations Ni, i=1,…,k from each population are less than or equal to the dimension m, Nim, i=1,…,k. Two tests are proposed and compared with two other tests proposed in the literature. These tests, however, do not require that Nim, and thus can be used in all situations, including when the likelihood ratio test is available. The asymptotic distributions of the test statistics are given, and the power compared by simulations with other test statistics proposed in the literature. The proposed tests perform well and better in several cases than the other two tests available in the literature.  相似文献   

20.
This article analyzes whether some existing tests for the p×p covariance matrix Σ of the N independent identically distributed observation vectors work under non-normality. We focus on three hypotheses testing problems: (1) testing for sphericity, that is, the covariance matrix Σ is proportional to an identity matrix Ip; (2) the covariance matrix Σ is an identity matrix Ip; and (3) the covariance matrix is a diagonal matrix. It is shown that the tests proposed by Srivastava (2005) for the above three problems are robust under the non-normality assumption made in this article irrespective of whether Np or Np, but (N,p)→, and N/p may go to zero or infinity. Results are asymptotic and it may be noted that they may not hold for finite (N,p).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号