首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this paper, we use directed acyclic graphs (DAGs) with temporal structure to describe models of nonignorable nonresponse mechanisms for binary outcomes in longitudinal studies, and we discuss identification of these models under an assumption that the sequence of variables has the first-order Markov dependence, that is, the future variables are independent of the past variables conditional on the present variables. We give a stepwise approach for checking identifiability of DAG models. For an unidentifiable model, we propose adding completely observed variables such that this model becomes identifiable.  相似文献   

2.
In this paper we consider exact tests of a multiple logistic regression with categorical covariates via Markov bases. In many applications of multiple logistic regression, the sample size is positive for each combination of levels of the covariates. In this case we do not need a whole Markov basis, which guarantees connectivity of all fibers. We first give an explicit Markov basis for multiple Poisson regression. By the Lawrence lifting of this basis, in the case of bivariate logistic regression, we show a simple subset of the Markov basis which connects all fibers with a positive sample size for each combination of levels of covariates.  相似文献   

3.
The ratio of the largest eigenvalue divided by the trace of a p×p random Wishart matrix with n degrees of freedom and an identity covariance matrix plays an important role in various hypothesis testing problems, both in statistics and in signal processing. In this paper we derive an approximate explicit expression for the distribution of this ratio, by considering the joint limit as both p,n with p/nc. Our analysis reveals that even though asymptotically in this limit the ratio follows a Tracy-Widom (TW) distribution, one of the leading error terms depends on the second derivative of the TW distribution, and is non-negligible for practical values of p, in particular for determining tail probabilities. We thus propose to explicitly include this term in the approximate distribution for the ratio. We illustrate empirically using simulations that adding this term to the TW distribution yields a quite accurate expression to the empirical distribution of the ratio, even for small values of p,n.  相似文献   

4.
The estimation problem of the parameters in a symmetry model for categorical data has been considered for many authors in the statistical literature (for example, Bowker (1948) [1], Ireland et al. (1969) [2], Quade and Salama (1975) [3], Cressie and Read (1988) [4], Menéndez et al. (2005) [5]) without using uncertain prior information. It is well known that many new and interesting estimators, using uncertain prior information, have been studied by a host of researchers in different statistical models, and many papers have been published on this topic (see Saleh (2006) [9] and references therein). In this paper, we consider the symmetry model of categorical data and we study, for the first time, some new estimators when non-sample information about the symmetry of the probabilities is considered. The decision to use a “restricted” estimator or an “unrestricted” estimator is based on the outcome of a preliminary test, and then a shrinkage technique is used. It is interesting to note that we present a unified study in the sense that we consider not only the maximum likelihood estimator and likelihood ratio test or chi-square test statistic but we consider minimum phi-divergence estimators and phi-divergence test statistics. Families of minimum phi-divergence estimators and phi-divergence test statistics are wide classes of estimators and test statistics that contain as a particular case the maximum likelihood estimator, likelihood ratio test and chi-square test statistic. In an asymptotic set-up, the biases and the risk under the squared loss function for the proposed estimators are derived and compared. A numerical example clarifies the content of the paper.  相似文献   

5.
Principal component analysis (PCA) is a widely used tool for data analysis and dimension reduction in applications throughout science and engineering. However, the principal components (PCs) can sometimes be difficult to interpret, because they are linear combinations of all the original variables. To facilitate interpretation, sparse PCA produces modified PCs with sparse loadings, i.e. loadings with very few non-zero elements. In this paper, we propose a new sparse PCA method, namely sparse PCA via regularized SVD (sPCA-rSVD). We use the connection of PCA with singular value decomposition (SVD) of the data matrix and extract the PCs through solving a low rank matrix approximation problem. Regularization penalties are introduced to the corresponding minimization problem to promote sparsity in PC loadings. An efficient iterative algorithm is proposed for computation. Two tuning parameter selection methods are discussed. Some theoretical results are established to justify the use of sPCA-rSVD when only the data covariance matrix is available. In addition, we give a modified definition of variance explained by the sparse PCs. The sPCA-rSVD provides a uniform treatment of both classical multivariate data and high-dimension-low-sample-size (HDLSS) data. Further understanding of sPCA-rSVD and some existing alternatives is gained through simulation studies and real data examples, which suggests that sPCA-rSVD provides competitive results.  相似文献   

6.
We consider the problem of deriving the asymptotic distribution of the three commonly used multivariate test statistics, namely likelihood ratio, Lawley-Hotelling and Bartlett-Nanda-Pillai statistics, for testing hypotheses on the various effects (main, nested or interaction) in multivariate mixed models. We derive the distributions of these statistics, both in the null as well as non-null cases, as the number of levels of one of the main effects (random or fixed) goes to infinity. The robustness of these statistics against departure from normality will be assessed.Essentially, in the asymptotic spirit of this paper, both the hypothesis and error degrees of freedom tend to infinity at a fixed rate. It is intuitively appealing to consider asymptotics of this type because, for example, in random or mixed effects models, the levels of the main random factors are assumed to be a random sample from a large population of levels.For the asymptotic results of this paper to hold, we do not require any distributional assumption on the errors. That means the results can be used in real-life applications where normality assumption is not tenable.As it happens, the asymptotic distributions of the three statistics are normal. The statistics have been found to be asymptotically null robust against the departure from normality in the balanced designs. The expressions for the asymptotic means and variances are fairly simple. That makes the results an attractive alternative to the standard asymptotic results. These statements are favorably supported by the numerical results.  相似文献   

7.
In this paper, we use the kernel method to estimate sliced average variance estimation (SAVE) and prove that this estimator is both asymptotically normal and root n consistent. We use this kernel estimator to provide more insight about the differences between slicing estimation and other sophisticated local smoothing methods. Finally, we suggest a Bayes information criterion (BIC) to estimate the dimensionality of SAVE. Examples and real data are presented for illustrating our method.  相似文献   

8.
The maximum asymptotic bias of an estimator is a global robustness measure of its performance. The projection median estimator for multivariate location shows a remarkable behavior regarding asymptotic bias. In this paper we consider a modification of the projection median estimator which renders an estimate with better bias performance for point mass contaminations (the worst situation for the projection median estimator). Moreover, it achieves the lowest bound for an equivariant estimate for point mass contaminations.  相似文献   

9.
In this paper, we propose auto-associative (AA) models to generalize Principal component analysis (PCA). AA models have been introduced in data analysis from a geometrical point of view. They are based on the approximation of the observations scatter-plot by a differentiable manifold. In this paper, they are interpreted as Projection pursuit models adapted to the auto-associative case. Their theoretical properties are established and are shown to extend the PCA ones. An iterative algorithm of construction is proposed and its principle is illustrated both on simulated and real data from image analysis.  相似文献   

10.
Ledford and Tawn (1997) introduced a flexible bivariate tail model based on the coefficient of tail dependence and on the dependence of the extreme values of the random variables. In this paper, we extend the concept by specifying the slowly varying part of the model as done by Hall (1982) with the univariate case. Based on Beirlant et al. (2009), we propose a bias-reduced estimator for the coefficient of tail dependence and for the estimation of small tail probabilities. We discuss the properties of these estimators via simulations and a real-life example. Furthermore, we discuss some theoretical asymptotic aspects of this approach.  相似文献   

11.
Characterizations of multivariate life distributions   总被引:1,自引:0,他引:1  
Characterizations of multivariate distributions has been a topic of great interest in applied statistics literature for the last three decades. In this paper, we develop characterizations of multivariate lifetime distributions by relationship between multivariate failure rates (reversed failure rates) and the left (right) truncated expectations of functions of random variables. We, then, discuss the application of the results to derive a multivariate Stein type identity.  相似文献   

12.
For continuous random variables, many dependence concepts and measures of association can be expressed in terms of the corresponding copula only and are thus independent of the marginal distributions. This interrelationship generally fails as soon as there are discontinuities in the marginal distribution functions. In this paper, we consider an alternative transformation of an arbitrary random variable to a uniformly distributed one. Using this technique, the class of all possible copulas in the general case is investigated. In particular, we show that one of its members—the standard extension copula introduced by Schweizer and Sklar—captures the dependence structures in an analogous way the unique copula does in the continuous case. Furthermore, we consider measures of concordance between arbitrary random variables and obtain generalizations of Kendall's tau and Spearman's rho that correspond to the sample version of these quantities for empirical distributions.  相似文献   

13.
We consider models for the covariance between two blocks of variables. Such models are often used in situations where latent variables are believed to present. In this paper we characterize exactly the set of distributions given by a class of models with one-dimensional latent variables. These models relate two blocks of observed variables, modeling only the cross-covariance matrix. We describe the relation of this model to the singular value decomposition of the cross-covariance matrix. We show that, although the model is underidentified, useful information may be extracted. We further consider an alternative parameterization in which one latent variable is associated with each block, and we extend the result to models with r-dimensional latent variables.  相似文献   

14.
In this paper we show how, based on a decomposition of the likelihood ratio test for sphericity into two independent tests and a suitably developed decomposition of the characteristic function of the logarithm of the likelihood ratio test statistic to test independence in a set of variates, we may obtain extremely well-fitting near-exact distributions for both test statistics. Since both test statistics have the distribution of the product of independent Beta random variables, it is possible to obtain near-exact distributions for both statistics in the form of Generalized Near-Integer Gamma distributions or mixtures of these distributions. For the independence test statistic, numerical studies and comparisons with asymptotic distributions proposed by other authors show the extremely high accuracy of the near-exact distributions developed as approximations to the exact distribution. Concerning the sphericity test statistic, comparisons with formerly developed near-exact distributions show the advantages of these new near-exact distributions.  相似文献   

15.
An exhaustive search as required for traditional variable selection methods is impractical in high dimensional statistical modeling. Thus, to conduct variable selection, various forms of penalized estimators with good statistical and computational properties, have been proposed during the past two decades. The attractive properties of these shrinkage and selection estimators, however, depend critically on the size of regularization which controls model complexity. In this paper, we consider the problem of consistent tuning parameter selection in high dimensional sparse linear regression where the dimension of the predictor vector is larger than the size of the sample. First, we propose a family of high dimensional Bayesian Information Criteria (HBIC), and then investigate the selection consistency, extending the results of the extended Bayesian Information Criterion (EBIC), in Chen and Chen (2008) to ultra-high dimensional situations. Second, we develop a two-step procedure, the SIS+AENET, to conduct variable selection in p>n situations. The consistency of tuning parameter selection is established under fairly mild technical conditions. Simulation studies are presented to confirm theoretical findings, and an empirical example is given to illustrate the use in the internet advertising data.  相似文献   

16.
This paper is primarily concerned with the open problem of minimizing the lower tail of the multinomial distribution. During the study of that specific problem, we have developed an approach which reveals itself useful for solving a general class of problems involving multinomial probabilities. Concerning the main problem, we provide a self-contained proof that the minimum of the multinomial lower tail is reached, as conjectured by Gupta and Nagel (Sankhya Ser. B 29 (1967) 1) (within the framework of subset-selection problems) at the equal probability configuration, i.e., when the cell probabilities are equal to one another. We also point out some novel inequalities and general properties involving multinomial probabilities and multinomial coefficients.  相似文献   

17.
De Finetti style theorems characterize models (predictive distributions) as mixtures of the likelihood function and the prior distribution, beginning from some judgment of invariance about observable quantities. The likelihood function generally has its functional form identified from invariance assumptions only. However, we need additional conditions on observable quantities (typically, assumptions on conditional expectations) to identify the prior distribution. In this paper, we consider some well-known invariance assumptions and establish additional conditions on observable quantities in order to obtain a predictivistic characterization of the multivariate and matrix-variate Student-t distributions as well as for the Student-t linear model. As a byproduct, a characterization for the Pearson type II distribution is provided.  相似文献   

18.
The canonical correlation (CANCOR) method for dimension reduction in a regression setting is based on the classical estimates of the first and second moments of the data, and therefore sensitive to outliers. In this paper, we study a weighted canonical correlation (WCANCOR) method, which captures a subspace of the central dimension reduction subspace, as well as its asymptotic properties. In the proposed WCANCOR method, each observation is weighted based on its Mahalanobis distance to the location of the predictor distribution. Robust estimates of the location and scatter, such as the minimum covariance determinant (MCD) estimator of Rousseeuw [P.J. Rousseeuw, Multivariate estimation with high breakdown point, Mathematical Statistics and Applications B (1985) 283-297], can be used to compute the Mahalanobis distance. To determine the number of significant dimensions in WCANCOR, a weighted permutation test is considered. A comparison of SIR, CANCOR and WCANCOR is also made through simulation studies to show the robustness of WCANCOR to outlying observations. As an example, the Boston housing data is analyzed using the proposed WCANCOR method.  相似文献   

19.
In this paper, we consider the general growth curve model with multivariate random effects covariance structure and provide a new simple estimator for the parameters of interest. This estimator is not only convenient for testing the hypothesis on the corresponding parameters, but also has higher efficiency than the least-square estimator and the improved two-stage estimator obtained by Rao under certain conditions. Moreover, we obtain the necessary and sufficient condition for the new estimator to be identical to the best linear unbiased estimator. Examples of its application are given.  相似文献   

20.
This paper presents a statistic for testing the hypothesis of elliptical symmetry. The statistic also provides a specialized test of multivariate normality. We obtain the asymptotic distribution of this statistic under the null hypothesis of multivariate normality, and give a bootstrapping procedure for approximating the null distribution of the statistic under an arbitrary elliptically symmetric distribution. We present simulation results to examine the accuracy of the asymptotic distribution and the performance of the bootstrapping procedure. Finally, for selected alternatives, we compare the power of our test statistic with that of recently proposed tests for elliptical symmetry given by Manzotti et al. [A statistic for testing the null hypothesis of elliptical symmetry, J. Multivariate Anal. 81 (2002) 274-285] and Schott [Testing for elliptical symmetry in covariance-matrix-based analyses, Statist. Probab. Lett. 60 (2002) 395-404], and with that of the well known tests for multivariate normality of Mardia [Measures of multivariate skewness and kurtosis with applications, Biometrika 57 (1970) 519-530] and Baringhaus and Henze [A consistent test for multivariate normality based on the empirical characteristic function, Metrika 35 (1988) 339-348].  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号