首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this paper we aim to estimate the direction in general single-index models and to select important variables simultaneously when a diverging number of predictors are involved in regressions. Towards this end, we propose the nonconcave penalized inverse regression method. Specifically, the resulting estimation with the SCAD penalty enjoys an oracle property in semi-parametric models even when the dimension, pn, of predictors goes to infinity. Under regularity conditions we also achieve the asymptotic normality when the dimension of predictor vector goes to infinity at the rate of pn=o(n1/3) where n is sample size, which enables us to construct confidence interval/region for the estimated index. The asymptotic results are augmented by simulations, and illustrated by analysis of an air pollution dataset.  相似文献   

2.
There are hypothesis testing problems for (nonlinear) functions of parameters against functional ordered alternatives for which a reduction to a conventional order-restricted hypothesis testing problem may not be feasible. While such problems can be handled in an asymptotic setup, among the available choices, it is shown that the union-intersection principle may have certain advantages over the likelihood principle or its ramifications. An application to a genomic model is also considered.  相似文献   

3.
4.
Multiple hypotheses testing is concerned with appropriately controlling the rate of false positives, false negatives or both when testing several hypotheses simultaneously. Nowadays, the common approach to testing multiple hypotheses calls for controlling the expected proportion of falsely rejected null hypotheses referred to as the false discovery rate (FDR) or suitable measures based on the positive false discovery rate (pFDR). In this paper, we consider the problem of determining levels that both false positives and false negatives can be controlled simultaneously. As our risk function, we use the expected value of the maximum between the proportions of false positives and false negatives, with the expectation being taken conditional on the event that at least one hypothesis is rejected and one is accepted, referred to as hybrid error rate (HER). We then develop, based on HER, an analog of p-value termed as h-value to test the individual hypotheses. The use of the new procedure is illustrated using the well-known public data set by Golub et al. [Molecular classification of cancer: class discovery and class prediction by gene expression monitoring, Science 386 (1999) 531-537] with Affymetrix arrays of patients with acute lymphoic leukemia and acute myeloid leukemia.  相似文献   

5.
An exhaustive search as required for traditional variable selection methods is impractical in high dimensional statistical modeling. Thus, to conduct variable selection, various forms of penalized estimators with good statistical and computational properties, have been proposed during the past two decades. The attractive properties of these shrinkage and selection estimators, however, depend critically on the size of regularization which controls model complexity. In this paper, we consider the problem of consistent tuning parameter selection in high dimensional sparse linear regression where the dimension of the predictor vector is larger than the size of the sample. First, we propose a family of high dimensional Bayesian Information Criteria (HBIC), and then investigate the selection consistency, extending the results of the extended Bayesian Information Criterion (EBIC), in Chen and Chen (2008) to ultra-high dimensional situations. Second, we develop a two-step procedure, the SIS+AENET, to conduct variable selection in p>n situations. The consistency of tuning parameter selection is established under fairly mild technical conditions. Simulation studies are presented to confirm theoretical findings, and an empirical example is given to illustrate the use in the internet advertising data.  相似文献   

6.
Asymptotic multivariate normal approximations to the joint distributions of edge exclusion test statistics for saturated graphical log-linear models, with all variables binary, are derived. Non-signed and signed square-root versions of the likelihood ratio, Wald and score test statistics are considered. Non-central chi-squared approximations are also considered for the non-signed versions of the test statistics. Simulation results are used to assess the quality of the proposed approximations. These approximations are used to estimate the overall power of edge exclusion tests. Power calculations are illustrated using data on university admissions.  相似文献   

7.
A class of test statistics are proposed for sparse tables with ordered categories. It is shown that for different testing situations these test statistics asymptotically more powerful tests than Pearson's chi-square.  相似文献   

8.
Diagnostic checking for multivariate parametric models is investigated in this article. A nonparametric Monte Carlo Test (NMCT) procedure is proposed. This Monte Carlo approximation is easy to implement and can automatically make any test procedure scale-invariant even when the test statistic is not scale-invariant. With it we do not need plug-in estimation of the asymptotic covariance matrix that is used to normalize test statistic and then the power performance can be enhanced. The consistency of NMCT approximation is proved. For comparison, we also extend the score type test to one-dimensional cases. NMCT can also be applied to diverse problems such as a classical problem for which we test whether or not certain covariables in linear model has significant impact for response. Although the Wilks lambda, a likelihood ratio test, is a proven powerful test, NMCT outperforms it especially in non-normal cases. Simulations are carried out and an application to a real data set is illustrated.  相似文献   

9.
In many real world classification problems, class-conditional classification noise (CCC-Noise) frequently deteriorates the performance of a classifier that is naively built by ignoring it. In this paper, we investigate the impact of CCC-Noise on the quality of a popular generative classifier, normal discriminant analysis (NDA), and its corresponding discriminative classifier, logistic regression (LR). We consider the problem of two multivariate normal populations having a common covariance matrix. We compare the asymptotic distribution of the misclassification error rate of these two classifiers under CCC-Noise. We show that when the noise level is low, the asymptotic error rates of both procedures are only slightly affected. We also show that LR is less deteriorated by CCC-Noise compared to NDA. Under CCC-Noise contexts, the Mahalanobis distance between the populations plays a vital role in determining the relative performance of these two procedures. In particular, when this distance is small, LR tends to be more tolerable to CCC-Noise compared to NDA.  相似文献   

10.
“Kriging” is the name of a parametric regression method used by hydrologists and mining engineers, among others. Features of the kriging approach are that it also provides an error estimate and that it can conveniently be employed also to estimate the integral of the regression function. In the present work, the kriging method is described and some of its statistical characteristics are explored. Also, some extensions of the nonparametric regression approach are made so that it too displays the kriging features. In particular, a “data driven” estimator of the expected square error is derived. Theoretical and computational comparisons of the kriging and nonparametric regressors are offered.  相似文献   

11.
A trace test for the mean parameters of the growth curve model is proposed. It is constructed using the restricted maximum likelihood followed by an estimated likelihood ratio approach. The statistic reduces to the Lawley-Hotelling trace test for the Multivariate Analysis of Variance (MANOVA) models. Our test statistic is, therefore, a natural extension of the classical trace test to GMANOVA models. We show that the distribution of the test under the null hypothesis does not depend on the unknown covariance matrix Σ. We also show that the distributions under the null and alternative hypotheses can be represented as sums of weighted central and non-central chi-square random variables, respectively. Under the null hypothesis, the Satterthwaite approximation is used to get an approximate critical point. A novel Satterthwaite type approximation is proposed to obtain an approximate power. A simulation study is performed to evaluate the performance of our proposed test and numerical examples are provided as illustrations.  相似文献   

12.
In this paper, we consider a semiparametric modeling with multi-indices when neither the response nor the predictors can be directly observed and there are distortions from some multiplicative factors. In contrast to the existing methods in which the response distortion deteriorates estimation efficacy even for a simple linear model, the dimension reduction technique presented in this paper interestingly does not have to account for distortion of the response variable. The observed response can be used directly whether distortion is present or not. The resulting dimension reduction estimators are shown to be consistent and asymptotically normal. The results can be employed to test whether the central dimension reduction subspace has been estimated appropriately and whether the components in the basis directions in the space are significant. Thus, the method provides an alternative for determining the structural dimension of the subspace and for variable selection. A simulation study is carried out to assess the performance of the proposed method. The analysis of a real dataset demonstrates the potential usefulness of distortion removal.  相似文献   

13.
In this paper, we consider simultaneous confidence intervals for all contrasts in the means when the observations are missing at random in the intraclass correlation model. An exact test statistic for the equality of the means and Scheffé, Bonferroni and Tukey types of simultaneous confidence intervals are given by an extension of Bhargava and Srivastava [On Tukey's confidence intervals for the contrasts in the means of the intraclass correlation model, J. Royal Statist. Soc. B35 (1973) 147-152] when the missing observations are of the monotone type. Finally, numerical results of simultaneous confidence intervals are presented.  相似文献   

14.
Goodness-of-fit tests allow one to conclude that k possible outcomes are not equally likely. In this paper, we develop an exact equivalence test that allows one to conclude that k possible outcomes are approximately equally likely. We show that the power properties of the test compare favorably to those of possible alternative tests, and we develop an associated simultaneous confidence interval procedure. We apply the test to data sets on the digits of π, winning roulette numbers, and winning numbers from the Pennsylvania Lottery.  相似文献   

15.
We consider dependence structures in multivariate time series that are characterized by deterministic trends. Results from spectral analysis for stationary processes are extended to deterministic trend functions. A regression cross covariance and spectrum are defined. Estimation of these quantities is based on wavelet thresholding. The method is illustrated by a simulated example and a three-dimensional time series consisting of ECG, blood pressure and cardiac stroke volume measurements.  相似文献   

16.
This paper studies the exact distributions of the MLEs of the regression coefficient matrices in a GMANOVA-MANOVA model with normal error. The unique conditions for linear functions of the MLEs of regression coefficient matrices are presented, and the exact density functions or characteristic functions for these linear functions are derived.  相似文献   

17.
Let (X,Y) be a Rd×N0-valued random vector where the conditional distribution of Y given X=x is a Poisson distribution with mean m(x). We estimate m by a local polynomial kernel estimate defined by maximizing a localized log-likelihood function. We use this estimate of m(x) to estimate the conditional distribution of Y given X=x by a corresponding Poisson distribution and to construct confidence intervals of level α of Y given X=x. Under mild regularity conditions on m(x) and on the distribution of X we show strong convergence of the integrated L1 distance between Poisson distribution and its estimate. We also demonstrate that the corresponding confidence interval has asymptotically (i.e., for sample size tending to infinity) level α, and that the probability that the length of this confidence interval deviates from the optimal length by more than one converges to zero with the number of samples tending to infinity.  相似文献   

18.
For multinormal distributions, testing against a global shift alternative, the Hotelling T2-test is uniformly most powerful invariant, and hence admissible. For testing against restricted alternatives this feature may no longer be true. It is shown that whenever the dispersion matrix is an M-matrix, Hotelling's T2-test is inadmissible, though some union-intersection tests may not be so.  相似文献   

19.
In this paper, we parameterize non-negative matrices of sum one and rank at most two using the least possible number of parameters. We also show how this parameterization relates to a class of statistical models, known in Probability and Statistics as mixture models for contingency tables. In particular, we show how to use this parameterization to make some optimization problems computationally easier.  相似文献   

20.
We explore simultaneous modeling of several covariance matrices across groups using the spectral (eigenvalue) decomposition and modified Cholesky decomposition. We introduce several models for covariance matrices under different assumptions about the mean structure. We consider ‘dependence’ matrices, which tend to have many parameters, as constant across groups and/or parsimoniously modeled via a regression formulation. For ‘variances’, we consider both unrestricted across groups and more parsimoniously modeled via log-linear models. In all these models, we explore the propriety of the posterior when improper priors are used on the mean and ‘variance’ parameters (and in some cases, on components of the ‘dependence’ matrices). The models examined include several common Bayesian regression models, whose propriety has not been previously explored, as special cases. We propose a simple approach to weaken the assumption of constant dependence matrices in an automated fashion and describe how to compute Bayes factors to test the hypothesis of constant ‘dependence’ across groups. The models are applied to data from two longitudinal clinical studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号