首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 187 毫秒
1.
In many applications, some covariates could be missing for various reasons. Regression quantiles could be either biased or under-powered when ignoring the missing data. Multiple imputation and EM-based augment approach have been proposed to fully utilize the data with missing covariates for quantile regression. Both methods however are computationally expensive. We propose a fast imputation algorithm (FI) to handle the missing covariates in quantile regression, which is an extension of the fractional imputation in likelihood based regressions. FI and modified imputation algorithms (FIIPW and MIIPW) are compared to existing MI and IPW approaches in the simulation studies, and applied to part of of the National Collaborative Perinatal Project study.  相似文献   

2.
Tracking the correct directions of monotonicity in multi-dimensional modeling plays an important role in interpreting functional associations. In the presence of multiple predictors, we provide empirical evidence that the observed monotone directions via parametric, nonparametric or semiparametric fit of commonly used multi-dimensional models may entirely violate the actual directions of monotonicity. This breakdown is caused primarily by the dependence structure of covariates, with negligible influence from the bias of function estimation. To examine the linkage between the dependent covariates and monotone directions, we first generalize Stein’s Lemma for random variables which are mutually independent Gaussian to two important cases: dependent Gaussian, and independent non-Gaussian. We show that in both two cases, there is an explicit one-to-one correspondence between the monotone directions of a multi-dimensional function and the signs of a deterministic surrogate vector. Moreover, we demonstrate that the second case can be extended to accommodate a class of dependent covariates. This generalization further enables us to develop a de-correlation transform for arbitrarily dependent covariates. The transformed covariates preserve modeling interpretability with little loss in modeling efficiency. The simplicity and effectiveness of the proposed method are illustrated via simulation studies and real data application.  相似文献   

3.
Many statistical models, e.g. regression models, can be viewed as conditional moment restrictions when distributional assumptions on the error term are not assumed. For such models, several estimators that achieve the semiparametric efficiency bound have been proposed. However, in many studies, auxiliary information is available as unconditional moment restrictions. Meanwhile, we also consider the presence of missing responses. We propose the combined empirical likelihood (CEL) estimator to incorporate such auxiliary information to improve the estimation efficiency of the conditional moment restriction models. We show that, when assuming responses are strongly ignorable missing at random, the CEL estimator achieves better efficiency than the previous estimators due to utilization of the auxiliary information. Based on the asymptotic property of the CEL estimator, we also develop Wilks’ type tests and corresponding confidence regions for the model parameter and the mean response. Since kernel smoothing is used, the CEL method may have difficulty for problems with high dimensional covariates. In such situations, we propose an instrumental variable-based empirical likelihood (IVEL) method to handle this problem. The merit of the CEL and IVEL are further illustrated through simulation studies.  相似文献   

4.
<正>Empirical Likelihood of Quantile Difference with Missing Response When High-dimensional Covariates Are Present Cui Juan KONG Han Ying LIANG Abstract We,in this paper,investigate two-sample quantile difference by empirical likelihood method when the responses with high-dimensional covariates of the two populations are missing at random.In particular,based on sufficient dimension reduction technique,we construct three empirical log-likelihood ratios for the quantile difference between two samples by using inverse probability weighting imputation,regression imputation as well as augmented inverse probability weighting imputation,respectively,and prove their asymptotic distributions.  相似文献   

5.
设两个样本数据不完全的线性模型,其中协变量的观测值不缺失,响应变量的观测值随机缺失。采用随机回归插补法对响应变量的缺失值进行补足,得到两个线性回归模型的"完全"样本数据,在一定条件下得到两响应变量分位数差异的对数经验似然比统计量的极限分布为加权x_1~2,并利用此结果构造分位数差异的经验似然置信区间。模拟结果表明在随机插补下得到的置信区间具有较高的覆盖精度。  相似文献   

6.
主要研究因变量存在缺失且协变量部分包含测量误差情形下,如何对变系数部分线性模型同时进行参数估计和变量选择.我们利用插补方法来处理缺失数据,并结合修正的profile最小二乘估计和SCAD惩罚对参数进行估计和变量选择.并且证明所得的估计具有渐近正态性和Oracle性质.通过数值模拟进一步研究所得估计的有限样本性质.  相似文献   

7.
In this paper we consider exact tests of a multiple logistic regression with categorical covariates via Markov bases. In many applications of multiple logistic regression, the sample size is positive for each combination of levels of the covariates. In this case we do not need a whole Markov basis, which guarantees connectivity of all fibers. We first give an explicit Markov basis for multiple Poisson regression. By the Lawrence lifting of this basis, in the case of bivariate logistic regression, we show a simple subset of the Markov basis which connects all fibers with a positive sample size for each combination of levels of covariates.  相似文献   

8.
We consider, in the presence of covariates, non-independent competing risks that are subject to right censoring. We define a nonparametric estimator of the incident regression function through the generalized product-limit estimator of the conditional censorship distribution function. Under suitable conditions, we establish the almost sure uniform convergence of those estimators with an appropriate rate.  相似文献   

9.
In this paper, we carry out an in-depth theoretical investigation for inference with missing response and covariate data for general regression models. We assume that the missing data are missing at random (MAR) or missing completely at random (MCAR) throughout. Previous theoretical investigations in the literature have focused only on missing covariates or missing responses, but not both. Here, we consider theoretical properties of the estimates under three different estimation settings: complete case (CC) analysis, a complete response (CR) analysis that involves an analysis of those subjects with only completely observed responses, and the all case (AC) analysis, which is an analysis based on all of the cases. Under each scenario, we derive general expressions for the likelihood and devise estimation schemes based on the EM algorithm. We carry out a theoretical investigation of the three estimation methods in the normal linear model and analytically characterize the loss of information for each method, as well as derive and compare the asymptotic variances for each method assuming the missing data are MAR or MCAR. In addition, a theoretical investigation of bias for the CC method is also carried out. A simulation study and real dataset are given to illustrate the methodology.  相似文献   

10.
In this article we study a semiparametric generalized partially linear model when the covariates are missing at random. We propose combining local linear regression with the local quasilikelihood technique and weighted estimating equation to estimate the parameters and nonparameters when the missing probability is known or unknown. We establish normality of the estimators of the parameter and asymptotic expansion for the estimators of the nonparametric part. We apply the proposed models and methods to a study of the relation between virologic and immunologic responses in AIDS clinical trials, in which virologic response is classified into binary variables. We also give simulation results to illustrate our approach.  相似文献   

11.
Missing covariate data are very common in regression analysis. In this paper, the weighted estimating equation method (Qi et al., 2005) [25] is used to extend the so-called unified estimation procedure (Chen et al., 2002) [4] for linear transformation models to the case of missing covariates. The non-missingness probability is estimated nonparametrically by the kernel smoothing technique. Under missing at random, the proposed estimators are shown to be consistent and asymptotically normal, with the asymptotic variance estimated consistently by the usual plug-in method. Moreover, the proposed estimators are more efficient than the weighted estimators with the inverse of true non-missingness probability as weight. Finite sample performance of the estimators is examined via simulation and a real dataset is analyzed to illustrate the proposed methods.  相似文献   

12.
In competing risks model, several failure times arise potentially. The smallest failure time and its index only are observed. Without specific assumptions, the joint or even the marginal distribution functions of the underlying failure times are not identifiable (A. Tsiatis, Proc. Natl. Acad. Sci. USA 72 (1975) 20). Nonetheless, if each individual is characterized by a “sufficiently informative” set of covariates, these distributions are identifiable under some conditions of regularity (J.J. Heckman and B. Honoré, Biometrika 76 (1989) 325). In this paper, nonparametric kernel estimators of the joint distribution function of failure times conditional on the covariates are proposed. Their weak and strong consistency are discussed.  相似文献   

13.
In this article, we propose and explore a multivariate logistic regression model for analyzing multiple binary outcomes with incomplete covariate data where auxiliary information is available. The auxiliary data are extraneous to the regression model of interest but predictive of the covariate with missing data. Horton and Laird [N.J. Horton, N.M. Laird, Maximum likelihood analysis of logistic regression models with incomplete covariate data and auxiliary information, Biometrics 57 (2001) 34–42] describe how the auxiliary information can be incorporated into a regression model for a single binary outcome with missing covariates, and hence the efficiency of the regression estimators can be improved. We consider extending the method of [9] to the case of a multivariate logistic regression model for multiple correlated outcomes, and with missing covariates and completely observed auxiliary information. We demonstrate that in the case of moderate to strong associations among the multiple outcomes, one can achieve considerable gains in efficiency from estimators in a multivariate model as compared to the marginal estimators of the same parameters.  相似文献   

14.
主要考虑线性模型在自变量测量含误差以及因变量缺失情况下的估计问题.对于模型中的回归系数,我们基于最小二乘方法提出了两类估计,其中一类估计只由完整观测数据构成,而另外一类估计利用的则是利用简单插补方法构造的完整数据.证明了这两类估计是渐近正态性的.  相似文献   

15.
The purpose of this article is to review the findings of Professor Fujikoshi which are primarily in multivariate analysis. He derived many asymptotic expansions for multivariate statistics which include MANOVA tests, dimensionality tests and latent roots under normality and nonnormality. He has made a large contribution in the study on theoretical accuracy for asymptotic expansions by deriving explicit error bounds. A large contribution has been also made in an important problem involving the selection of variables with introducing “no additional information hypotheses” in some multivariate models and the application of model selection criteria. Recently he is challenging to a high-dimensional statistical problem. He has been involved in other topics in multivariate analysis, such as power comparison of a class of tests, monotone transformations with improved approximations, etc.  相似文献   

16.
The nonparametric estimator of the conditional survival function proposed by Beran is a useful tool to evaluate the effects of covariates in the presence of random right censoring. However, censoring indicators of right censored data may be missing for different reasons in many applications. We propose some estimators of the conditional cumulative hazard and survival functions which allow to handle this situation. We also construct the likelihood ratio confidence bands for them and obtain their asymptotic properties. Simulation studies are used to evaluate the performances of the estimators and their confidence bands.  相似文献   

17.
For the test of sphericity, Ledoit and Wolf [Ann. Statist. 30 (2002) 1081-1102] proposed a statistic which is robust against high dimensionality. In this paper, we consider a natural generalization of their statistic for the test that the smallest eigenvalues of a covariance matrix are equal. Some inequalities are obtained for sums of eigenvalues and sums of squared eigenvalues. These bounds permit us to obtain the asymptotic null distribution of our statistic, as the dimensionality and sample size go to infinity together, by using distributional results obtained by Ledoit and Wolf [Ann. Statist. 30 (2002) 1081-1102]. Some empirical results comparing our test with the likelihood ratio test are also given.  相似文献   

18.
Prediction of Euclidean distances with discrete and continuous outcomes   总被引:1,自引:0,他引:1  
The objective of this paper is first to predict generalized Euclidean distances in the context of discrete and quantitative variables and then to derive their statistical properties. We first consider the simultaneous modelling of discrete and continuous random variables with covariates and obtain the likelihood. We derive an important property useful for its practical maximization. We then study the prediction of any Euclidean distances and its statistical proprieties, especially for the Mahalanobis distance. The quality of distance estimation is analyzed through simulations. This results are applied to our motivating example: the official distinction procedure of rapeseed varieties.  相似文献   

19.
New imputation methods for missing data using quantiles   总被引:1,自引:0,他引:1  
The problem of missing values commonly arises in data sets, and imputation is usually employed to compensate for non-response. We propose a novel imputation method based on quantiles, which can be implemented with or without the presence of auxiliary information. The proposed method is extended to unequal sampling designs and non-uniform response mechanisms. Iterative algorithms to compute the proposed imputation methods are presented. Monte Carlo simulations are conducted to assess the performance of the proposed imputation methods with respect to alternative imputation methods. Simulation results indicate that the proposed methods perform competitively in terms of relative bias and relative root mean square error.  相似文献   

20.
Multivariate failure time data often arise in biomedical studies due to natural or artificial clustering. With appropriate adjustment for the underlying correlation, the marginal additive hazards model characterizes the hazard difference via a linear link function between the hazard and covariates. We propose a class of graphical and numerical methods to assess the overall fitting adequacy of the marginal additive hazards model. The test statistics are based on the supremum of the stochastic processes derived from the cumulative sum of the martingale-based residuals over time and/or covariates. The distribution of the stochastic process can be approximated through a simulation technique. The proposed tests examine how unusual the observed stochastic process is, compared to a large number of realizations from the approximated process. This class of tests is very general and suitable for various purposes of model fitting evaluation. Simulation studies are conducted to examine the finite sample performance, and the model-checking methods are illustrated with data from an otitis media study.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号