期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

On sparse estimation for semiparametric linear transformation models

Hao Helen Zhang Wenbin Lu 《Journal of multivariate analysis》2010,101(7):1594-1606

Semiparametric linear transformation models have received much attention due to their high flexibility in modeling survival data. A useful estimating equation procedure was recently proposed by Chen et al. (2002) [21] for linear transformation models to jointly estimate parametric and nonparametric terms. They showed that this procedure can yield a consistent and robust estimator. However, the problem of variable selection for linear transformation models has been less studied, partially because a convenient loss function is not readily available under this context. In this paper, we propose a simple yet powerful approach to achieve both sparse and consistent estimation for linear transformation models. The main idea is to derive a profiled score from the estimating equation of Chen et al. [21], construct a loss function based on the profile scored and its variance, and then minimize the loss subject to some shrinkage penalty. Under regularity conditions, we have shown that the resulting estimator is consistent for both model estimation and variable selection. Furthermore, the estimated parametric terms are asymptotically normal and can achieve a higher efficiency than that yielded from the estimation equations. For computation, we suggest a one-step approximation algorithm which can take advantage of the LARS and build the entire solution path efficiently. Performance of the new procedure is illustrated through numerous simulations and real examples including one microarray data. 相似文献

2.

Semiparametric inference for transformation models via empirical likelihood

Yichuan Zhao 《Journal of multivariate analysis》2010,101(8):1846-1858

Recent advances in the transformation model have made it possible to use this model for analyzing a variety of censored survival data. For inference on the regression parameters, there are semiparametric procedures based on the normal approximation. However, the accuracy of such procedures can be quite low when the censoring rate is heavy. In this paper, we apply an empirical likelihood ratio method and derive its limiting distribution via U-statistics. We obtain confidence regions for the regression parameters and compare the proposed method with the normal approximation based method in terms of coverage probability. The simulation results demonstrate that the proposed empirical likelihood method overcomes the under-coverage problem substantially and outperforms the normal approximation based method. The proposed method is illustrated with a real data example. Finally, our method can be applied to general U-statistic type estimating equations. 相似文献

3.

Semiparametric estimation based on parametric modeling of the cause-specific hazard ratios in competing risks

A. Suzukawa N. Taneichi 《Journal of multivariate analysis》2003,87(1):80-100

This paper is intended as an investigation of estimating cause-specific cumulative hazard and cumulative incidence functions in a competing risks model. The proportional model in which ratios of the cause-specific hazards to the overall hazard are assumed to be constant (independent of time) is a well-known semiparametric model. We are here concerned with relaxation of the proportionality assumption. The set C of all causes are decomposed into two disjoint subsets of causes as C=C₁∪C₂. The relative risk of cause A in the sub-causes C₁ can be represented as a function defined by ratio of the cause-specific hazard of cause A to the sum of cause-specific hazards in the sub-causes C₁. We call this function the risk pattern function of cause A in C₁, and consider a semiparametric model in which risk pattern functions in C₁ are not constant (independent of time) but those functional forms, except for finite-dimensional parameters, are known. Based on this model, semiparametric estimators are obtained, and estimated variances of them are derived by delta methods. We investigate asymptotic properties of the semiparametric estimators and compare them with the nonparametric estimators. The semiparametric procedure is illustrated with the radiation-exposed mice data set, which represents lifetimes and causes of death of mice exposed to radiation in two different environments. 相似文献

4.

Generalized partially linear models with missing covariates

Hua Liang 《Journal of multivariate analysis》2008,99(5):880-895

In this article we study a semiparametric generalized partially linear model when the covariates are missing at random. We propose combining local linear regression with the local quasilikelihood technique and weighted estimating equation to estimate the parameters and nonparameters when the missing probability is known or unknown. We establish normality of the estimators of the parameter and asymptotic expansion for the estimators of the nonparametric part. We apply the proposed models and methods to a study of the relation between virologic and immunologic responses in AIDS clinical trials, in which virologic response is classified into binary variables. We also give simulation results to illustrate our approach. 相似文献

5.

Estimation and confidence bands of a conditional survival function with censoring indicators missing at random

Qihua Wang Junshan Shen 《Journal of multivariate analysis》2008,99(5):928-948

The nonparametric estimator of the conditional survival function proposed by Beran is a useful tool to evaluate the effects of covariates in the presence of random right censoring. However, censoring indicators of right censored data may be missing for different reasons in many applications. We propose some estimators of the conditional cumulative hazard and survival functions which allow to handle this situation. We also construct the likelihood ratio confidence bands for them and obtain their asymptotic properties. Simulation studies are used to evaluate the performances of the estimators and their confidence bands. 相似文献

6.

Multiple imputations and the missing censoring indicator model

Sundarraman Subramanian 《Journal of multivariate analysis》2011,102(1):105-117

Semiparametric random censorship (SRC) models (Dikta, 1998) provide an attractive framework for estimating survival functions when censoring indicators are fully or partially available. When there are missing censoring indicators (MCIs), the SRC approach employs a model-based estimate of the conditional expectation of the censoring indicator given the observed time, where the model parameters are estimated using only the complete cases. The multiple imputations approach, on the other hand, utilizes this model-based estimate to impute the missing censoring indicators and form several completed data sets. The Kaplan-Meier and SRC estimators based on the several completed data sets are averaged to arrive at the multiple imputations Kaplan-Meier (MIKM) and the multiple imputations SRC (MISRC) estimators. While the MIKM estimator is asymptotically as efficient as or less efficient than the standard SRC-based estimator that involves no imputations, here we investigate the performance of the MISRC estimator and prove that it attains the benchmark variance set by the SRC-based estimator. We also present numerical results comparing the performances of the estimators under several misspecified models for the above mentioned conditional expectation. 相似文献

7.

Nonparametric estimation of competing risks models with covariates

Jean-David Fermanian 《Journal of multivariate analysis》2003,85(1):156-191

In competing risks model, several failure times arise potentially. The smallest failure time and its index only are observed. Without specific assumptions, the joint or even the marginal distribution functions of the underlying failure times are not identifiable (A. Tsiatis, Proc. Natl. Acad. Sci. USA 72 (1975) 20). Nonetheless, if each individual is characterized by a “sufficiently informative” set of covariates, these distributions are identifiable under some conditions of regularity (J.J. Heckman and B. Honoré, Biometrika 76 (1989) 325). In this paper, nonparametric kernel estimators of the joint distribution function of failure times conditional on the covariates are proposed. Their weak and strong consistency are discussed. 相似文献

8.

A class of weighted estimating equations for additive hazards models with covariates missing at random

Jin Jin Peng Ye Liuquan Sun 《中国科学数学(英文版)》2022,(3)

Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at random.Time-specific and subject-specific weights are incorporated into the formulation of weighted estimating equations.Unified results are established for estimating selection probabilities that cover both parametric and non-parametric modelling schemes.The resulting estimators have closed forms and are shown to be consistent and asymptotically normal.Simulation studies indicate that the proposed estimators perform well for practical settings.An application to a mouse leukemia study is illustrated. 相似文献

9.

A class of weighted estimating equations for additive hazard models with covariates missing at random

Jin Jin Ye Peng Sun Liuquan 《中国科学数学(英文版)》2022,65(3):583-602

Missing covariate data arise frequently in biomedical studies. In this article, we propose a class of weighted estimating equations for the additive hazard regression model when some of the covariates are missing at random. Time-specific and subject-specific weights are incorporated into the formulation of weighted estimating equations. Unified results are established for estimating selection probabilities that cover both parametric and non-parametric modeling schemes. The resulting estimators have closed forms and are shown to be consistent and asymptotically normal. Simulation studies indicate that the proposed estimators perform well for practical settings. An application to a mouse leukemia study is illustrated.

相似文献

10.

Multivariate Failure Times Regression with a Continuous Auxiliary Covariate

Liu Y Wu Y Zhou H 《Journal of multivariate analysis》2010,101(3):679-691

How to take advantage of the available auxiliary covariate information when the primary covariate of interest is not measured is a frequently encountered question in biomedical study. In this paper, we consider the multivariate failure times regression analysis in which the primary covariate is assessed only in a validation set, but a continuous auxiliary covariate for it is available for all subjects in the study cohort. Under the frame of marginal hazard model, we propose to estimate the induced relative risk function in the non-validation set through kernel smoothing method and then obtain an estimated pseudo-partial likelihood function. The proposed estimator which maximizes the estimated pseudo-partial likelihood is shown to be consistent and asymptotically normal. We also give an estimator of the marginal cumulative baseline hazard function. Simulation studies are conducted to evaluate the finite sample performance of our proposed estimator. The proposed method is illustrated by analyzing a heart disease data from the Study of Left Ventricular Dysfunction (SOLVD). 相似文献

11.

Approximate martingale estimating functions for stochastic differential equations with small noises

Masayuki Uchida 《Stochastic Processes and their Applications》2008

An approximate martingale estimating function with an eigenfunction is proposed for an estimation problem about an unknown drift parameter for a one-dimensional diffusion process with small perturbed parameter ε

ε

from discrete time observations at n

n

regularly spaced time points k/n

k / n

, k=0,1,…,n

k = 0, 1, \dots, n

. We show asymptotic efficiency of an M

M

-estimator derived from the approximate martingale estimating function as ε→0

ε \to 0

and n→∞

n \to \infty

simultaneously. 相似文献

12.

Statistical inference using a weighted difference-based series approach for partially linear regression models

Chunrong Ai Jinhong You Yong Zhou 《Journal of multivariate analysis》2011,102(3):601-618

Partially linear regression models with fixed effects are useful tools for making econometric analyses and normalizing microarray data. Baltagi and Li (2002) [7] proposed a computation friendly difference-based series estimation (DSE) for them. We show that the DSE is not asymptotically efficient in most cases and further propose a weighted difference-based series estimation (WDSE). The weights in it do not involve any unknown parameters. The asymptotic properties of the resulting estimators are established for both balanced and unbalanced cases, and it is shown that they achieve a semiparametric efficient boundary. Additionally, we propose a variable selection procedure for identifying significant covariates in the parametric part of the semiparametric fixed-effects regression model. The method is based on a combination of the nonconcave penalization (Fan and Li, 2001 [13]) and weighted difference-based series estimation techniques. The resulting estimators have the oracle property; that is, they can correctly identify the true model as if the true model (the subset of variables with nonvanishing coefficients) were known in advance. Simulation studies are conducted and an application is given to demonstrate the finite sample performance of the proposed procedures. 相似文献

13.

Empirical likelihood for semiparametric regression model with missing response data

Liugen Xue Dong Xue 《Journal of multivariate analysis》2011,102(4):723-740

A bias-corrected technique for constructing the empirical likelihood ratio is used to study a semiparametric regression model with missing response data. We are interested in inference for the regression coefficients, the baseline function and the response mean. A class of empirical likelihood ratio functions for the parameters of interest is defined so that undersmoothing for estimating the baseline function is avoided. The existing data-driven algorithm is also valid for selecting an optimal bandwidth. Our approach is to directly calibrate the empirical log-likelihood ratio so that the resulting ratio is asymptotically chi-squared. Also, a class of estimators for the parameters of interest is constructed, their asymptotic distributions are obtained, and consistent estimators of asymptotic bias and variance are provided. Our results can be used to construct confidence intervals and bands for the parameters of interest. A simulation study is undertaken to compare the empirical likelihood with the normal approximation-based method in terms of coverage accuracies and average lengths of confidence intervals. An example for an AIDS clinical trial data set is used for illustrating our methods. 相似文献

14.

Probability density estimation for survival data with censoring indicators missing at random

Qihua Wang Wei Liu 《Journal of multivariate analysis》2009,100(5):835-850

In this paper, some nonparametric approaches of density function estimation are developed when censoring indicators are missing at random. A conditional mean score based estimator and a mean score estimator are suggested, respectively. The two estimators are proved to be asymptotically normal and uniformly strongly consistent. The bandwidth selection problem is also discussed. A simulation study is conducted to compare finite-sample behaviors of the proposed estimators. 相似文献

15.

Statistical inference of minimum BD estimators and classifiers for varying-dimensional models

Chunming Zhang 《Journal of multivariate analysis》2010,101(7):1574-1593

Stochastic modeling for large-scale datasets usually involves a varying-dimensional model space. This paper investigates the asymptotic properties, when the number of parameters grows with the available sample size, of the minimum- estimators and classifiers under a broad and important class of Bregman divergence (), which encompasses nearly all of the commonly used loss functions in the regression analysis, classification procedures and machine learning literature. Unlike the maximum likelihood estimators which require the joint likelihood of observations, the minimum-BD estimators are useful for a range of models where the joint likelihood is unavailable or incomplete. Statistical inference tools developed for the class of large dimensional minimum- estimators and related classifiers are evaluated via simulation studies, and are illustrated by analysis of a real dataset. 相似文献

16.

Hazard rate estimation on random fields

Jiexiang Li 《Journal of multivariate analysis》2007,98(7):1337-1355

Consider observations (representing lifelengths) taken on a random field indexed by lattice points. Our purpose is to estimate the hazard rate r(x), which is the rate of failure at time x for the survivors up to time x. We estimate r(x) by the nonparametric estimator constructed in terms of a kernel-type estimator for f(x) and the natural estimator for . Under some general mixing assumptions, the limiting distribution of the estimator at multiple points is shown to be multivariate normal. The result is useful in establishing confidence bands for r(x) with x in an interval. 相似文献

17.

Theory and inference for regression models with missing responses and covariates

Qingxia Chen Ming-Hui Chen 《Journal of multivariate analysis》2008,99(6):1302-1331

In this paper, we carry out an in-depth theoretical investigation for inference with missing response and covariate data for general regression models. We assume that the missing data are missing at random (MAR) or missing completely at random (MCAR) throughout. Previous theoretical investigations in the literature have focused only on missing covariates or missing responses, but not both. Here, we consider theoretical properties of the estimates under three different estimation settings: complete case (CC) analysis, a complete response (CR) analysis that involves an analysis of those subjects with only completely observed responses, and the all case (AC) analysis, which is an analysis based on all of the cases. Under each scenario, we derive general expressions for the likelihood and devise estimation schemes based on the EM algorithm. We carry out a theoretical investigation of the three estimation methods in the normal linear model and analytically characterize the loss of information for each method, as well as derive and compare the asymptotic variances for each method assuming the missing data are MAR or MCAR. In addition, a theoretical investigation of bias for the CC method is also carried out. A simulation study and real dataset are given to illustrate the methodology. 相似文献

18.

Estimation in partially linear models with missing responses at random

Qihua Wang Zhihua Sun 《Journal of multivariate analysis》2007,98(7):1470-1493

A partially linear model is considered when the responses are missing at random. Imputation, semiparametric regression surrogate and inverse marginal probability weighted approaches are developed to estimate the regression coefficients and the nonparametric function, respectively. All the proposed estimators for the regression coefficients are shown to be asymptotically normal, and the estimators for the nonparametric function are proved to converge at an optimal rate. A simulation study is conducted to compare the finite sample behavior of the proposed estimators. 相似文献

19.

A weighted multivariate signed-rank test for cluster-correlated data

Riina Haataja Denis Larocque Hannu Oja 《Journal of multivariate analysis》2009,100(6):1107-24

A weighted multivariate signed-rank test is introduced for an analysis of multivariate clustered data. Observations in different clusters may then get different weights. The test provides a robust and efficient alternative to normal theory based methods. Asymptotic theory is developed to find the approximate p-value as well as to calculate the limiting Pitman efficiency of the test. A conditionally distribution-free version of the test is also discussed. The finite-sample behavior of different versions of the test statistic is explored by simulations and the new test is compared to the unweighted and weighted versions of Hotelling’s T² test and the multivariate spatial sign test introduced in [D. Larocque, J. Nevalainen, H. Oja, A weighted multivariate sign test for cluster-correlated data, Biometrika 94 (2007) 267-283]. Finally, a real data example is used to illustrate the theory. 相似文献

20.

Confidence intervals for marginal parameters under fractional linear regression imputation for missing data

Yongsong Qin Qunshu Ren 《Journal of multivariate analysis》2008,99(6):1232-1259

Item nonresponse occurs frequently in sample surveys and other applications. Imputation is commonly used to fill in the missing item values in a random sample {Y_i;i=1,…,n}. Fractional linear regression imputation, based on the model with independent zero mean errors ?_i, is used to create one or more imputed values in the data file for each missing item Y_i, where {X_i,i=1,…,n}, is observed completely. Asymptotic normality of the imputed estimators of the mean μ=E(Y), distribution function θ=F(y) for a given y, and qth quantile θ_q=F^-1(q),0<q<1 is established, assuming that Y is missing at random (MAR) given X. This result is used to obtain normal approximation (NA)-based confidence intervals on μ,θ and θ_q. In the case of θ_q, a Bahadur-type representation and Woodruff-type confidence intervals are also obtained. Empirical likelihood (EL) ratios are also obtained and shown to be asymptotically scaled variables. This result is used to obtain asymptotically correct EL-based confidence intervals on μ,θ and θ_q. Results of a simulation study on the finite sample performance of NA-based and EL-based confidence intervals are reported. 相似文献