期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Bayesian binary regression involving two explanatory variables 总被引：1，自引：0，他引：1

Yosiyuki Sakamoto Makio Ishiguro 《Annals of the Institute of Statistical Mathematics》1985,37(1):369-387

Summary The purpose of the present paper is to propose a practical Bayesian procedure for the estimation of binary response probability where the explanatory variable is bivariate. The procedure is an extension of the procedure for univariate case which was proposed by the present authors [2] and is based on a model which approximates the logistic transformation of response probability by a quadratic orthogonal spline function on the two-dimensional space of explanatory variable. The flexibility of the model is guaranteed by assuming a spline function on sufficiently fine mesh. To obtain stable estimates we introduce a prior distribution of the parameters of the model. The prior distribution has several parameters (hyper-parameters) which are chosen to minimize an Bayesian information criterion ABIC. The procedure is applicable cable to cases where each explanatory variable takes continuous values provided that the probability of the occurrence changes smoothly. The practical utility of the procedure is demonstrated by examples of applications to five sets of data. The Institute of Statistical Mathematics 相似文献

2.

Bayesian lasso binary quantile regression

Dries F. Benoit Rahim Alhamzawi Keming Yu 《Computational Statistics》2013,28(6):2861-2873

In this paper, a Bayesian hierarchical model for variable selection and estimation in the context of binary quantile regression is proposed. Existing approaches to variable selection in a binary classification context are sensitive to outliers, heteroskedasticity or other anomalies of the latent response. The method proposed in this study overcomes these problems in an attractive and straightforward way. A Laplace likelihood and Laplace priors for the regression parameters are proposed and estimated with Bayesian Markov Chain Monte Carlo. The resulting model is equivalent to the frequentist lasso procedure. A conceptional result is that by doing so, the binary regression model is moved from a Gaussian to a full Laplacian framework without sacrificing much computational efficiency. In addition, an efficient Gibbs sampler to estimate the model parameters is proposed that is superior to the Metropolis algorithm that is used in previous studies on Bayesian binary quantile regression. Both the simulation studies and the real data analysis indicate that the proposed method performs well in comparison to the other methods. Moreover, as the base model is binary quantile regression, a much more detailed insight in the effects of the covariates is provided by the approach. An implementation of the lasso procedure for binary quantile regression models is available in the R-package bayesQR. 相似文献

3.

分位数变系数模型基于核光滑的变量选择

下载免费PDF全文

赵为华张日权刘吉彩《应用概率统计》2014,30(5):537-560

分位数变系数模型是一种稳健的非参数建模方法.使用变系数模型分析数据时,一个自然的问题是如何同时选择重要变量和从重要变量中识别常数效应变量.本文基于分位数方法研究具有稳健和有效性的估计和变量选择程序.利用局部光滑和自适应组变量选择方法,并对分位数损失函数施加双惩罚,我们获得了惩罚估计.通过BIC准则合适地选择调节参数,提出的变量选择方法具有oracle理论性质,并通过模拟研究和脂肪实例数据分析来说明新方法的有用性.数值结果表明,在不需要知道关于变量和误差分布的任何信息前提下,本文提出的方法能够识别不重要变量同时能区分出常数效应变量. 相似文献

4.

A bayesian approach to the probability density estimation

Makio Ishiguro Yosiyuki Sakamoto 《Annals of the Institute of Statistical Mathematics》1984,36(1):523-538

Summary A Bayesian procedure for the probability density estimation is proposed. The procedure is based on the multinomial logit transformations of the parameters of a finely segmented histogram model. The smoothness of the estimated density is guaranteed by the introduction of a prior distribution of the parameters. The estimates of the parameters are defined as the mode of the posterior distribution. The prior distribution has several adjustable parameters (hyper-parameters), whose values are chosen so that ABIC (Akaike's Bayesian Information Criterion) is minimized. The basic procedure is developed under the assumption that the density is defined on a bounded interval. The handling of the general case where the support of the density function is not necessarily bounded is also discussed. The practical usefulness of the procedure is demonstrated by numerical examples. The Institute of Statistical Mathematics 相似文献

5.

Nonparametric estimation of a conditional distribution from length-biased data

Jacobo de Uña-Álvarez M. Carmen Iglesias-Pérez 《Annals of the Institute of Statistical Mathematics》2010,62(2):323-341

In this paper we consider the problem of estimating a conditional distribution function in a nonparametric way, when the response variable is nonnegative, and the observational procedure is length-biased. We propose a proper adaptation of the estimate to right-censoring provoked by limitation in following-up. Large sample analysis of the introduced estimator is given, including rates of convergence, limiting distribution, and efficiency results. We show that the length-bias model results in less variance in estimation, when compared to methods based on observed truncation times. Practical performance of the proposed estimator is explored through simulations. Application to unemployment data analysis is provided. 相似文献

6.

Efficient estimation of a varying-coefficient partially linear binary regression model

Tao Hu Heng Jian Cui 《数学学报(英文版)》2010,26(11):2179-2190

This article considers a semiparametric varying-coefficient partially linear binary regression model. The semiparametric varying-coefficient partially linear regression binary model which is a generalization of binary regression model and varying-coefficient regression model that allows one to explore the possibly nonlinear effect of a certain covariate on the response variable. A Sieve maximum likelihood estimation method is proposed and the asymptotic properties of the proposed estimators are discussed. One of our main objects is to estimate nonparametric component and the unknowen parameters simultaneously. It is easier to compute, and the required computation burden is much less than that of the existing two-stage estimation method. Under some mild conditions, the estimators are shown to be strongly consistent. The convergence rate of the estimator for the unknown smooth function is obtained, and the estimator for the unknown parameter is shown to be asymptotically efficient and normally distributed. Simulation studies are carried out to investigate the performance of the proposed method. 相似文献

7.

Reliability analysis for Weibull distribution with homogeneous heavily censored data based on Bayesian and least-squares methods

《Applied Mathematical Modelling》2020

The reliability for Weibull distribution with homogeneous heavily censored data is analyzed in this study. The universal model of heavily censored data and existing methods, including maximum likelihood, least-squares, E-Bayesian estimation, and hierarchical Bayesian methods, are introduced. An improved method is proposed based on Bayesian inference and least-squares method. In this method, the Bayes estimations of failure probabilities are focused on for all the samples. The conjugate prior distribution of failure probability is set, and an optimization model is developed by maximizing the information entropy of prior distribution to determine the hyper-parameters. By integrating the likelihood function, the posterior distribution of failure probability is then derived to yield the Bayes estimation of failure probability. The estimations of reliability parameters are obtained by fitting distribution curve using least-squares method. The four existing methods are compared with the proposed method in terms of applicability, precision, efficiency, robustness, and simplicity. Specifically, the closed form expressions concerning E-Bayesian estimation and hierarchical Bayesian methods are derived and used. The comparisons demonstrate that the improved method is superior. Finally, three illustrative examples are presented to show the application of the proposed method. 相似文献

8.

A quasi Bayesian approach to outlier detection

Genshiro Kitagawa Hirotugu Akaike 《Annals of the Institute of Statistical Mathematics》1982,34(1):389-398

Summary A quasi Bayesian procedure is developed for the detection of outliers. A particular Gaussian distribution with ordered means is assumed as the basic model of the data distribution. By introducing a definition of the likelihood of a model whose parameters are determined by the method of maximum likelihood, the posterior probability of the model is obtained for a particular choice of the prior probability distribution. Numerical examples are given to illustrate the practical utility of the procedure. The Institute of Statistical Mathematics 相似文献

9.

Bayesian Variable Selection for Median Regression

HU Danqing GU Yongquan ZHAO Weihua 《应用概率统计》2019,35(6):594-610

When the data has heavy tail feature or contains outliers, conventional variable selection methods based on penalized least squares or likelihood functions perform poorly. Based on Bayesian inference method, we study the Bayesian variable selection problem for median linear models. The Bayesian estimation method is proposed by using Bayesian model selection theory and Bayesian estimation method through selecting the Spike and Slab prior for regression coefficients, and the effective posterior Gibbs sampling procedure is also given. Extensive numerical simulations and Boston house price data analysis are used to illustrate the effectiveness of the proposed method. 相似文献

10.

��λ��ع�ı�Ҷ˹��ѡ�񷽷�

�� Ȫ ��Ϊ�� 《应用概率统计》2006,35(6):594-610

??When the data has heavy tail feature or contains outliers, conventional variable selection methods based on penalized least squares or likelihood functions perform poorly. Based on Bayesian inference method, we study the Bayesian variable selection problem for median linear models. The Bayesian estimation method is proposed by using Bayesian model selection theory and Bayesian estimation method through selecting the Spike and Slab prior for regression coefficients, and the effective posterior Gibbs sampling procedure is also given. Extensive numerical simulations and Boston house price data analysis are used to illustrate the effectiveness of the proposed method. 相似文献

11.

Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis

Aijun Yang Xuejun Jiang Lianjie Shu Jinguan Lin 《Computational Statistics》2017,32(1):127-143

The main challenge in working with gene expression microarrays is that the sample size is small compared to the large number of variables (genes). In many studies, the main focus is on finding a small subset of the genes, which are the most important ones for differentiating between different types of cancer, for simpler and cheaper diagnostic arrays. In this paper, a sparse Bayesian variable selection method in probit model is proposed for gene selection and classification. We assign a sparse prior for regression parameters and perform variable selection by indexing the covariates of the model with a binary vector. The correlation prior for the binary vector assigned in this paper is able to distinguish models with the same size. The performance of the proposed method is demonstrated with one simulated data and two well known real data sets, and the results show that our method is comparable with other existing methods in variable selection and classification. 相似文献

12.

A Bayesian approach for quantile and response probability estimation with applications to reliability

Moshe Shaked Nozer D. Singpurwalla 《Annals of the Institute of Statistical Mathematics》1990,42(1):1-19

In this paper we propose a Bayesian approach for the estimation of a potency curve which is assumed to be nondecreasing and concave or convex. This is done by assigning the Dirichlet as a prior distribution for transformations of some unknown parameters. We motivate our choice of the prior and investigate several aspects of the problem, including the numerical implementation of the suggested scheme. An approach for estimating the quantiles is also given. By casting the problem in a more general context, we argue that distributions which are IHR or IHRA can also be estimated via the suggested procedure. A problem from a government laboratory serves as an example to illustrate the use of our procedure in a realistic scenario. 相似文献

13.

Semiparametric Modeling and Estimation of Instrumental Variable Models

《Journal of computational and graphical statistics》2013,22(1):86-114

We apply Bayesian methods to a model involving a binary nonrandom treatment intake variable and an instrumental variable in which the functional forms of some of the covariates in both the treatment intake and outcome distributions are unknown. Continuous and binary response variables are considered. Under the assumption that the functional form is additive in the covariates, we develop efficient Markov chain Monte Carlo-based approaches for summarizing the posterior distribution and for comparing various alternative models via marginal likelihoods and Bayes factors. We show in a simulation experiment that the methods are capable of recovering the unknown functions and are sensitive neither to the sample size nor to the degree of confounding as measured by the correlation between the errors in the treatment and response equations. In the binary response case, however, estimation of the average treatment effect requires larger sample sizes, especially when the degree of confounding is high. The methods are applied to an example dealing with the effect on wages of more than 12 years of education. 相似文献

14.

A fusion of least squares and empirical likelihood for regression models with a missing binary covariate

XiaoGang Duan Zhi Wang 《中国科学数学(英文版)》2016,59(10):2027-2036

Multiply robust inference has attracted much attention recently in the context of missing response data. An estimation procedure is multiply robust, if it can incorporate information from multiple candidate models, and meanwhile the resulting estimator is consistent as long as one of the candidate models is correctly specified. This property is appealing, since it provides the user a flexible modeling strategy with better protection against model misspecification. We explore this attractive property for the regression models with a binary covariate that is missing at random. We start from a reformulation of the celebrated augmented inverse probability weighted estimating equation, and based on this reformulation, we propose a novel combination of the least squares and empirical likelihood to separately handling each of the two types of multiple candidate models, one for the missing variable regression and the other for the missingness mechanism. Due to the separation, all the working models are fused concisely and effectively. The asymptotic normality of our estimator is established through the theory of estimating function with plugged-in nuisance parameter estimates. The finite-sample performance of our procedure is illustrated both through the simulation studies and the analysis of a dementia data collected by the national Alzheimer’s coordinating center. 相似文献

15.

A generalization of the growth curve model which allows missing data

David G Kleinbaum 《Journal of multivariate analysis》1973,3(1):117-124

This study presents methods for estimating and testing hypotheses about linear functions of the unknown parameters in a generalization of the growth curve model which allows missing data. The estimators proposed are best asymptotically normal (BAN). A testing method for large samples is described which uses a test criterion given in general form by Wald. The asymptotic null distribution of the test statistic is a central chi-square variable. A BAN estimator of a linear vector function of the unknown parameters of the expectation model and consistent estimators of the variance-covariance parameters are required for computation. 相似文献

16.

响应变量删失时函数型部分线性分位数回归模型的估计

史功明张忠占谢田法《数学的实践与认识》2021,(3):152-166

最近几年,函数型数据分析的理论和应用飞速发展.在许多实际应用里,响应变量往往存在随机右删失的情况.考虑利用函数型部分线性分位数回归模型来刻画函数型和标量预测量与右删失响应变量之间的关系.基于函数型主成分基函数来逼近未知的斜率函数,通过极小化逆概率加权分位数损失函数得到未知系数的估计量.文章的估计方法容易通过加权分位数回归程序实现.在一定的假设条件下,给出了有限维参数估计量的渐近正态性与斜率函数估计量的收敛速度.最后,通过模拟计算与应用实例证明了所提方法的有效性. 相似文献

17.

A non-parametric test for composite hypotheses in survival analysis

H. Dennis Tolley 《Annals of the Institute of Statistical Mathematics》1978,30(1):281-295

Summary For survival data with several concomitant (regressor) variables a large sample non-parametric procedure is presented which provides significance tests of hypotheses about a subset of the concomitant variables. This non-iterative procedure resembles linear model methodology in simplicity and form. The method is useful to eliminate unimportant concomitant variables prior to estimation of model parameters. 相似文献

18.

�ݺ��ģ��º��ķǲ��Ҷ˹��

�� ʦ�� ̾� �� 《应用概率统计》2016,32(6):617-631

??The linear accelerated model is often used to the statistical analysis of constant stress accelerated life test, whereas it does not relate well with the facts. By adopting the power functional accelerated model, the relationship of sample quantiles among different constant stress levels is obtained, which can lead to the estimations of the parameters in accelerated model and the characteristic coefficient vectors by virtue of the least square method, then the life-time data transformation between different stress levels can be operated. For complete data and censoring data, a Dirichlet process prior is introduced to gain the posterior distribution and the nonparametric Bayesian estimation of the reliability function, meanwhile, the consistency of the posterior estimators is proved. Finally, a real life example of Metal-Oxide-Semiconductor capacitors is analyzed to illustrate the effect of our model. 相似文献

19.

Bayesian estimation of generalized gamma shared frailty model

Sukhmani Sidhu Kanchan Jain Suresh Kumar Sharma 《Computational Statistics》2018,33(1):277-297

Multivariate survival analysis comprises of event times that are generally grouped together in clusters. Observations in each of these clusters relate to data belonging to the same individual or individuals with a common factor. Frailty models can be used when there is unaccounted association between survival times of a cluster. The frailty variable describes the heterogeneity in the data caused by unknown covariates or randomness in the data. In this article, we use the generalized gamma distribution to describe the frailty variable and discuss the Bayesian method of estimation for the parameters of the model. The baseline hazard function is assumed to follow the two parameter Weibull distribution. Data is simulated from the given model and the Metropolis–Hastings MCMC algorithm is used to obtain parameter estimates. It is shown that increasing the size of the dataset improves estimates. It is also shown that high heterogeneity within clusters does not affect the estimates of treatment effects significantly. The model is also applied to a real life dataset. 相似文献

20.

A　Class　of　Linear　Biased　Estimators　of　Regression　ParameterMatrix　in　the　Growth　Curve　Model

归庆明《数学季刊》1995,(2)

ＡＣｌａｓｓｏｆＬｉｎｅａｒＢｉａｓｅｄＥｓｔｉｍａｔｏｒｓｏｆＲｅｇｒｅｓｓｉｏｎＰａｒａｍｅｔｅｒＭａｔｒｉｘｉｎｔｈｅＧｒｏｗｔｈＣｕｒｖｅＭｏｄｅｌ￥ＧｕｉＱｉｎｇｍｉｎｇ（ＺｈｅｎｇｚｈｏｕＩｎｓｔｉｔｕｔｅｏｆＳｕｒｖｅｙｉｎｇａｎｄＭａ... 相似文献