期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

徐昕袁卫孟生旺《数学的实践与认识》2009,39(24)

在保险产品的分类费率厘定中,最常使用的模型之一是泊松回归模型.当损失数据存在零膨胀(zero-in flated)特征时,通常会采用零膨胀泊松回归模型.在零膨胀泊松回归模型中,一般假设结构零的比例参数φ为常数,不受费率因子的影响,这有可能背离实际情况.假设参数φ与费率因子之间存在一定关系,并在此基础上建立了零膨胀广义泊松回归模型,即Z IGP(τ)回归模型.通过对一组汽车保险损失数据的拟合表明,Z IGP(τ)回归模型可以有效地改善对实际数据的拟合效果,从而提高费率厘定结果的合理性. 相似文献

2.

0-1膨胀几何分布回归模型及其应用

《系统科学与数学》2019,(9)

在制造缺陷、专利申请、道路安全和公共卫生等应用领域,经常会出现较多的零观测值和一观测值.采用传统的泊松回归或负二项回归模型往往会过低地估计零观测值和一观测值出现的概率,数据拟合的效果欠佳.文章提出了0-1膨胀几何分布回归模型,巧妙地引入隐变量并进行极大似然估计和贝叶斯估计,基于数据扩充策略分别采用最大期望(EM)算法和Metropolis-Hastings抽样算法对回归参数向量进行估计.在不同的样本容量下进行数值模拟,并对两种估计方法的性能进行评价.研究表明,对于博士研究生发表论文数量的数据集,0-1膨胀几何分布回归模型能够达到更好的拟合效果. 相似文献

3.

零—膨胀泊松回归模型的非参数统计分析及其应用

《数理统计与管理》2019,(2):235-246

零膨胀计数数据是当今数据分析的热点问题之一,该类数据的特点是零点过多,目前对这类数据的研究已经比较全面。另外还有些计数数据不仅会出现零点过多的现象,也会同时存在零、一点都过多的情形,如果再用零膨胀计数数据的统计方法去研究,产生的误差较大。目前国内外对零和一都膨胀的数据的研究还比较少,针对这种现象,本文引入零一膨胀泊松回归模型,并用局部多项式核回归法这种非参数统计分析方法对零一膨胀泊松回归模型进行参数估计,这是本文的创新点也是难点,并在求解参数的过程中引进了EM算法和Newton-Raphson迭代对参数近似求解。通过模拟结果可以得出此方法的可行性,最后通过对糖尿病患者数据的实例分析,可以验证此方法的有效性。相似文献

4.

散度偏大计数数据分布的贝叶斯估计

下载免费PDF全文

陈雪东《应用概率统计》2007,23(4):361-368

本文针对索赔次数数据的特点, 讨论了两类可导致散度偏大特征数据的分布类型: 零点膨胀分布与膨胀参数分布, 并根据Bayes理论与MCMC方法, 利用WinBUGS对其进行建模和抽样\bd 经过比较,给出了实现分布拟合的途径, 最后通过两个数值例子加以展示. 相似文献

5.

两种ZIP模型的比较及其在保费厘定中的应用

项丽雅朱仲义《数理统计与管理》2013,(5):854-862

本文研究了两种ZIP模型,比较了一般ZIP回归模型和ZIP(τ)模型对相同的零膨胀(zero-inflated)计数数据的拟合效果,发现前者在拟合时有较好的稳定性,并在此基础上得出实际数据中两种模型应用的技巧以及需要注意的问题。同时研究了ZIP模型在保险费率厘定中的应用,将这两种模型分别应用于保险数据,发现在拟合和预测两方面,ZIP回归模型都明显优于ZIP(τ)模型。相似文献

6.

零调整分位数回归模型在车辆保险索赔额中的研究与应用

《数学的实践与认识》2013,(20)

车辆保险产品的定价一般会考虑保单持有人的索赔概率和期望索赔额等两个因素,零调整逆高斯回归模型作为解决这类问题的一个有力工具,由于变量分布的限定,从而具有一定的局限性.针对该问题,本文基于零调整逆高斯回归模型和分位数回归模型的思想,提出零调整分位数回归模型,并结合实际数据进行了拟合分析.与零调整逆高斯回归模型拟合的结果比较表明,零调整分位数回归模型可以作为研究车辆保险中索赔额的一个有力工具. 相似文献

7.

含右删失数据的ZIP模型及其应用

金晶亮吕效国曹张华季晓敏王金华《数学的实践与认识》2012,42(1):102-107

零膨胀Poisson回归(ZIP)是处理零频数过多计数资料的有效模型,而计数数据一般含有删失或不精密的特点.本文将删失数据引入到ZIP模型中来,分别建立含右删失数据的固定效应ZIP模型,随机效应ZIP模型,通过极大边际似然函数估计法对模型进行参数估计.最后,利用实例分析验证了上述模型的可行性. 相似文献

8.

零截断及零调整离散模型的计数数据分析方法的新进展之综述研究

张弛田国梁刘寅《应用概率统计》2021,(3):303-330

在社会学、心理学、生态学、保险学、医学、流行病学等领域,人们经常收集到各种各样的计数资料以研究它们的规律和特征.往往会出现计数数据不包含零观测值或零观测值过多的情形.一系列零截断和零膨胀离散模型也由此提出用于分析这一类数据,如零截断/零膨胀泊松分布、零截断/零膨胀负二项分布等.在利用这一类模型进行拟合时,对未知参数进行... 相似文献

9.

零膨胀泊松混合回归中的变量选择

王思洋高铭《应用数学学报》2023,(1):88-100

来源于不同总体的数据异质性较大,数据“零取值”较多且离散度大,可利用零膨胀泊松(ZIP)混合回归模型建模分析,然而混合模型中自变量较多.为了筛选出重要变量,本文利用自适应LASSO对ZIP混合回归模型进行变量选择,即在似然函数中加入惩罚项,再利用EM算法估计参数.通过模拟,验证了该方法在变量选择和参数估计中的有效性.同时,将ZIP混合回归模型应用于预测借贷失败次数的实际数据分析,筛选出对借贷失败有重要影响的因素.最后,通过比较各模型的预测效果,得到ZIP混合回归模型优于泊松(Poisson),负二项(NB)和ZIP回归模型. 相似文献

10.

纵向数据半参数回归模型的估计方法

王芬玲樊明智《数理统计与管理》2007,26(6):992-998

本文考虑纵向数据半参数回归模型,通过考虑纵向数据的协方差结构,基于Profile最小二乘法和局部线性拟合的方法建立了模型中参数分量、回归函数和误差方差的估计量,来提高估计的有效性,在适当条件下给出了这些估计量的相合性.并通过模拟研究将该方法与最小二乘局部线性拟合估计方法进行了比较,表明了Profile最小二乘局部线性拟合方法在有限样本情况下具有良好的性质. 相似文献

11.

A Bayesian model for repeated measures zero-inflated count data with application to outpatient psychiatric service use 总被引：1，自引：0，他引：1

Neelon BH O'Malley AJ Normand SL 《Statistical modelling》2010,10(4):421-439

In applications involving count data, it is common to encounter an excess number of zeros. In the study of outpatient service utilization, for example, the number of utilization days will take on integer values, with many subjects having no utilization (zero values). Mixed-distribution models, such as the zero-inflated Poisson (ZIP) and zero-inflated negative binomial (ZINB), are often used to fit such data. A more general class of mixture models, called hurdle models, can be used to model zero-deflation as well as zero-inflation. Several authors have proposed frequentist approaches to fitting zero-inflated models for repeated measures. We describe a practical Bayesian approach which incorporates prior information, has optimal small-sample properties, and allows for tractable inference. The approach can be easily implemented using standard Bayesian software. A study of psychiatric outpatient service use illustrates the methods. 相似文献

12.

��Ҷ˹��Ϸ�λ�ع��Gibbs��㷨

�� Ǭ ��ï�� 《应用概率统计》2019,35(2):178-192

Most regression modeling is based on traditional mean regression which results in non-robust estimation results for non-normal errors. Compared to conventional mean regression, composite quantile regression (CQR) may produce more robust parameters estimation. Based on a composite asymmetric Laplace distribution (CALD), we build a Bayesian hierarchical model for the weighted CQR (WCQR). The Gibbs sampler algorithm of Bayesian WCQR is developed to implement posterior inference. Finally, the proposed method are illustrated by some simulation studies and a real data analysis. 相似文献

13.

带有不可忽略缺失数据的半参数再生散度模型的贝叶斯分析

陈雪东唐年胜《系统科学与数学》2010,10(10):1334-1350

半参数再生散度模型是再生散度模型和半参数回归模型的推广,包括了半参数广义线性模型和广义部分线性模型等特殊类型.讨论的是该模型在响应变量和协变量均存在非随机缺失数据情形下参数的Bayes估计和基于Bayes因子的模型选择问题.在分析中,采用了惩罚样条来估计模型中的非参数成分,并建立了Bayes层次模型;为了解决Gibbs抽样过程中因参数高度相关带来的混合性差以及因维数增加导致出现不稳定性的问题,引入了潜变量做为添加数据并应用了压缩Gibbs抽样方法,改进了收敛性;同时,为了避免计算多重积分,利用了M-H算法估计边缘密度函数后计算Bayes因子,为模型的选择比较提供了一种准则.最后,通过模拟和实例验证了所给方法的有效性. 相似文献

14.

Bayesian Variable Selection for Median Regression

HU Danqing GU Yongquan ZHAO Weihua 《应用概率统计》2019,35(6):594-610

When the data has heavy tail feature or contains outliers, conventional variable selection methods based on penalized least squares or likelihood functions perform poorly. Based on Bayesian inference method, we study the Bayesian variable selection problem for median linear models. The Bayesian estimation method is proposed by using Bayesian model selection theory and Bayesian estimation method through selecting the Spike and Slab prior for regression coefficients, and the effective posterior Gibbs sampling procedure is also given. Extensive numerical simulations and Boston house price data analysis are used to illustrate the effectiveness of the proposed method. 相似文献

15.

Bayesian lasso binary quantile regression

Dries F. Benoit Rahim Alhamzawi Keming Yu 《Computational Statistics》2013,28(6):2861-2873

In this paper, a Bayesian hierarchical model for variable selection and estimation in the context of binary quantile regression is proposed. Existing approaches to variable selection in a binary classification context are sensitive to outliers, heteroskedasticity or other anomalies of the latent response. The method proposed in this study overcomes these problems in an attractive and straightforward way. A Laplace likelihood and Laplace priors for the regression parameters are proposed and estimated with Bayesian Markov Chain Monte Carlo. The resulting model is equivalent to the frequentist lasso procedure. A conceptional result is that by doing so, the binary regression model is moved from a Gaussian to a full Laplacian framework without sacrificing much computational efficiency. In addition, an efficient Gibbs sampler to estimate the model parameters is proposed that is superior to the Metropolis algorithm that is used in previous studies on Bayesian binary quantile regression. Both the simulation studies and the real data analysis indicate that the proposed method performs well in comparison to the other methods. Moreover, as the base model is binary quantile regression, a much more detailed insight in the effects of the covariates is provided by the approach. An implementation of the lasso procedure for binary quantile regression models is available in the R-package bayesQR. 相似文献

16.

A Gaussian Mixed Model for Learning Discrete Bayesian Networks

Balov N 《Statistics & probability letters》2011,81(2):220-230

In this paper, we address the problem of learning discrete Bayesian networks from noisy data. A graphical model based on a mixture of Gaussian distributions with categorical mixing structure coming from a discrete Bayesian network is considered. The network learning is formulated as a maximum likelihood estimation problem and performed by employing an EM algorithm. The proposed approach is relevant to a variety of statistical problems for which Bayesian network models are suitable—from simple regression analysis to learning gene/protein regulatory networks from microarray data. 相似文献

17.

��λ��ع�ı�Ҷ˹��ѡ�񷽷�

�� Ȫ ��Ϊ�� 《应用概率统计》2006,35(6):594-610

??When the data has heavy tail feature or contains outliers, conventional variable selection methods based on penalized least squares or likelihood functions perform poorly. Based on Bayesian inference method, we study the Bayesian variable selection problem for median linear models. The Bayesian estimation method is proposed by using Bayesian model selection theory and Bayesian estimation method through selecting the Spike and Slab prior for regression coefficients, and the effective posterior Gibbs sampling procedure is also given. Extensive numerical simulations and Boston house price data analysis are used to illustrate the effectiveness of the proposed method. 相似文献

18.

MARS: selecting basis functions and knots with an empirical Bayes method

Wataru Sakamoto 《Computational Statistics》2007,22(4):583-597

An empirical Bayes method to select basis functions and knots in multivariate adaptive regression spline (MARS) is proposed, which takes both advantages of frequentist model selection approaches and Bayesian approaches. A penalized likelihood is maximized to estimate regression coefficients for selected basis functions, and an approximated marginal likelihood is maximized to select knots and variables involved in basis functions. Moreover, the Akaike Bayes information criterion (ABIC) is used to determine the number of basis functions. It is shown that the proposed method gives estimation of regression structure that is relatively parsimonious and more stable for some example data sets. 相似文献

19.

Data analysis for accelerated life tests via Weibull-gamma frailty regression models

Xiao-Dong Zhou Yun-Juan Wang Lin Wu Rong-Xian Yue 《商业与工业应用随机模型》2023,39(4):567-583

In this article, we study data analysis methods for accelerated life test (ALT) with blocking. Unlike the previous assumption of normal distribution for random block effects, we advocate the use of Weibull regression model with gamma random effects for making statistical inference of ALT data. To estimate the unknown parameters in the proposed model, maximum likelihood estimation and Bayesian estimation methods are provided. We illustrate the proposed methods using real data examples and simulation examples. Numerical results suggest that distribution of random effects has minimal impact on the estimation of fixed effects in the Weibull regression models. Furthermore, to demonstrate the advantage of our proposed model, we also provide methods to compare ALT plans and thus identify the optimal ALT plans. 相似文献

20.

Approximate Bayesian shrinkage estimation

Wang-Shu Lu 《Annals of the Institute of Statistical Mathematics》1994,46(3):497-507

A Bayesian shrinkage estimate for the mean in the generalized linear empirical Bayes model is proposed. The posterior mean under the empirical Bayes model has a shrinkage pattern. The shrinkage factor is estimated by using a Bayesian method with the regression coefficients to be fixed at the maximum extended quasi-likelihood estimates. This approach develops a Bayesian shrinkage estimate of the mean which is numerically quite tractable. The method is illustrated with a data set, and the estimate is compared with an earlier one based on an empirical Bayes method. In a special case of the homogeneous model with exchangeable priors, the performance of the Bayesian estimate is illustrated by computer simulations. The simulation result shows as improvement of the Bayesian estimate over the empirical Bayes estimate in some situations. 相似文献