期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

GPU-Powered Shotgun Stochastic Search for Dirichlet Process Mixtures of Gaussian Graphical Models

Chiranjit Mukherjee Abel Rodriguez 《Journal of computational and graphical statistics》2016,25(3):762-788

Gaussian graphical models (GGMs) are popular for modeling high-dimensional multivariate data with sparse conditional dependencies. A mixture of GGMs extends this model to the more realistic scenario where observations come from a heterogenous population composed of a small number of homogeneous subgroups. In this article, we present a novel stochastic search algorithm for finding the posterior mode of high-dimensional Dirichlet process mixtures of decomposable GGMs. Further, we investigate how to harness the massive thread-parallelization capabilities of graphical processing units to accelerate computation. The computational advantages of our algorithms are demonstrated with various simulated data examples in which we compare our stochastic search with a Markov chain Monte Carlo (MCMC) algorithm in moderate dimensional data examples. These experiments show that our stochastic search largely outperforms the MCMC algorithm in terms of computing-times and in terms of the quality of the posterior mode discovered. Finally, we analyze a gene expression dataset in which MCMC algorithms are too slow to be practically useful. 相似文献

2.

Bayesian and Maximum Likelihood Estimation for Gaussian Processes on an Incomplete Lattice

Jonathan R. Stroud Michael L. Stein Shaun Lysen 《Journal of computational and graphical statistics》2017,26(1):108-120

This article proposes a new approach for Bayesian and maximum likelihood parameter estimation for stationary Gaussian processes observed on a large lattice with missing values. We propose a Markov chain Monte Carlo approach for Bayesian inference, and a Monte Carlo expectation-maximization algorithm for maximum likelihood inference. Our approach uses data augmentation and circulant embedding of the covariance matrix, and provides likelihood-based inference for the parameters and the missing data. Using simulated data and an application to satellite sea surface temperatures in the Pacific Ocean, we show that our method provides accurate inference on lattices of sizes up to 512 × 512, and is competitive with two popular methods: composite likelihood and spectral approximations. 相似文献

3.

Bayesian Inference for the One-Factor Copula Model

Ban Kheng Tan Anastasios Panagiotelis George Athanasopoulos 《Journal of computational and graphical statistics》2019,28(1):155-173

We develop efficient Bayesian inference for the one-factor copula model with two significant contributions over existing methodologies. First, our approach leads to straightforward inference on dependence parameters and the latent factor; only inference on the former is available under frequentist alternatives. Second, we develop a reversible jump Markov chain Monte Carlo algorithm that averages over models constructed from different bivariate copula building blocks. Our approach accommodates any combination of discrete and continuous margins. Through extensive simulations, we compare the computational and Monte Carlo efficiency of alternative proposed sampling schemes. The preferred algorithm provides reliable inference on parameters, the latent factor, and model space. The potential of the methodology is highlighted in an empirical study of 10 binary measures of socio-economic deprivation collected for 11,463 East Timorese households. The importance of conducting inference on the latent factor is motivated by constructing a poverty index using estimates of the factor. Compared to a linear Gaussian factor model, our model average improves out-of-sample fit. The relationships between the poverty index and observed variables uncovered by our approach are diverse and allow for a richer and more precise understanding of the dependence between overall deprivation and individual measures of well-being. 相似文献

4.

Reparameterized and Marginalized Posterior and Predictive Sampling for Complex Bayesian Geostatistical Models

《Journal of computational and graphical statistics》2013,22(2):262-282

This article proposes a four-pronged approach to efficient Bayesian estimation and prediction for complex Bayesian hierarchical Gaussian models for spatial and spatiotemporal data. The method involves reparameterizing the covariance structure of the model, reformulating the means structure, marginalizing the joint posterior distribution, and applying a simplex-based slice sampling algorithm. The approach permits fusion of point-source data and areal data measured at different resolutions and accommodates nonspatial correlation and variance heterogeneity as well as spatial and/or temporal correlation. The method produces Markov chain Monte Carlo samplers with low autocorrelation in the output, so that fewer iterations are needed for Bayesian inference than would be the case with other sampling algorithms. Supplemental materials are available online. 相似文献

5.

һ��Ķ�̬��

�� άά �� 《应用概率统计》2006,35(4):397-407

??Dynamic complex network has become a popular topic in the many fields, such as population ecology, social ecology, biology and Internet. Meanwhile cluster analysis is a common tool to extract network structure. Previous articles on network clustering mostly supposed that observations are conditionally independent. However, we construct novel model which combines the stochastic block model, the hidden structure in Markov process and the autoregressive model to relax this assumption. We also propose relative statistical inference and VEM algorithm. Finally, the Monte Carlo simulations are performed well, which shows the consistency and robustness of the work. 相似文献

6.

Dynamic Cluster Analysis of Dependent Networks

ZHANG Qi LI Xinmin WANG Weiwei WANG Liang 《应用概率统计》2019,35(4):397-407

Dynamic complex network has become a popular topic in the many fields, such as population ecology, social ecology, biology and Internet. Meanwhile cluster analysis is a common tool to extract network structure. Previous articles on network clustering mostly supposed that observations are conditionally independent. However, we construct novel model which combines the stochastic block model, the hidden structure in Markov process and the autoregressive model to relax this assumption. We also propose relative statistical inference and VEM algorithm. Finally, the Monte Carlo simulations are performed well, which shows the consistency and robustness of the work. 相似文献

7.

MCMC using Markov bases for computing p-values in decomposable log-linear models

Masahiro Kuroda Hiroki Hashiguchi Shigekazu Nakagawa Zhi Geng 《Computational Statistics》2013,28(2):831-850

相似文献

8.

A New Algorithm for Simulating a Correlation Matrix Based on Parameter Expansion and Reparameterization

《Journal of computational and graphical statistics》2013,22(4):897-914

The correlation matrix (denoted by R) plays an important role in many statistical models. Unfortunately, sampling the correlation matrix in Markov chain Monte Carlo (MCMC) algorithms can be problematic. In addition to the positive definite constraint of covariance matrices, correlation matrices have diagonal elements fixed at one. In this article, we propose an efficient two-stage parameter expanded reparameterization and Metropolis-Hastings (PX-RPMH) algorithm for simulating R. Using this algorithm, we draw all elements of R simultaneously by first drawing a covariance matrix from an inverse Wishart distribution, and then translating it back to a correlation matrix through a reduction function and accepting it based on a Metropolis-Hastings acceptance probability. This algorithm is illustrated using multivariate probit (MVP) models and multivariate regression (MVR) models with a common correlation matrix across groups. Via both a simulation study and a real data example, the performance of the PX-RPMH algorithm is compared with those of other common algorithms. The results show that the PX-RPMH algorithm is more efficient than other methods for sampling a correlation matrix. 相似文献

9.

Bayesian analysis of multivariate t linear mixed models using a combination of IBF and Gibbs samplers

Wan-Lun Wang Tsai-Hung Fan 《Journal of multivariate analysis》2012,105(1):300-310

The multivariate linear mixed model (MLMM) has become the most widely used tool for analyzing multi-outcome longitudinal data. Although it offers great flexibility for modeling the between- and within-subject correlation among multi-outcome repeated measures, the underlying normality assumption is vulnerable to potential atypical observations. We present a fully Bayesian approach to the multivariate t linear mixed model (MtLMM), which is a robust extension of MLMM with the random effects and errors jointly distributed as a multivariate t distribution. Owing to the introduction of too many hidden variables in the model, the conventional Markov chain Monte Carlo (MCMC) method may converge painfully slowly and thus fails to provide valid inference. To alleviate this problem, a computationally efficient inverse Bayes formulas (IBF) sampler coupled with the Gibbs scheme, called the IBF-Gibbs sampler, is developed and shown to be effective in drawing samples from the target distributions. The issues related to model determination and Bayesian predictive inference for future values are also investigated. The proposed methodologies are illustrated with a real example from an AIDS clinical trial and a careful simulation study. 相似文献

10.

A spatio-temporal dynamic regression model for extreme wind speeds

Behzad Mahmoudian Mohsen Mohammadzadeh 《Extremes》2014,17(2):221-245

This paper presents a hierarchical regression type model for analyzing the dependency of sample extremes on time, space and a covariate effect. The model is based on the assumption that the observations follow independently a generalized extreme value distribution given location, scale and shape parameters. Then a multivariate spatial process is considered to accommodate the association and spatial correlation in the distribution parameters. The mean of the process incorporates the underlying dynamics which is elaborated on the lower stage of hierarchy. Finally, three spatio-temporal dynamic linear models drive independently this mean function to take the variations in the parameters separately into account. In a Bayesian setting, the model structure leads to parallel implementation of the Markov chain Monte Carlo algorithm in a sense that it is less time consuming. Our methodology is applied to the monthly maxima of wind speed with temperature as a covariate for which the relationship is expressed in terms of a penalized spline regression model. The comparison of the proposed model with several simpler ones suggests considerable improvements in wind speed analysis. 相似文献

11.

Additive Positive Stable Frailty Models

Madhuja Mallick Nalini Ravishanker 《Methodology and Computing in Applied Probability》2006,8(4):541-558

In this article, we describe an additive stable frailty model for multivariate times to events data using a flexible baseline hazard, and assuming that the frailty component for each individual is described by additive functions of independent positive stable random variables with possibly different stability indices. Dependence properties of this frailty model are investigated. To carry out inference, the likelihood function is derived by replacing high-dimensional integration by Monte Carlo simulation. Markov chain Monte Carlo algorithms enable estimation and model checking in the Bayesian framework. 相似文献

12.

Efficient Simulation of Markov Chains Using Segmentation

Tim Brereton Ole Stenzel Björn Baumeier Denis Andrienko Volker Schmidt Dirk Kroese 《Methodology and Computing in Applied Probability》2014,16(2):465-484

A methodology is proposed that is suitable for efficient simulation of continuous-time Markov chains that are nearly-completely decomposable. For such Markov chains the effort to adequately explore the state space via Crude Monte Carlo (CMC) simulation can be extremely large. The purpose of this paper is to provide a fast alternative to the standard CMC algorithm, which we call Aggregate Monte Carlo (AMC). The idea of the AMC algorithm is to reduce the jumping back and forth of the Markov chain in small subregions of the state space. We accomplish this by aggregating such problem regions into single states. We discuss two methods to identify collections of states where the Markov chain may become ‘trapped’: the stochastic watershed segmentation from image analysis, and a graph-theoretic decomposition method. As a motivating application, we consider the problem of estimating the charge carrier mobility of disordered organic semiconductors, which contain low-energy regions in which the charge carrier can quickly become stuck. It is shown that the AMC estimator for the charge carrier mobility reduces computational costs by several orders of magnitude compared to the CMC estimator. 相似文献

13.

Modelling non-stationary dynamic gene regulatory processes with the BGM model

Marco Grzegorczyk Dirk Husmeier J?rg Rahnenf��hrer 《Computational Statistics》2011,26(2):199-218

Recently, a Bayesian network model for inferring non-stationary regulatory processes from gene expression time series has been proposed. The Bayesian Gaussian Mixture (BGM) Bayesian network model divides the data into disjunct compartments (data subsets) by a free allocation model, and infers network structures, which are kept fixed for all compartments. Fixing the network structure allows for some information sharing among compartments, and each compartment is modelled separately and independently with the Gaussian BGe scoring metric for Bayesian networks. The BGM model can equally be applied to both static (steady-state) and dynamic (time series) gene expression data. However, it is this flexibility that renders its application to time series data suboptimal. To improve the performance of the BGM model on time series data we propose a revised approach in which the free allocation of data points is replaced by a changepoint process so as to take the temporal structure into account. The practical inference follows the Bayesian paradigm and approximately samples the network, the number of compartments and the changepoint locations from the posterior distribution with Markov chain Monte Carlo (MCMC). Our empirical results show that the proposed modification leads to a more efficient inference tool for analysing gene expression time series. 相似文献

14.

Properties of Prior and Posterior Distributions for Multivariate Categorical Response Data Models

Ming-Hui Chen Qi-Man Shao 《Journal of multivariate analysis》1999,71(2):97

In this article, we model multivariate categorical (binary and ordinal) response data using a very rich class of scale mixture of multivariate normal (SMMVN) link functions to accommodate heavy tailed distributions. We consider both noninformative as well as informative prior distributions for SMMVN-link models. The notation of informative prior elicitation is based on available similar historical studies. The main objectives of this article are (i) to derive theoretical properties of noninformative and informative priors as well as the resulting posteriors and (ii) to develop an efficient Markov chain Monte Carlo algorithm to sample from the resulting posterior distribution. A real data example from prostate cancer studies is used to illustrate the proposed methodologies. 相似文献

15.

Implementing random scan Gibbs samplers

Richard A. Levine Zhaoxia Yu William G. Hanley John J. Nitao 《Computational Statistics》2005,20(1):177-196

Summary The Gibbs sampler, being a popular routine amongst Markov chain Monte Carlo sampling methodologies, has revolutionized the application of Monte Carlo methods in statistical computing practice. The performance of the Gibbs sampler relies heavily on the choice of sweep strategy, that is, the means by which the components or blocks of the random vector X of interest are visited and updated. We develop an automated, adaptive algorithm for implementing the optimal sweep strategy as the Gibbs sampler traverses the sample space. The decision rules through which this strategy is chosen are based on convergence properties of the induced chain and precision of statistical inferences drawn from the generated Monte Carlo samples. As part of the development, we analytically derive closed form expressions for the decision criteria of interest and present computationally feasible implementations of the adaptive random scan Gibbs sampler via a Gaussian approximation to the target distribution. We illustrate the results and algorithms presented by using the adaptive random scan Gibbs sampler developed to sample multivariate Gaussian target distributions, and screening test and image data. Research by RL and ZY supported in part by a US National Science Foundation FRG grant 0139948 and a grant from Lawrence Livermore National Laboratory, Livermore, California, USA. 相似文献

16.

A comparison of some criteria for states selection in the latent Markov model for longitudinal data

S. Bacci S. Pandolfi F. Pennoni 《Advances in Data Analysis and Classification》2014,8(2):125-145

We compare different selection criteria to choose the number of latent states of a multivariate latent Markov model for longitudinal data. This model is based on an underlying Markov chain to represent the evolution of a latent characteristic of a group of individuals over time. Then, the response variables observed at different occasions are assumed to be conditionally independent given this chain. Maximum likelihood estimation of the model is carried out through an Expectation–Maximization algorithm based on forward–backward recursions which are well known in the hidden Markov literature for time series. The selection criteria we consider are based on penalized versions of the maximum log-likelihood or on the posterior probabilities of belonging to each latent state, that is, the conditional probability of the latent state given the observed data. Among the latter criteria, we propose an appropriate entropy measure tailored for the latent Markov models. We show the results of a Monte Carlo simulation study aimed at comparing the performance of the above states selection criteria on the basis of a wide set of model specifications. 相似文献

17.

Bayesian modeling of financial returns: A relationship between volatility and trading volume

Carlos A. Abanto‐Valle Helio S. Migon Hedibert F. Lopes 《商业与工业应用随机模型》2010,26(2):172-193

The modified mixture model with Markov switching volatility specification is introduced to analyze the relationship between stock return volatility and trading volume. We propose to construct an algorithm based on Markov chain Monte Carlo simulation methods to estimate all the parameters in the model using a Bayesian approach. The series of returns and trading volume of the British Petroleum stock will be analyzed. Copyright © 2009 John Wiley & Sons, Ltd. 相似文献

18.

Bayesian Inference with Wavelets: Density Estimation

Peter Müller Brani Vidakovic 《Journal of computational and graphical statistics》2013,22(4):456-468

Abstract

We propose a prior probability model in the wavelet coefficient space. The proposed model implements wavelet coefficient thresholding by full posterior inference in a coherent probability model. We introduce a prior probability model with mixture priors for the wavelet coefficients. The prior includes a positive prior probability mass at zero which leads to a posteriori thresholding and generally to a posteriori shrinkage on the coefficients. We discuss an efficient posterior simulation scheme to implement inference in the proposed model. The discussion is focused on the density estimation problem. However, the introduced prior probability model on the wavelet coefficient space and the Markov chain Monte Carlo scheme are general. 相似文献

19.

Multivariate Survival Models with a Mixture of Positive Stable Frailties

Ravishanker Nalini Dey Dipak K. 《Methodology and Computing in Applied Probability》2000,2(3):293-308

In this paper, we describe models for dependent multivariate survival data using finite mixtures of positive stable frailty distributions. We investigate the cross-ratio function as a local measure of association. We estimate the parameters in the stable mixture together with the parameters of the (conditional) proportional hazards model in a Bayesian framework using Markov chain Monte Carlo algorithms. We illustrate the methodology using data on kidney infections. 相似文献

20.

Robust Gaussian graphical modeling

Masashi Miyamura Yutaka Kano 《Journal of multivariate analysis》2006,97(7):1525-1550

A new Gaussian graphical modeling that is robustified against possible outliers is proposed. The likelihood function is weighted according to how the observation is deviated, where the deviation of the observation is measured based on its likelihood. Test statistics associated with the robustified estimators are developed. These include statistics for goodness of fit of a model. An outlying score, similar to but more robust than the Mahalanobis distance, is also proposed. The new scores make it easier to identify outlying observations. A Monte Carlo simulation and an analysis of a real data set show that the proposed method works better than ordinary Gaussian graphical modeling and some other robustified multivariate estimators. 相似文献