首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Due to their high flexibility, yet simple structure, pair-copula constructions (PCCs) are becoming increasingly popular for constructing continuous multivariate distributions. However, inference requires the simplifying assumption that all the pair-copulae depend on the conditioning variables merely through the two conditional distribution functions that constitute their arguments, and not directly. In terms of standard measures of dependence, we express conditions under which a specific pair-copula decomposition of a multivariate distribution is of this simplified form. Moreover, we show that the simplified PCC in fact is a rather good approximation, even when the simplifying assumption is far from being fulfilled by the actual model.  相似文献   

2.
Pair-copula Bayesian networks (PCBNs) are a novel class of multivariate statistical models, which combine the distributional flexibility of pair-copula constructions (PCCs) with the parsimony of conditional independence models associated with directed acyclic graphs (DAGs). We are first to provide generic algorithms for random sampling and likelihood inference in arbitrary PCBNs as well as for selecting orderings of the parents of the vertices in the underlying graphs. Model selection of the DAG is facilitated using a version of the well-known PC algorithm that is based on a novel test for conditional independence of random variables tailored to the PCC framework. A simulation study shows the PC algorithm’s high aptitude for structure estimation in non-Gaussian PCBNs. The proposed methods are finally applied to modeling financial return data. Supplementary materials for this article are available online.  相似文献   

3.
The general multivariate analysis of variance model has been extensively studied in the statistical literature and successfully applied in many different fields for analyzing longitudinal data. In this article, we consider the extension of this model having two sets of regressors constituting a growth curve portion and a multivariate analysis of variance portion, respectively. Nowadays, the data collected in empirical studies have relatively complex structures though often demanding a parsimonious modeling. This can be achieved for example through imposing rank constraints on the regression coefficient matrices. The reduced rank regression structure also provides a theoretical interpretation in terms of latent variables. We derive likelihood based estimators for the mean parameters and covariance matrix in this type of models. A numerical example is provided to illustrate the obtained results.  相似文献   

4.
A data analysis method is proposed to derive a latent structure matrix from a sample covariance matrix. The matrix can be used to explore the linear latent effect between two sets of observed variables. Procedures with which to estimate a set of dependent variables from a set of explanatory variables by using latent structure matrix are also proposed. The proposed method can assist the researchers in improving the effectiveness of the SEM models by exploring the latent structure between two sets of variables. In addition, a structure residual matrix can also be derived as a by-product of the proposed method, with which researchers can conduct experimental procedures for variables combinations and selections to build various models for hypotheses testing. These capabilities of data analysis method can improve the effectiveness of traditional SEM methods in data property characterization and models hypotheses testing. Case studies are provided to demonstrate the procedure of deriving latent structure matrix step by step, and the latent structure estimation results are quite close to the results of PLS regression. A structure coefficient index is suggested to explore the relationships among various combinations of variables and their effects on the variance of the latent structure.  相似文献   

5.
An extension of univariate quantiles in the multivariate set-up has been proposed and studied. The proposed approach is affine equivariant, and it is based on an adaptive transformation retransformation procedure. Behadur type linear representations of the proposed quantiles are established and consequently asymptotic distributions are also derived. As applications of these multivariate quantiles, we develop some affine equivariant quantile contour plots which can be used to study the geometry of the data cloud as well as the underlying probability distribution and to detect outliers. These quantiles can also be used to construct affine invariant versions of multivariate Q-Q plots which are useful in checking how well a given multivariate probability distribution fits the data and for comparing the distributions of two data sets. We illustrate these applications with some simulated and real data sets. We also indicate a way of extending the notion of univariate L-estimates and trimmed means in the multivariate set-up using these affine equivariant quantiles.  相似文献   

6.
Geometric stable laws constitute a class of limiting distributions of appropriately normalized random sums of i.i.d. random variables. We consider the problem of estimation of the parameters of univariate and multivariate geometric stable laws. Our estimation technique is based on the method of moments and yields consistent and asymptotically normal estimators. We apply our estimators to a currency exchange data and show that the geometric stable dominates Paretian stable and normal models.  相似文献   

7.
By introducing auxiliary variables, the traditional Markov chain Monte Carlo method can be improved in certain cases by implementing a “slice sampler.” In the current literature, this sampling technique is used to sample from multivariate distributions with both single and multiple auxiliary variables. When the latter is employed, it generally updates one component at a time.

In this article, we propose two variations of a new multivariate normal slice sampling method that uses multiple auxiliary variables to perform multivariate updating. These methods are flexible enough to allow for truncation to a rectangular region and/or exclusion of any n-dimensional hyper-quadrant. We present results of our methods and existing state-of-the-art slice samplers by comparing efficiency and accuracy. We find that we can generate approximately iid samples at a rate that is more efficient than other methods that update all dimensions at once. Supplemental materials are available online.  相似文献   

8.
用pair-copula构建高维相依结构,将n维联合密度函数转化为若干个pair-copula密度函数相乘。在pair-copula的选择上,本文构造了能描述非对称尾部相关性的混合copula函数——M-copula。并在实证分析部分用该方法探索了上证市场上四个板块的相依关系,得到了比较理想的结果。  相似文献   

9.
Methods are given for simulating from symmetric and asymmetric versions of the multivariate logistic distribution, and from other multivariate extreme value distributions based on the well known logistic model. We consider two general approaches. The first approach uses transformations to derive random variables with a joint distribution function from which it is easy to simulate. The second approach derives from a specification of conditionally independent marginal components, conditioning on positive stable random variables. This specification extends to models of nested or hierarchical type and leads to an efficient way of incorporating marginal censoring. The algorithms presented in Sections 2 and 3 are available on request from the author. They are also included in the R (Ihaka and Gentleman, 1996) package evd (Stephenson, 2002), which is available from http://www.maths.lancs.ac.uk/~stephena/.  相似文献   

10.
Analyses of multivariate ordinal probit models typically use data augmentation to link the observed (discrete) data to latent (continuous) data via a censoring mechanism defined by a collection of “cutpoints.” Most standard models, for which effective Markov chain Monte Carlo (MCMC) sampling algorithms have been developed, use a separate (and independent) set of cutpoints for each element of the multivariate response. Motivated by the analysis of ratings data, we describe a particular class of multivariate ordinal probit models where it is desirable to use a common set of cutpoints. While this approach is attractive from a data-analytic perspective, we show that the existing efficient MCMC algorithms can no longer be accurately applied. Moreover, we show that attempts to implement these algorithms by numerically approximating required multivariate normal integrals over high-dimensional rectangular regions can result in severely degraded estimates of the posterior distribution. We propose a new data augmentation that is based on a covariance decomposition and that admits a simple and accurate MCMC algorithm. Our data augmentation requires only that univariate normal integrals be evaluated, which can be done quickly and with high accuracy. We provide theoretical results that suggest optimal decompositions within this class of data augmentations, and, based on the theory, recommend default decompositions that we demonstrate work well in practice. This article has supplementary material online.  相似文献   

11.
M. Falk  R. Michel 《Extremes》2009,12(1):33-51
It has recently been shown by Rootzén and Tajvidi (Bernoulli, 12:917–930, 2006) that modelling exceedances of a random variable over a high threshold (peaks-over-threshold approach [POT]) can also in the multivariate setup be done rationally only by a multivariate generalized Pareto distribution (GPD). The selection of a proper threshold is, however, a crucial problem. The contribution of this paper is twofold: We develop first a non asymptotic and exact level-α test based on the single-sample t-test, which checks whether multivariate data are actually generated by a multivariate GPD. Secondly, this procedure is utilized for the derivation of a t-test based threshold selection rule in multivariate peaks-over-threshold models. The application to a hydrological data set illustrates this approach.   相似文献   

12.
The connectives ‘and’ and ‘or’ are potentially useful in multivariate analysis and theory construction. They are simple, logical ways to connect two or more variables together. However, until recently there has been no framework for operationalizing these connectives for continuous variables, and this lack has severely limited their use. Using fuzzy set theory as a basis for such a framework, this paper lays out the necessary tools and models to permit the use of ‘and’ and ‘or’ in multivariate analysis.Section 1 introduces conventional operators for ‘and’ and ‘or’, and Section 2 provides suitable extensions and generalizations of them. Section 3 sets out the required least-squares techniques for fitting these generalized operators to data, first in the context of ANOVA problems and then in regression contexts, for single-connective (three-variable) models. The theoretical developments and examples from real data-sets demonstrate the utility of ‘and’ and ‘or’ as a means to cell-specific interpretations of interaction effects which can also readily be translated into English. Section 4 extends these developments to multivariate, multiple-connective models and discusses issues of generalizability. The paper concludes (Section 5) with a brief discussion of remaining unsolved problems, future prospects for more sophisticated models, and computer programs.  相似文献   

13.
Random sets are set-valued random variables. They have been applied in various fields like stochastic geometry, statistics, economics, engineering or computer science, and are often used for modeling uncertainty. In an earlier paper the author has defined joint capacity and joint containment functionals which are multivariate set functions describing the joint distribution of random sets. This paper is concerned with the question how copulas can be used to describe or model the dependence of random sets. It is demonstrated that a joint containment functional can be related to its margins by a family of copulas. Furthermore, the paper provides a first insight how copulas can be used to define joint containment functionals.  相似文献   

14.
Tree-structured models have been widely used because they function as interpretable prediction models that offer easy data visualization. A number of tree algorithms have been developed for univariate response data and can be extended to analyze multivariate response data. We propose a tree algorithm by combining the merits of a tree-based model and a mixed-effects model for longitudinal data. We alleviate variable selection bias through residual analysis, which is used to solve problems that exhaustive search approaches suffer from, such as undue preference to split variables with more possible splits, expensive computational cost, and end-cut preference. Most importantly, our tree algorithm discovers trends over time on each of the subspaces from recursive partitioning, while other tree algorithms predict responses. We investigate the performance of our algorithm with both simulation and real data studies. We also develop an R package melt that can be used conveniently and freely. Additional results are provided as online supplementary material.  相似文献   

15.
We introduce graphical time series models for the analysis of dynamic relationships among variables in multivariate time series. The modelling approach is based on the notion of strong Granger causality and can be applied to time series with non-linear dependences. The models are derived from ordinary time series models by imposing constraints that are encoded by mixed graphs. In these graphs each component series is represented by a single vertex and directed edges indicate possible Granger-causal relationships between variables while undirected edges are used to map the contemporaneous dependence structure. We introduce various notions of Granger-causal Markov properties and discuss the relationships among them and to other Markov properties that can be applied in this context. Examples for graphical time series models include nonlinear autoregressive models and multivariate ARCH models.  相似文献   

16.
Penrose has given asymptotic results for the distribution of the longest edge of the minimal spanning tree and nearest neighbour graph for sets of multivariate uniformly or normally distributed points. We investigate the applicability of these results to samples of up to 100 points, in up to 10 dimensions. We conclude that the asymptotic results provide an acceptable approximation only in the uniform case. Their inaccuracy for the multivariate normal case means that they cannot be applied to improve Rohlf's gap test for an outlier in a set of multivariate data points, which depends on the longest edge of the minimal spanning tree of the set.  相似文献   

17.
18.
In this paper, we describe models for dependent multivariate survival data using finite mixtures of positive stable frailty distributions. We investigate the cross-ratio function as a local measure of association. We estimate the parameters in the stable mixture together with the parameters of the (conditional) proportional hazards model in a Bayesian framework using Markov chain Monte Carlo algorithms. We illustrate the methodology using data on kidney infections.  相似文献   

19.
This article compares methods for the numerical computation of multivariate t probabilities for hyper-rectangular integration regions. Methods based on acceptance-rejection, spherical-radial transformations, and separation-of-variables transformations are considered. Tests using randomly chosen problems show that the most efficient numerical methods use a transformation developed by Genz for multivariate normal probabilities. These methods allow moderately accurate multivariate t probabilities to be quickly computed for problems with as many as 20 variables. Methods for the noncentral multivariate t distribution are also described.  相似文献   

20.
Parameters of Gaussian multivariate models are often estimated using the maximum likelihood approach. In spite of its merits, this methodology is not practical when the sample size is very large, as, for example, in the case of massive georeferenced data sets. In this paper, we study the asymptotic properties of the estimators that minimize three alternatives to the likelihood function, designed to increase the computational efficiency. This is achieved by applying the information sandwich technique to expansions of the pseudo-likelihood functions as quadratic forms of independent normal random variables. Theoretical calculations are given for a first-order autoregressive time series and then extended to a two-dimensional autoregressive process on a lattice. We compare the efficiency of the three estimators to that of the maximum likelihood estimator as well as among themselves, using numerical calculations of the theoretical results and simulations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号