共查询到20条相似文献,搜索用时 0 毫秒
1.
Abstract When estimating a regression function or its derivatives, local polynomials are an attractive choice due to their flexibility and asymptotic performance. Seifert and Gasser proposed ridging of local polynomials to overcome problems with variance for random design while retaining their advantages. In this article we present a data-independent rule of thumb and a data-adaptive spatial choice of the ridge parameter in local linear regression. In a framework of penalized local least squares regression, the methods are generalized to higher order polynomials, to estimation of derivatives, and to multivariate designs. The main message is that ridging is a powerful tool for improving the performance of local polynomials. A rule of thumb offers drastic improvements; data-adaptive ridging brings further but modest gains in mean square error. 相似文献
2.
3.
《Journal of computational and graphical statistics》2013,22(3):524-541
A variety of very useful methods of statistical shape analysis are available for landmark data. In particular, standard methods of multivariate analysis can often be applied after suitable alignment and transformation of the data. An important example is the use of principal components analysis to provide a convenient route to graphical exploration of the main modes of variation in a sample. Where there are many landmarks or shape information is extracted in the form of curves or surfaces, the dimensionality of the resulting data can be very high and it is unlikely that substantial proportions of variability will be captured in one or two principal components. Issues of graphical exploration are explored in this setting, including random tours of a suitable low-dimensional subspace, the comparison of different groups of data, longitudinal changes and the identification of the features which distinguish individual cases from a group of controls. A suitable software environment for handling these methods with three-dimensional data is outlined. Issues of comparing principal components across time are also tackled through appropriately constructed permutation tests. All of these techniques are illustrated on a longitudinal study of facial development in young children, with particular interest in the identification of differences in facial shape between control children and those who have undergone surgical repair of a cleft lip and/or palate. 相似文献
4.
《Journal of computational and graphical statistics》2013,22(2):372-397
This article introduces graphical tools for visualizing multivariate functions, specializing to the case of visualizing multivariate density estimates. We visualize a density estimate by visualizing a series of its level sets. From each connected part of a level set a shape tree is formed. A shape tree is a tree whose nodes are associated with regions of the level set. With the help of a shape tree we define a transformation of a multivariate set to a univariate function. The shape trees are visualized with the shape plots and the location plot. By studying these plots one may identify the regions of the Euclidean space where the probability mass is concentrated. An application of shape trees to visualize the distribution of stock index returns is presented. 相似文献
5.
关于数据分析方法及SAS软件教学的探索 总被引:1,自引:0,他引:1
陈方樱 《数学的实践与认识》2004,34(1):168-172
论述了在大学本科阶段开设数据分析方法及其应用软件选修课的必要性及可能性 ,并介绍了笔者自 1 996年以来给本校信息与计算科学系三年级本科生开设该课程的情况及收获和体会 . 相似文献
6.
Abstract The theory of wavelets has recently undergone a period of rapid development. We introduce a software package called wavethresh that works within the statistical language S to perform one- and two-dimensional discrete wavelet transforms. The transforms and their inverses can be computed using any particular wavelet selected from a range of different families of wavelets. Pictures can be drawn of any of the one- or two-dimensional wavelets available in the package. The wavelet coefficients can be presented in a variety of ways to aid in the interpretation of data. The package's wavelet transform “engine” is written in C for speed and the object-oriented functionality of S makes wavethresh easy to use. We provide a tutorial introduction to wavelets and the wavethresh software. We also discuss how the software may be used to carry out nonlinear regression and image compression. In particular, thresholding of wavelet coefficients is a method for attempting to extract signal from noise and wavethresh includes functions to perform thresholding according to methods in the literature. 相似文献
7.
8.
本文考虑纵向数据半参数回归模型,通过考虑纵向数据的协方差结构,基于Profile最小二乘法和局部线性拟合的方法建立了模型中参数分量、回归函数和误差方差的估计量,来提高估计的有效性,在适当条件下给出了这些估计量的相合性.并通过模拟研究将该方法与最小二乘局部线性拟合估计方法进行了比较,表明了Profile最小二乘局部线性拟合方法在有限样本情况下具有良好的性质. 相似文献
9.
本文提出一种针对纵向数据回归模型下的均值和协方差矩阵同时进行的有效稳健估计.基于对协方差矩阵的Cholesky分解和对模型的改写,我们提出一个加权最小二乘估计,其中权重是通过广义经验似然方法估计出来的.所提估计的有效性得益于经验似然方法的优势,稳健性则是通过限制残差平方和的上界来达到.模拟研究表明,和已有的针对纵向数据的稳健估计相比,所提估计具有更高的效率和可比的稳健性.最后,我们把所提估计方法用来分析一组实际数据. 相似文献
10.
11.
In this article, we develop efficient robust method for estimation of mean and covariance simultaneously for longitudinal data in regression model. Based on Cholesky decomposition for the covariance matrix and rewriting the regression model, we propose a weighted least square estimator, in which the weights are estimated under generalized empirical likelihood framework. The proposed estimator obtains high efficiency from the close connection to empirical likelihood
method, and achieves robustness by bounding the weighted sum of squared residuals. Simulation study shows that, compared to existing robust estimation methods for longitudinal data, the proposed estimator has relatively high efficiency and comparable robustness. In the end, the proposed method is used to analyse a real data set. 相似文献
12.
José Camacho Rafael A. Rodríguez-Gómez Edoardo Saccenti 《Journal of computational and graphical statistics》2017,26(3):501-512
In this article, we propose a new framework for matrix factorization based on principal component analysis (PCA) where sparsity is imposed. The structure to impose sparsity is defined in terms of groups of correlated variables found in correlation matrices or maps. The framework is based on three new contributions: an algorithm to identify the groups of variables in correlation maps, a visualization for the resulting groups, and a matrix factorization. Together with a method to compute correlation maps with minimum noise level, referred to as missing-data for exploratory data analysis (MEDA), these three contributions constitute a complete matrix factorization framework. Two real examples are used to illustrate the approach and compare it with PCA, sparse PCA, and structured sparse PCA. Supplementary materials for this article are available online. 相似文献
13.
李望月 《数学的实践与认识》2014,(9)
基于2008年经济普查的数据,从描述统计分析和回归分析两方面分别对微观数据和宏观汇总数据在统计分析上的差异进行了实证分析.在描述统计分析中发现,宏观汇总数据比微观数据更接近正态分布,但对数化处理后的数据并非如此;在回归分析中发现,基于微观数据和宏观汇总数据估计的生产函数,在消除异方差和多重共线性之前,无论是在生产函数的规模效应、生产要素的贡献率以及生产要素对产出的解释力度上均存在着差异,但是在消除异方差和多重共线性之后,在要素对产出的解释力度上仍存在很大差异. 相似文献
14.
15.
Cédric Heuchenne Ingrid Van Keilegom 《Annals of the Institute of Statistical Mathematics》2007,59(2):273-297
Consider the polynomial regression model
, where σ2(X)=Var(Y|X) is unknown, and ε is independent of X and has zero mean. Suppose that Y is subject to random right censoring. A new estimation procedure for the parameters β0,...,β
p
is proposed, which extends the classical least squares procedure to censored data. The proposed method is inspired by the
method of Buckley and James (1979, Biometrika, 66, 429–436), but is, unlike the latter method, a noniterative procedure due to nonparametric preliminary estimation of the
conditional regression function. The asymptotic normality of the estimators is established. Simulations are carried out for
both methods and they show that the proposed estimators have usually smaller variance and smaller mean squared error than
the Buckley–James estimators. The two estimation procedures are also applied to a medical and an astronomical data set. 相似文献
16.
Heping Zhang 《Journal of computational and graphical statistics》2013,22(1):74-91
Abstract An essential feature of longitudinal data is the existence of autocorrelation among the observations from the same unit or subject. Two-stage random-effects linear models are commonly used to analyze longitudinal data. These models are not flexible enough, however, for exploring the underlying data structures and, especially, for describing time trends. Semi-parametric models have been proposed recently to accommodate general time trends. But these semi-parametric models do not provide a convenient way to explore interactions among time and other covariates although such interactions exist in many applications. Moreover, semi-parametric models require specifying the design matrix of the covariates (time excluded). We propose nonparametric models to resolve these issues. To fit nonparametric models, we use the novel technique of the multivariate adaptive regression splines for the estimation of mean curve and then apply an EM-like iterative procedure for covariance estimation. After giving a general algorithm of model building, we show how to design a fast algorithm. We use both simulated and published data to illustrate the use of our proposed method. 相似文献
17.
本文考虑如下纵向数据半参数回归模型:y_(ij)=x′_(ij)β+g(x_(ij))+e_(ij).结合最小二乘法和非参数权函数估计方法得到模型中参数β,回归函数g(·)的估计,并在适当条件下证明了估计量的强相合性. 相似文献
18.
19.
Michelle Carey Eugene G. Gath Kevin Hayes 《Journal of computational and graphical statistics》2017,26(3):671-681
Ordinary differential equations (ODEs) are equalities involving a function and its derivatives that define the evolution of the function over a prespecified domain. The applications of ODEs range from simulation and prediction to control and diagnosis in diverse fields such as engineering, physics, medicine, and finance. Parameter estimation is often required to calibrate these theoretical models to data. While there are many methods for estimating ODE parameters from partially observed data, they are invariably subject to several problems including high computational cost, complex estimation procedures, biased estimates, and large sampling variance. We propose a method that overcomes these issues and produces estimates of the ODE parameters that have less bias, a smaller sampling variance, and a 10-fold improvement in computational efficiency. The package GenPen containing the Matlab code to perform the methods described in this article is available online. 相似文献
20.
考虑纵向数据下半参数回归模型:yij=x′ijβ+g(tij)+eij,i=1,…,n,j=1,…,mi.基于最小二乘法和一般的非参数权函数方法给出了模型中参数β和回归函数g(·)的估计,并在适当条件下证明了参数分量β的估计量的强收敛速度和未知函数g(·)的估计量的一致强收敛速度. 相似文献