期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Data Adaptive Ridging in Local Polynomial Regression

Burkhardt Seifert Theo Gasser 《Journal of computational and graphical statistics》2013,22(2):338-360

Abstract

When estimating a regression function or its derivatives, local polynomials are an attractive choice due to their flexibility and asymptotic performance. Seifert and Gasser proposed ridging of local polynomials to overcome problems with variance for random design while retaining their advantages. In this article we present a data-independent rule of thumb and a data-adaptive spatial choice of the ridge parameter in local linear regression. In a framework of penalized local least squares regression, the methods are generalized to higher order polynomials, to estimation of derivatives, and to multivariate designs. The main message is that ridging is a powerful tool for improving the performance of local polynomials. A rule of thumb offers drastic improvements; data-adaptive ridging brings further but modest gains in mean square error. 相似文献

2.

删失数据回归误差项的非参数密度估计的一致相合性(英文)

下载免费PDF全文

周秀轻史宁中赵进《应用概率统计》2010,26(6):623-636

回归误差项是不可观测的. 由于回归误差项的密度函数在实际中有许多应用, 故使用非参数方法对其进行估计就成为回归分析中的一个基本问题. 针对完全观测数据回归模型, 曾有作者对此问题进行了研究. 然而在实际应用中, 经常会有数据被删失的情况发生, 在此情况下, 可以利用删失回归残差, 并使用核估计的方法对回归误差项的密度函数进行估计. 本文研究了该估计的大样本性质, 并证明了估计量的一致相合性. 相似文献

3.

Exploring Variation in Three-Dimensional Shape Data

《Journal of computational and graphical statistics》2013,22(3):524-541

A variety of very useful methods of statistical shape analysis are available for landmark data. In particular, standard methods of multivariate analysis can often be applied after suitable alignment and transformation of the data. An important example is the use of principal components analysis to provide a convenient route to graphical exploration of the main modes of variation in a sample. Where there are many landmarks or shape information is extracted in the form of curves or surfaces, the dimensionality of the resulting data can be very high and it is unlikely that substantial proportions of variability will be captured in one or two principal components. Issues of graphical exploration are explored in this setting, including random tours of a suitable low-dimensional subspace, the comparison of different groups of data, longitudinal changes and the identification of the features which distinguish individual cases from a group of controls. A suitable software environment for handling these methods with three-dimensional data is outlined. Issues of comparing principal components across time are also tackled through appropriately constructed permutation tests. All of these techniques are illustrated on a longitudinal study of facial development in young children, with particular interest in the identification of differences in facial shape between control children and those who have undergone surgical repair of a cleft lip and/or palate. 相似文献

4.

Visualization of Multivariate Density Estimates With Shape Trees

《Journal of computational and graphical statistics》2013,22(2):372-397

This article introduces graphical tools for visualizing multivariate functions, specializing to the case of visualizing multivariate density estimates. We visualize a density estimate by visualizing a series of its level sets. From each connected part of a level set a shape tree is formed. A shape tree is a tree whose nodes are associated with regions of the level set. With the help of a shape tree we define a transformation of a multivariate set to a univariate function. The shape trees are visualized with the shape plots and the location plot. By studying these plots one may identify the regions of the Euclidean space where the probability mass is concentrated. An application of shape trees to visualize the distribution of stock index returns is presented. 相似文献

5.

关于数据分析方法及SAS软件教学的探索 总被引：1，自引：0，他引：1

陈方樱《数学的实践与认识》2004,34(1):168-172

论述了在大学本科阶段开设数据分析方法及其应用软件选修课的必要性及可能性 ,并介绍了笔者自 1 996年以来给本校信息与计算科学系三年级本科生开设该课程的情况及收获和体会 . 相似文献

6.

The Discrete Wavelet Transform in S

G. P. Nason B. W. Silverman 《Journal of computational and graphical statistics》2013,22(2):163-191

Abstract

The theory of wavelets has recently undergone a period of rapid development. We introduce a software package called wavethresh that works within the statistical language S to perform one- and two-dimensional discrete wavelet transforms. The transforms and their inverses can be computed using any particular wavelet selected from a range of different families of wavelets. Pictures can be drawn of any of the one- or two-dimensional wavelets available in the package. The wavelet coefficients can be presented in a variety of ways to aid in the interpretation of data. The package's wavelet transform “engine” is written in C for speed and the object-oriented functionality of S makes wavethresh easy to use. We provide a tutorial introduction to wavelets and the wavethresh software. We also discuss how the software may be used to carry out nonlinear regression and image compression. In particular, thresholding of wavelet coefficients is a method for attempting to extract signal from noise and wavethresh includes functions to perform thresholding according to methods in the literature. 相似文献

7.

纵向数据半参数Beta回归模型的影响分析

赵为华李泽安徐相建《应用数学学报》2012,35(4):677-692

本将随机效应当作是缺失数据,基于Q函数和EM算法并利用P-样条拟合非参数部分,得到了纵向数据半参数Beta回归模型估计方法.基于数据删除模型,我们得到了模型参数部分的广义Cook距离以及非参数部分的广义DFIT.此外,本文还研究了在四种不同扰动情形下模型的局部影响分析,得到了相应的影响矩阵.最后,我们通过两个数值实例验证了所得诊断统计量的有效性. 相似文献

8.

纵向数据半参数回归模型的估计方法

王芬玲樊明智《数理统计与管理》2007,26(6):992-998

本文考虑纵向数据半参数回归模型,通过考虑纵向数据的协方差结构,基于Profile最小二乘法和局部线性拟合的方法建立了模型中参数分量、回归函数和误差方差的估计量,来提高估计的有效性,在适当条件下给出了这些估计量的相合性.并通过模拟研究将该方法与最小二乘局部线性拟合估计方法进行了比较,表明了Profile最小二乘局部线性拟合方法在有限样本情况下具有良好的性质. 相似文献

9.

纵向数据均值和协方差矩阵的有效稳健估计（英文）

樊亚莉徐孝琳《应用概率统计》2018,(6):598-612

本文提出一种针对纵向数据回归模型下的均值和协方差矩阵同时进行的有效稳健估计.基于对协方差矩阵的Cholesky分解和对模型的改写,我们提出一个加权最小二乘估计,其中权重是通过广义经验似然方法估计出来的.所提估计的有效性得益于经验似然方法的优势,稳健性则是通过限制残差平方和的上界来达到.模拟研究表明,和已有的针对纵向数据的稳健估计相比,所提估计具有更高的效率和可比的稳健性.最后,我们把所提估计方法用来分析一组实际数据. 相似文献

10.

回归分析中参数估计的一种新方法 总被引：1，自引：0，他引：1

王玉杰张大克《应用数学与计算数学学报》2001,15(2):85-89

本文根据矩阵的广义逆理论,建立了回归分析中进行参数估计的一种新方法,该方法更简单更精确。相似文献

11.

��ݾ�ֵ��Э��Ч�Ƚ��

�� Т�� 《应用概率统计》2018,34(6):598-612

In this article, we develop efficient robust method for estimation of mean and covariance simultaneously for longitudinal data in regression model. Based on Cholesky decomposition for the covariance matrix and rewriting the regression model, we propose a weighted least square estimator, in which the weights are estimated under generalized empirical likelihood framework. The proposed estimator obtains high efficiency from the close connection to empirical likelihood method, and achieves robustness by bounding the weighted sum of squared residuals. Simulation study shows that, compared to existing robust estimation methods for longitudinal data, the proposed estimator has relatively high efficiency and comparable robustness. In the end, the proposed method is used to analyse a real data set. 相似文献

12.

Group-Wise Principal Component Analysis for Exploratory Data Analysis

José Camacho Rafael A. Rodríguez-Gómez Edoardo Saccenti 《Journal of computational and graphical statistics》2017,26(3):501-512

In this article, we propose a new framework for matrix factorization based on principal component analysis (PCA) where sparsity is imposed. The structure to impose sparsity is defined in terms of groups of correlated variables found in correlation matrices or maps. The framework is based on three new contributions: an algorithm to identify the groups of variables in correlation maps, a visualization for the resulting groups, and a matrix factorization. Together with a method to compute correlation maps with minimum noise level, referred to as missing-data for exploratory data analysis (MEDA), these three contributions constitute a complete matrix factorization framework. Two real examples are used to illustrate the approach and compare it with PCA, sparse PCA, and structured sparse PCA. Supplementary materials for this article are available online. 相似文献

13.

微观数据和宏观汇总数据在统计分析上的差异——以C-D生产函数为例

李望月《数学的实践与认识》2014,(9)

基于2008年经济普查的数据,从描述统计分析和回归分析两方面分别对微观数据和宏观汇总数据在统计分析上的差异进行了实证分析.在描述统计分析中发现,宏观汇总数据比微观数据更接近正态分布,但对数化处理后的数据并非如此;在回归分析中发现,基于微观数据和宏观汇总数据估计的生产函数,在消除异方差和多重共线性之前,无论是在生产函数的规模效应、生产要素的贡献率以及生产要素对产出的解释力度上均存在着差异,但是在消除异方差和多重共线性之后,在要素对产出的解释力度上仍存在很大差异. 相似文献

14.

数据规律分析法在曲线优化拟合中的应用

卢玉成唐荣桂卢曦王一帆《高等数学研究》2016,(1):67-70

本文介绍了数据规律分析法所包含的公式体系和系数计算方法;在曲线拟合中,实际算例结果表明该方法优于常规方法,同时验证了降价法与回归分析法具有同等的计算精确度;作为一种数学方法,可为高等教材数值分析增添新的内容。相似文献

15.

Polynomial Regression with Censored Data based on Preliminary Nonparametric Estimation

Cédric Heuchenne Ingrid Van Keilegom 《Annals of the Institute of Statistical Mathematics》2007,59(2):273-297

Consider the polynomial regression model , where σ²(X)=Var(Y|X) is unknown, and ε is independent of X and has zero mean. Suppose that Y is subject to random right censoring. A new estimation procedure for the parameters β₀,...,β_p is proposed, which extends the classical least squares procedure to censored data. The proposed method is inspired by the method of Buckley and James (1979, Biometrika, 66, 429–436), but is, unlike the latter method, a noniterative procedure due to nonparametric preliminary estimation of the conditional regression function. The asymptotic normality of the estimators is established. Simulations are carried out for both methods and they show that the proposed estimators have usually smaller variance and smaller mean squared error than the Buckley–James estimators. The two estimation procedures are also applied to a medical and an astronomical data set. 相似文献

16.

Multivariate Adaptive Splines for Analysis of Longitudinal Data

Heping Zhang 《Journal of computational and graphical statistics》2013,22(1):74-91

Abstract

An essential feature of longitudinal data is the existence of autocorrelation among the observations from the same unit or subject. Two-stage random-effects linear models are commonly used to analyze longitudinal data. These models are not flexible enough, however, for exploring the underlying data structures and, especially, for describing time trends. Semi-parametric models have been proposed recently to accommodate general time trends. But these semi-parametric models do not provide a convenient way to explore interactions among time and other covariates although such interactions exist in many applications. Moreover, semi-parametric models require specifying the design matrix of the covariates (time excluded). We propose nonparametric models to resolve these issues. To fit nonparametric models, we use the novel technique of the multivariate adaptive regression splines for the estimation of mean curve and then apply an EM-like iterative procedure for covariance estimation. After giving a general algorithm of model building, we show how to design a fast algorithm. We use both simulated and published data to illustrate the use of our proposed method. 相似文献

17.

一类纵向数据半参数模型中的强相合估计

田萍马国锋《数理统计与管理》2008,27(5)

本文考虑如下纵向数据半参数回归模型:y_(ij)=x′_(ij)β+g(x_(ij))+e_(ij).结合最小二乘法和非参数权函数估计方法得到模型中参数β,回归函数g(·)的估计,并在适当条件下证明了估计量的强相合性. 相似文献

18.

基于似然函数的纵向数据线性混合模型影响分析

费宇潘建新王力宾《系统科学与数学》2009,29(2):271-279

线性混合模型在纵向数据分析中有广泛应用,应用点删除法研究纵向数据线性混合模型影响分析问题.采用的影响度量是基于似然函数构造的广义Cook距离,推导出两种广义Cook距离并用于实际数据分析和统计模拟分析,实际数据分析和统计模拟分析结果都说明该方法是行之有效的. 相似文献

19.

A Generalized Smoother for Linear Ordinary Differential Equations

Michelle Carey Eugene G. Gath Kevin Hayes 《Journal of computational and graphical statistics》2017,26(3):671-681

Ordinary differential equations (ODEs) are equalities involving a function and its derivatives that define the evolution of the function over a prespecified domain. The applications of ODEs range from simulation and prediction to control and diagnosis in diverse fields such as engineering, physics, medicine, and finance. Parameter estimation is often required to calibrate these theoretical models to data. While there are many methods for estimating ODE parameters from partially observed data, they are invariably subject to several problems including high computational cost, complex estimation procedures, biased estimates, and large sampling variance. We propose a method that overcomes these issues and produces estimates of the ODE parameters that have less bias, a smaller sampling variance, and a 10-fold improvement in computational efficiency. The package GenPen containing the Matlab code to perform the methods described in this article is available online. 相似文献

20.

纵向数据下半参数回归模型估计的收敛速度

田萍苗宝军薛留根《数学的实践与认识》2008,38(11):108-114

考虑纵向数据下半参数回归模型:yij=x′ijβ+g(tij)+eij,i=1,…,n,j=1,…,mi.基于最小二乘法和一般的非参数权函数方法给出了模型中参数β和回归函数g(·)的估计,并在适当条件下证明了参数分量β的估计量的强收敛速度和未知函数g(·)的估计量的一致强收敛速度. 相似文献