首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
在协变量和反映变量都缺失下,构造了线性模型中反映变量均值的经验似然置信区间,数据模拟表明调整的经验似然置信区间有较好的覆盖率和精度,进一步完善了缺失数据下对线性模型的研究.  相似文献   

2.
The aggregation of financial and economic time series occurs in a number of ways. Temporal aggregation or systematic sampling is the commonly used approach. In this paper, we investigate the time interval effect of multiple regression models in which the variables are additive or systematically sampled. The correlation coefficient changes with the selected time interval when one is additive and the other is systematically sampled. It is shown that the squared correlation coefficient decreases monotonically as the differencing interval increases, approaching zero in the limit. When two random variables are both added or systematically sampled, the correlation coefficient is invariant with time and equal to the one-period values. We find that the partial regression and correlation coefficients between two additive or systematically sampled variables approach one-period values as n increases. When one of the variables is systematically sampled, they will approach zero in the limit. The time interval for the association analyses between variables is not selected arbitrarily or the statistical results are likely affected.  相似文献   

3.
In a Bayesian setup, we consider the problem of predicting a dependent variable given an independent variable and past observations on the two variables. An asymptotic formula for the relevant posterior predictive density is worked out. Considering posterior quantiles and highest predictive density regions, we then characterize priors that ensure approximate frequentist validity of Bayesian prediction in the above setting. Application to regression models is also discussed.  相似文献   

4.
Treed Regression     
Abstract

Given a data set consisting of n observations on p independent variables and a single dependent variable, treed regression creates a binary tree with a simple linear regression function at each of the leaves. Each node of the tree consists of an inequality condition on one of the independent variables. The tree is generated from the training data by a recursive partitioning algorithm. Treed regression models are more parsimonious than CART models because there are fewer splits. Additionally, monotonicity in some or all of the variables can be imposed.  相似文献   

5.
《Journal of Complexity》2003,19(4):474-510
In this paper we address the complexity of solving linear programming problems with a set of differential equations that converge to a fixed point that represents the optimal solution. Assuming a probabilistic model, where the inputs are i.i.d. Gaussian variables, we compute the distribution of the convergence rate to the attracting fixed point. Using the framework of Random Matrix Theory, we derive a simple expression for this distribution in the asymptotic limit of large problem size. In this limit, we find the surprising result that the distribution of the convergence rate is a scaling function of a single variable. This scaling variable combines the convergence rate with the problem size (i.e., the number of variables and the number of constraints). We also estimate numerically the distribution of the computation time to an approximate solution, which is the time required to reach a vicinity of the attracting fixed point. We find that it is also a scaling function. Using the problem size dependence of the distribution functions, we derive high probability bounds on the convergence rates and on the computation times to the approximate solution.  相似文献   

6.
The problem of producing medium- to long-term forecasts of the market for business telephones is examined. Growth curves are generally appropriate for forecasting developing markets. However, this market is particularly sensitive to the state of business confidence and the feasibility of incorporating explanatory economic variables into the forecasting model is investigated. Three different model types are compared: growth curves with a fixed saturation level, multivariate linear models and growth curves with saturation levels determined by explanatory variables. The initial promise of models using explanatory variables is considerably diminished, once forecast rather than actual values of these variables are used. The market development model implicit in the growth curve is shown to be more robust than the linear model. Although the variable saturation level growth curve grants more insight into the maturity of the market, it does not produce significantly better forecasts than that with the fixed saturation level.  相似文献   

7.
A type-2 fuzzy variable is a map from a fuzzy possibility space to the real number space; it is an appropriate tool for describing type-2 fuzziness. This paper first presents three kinds of critical values (CVs) for a regular fuzzy variable (RFV), and proposes three novel methods of reduction for a type-2 fuzzy variable. Secondly, this paper applies the reduction methods to data envelopment analysis (DEA) models with type-2 fuzzy inputs and outputs, and develops a new class of generalized credibility DEA models. According to the properties of generalized credibility, when the inputs and outputs are mutually independent type-2 triangular fuzzy variables, we can turn the proposed fuzzy DEA model into its equivalent parametric programming problem, in which the parameters can be used to characterize the degree of uncertainty about type-2 fuzziness. For any given parameters, the parametric programming model becomes a linear programming one that can be solved using standard optimization solvers. Finally, one numerical example is provided to illustrate the modeling idea and the efficiency of the proposed DEA model.  相似文献   

8.
The dependent variable in a regular linear regression is a numerical variable, and in a logistic regression it is a binary or categorical variable. In these models the dependent variable has varying values. However, there are problems yielding an identity output of a constant value which can also be modelled in a linear or logistic regression with this constant in place of a numerical or binary response. In a linear model with a positive response, dividing by its values yields a regression of constant output by the relative shares of individual predictors into the total response. Chemical reaction models use the agents' concentration, summing to a constant 100%. Another example can be found in priority modelling by Thurstone scaling for ranked or paired comparison data. The Thurstone scale can be estimated by probit or logit models with identical output across all the responses. Models with a unitary output can be constructed by software for regular regressions, but they give a different interpretation of results. For instance, the coefficient of multiple determination is not an estimate of the explained variance in the total response variance (which is zero), but a measure of the fitting quality of the constant approximated by an aggregate of predictors.  相似文献   

9.
In this paper we investigate the time interval effect of multiple regression models in which some of the variables are additive and some are multiplicative. The effect on the partial regression and correlation coefficients is influenced by the selected time interval. We find that the partial regression and correlation coefficients between two additive variables approach one-period values as n increases. When one of the variables is multiplicative, they will approach zero in the limit. We also show that the decreasing speed of the n-period correlation coefficients between both multiplicative variables is faster than others, except that a one-period correlation has a higher positive value. The results of this paper can be widely applied in various fields where regression or correlation analyses are employed.  相似文献   

10.
This paper considers the problem of interval scale data in the most widely used models of data envelopment analysis (DEA), the CCR and BCC models. Radial models require inputs and outputs measured on the ratio scale. Our focus is on how to deal with interval scale variables especially when the interval scale variable is a difference of two ratio scale variables like profit or the decrease/increase in bank accounts. We suggest the use of these ratio scale variables in a radial DEA model.  相似文献   

11.
This article presents a method for the construction of a simultaneous confidence band for the normal-error multiple linear regression model. The confidence bands considered have their width proportional to the standard error of the estimated regression function, and the predictor variables are allowed to be constrained in intervals. Past articles in this area gave exact bands only for the simple regression model. When there is more than one predictor variable, only conservative bands are proposed in the statistics literature. This article advances this methodology by providing simulation-based confidence bands for regression models with any number of predictor variables. Additionally, a criterion is proposed to assess the sensitivity of a simultaneous confidence band. This criterion is defined to be the probability that a false linear regression model is excluded from the band at least at one point and hence this false linear regression model is correctly declared as a false model by the band. Finally, the article considers and compares several computational algorithms for obtaining the confidence band.  相似文献   

12.
选择合适的变量是建立多元线性回归方程的重要问题.以减弱诸多变量之间的复共线性为目标,采用条件数分析方法选择多元回归模型的自变量.最后以西北太平洋海域2001-2011年5-7月的台风强度为研究对象,利用条件数方法建立预报方程进行预报试验,并进一步将该预报方法与逐步回归方法进行对比分析.  相似文献   

13.
The coverage probability of an approximate confidence interval on the among-group variance component, σ α 2 , in a one-way random model is modeled using generalized linear models techniques. The purpose of the proposed modeling is to derive an empirical relationship between the coverage probability on one hand, andk, n., ?, and the model’s variance components on the other hand, wherek is the number of groups, n. is the total number of observations, and ? is a measure of imbalance for the design used. The latter quantities serve as control variables in the derived model, and the coverage probability is treated as the response variable. Contour plots generated from this model can be easily used to depict the effects of the control variables on the coverage probability. In particular, the plots are utilized to compare four methods for constructing approximate confidence intervals on σ α 2 . Additional advantages of the derived model include prediction of the coverage probability for a given method using only specified values of the control variables, and the determination of operating conditions that result in improved coverage probability within the region of interest.  相似文献   

14.
《Optimization》2012,61(11):2441-2454
Inverse data envelopment analysis (InDEA) is a well-known approach for short-term forecasting of a given decision-making unit (DMU). The conventional InDEA models use the production possibility set (PPS) that is composed of an evaluated DMU with current inputs and outputs. In this paper, we replace the fluctuated DMU with a modified DMU involving renewal inputs and outputs in the PPS since the DMU with current data cannot be allowed to establish the new PPS. Besides, the classical DEA models such as InDEA are assumed to consider perfect knowledge of the input and output values but in numerous situations, this assumption may not be realistic. The observed values of the data in these situations can sometimes be defined as interval numbers instead of crisp numbers. Here, we extend the InDEA model to interval data for evaluating the relative efficiency of DMUs. The proposed models determine the lower and upper bounds of the inputs of a given DMU separately when its interval outputs are changed in the performance analysis process. We aim to remain the current interval efficiency of a considered DMU and the interval efficiencies of the remaining DMUs fixed or even improve compared with the current interval efficiencies.  相似文献   

15.
The problem is considered of modeling simultaneous confidence intervals for the mean values of multiple responses in a linear multivariate normal regression model with predictor variables defined in intervals. To solve it, a numerical way of calculating the critical value that determines the simultaneous confidence interval of a given level is used. Simultaneous confidence intervals are numerically modelled and analyzed by comparison for regression, the mean value of multiple responses, and individual observation.  相似文献   

16.
A statistical model for the analysis of ordinal level dependent variables   总被引:1,自引:0,他引:1  
This paper develops a model, with assumptions similar to those of the linear model, for use when the observed dependent variable is ordinal. This model is an extension of the dichotomous probit model, and assumes that the ordinal nature of the observed dependent variable is due to methodological limitations in collecting the data, which force the researcher to lump together and identify various portions of an (otherwise) interval level variable. The model assumes a linear eflect of each independent variable as well as a series of break points between categories for the dependent variable. Maximum likelihood estimators are found for these parameters, along with their asymptotic sampling distributions, and an analogue of R 2 (the coefficient of determination in regression analysis) is defined to measure goodness of fit. The use of the model is illustrated with an analysis of Congressional voting on the 1965 Medicare Bill.  相似文献   

17.
In this paper we present a new method of confidence interval identification for Takagi–Sugeno fuzzy models in the case of the data with regionally changeable variance. The method combines a fuzzy identification methodology with some ideas from applied statistics. The idea is to find, on a finite set of measured data, the confidence interval defined by the lower and upper bounds. The confidence interval which defines the band that contains the measurement values with certain confidence. The method can be used when describing a family of uncertain nonlinear functions or when the systems with uncertain physical parameters are observed. In our example the proposed method is applied to model the pH-titration curve.  相似文献   

18.
左截尾双参数指数分布的可靠寿命的广义置信下限   总被引:1,自引:0,他引:1       下载免费PDF全文
本文基于左截尾双参数指数分布定数截尾数据,利用Weerahandi给出的广义枢轴量和广义置信区间的概念,通过两种不同的方法建立了可靠寿命的广义置信下限.第1种方法利用位置参数无限制时可靠寿命的广义置信下限来定义左截尾情形下可靠寿命的限制广义置信下限,第2种方法基于广义枢轴量在限制参数空间上的条件分布给出可靠寿命的条件广义置信下限.我们分别研究了这两种置信下限的性质,给出了简单易行的数值计算方法.模拟比较表明限制广义置信下限具有好的覆盖率性质,条件广义置信下限的覆盖率与参数取值有关,但它有时比限制广义置信下限具有更大均值和更小标准差.  相似文献   

19.
In this paper we introduce an appealing nonparametric method for estimating variance and conditional variance functions in generalized linear models (GLMs), when designs are fixed points and random variables respectively, Bias-corrected confidence bands are proposed for the (conditional) variance by local linear smoothers. Nonparametric techniques are developed in deriving the bias-corrected confidence intervals of the (conditional) variance. The asymptotic distribution of the proposed estimator is established and show that the bias-corrected confidence bands asymptotically have the correct coverage properties. A small simulation is performed when unknown regression parameter is estimated by nonparametric quasi-likelihood. The results are also applicable to nonparamctric autoregressive times series model with heteroscedastic conditional variance.  相似文献   

20.
In this study, we investigate the factors that influence the object-oriented (OO) component size and source code documentation. For multiple inputs and multiple outputs, we use data envelopment analysis to illustrate that non-linear variable returns to scale (VRS) economies exist for OO component size and source code documentation. The existence of non-linear variable returns to scale economies indicates that non-linear regression models will perform better than linear regression models. Using empirical data, we compare the performance of non-linear artificial neural network (ANN) forecasting model and linear regression model. Our results indicate that the ANN model performs well when VRS economies exist between multiple inputs and multiple outputs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号