首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Let (X, Y) be an d × -valued random vector and let (X1, Y1),…,(XN, YN) be a random sample drawn from its distribution. Divide the data sequence into disjoint blocks of length l1, …, ln, find the nearest neighbor to X in each block and call the corresponding couple (Xi*, Yi*). It is shown that the estimate mn(X) = Σi = 1n wniYi*i = 1n wni of m(X) = E{Y|X} satisfies E{|mn(X) − m(X)|p} 0 (p ≥ 1) whenever E{|Y|p} < ∞, ln ∞, and the triangular array of positive weights {wni} satisfies supinwnii = 1n wni 0. No other restrictions are put on the distribution of (X, Y). Also, some distribution-free results for the strong convergence of E{|mn(X) − m(X)|p|X1, Y1,…, XN, YN} to zero are included. Finally, an application to the discrimination problem is considered, and a discrimination rule is exhibited and shown to be strongly Bayes risk consistent for all distributions.  相似文献   

2.
Let (X, Y), (X1, Y1), …, (Xn, Yn) be i.d.d. Rr × R-valued random vectors with E|Y| < ∞, and let Qn(x) be a kernel estimate of the regression function Q(x) = E(Y|X = x). In this paper, we establish an exponential bound of the mean deviation between Qn(x) and Q(x) given the training sample Zn = (X1, Y1, …, Xn, Yn), under conditions as weak as possible.  相似文献   

3.
For X one observation on a p-dimensional (p ≥ 4) spherically symmetric (s.s.) distribution about θ, minimax estimators whose risks dominate the risk of X (the best invariant procedure) are found with respect to general quadratic loss, L(δ, θ) = (δ − θ)′ D(δ − θ) where D is a known p × p positive definite matrix. For C a p × p known positive definite matrix, conditions are given under which estimators of the form δa,r,C,D(X) = (I − (ar(|X|2)) D−1/2CD1/2 |X|−2)X are minimax with smaller risk than X. For the problem of estimating the mean when n observations X1, X2, …, Xn are taken on a p-dimensional s.s. distribution about θ, any spherically symmetric translation invariant estimator, δ(X1, X2, …, Xn), with have a s.s. distribution about θ. Among the estimators which have these properties are best invariant estimators, sample means and maximum likelihood estimators. Moreover, under certain conditions, improved robust estimators can be found.  相似文献   

4.
Let (X, Y) be a random vector such that X is d-dimensional, Y is real valued, and θ(X) is the conditional αth quantile of Y given X, where α is a fixed number such that 0 < α < 1. Assume that θ is a smooth function with order of smoothness p > 0, and set r = (pm)/(2p + d), where m is a nonnegative integer smaller than p. Let T(θ) denote a derivative of θ of order m. It is proved that there exists estimate of T(θ), based on a set of i.i.d. observations (X1, Y1), …, (Xn, Yn), that achieves the optimal nonparametric rate of convergence nr in Lq-norms (1 ≤ q < ∞) restricted to compacts under appropriate regularity conditions. Further, it has been shown that there exists estimate of T(θ) that achieves the optimal rate (n/log n)r in L-norm restricted to compacts.  相似文献   

5.
Let X1, X2, …, Xn be random vectors that take values in a compact set in Rd, d ≥ 1. Let Y1, Y2, …, Yn be random variables (“the responses”) which conditionally on X1 = x1, …, Xn = xn are independent with densities f(y | xi, θ(xi)), i = 1, …, n. Assuming that θ lives in a sup-norm compact space Θq,d of real valued functions, an optimal L1-consistent estimator of θ is constructed via empirical measures. The rate of convergence of the estimator to the true parameter θ depends on Kolmogorov's entropy of Θq,d.  相似文献   

6.
Let (X1Y1), (X2Y2), …, be two-dimensional random vectors which are independent and distributed as (XY). For 0<p<1, letξ(px) be the conditionalpth quantile ofYgivenX=x; that is,ξ(px)=inf{y : P(YyX=x)p}. We consider the problem of estimatingξ(px) from the data (X1Y1), (X2Y2), …, (XnYn). In this paper, a new kernel estimator ofξ(px) is proposed. The asymptotic normality and a law of the iterated logarithm are obtained.  相似文献   

7.
We consider the problem of estimating a continuous bounded probability density function when independent data X1, ..., Xn from the density are partially contaminated by measurement error. In particular, the observations Y1, ..., Yn are such that P(Yj = Xj) = p and P(Yj = Xj + εj) = 1 − p, where the errors εj are independent (of each other and of the Xj) and identically distributed from a known distribution. When p = 0 it is well known that deconvolution via kernel density estimators suffers from notoriously slow rates of convergence. For normally distributed εj the best possible rates are of logarithmic order pointwise and in mean square error. In this paper we demonstrate that for merely partially(0 < p <1) contaminated observations (where of course it is unknown which observations are contaminated and which are not) under mild conditions almost sure rates of order O(((log h−1)/nh)1/2) with h = h(n) = const(log n/n)1/5 are achieved for convergence in L-norm. This is equal to the optimal rate available in ordinary density estimation from direct uncontaminated observations (p = 1). A corresponding result is obtained for the mean integrated squared error.  相似文献   

8.
Let (X1, X2,…, Xk, Y1, Y2,…, Yk) be multivariate normal and define a matrix C by Cij = cov(Xi, Yj). If (i) (X1,…, Xk) = (Y1,…, Yk) and (ii) C is symmetric positive definite, then 0 < varf(X1,…, Xk) < ∞ corr(f(X1,…, Xk),f(Y1,…, Yk)) > 0. Condition (i) is necessary for the conclusion. The sufficiency of (i) and (ii) follows from an infinite-dimensional version, which can also be applied to a pair of jointly normal Brownian motions.  相似文献   

9.
The behavior of the posterior for a large observation is considered. Two basic situations are discussed; location vectors and natural parameters.Let X = (X1, X2, …, Xn) be an observation from a multivariate exponential distribution with that natural parameter Θ = (Θ1, Θ2, …, Θn). Let θx* be the posterior mode. Sufficient conditions are presented for the distribution of Θ − θx* given X = x to converge to a multivariate normal with mean vector 0 as |x| tends to infinity. These same conditions imply that E(Θ | X = x) − θx* converges to the zero vector as |x| tends to infinity.The posterior for an observation X = (X1, X2, …, Xn is considered for a location vector Θ = (Θ1, Θ2, …, Θn) as x gets large along a path, γ, in Rn. Sufficient conditions are given for the distribution of γ(t) − Θ given X = γ(t) to converge in law as t → ∞. Slightly stronger conditions ensure that γ(t) − E(Θ | X = γ(t)) converges to the mean of the limiting distribution.These basic results about the posterior mean are extended to cover other estimators. Loss functions which are convex functions of absolute error are considered. Let δ be a Bayes estimator for a loss function of this type. Generally, if the distribution of Θ − E(Θ | X = γ(t)) given X = γ(t) converges in law to a symmetric distribution as t → ∞, it is shown that δ(γ(t)) − E(Θ | X = γ(t)) → 0 as t → ∞.  相似文献   

10.
Exact comparisons are made relating E|Y0|p, E|Yn−1|p, and E(maxjn−1 |Yj|p), valid for all martingales Y0,…,Yn−1, for each p ≥ 1. Specifically, for p > 1, the set of ordered triples {(x, y, z) : X = E|Y0|p, Y = E |Yn−1|p, and Z = E(maxjn−1 |Yj|p) for some martingale Y0,…,Yn−1} is precisely the set {(x, y, z) : 0≤xyz≤Ψn,p(x, y)}, where Ψn,p(x, y) = xψn,p(y/x) if x > 0, and = an−1,py if x = 0; here ψn,p is a specific recursively defined function. The result yields families of sharp inequalities, such as E(maxjn−1 |Yj|p) + ψn,p*(a) E |Y0|paE |Yn−1|p, valid for all martingales Y0,…,Yn−1, where ψn,p* is the concave conjugate function of ψn,p. Both the finite sequence and infinite sequence cases are developed. Proofs utilize moment theory, induction, conjugate function theory, and functional equation analysis.  相似文献   

11.
Starting from a real-valued Markov chain X0,X1,…,Xn with stationary transition probabilities, a random element {Y(t);t[0, 1]} of the function space D[0, 1] is constructed by letting Y(k/n)=Xk, k= 0,1,…,n, and assuming Y (t) constant in between. Sample tightness criteria for sequences {Y(t);t[0,1]};n of such random elements in D[0, 1] are then given in terms of the one-step transition probabilities of the underlying Markov chains. Applications are made to Galton-Watson branching processes.  相似文献   

12.
Let X,X1,…,Xm,…, Y,Y1,…,Yn,… be independent d-dimensional random vectors, where the Xj are i.i.d. copies of X, and the Yk are i.i.d. copies of Y. We study a class of consistent tests for the hypothesis that Y has the same distribution as X+μ for some unspecified . The test statistic L is a weighted integral of the squared modulus of the difference of the empirical characteristic functions of and Y1,…,Yn, where is an estimator of μ. An alternative representation of L is given in terms of an L2-distance between two nonparametric density estimators. The finite-sample and asymptotic null distribution of L is independent of μ. Carried out as a bootstrap or permutation procedure, the test is asymptotically of a given size, irrespective of the unknown underlying distribution. A large-scale simulation study shows that the permutation procedure performs better than the bootstrap.  相似文献   

13.
Treated in this paper is the problem of estimating with squared error loss the generalized variance | Σ | from a Wishart random matrix S: p × p Wp(n, Σ) and an independent normal random matrix X: p × k N(ξ, Σ Ik) with ξ(p × k) unknown. Denote the columns of X by X(1) ,…, X(k) and set ψ(0)(S, X) = {(np + 2)!/(n + 2)!} | S |, ψ(i)(X, X) = min[ψ(i−1)(S, X), {(np + i + 2)!/(n + i + 2)!} | S + X(1) X(1) + + X(i) X(i) |] and Ψ(i)(S, X) = min[ψ(0)(S, X), {(np + i + 2)!/(n + i + 2)!}| S + X(1) X(1) + + X(i) X(i) |], i = 1,…,k. Our result is that the minimax, best affine equivariant estimator ψ(0)(S, X) is dominated by each of Ψ(i)(S, X), i = 1,…,k and for every i, ψ(i)(S, X) is better than ψ(i−1)(S, X). In particular, ψ(k)(S, X) = min[{(np + 2)!/(n + 2)!} | S |, {(np + 2)!/(n + 2)!} | S + X(1)X(1)|,…,| {(np + k + 2)!/(n + k + 2)!} | S + X(1)X(1) + + X(k)X(k)|] dominates all other ψ's. It is obtained by considering a multivariate extension of Stein's result (Ann. Inst. Statist. Math. 16, 155–160 (1964)) on the estimation of the normal variance.  相似文献   

14.
Let (X, Y) be a pair of random variables such that X = (X1,…, Xd) ranges over a nondegenerate compact d-dimensional interval C and Y is real-valued. Let the conditional distribution of Y given X have mean θ(X) and satisfy an appropriate moment condition. It is assumed that the distribution of X is absolutely continuous and its density is bounded away from zero and infinity on C. Without loss of generality let C be the unit cube. Consider an estimator of θ having the form of a piecewise polynomial of degree kn based on mnd cubes of length 1/mn, where the mnd(dkn+d) coefficients are chosen by the method of least squares based on a random sample of size n from the distribution of (X, Y). Let (kn, mn) be chosen by the FPE procedure. It is shown that the indicated estimator has an asymptotically minimal squared error of prediction if θ is not of the form of piecewise polynomial.  相似文献   

15.
The basic result of the paper states: Let F1, …, Fn, F1,…, Fn have proportional hazard functions with λ1 ,…, λn , λ1 ,…, λn as the constants of proportionality. Let X(1) ≤ … ≤ X(n) (X(1) ≤ … ≤ X(n)) be the order statistics in a sample of size n from the heterogeneous populations {F1 ,…, Fn}({F1 ,…, Fn}). Then (λ1 ,…, λn) majorizes (λ1 ,…, λn) implies that (X(1) ,…, X(n)) is stochastically larger than (X(1) ,…, X(n)). Earlier results stochastically comparing individual order statistics are shown to be special cases. Applications of the main result are made in the study of the robustness of standard estimates of the failure rate of the exponential distribution, when observations actually come from a set of heterogeneous exponential distributions. Further applications are made to the comparisons of linear combinations of Weibull random variables and of binomial random variables.  相似文献   

16.
For Xi, …, Xn a random sample and K(·, ·) a symmetric kernel this paper considers large sample properties of location estimator satisfying , . Asymptotic normality of is obtained and two forms of interval estimators for parameter θ satisfying EK(X1 − θ, X2 − θ) = 0, are discussed. Consistent estimation of the variance parameters is obtained which permits the construction of asymptotically distribution free procedures. The p-variate and multigroup extension is accomplished to provide generalized one-way MANOVA. Monte Carlo results are included.  相似文献   

17.
Let X1, X2,… be idd random vectors with a multivariate normal distribution N(μ, Σ). A sequence of subsets {Rn(a1, a2,…, an), nm} of the space of μ is said to be a (1 − α)-level sequence of confidence sets for μ if PRn(X1, X2,…, Xn) for every nm) ≥ 1 − α. In this note we use the ideas of Robbins Ann. Math. Statist. 41 (1970) to construct confidence sequences for the mean vector μ when Σ is either known or unknown. The constructed sequence Rn(X1, X2, …, Xn) depends on Mahalanobis' or Hotelling's according as Σ is known or unknown. Confidence sequences for the vector-valued parameter in the general linear model are also given.  相似文献   

18.
Suppose that {Xi; I = 1, 2, …,} is a sequence of p-dimensional random vectors forming a stochastic process. Let pn, θ(Xn), Xn np, be the probability density function of Xn = (X1, …, Xn) depending on θ Θ, where Θ is an open set of 1. We consider to test a simple hypothesis H : θ = θ0 against the alternative A : θ ≠ θ0. For this testing problem we introduce a class of tests , which contains the likelihood ratio, Wald, modified Wald, and Rao tests as special cases. Then we derive the third-order asymptotic expansion of the distribution of T under a sequence of local alternatives. Using this result we elucidate various third-order asymptotic properties of T (e.g., Bartlett's adjustments, third-order asymptotically most powerful properties). Our results are very general, and can be applied to the i.i.d. case, multivariate analysis, and time series analysis. Two concrete examples will be given. One is a Gaussian ARMA process (dependent case), and the other is a nonlinear regression model (non-identically distributed case).  相似文献   

19.
Let Y1,…, Yn be independent identically distributed random variables with distribution function F(x, θ), θ = (θ′1, θ′2), where θi (i = 1, 2) is a vector of pi components, p = p1 + p2 and for θI, an open interval in p, F(x, θ) is continuous. In the present paper the author shows that the asymptotic distribution of modified Cramér-Smirnov statistic under Hn: θ1 = θ10 + n−1/2γ, θ2 unspecified, where γ is a given vector independent of n, is the distribution of a sum of weighted noncentral χ12 variables whose weights are eigenvalues of a covariance function of a Gaussian process and noncentrality parameters are Fourier coefficients of the mean function of the Gaussian process. Further, the author exploits the special form of the covariance function by using perturbation theory to obtain the noncentrality parameters and the weights. The technique is applicable to other goodness-of-fit statistics such as U2 [G. S. Watson, Biometrika 48 (1961), 109–114].  相似文献   

20.
Some new results are obtained on stochastic orderings between random vectors of spacings from heterogeneous exponential distributions and homogeneous ones. LetD1, …, Dnbe the normalized spacings associated with independent exponential random variablesX1, …,Xn, whereXihas hazard rateλi,i=1, 2, …, n. LetD*1, …, D*nbe the normalized spacings of a random sampleY1, …, Ynof sizenfrom an exponential distribution with hazard rateλ=∑ni=1 λi/n. It is shown that for anyn2, the random vector (D1, …, Dn) is greater than the random vector (D*1, …, D*n) in the sense of multivariate likelihood ratio ordering. It also follows from the results proved in this paper that for anyjbetween 2 andn, the survival function ofXj:nX1:nis Schur convex.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号