fbpx
Wikipedia

Empirical process

In probability theory, an empirical process is a stochastic process that characterizes the deviation of the empirical distribution function from its expectation. In mean field theory, limit theorems (as the number of objects becomes large) are considered and generalise the central limit theorem for empirical measures. Applications of the theory of empirical processes arise in non-parametric statistics.[1]

Definition edit

For X1, X2, ... Xn independent and identically-distributed random variables in R with common cumulative distribution function F(x), the empirical distribution function is defined by

 

where IC is the indicator function of the set C.

For every (fixed) x, Fn(x) is a sequence of random variables which converge to F(x) almost surely by the strong law of large numbers. That is, Fn converges to F pointwise. Glivenko and Cantelli strengthened this result by proving uniform convergence of Fn to F by the Glivenko–Cantelli theorem.[2]

A centered and scaled version of the empirical measure is the signed measure

 

It induces a map on measurable functions f given by

 

By the central limit theorem,   converges in distribution to a normal random variable N(0, P(A)(1 − P(A))) for fixed measurable set A. Similarly, for a fixed function f,   converges in distribution to a normal random variable  , provided that   and   exist.

Definition

  is called an empirical process indexed by  , a collection of measurable subsets of S.
  is called an empirical process indexed by  , a collection of measurable functions from S to  .

A significant result in the area of empirical processes is Donsker's theorem. It has led to a study of Donsker classes: sets of functions with the useful property that empirical processes indexed by these classes converge weakly to a certain Gaussian process. While it can be shown that Donsker classes are Glivenko–Cantelli classes, the converse is not true in general.

Example edit

As an example, consider empirical distribution functions. For real-valued iid random variables X1, X2, ..., Xn they are given by

 

In this case, empirical processes are indexed by a class   It has been shown that   is a Donsker class, in particular,

  converges weakly in   to a Brownian bridge B(F(x)) .

See also edit

References edit

  1. ^ Mojirsheibani, M. (2007). "Nonparametric curve estimation with missing data: A general empirical process approach". Journal of Statistical Planning and Inference. 137 (9): 2733–2758. doi:10.1016/j.jspi.2006.02.016.
  2. ^ Wolfowitz, J. (1954). "Generalization of the Theorem of Glivenko-Cantelli". The Annals of Mathematical Statistics. 25: 131–138. doi:10.1214/aoms/1177728852.

Further reading edit

  • Billingsley, P. (1995). Probability and Measure (Third ed.). New York: John Wiley and Sons. ISBN 0471007102.
  • Donsker, M. D. (1952). "Justification and Extension of Doob's Heuristic Approach to the Kolmogorov- Smirnov Theorems". The Annals of Mathematical Statistics. 23 (2): 277–281. doi:10.1214/aoms/1177729445.
  • Dudley, R. M. (1978). "Central Limit Theorems for Empirical Measures". The Annals of Probability. 6 (6): 899–929. doi:10.1214/aop/1176995384.
  • Dudley, R. M. (1999). Uniform Central Limit Theorems. Cambridge Studies in Advanced Mathematics. Vol. 63. Cambridge, UK: Cambridge University Press.
  • Kosorok, M. R. (2008). Introduction to Empirical Processes and Semiparametric Inference. Springer Series in Statistics. doi:10.1007/978-0-387-74978-5. ISBN 978-0-387-74977-8.
  • Shorack, G. R.; Wellner, J. A. (2009). Empirical Processes with Applications to Statistics. doi:10.1137/1.9780898719017. ISBN 978-0-89871-684-9.
  • van der Vaart, Aad W.; Wellner, Jon A. (2000). Weak Convergence and Empirical Processes: With Applications to Statistics (2nd ed.). Springer. ISBN 978-0-387-94640-5.
  • Dzhaparidze, K. O.; Nikulin, M. S. (1982). "Probability distributions of the Kolmogorov and omega-square statistics for continuous distributions with shift and scale parameters". Journal of Soviet Mathematics. 20 (3): 2147. doi:10.1007/BF01239992. S2CID 123206522.

External links edit

  • Empirical Processes: Theory and Applications, by David Pollard, a textbook available online.
  • Introduction to Empirical Processes and Semiparametric Inference, by Michael Kosorok, another textbook available online.

empirical, process, process, control, topic, process, control, control, model, probability, theory, empirical, process, stochastic, process, that, characterizes, deviation, empirical, distribution, function, from, expectation, mean, field, theory, limit, theor. For the process control topic see Process control Control model In probability theory an empirical process is a stochastic process that characterizes the deviation of the empirical distribution function from its expectation In mean field theory limit theorems as the number of objects becomes large are considered and generalise the central limit theorem for empirical measures Applications of the theory of empirical processes arise in non parametric statistics 1 Contents 1 Definition 2 Example 3 See also 4 References 5 Further reading 6 External linksDefinition editFor X1 X2 Xn independent and identically distributed random variables in R with common cumulative distribution function F x the empirical distribution function is defined by F n x 1 n i 1 n I x X i displaystyle F n x frac 1 n sum i 1 n I infty x X i nbsp where IC is the indicator function of the set C For every fixed x Fn x is a sequence of random variables which converge to F x almost surely by the strong law of large numbers That is Fn converges to F pointwise Glivenko and Cantelli strengthened this result by proving uniform convergence of Fn to F by the Glivenko Cantelli theorem 2 A centered and scaled version of the empirical measure is the signed measure G n A n P n A P A displaystyle G n A sqrt n P n A P A nbsp It induces a map on measurable functions f given by f G n f n P n P f n 1 n i 1 n f X i E f displaystyle f mapsto G n f sqrt n P n P f sqrt n left frac 1 n sum i 1 n f X i mathbb E f right nbsp By the central limit theorem G n A displaystyle G n A nbsp converges in distribution to a normal random variable N 0 P A 1 P A for fixed measurable set A Similarly for a fixed function f G n f displaystyle G n f nbsp converges in distribution to a normal random variable N 0 E f E f 2 displaystyle N 0 mathbb E f mathbb E f 2 nbsp provided that E f displaystyle mathbb E f nbsp and E f 2 displaystyle mathbb E f 2 nbsp exist Definition G n c c C displaystyle bigl G n c bigr c in mathcal C nbsp is called an empirical process indexed by C displaystyle mathcal C nbsp a collection of measurable subsets of S G n f f F displaystyle bigl G n f bigr f in mathcal F nbsp is called an empirical process indexed by F displaystyle mathcal F nbsp a collection of measurable functions from S to R displaystyle mathbb R nbsp A significant result in the area of empirical processes is Donsker s theorem It has led to a study of Donsker classes sets of functions with the useful property that empirical processes indexed by these classes converge weakly to a certain Gaussian process While it can be shown that Donsker classes are Glivenko Cantelli classes the converse is not true in general Example editAs an example consider empirical distribution functions For real valued iid random variables X1 X2 Xn they are given by F n x P n x P n I x displaystyle F n x P n infty x P n I infty x nbsp In this case empirical processes are indexed by a class C x x R displaystyle mathcal C infty x x in mathbb R nbsp It has been shown that C displaystyle mathcal C nbsp is a Donsker class in particular n F n x F x displaystyle sqrt n F n x F x nbsp converges weakly in ℓ R displaystyle ell infty mathbb R nbsp to a Brownian bridge B F x See also editKhmaladze transformation Weak convergence of measures Glivenko Cantelli theoremReferences edit Mojirsheibani M 2007 Nonparametric curve estimation with missing data A general empirical process approach Journal of Statistical Planning and Inference 137 9 2733 2758 doi 10 1016 j jspi 2006 02 016 Wolfowitz J 1954 Generalization of the Theorem of Glivenko Cantelli The Annals of Mathematical Statistics 25 131 138 doi 10 1214 aoms 1177728852 Further reading editBillingsley P 1995 Probability and Measure Third ed New York John Wiley and Sons ISBN 0471007102 Donsker M D 1952 Justification and Extension of Doob s Heuristic Approach to the Kolmogorov Smirnov Theorems The Annals of Mathematical Statistics 23 2 277 281 doi 10 1214 aoms 1177729445 Dudley R M 1978 Central Limit Theorems for Empirical Measures The Annals of Probability 6 6 899 929 doi 10 1214 aop 1176995384 Dudley R M 1999 Uniform Central Limit Theorems Cambridge Studies in Advanced Mathematics Vol 63 Cambridge UK Cambridge University Press Kosorok M R 2008 Introduction to Empirical Processes and Semiparametric Inference Springer Series in Statistics doi 10 1007 978 0 387 74978 5 ISBN 978 0 387 74977 8 Shorack G R Wellner J A 2009 Empirical Processes with Applications to Statistics doi 10 1137 1 9780898719017 ISBN 978 0 89871 684 9 van der Vaart Aad W Wellner Jon A 2000 Weak Convergence and Empirical Processes With Applications to Statistics 2nd ed Springer ISBN 978 0 387 94640 5 Dzhaparidze K O Nikulin M S 1982 Probability distributions of the Kolmogorov and omega square statistics for continuous distributions with shift and scale parameters Journal of Soviet Mathematics 20 3 2147 doi 10 1007 BF01239992 S2CID 123206522 External links editEmpirical Processes Theory and Applications by David Pollard a textbook available online Introduction to Empirical Processes and Semiparametric Inference by Michael Kosorok another textbook available online Retrieved from https en wikipedia org w index php title Empirical process amp oldid 1208525194, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.