fbpx
Wikipedia

Well-behaved statistic

Although the term well-behaved statistic often seems to be used in the scientific literature in somewhat the same way as is well-behaved in mathematics (that is, to mean "non-pathological"[1][2]) it can also be assigned precise mathematical meaning, and in more than one way. In the former case, the meaning of this term will vary from context to context. In the latter case, the mathematical conditions can be used to derive classes of combinations of distributions with statistics which are well-behaved in each sense.

First Definition: The variance of a well-behaved statistical estimator is finite and one condition on its mean is that it is differentiable in the parameter being estimated.[3]

Second Definition: The statistic is monotonic, well-defined, and locally sufficient.[4]

Conditions for a Well-Behaved Statistic: First Definition edit

More formally the conditions can be expressed in this way.   is a statistic for   that is a function of the sample,  . For   to be well-behaved we require:

 : Condition 1

  differentiable in  , and the derivative satisfies:

 : Condition 2

Conditions for a Well-Behaved Statistic: Second Definition edit

In order to derive the distribution law of the parameter T, compatible with  , the statistic must obey some technical properties. Namely, a statistic s is said to be well-behaved if it satisfies the following three statements:

  1. monotonicity. A uniformly monotone relation exists between s and ? for any fixed seed   – so as to have a unique solution of (1);
  2. well-defined. On each observed s the statistic is well defined for every value of ?, i.e. any sample specification   such that   has a probability density different from 0 – so as to avoid considering a non-surjective mapping from   to  , i.e. associating via   to a sample   a ? that could not generate the sample itself;
  3. local sufficiency.   constitutes a true T sample for the observed s, so that the same probability distribution can be attributed to each sampled value. Now,   is a solution of (1) with the seed  . Since the seeds are equally distributed, the sole caveat comes from their independence or, conversely from their dependence on ? itself. This check can be restricted to seeds involved by s, i.e. this drawback can be avoided by requiring that the distribution of   is independent of ?. An easy way to check this property is by mapping seed specifications into  s specifications. The mapping of course depends on ?, but the distribution of   will not depend on ?, if the above seed independence holds – a condition that looks like a local sufficiency of the statistic S.

The remainder of the present article is mainly concerned with the context of data mining procedures applied to statistical inference and, in particular, to the group of computationally intensive procedure that have been called algorithmic inference.

Algorithmic inference edit

In algorithmic inference, the property of a statistic that is of most relevance is the pivoting step which allows to transference of probability-considerations from the sample distribution to the distribution of the parameters representing the population distribution in such a way that the conclusion of this statistical inference step is compatible with the sample actually observed.

By default, capital letters (such as U, X) will denote random variables and small letters (u, x) their corresponding realizations and with gothic letters (such as  ) the domain where the variable takes specifications. Facing a sample  , given a sampling mechanism  , with   scalar, for the random variable X, we have

 

The sampling mechanism  , of the statistic s, as a function ? of   with specifications in   , has an explaining function defined by the master equation:

 

for suitable seeds   and parameter ?

Example edit

For instance, for both the Bernoulli distribution with parameter p and the exponential distribution with parameter ? the statistic   is well-behaved. The satisfaction of the above three properties is straightforward when looking at both explaining functions:   if  , 0 otherwise in the case of the Bernoulli random variable, and   for the Exponential random variable, giving rise to statistics

 

and

 

Vice versa, in the case of X following a continuous uniform distribution on   the same statistics do not meet the second requirement. For instance, the observed sample   gives  . But the explaining function of this X is  . Hence a master equation   would produce with a U sample   and a solution  . This conflicts with the observed sample since the first observed value should result greater than the right extreme of the X range. The statistic   is well-behaved in this case.

Analogously, for a random variable X following the Pareto distribution with parameters K and A (see Pareto example for more detail of this case),

 

and

 

can be used as joint statistics for these parameters.

As a general statement that holds under weak conditions, sufficient statistics are well-behaved with respect to the related parameters. The table below gives sufficient / Well-behaved statistics for the parameters of some of the most commonly used probability distributions.

Common distribution laws together with related sufficient and well-behaved statistics.
Distribution Definition of density function Sufficient/Well-behaved statistic
Uniform discrete    
Bernoulli    
Binomial    
Geometric    
Poisson    
Uniform continuous    
Negative exponential    
Pareto    
Gaussian    
Gamma    

References edit

  1. ^ Dawn Iacobucci. "Mediation analysis and categorical variables: The final frontier" (PDF). Retrieved 7 February 2017.
  2. ^ John DiNardo; Jason Winfree. "The Law of Genius and Home Runs Refuted" (PDF). Retrieved 7 February 2017.
  3. ^ A DasGupta. "(no title)" (PDF). Retrieved 7 February 2017. {{cite web}}: Cite uses generic title (help)
  4. ^ Apolloni, B; Bassis, S.; Malchiodi, D.; Witold, P. (2008). The Puzzle of Granular Computing. Studies in Computational Intelligence. Vol. 138. Berlin: Springer.
  • Bahadur, R. R.; Lehmann, E. L. (1955). "Two comments on Sufficiency and Statistical Decision Functions". Annals of Mathematical Statistics. 26: 139–142. doi:10.1214/aoms/1177728604.

well, behaved, statistic, this, article, written, like, personal, reflection, personal, essay, argumentative, essay, that, states, wikipedia, editor, personal, feelings, presents, original, argument, about, topic, please, help, improve, rewriting, encyclopedic. This article is written like a personal reflection personal essay or argumentative essay that states a Wikipedia editor s personal feelings or presents an original argument about a topic Please help improve it by rewriting it in an encyclopedic style September 2009 Learn how and when to remove this template message This article includes a list of general references but it lacks sufficient corresponding inline citations Please help to improve this article by introducing more precise citations November 2010 Learn how and when to remove this template message Although the term well behaved statistic often seems to be used in the scientific literature in somewhat the same way as is well behaved in mathematics that is to mean non pathological 1 2 it can also be assigned precise mathematical meaning and in more than one way In the former case the meaning of this term will vary from context to context In the latter case the mathematical conditions can be used to derive classes of combinations of distributions with statistics which are well behaved in each sense First Definition The variance of a well behaved statistical estimator is finite and one condition on its mean is that it is differentiable in the parameter being estimated 3 Second Definition The statistic is monotonic well defined and locally sufficient 4 Contents 1 Conditions for a Well Behaved Statistic First Definition 2 Conditions for a Well Behaved Statistic Second Definition 3 Algorithmic inference 3 1 Example 4 ReferencesConditions for a Well Behaved Statistic First Definition editMore formally the conditions can be expressed in this way T textstyle T nbsp is a statistic for 8 textstyle theta nbsp that is a function of the sample X 1 X n textstyle X 1 X n nbsp For T textstyle T nbsp to be well behaved we require V a r 8 T X 1 X n lt 8 8 textstyle Var theta left T left X 1 X n right right lt infty quad forall quad theta in Theta nbsp Condition 1E 8 T textstyle E theta left T right nbsp differentiable in 8 8 8 textstyle theta quad forall quad theta in Theta nbsp and the derivative satisfies d d 8 T X 1 X n i 1 n f x i 8 d x 1 d x n T X 1 X n 8 i 1 n f x i 8 d x 1 d x n textstyle frac d d theta int T left X 1 X n right prod i 1 n f left x i theta right d x 1 d x n int T left X 1 X n right left frac partial partial theta prod i 1 n f left x i theta right right d x 1 d x n nbsp Condition 2Conditions for a Well Behaved Statistic Second Definition editIn order to derive the distribution law of the parameter T compatible with x displaystyle boldsymbol x nbsp the statistic must obey some technical properties Namely a statistic s is said to be well behaved if it satisfies the following three statements monotonicity A uniformly monotone relation exists between s and for any fixed seed z 1 z m displaystyle z 1 ldots z m nbsp so as to have a unique solution of 1 well defined On each observed s the statistic is well defined for every value of i e any sample specification x 1 x m X m displaystyle x 1 ldots x m in mathfrak X m nbsp such that r x 1 x m s displaystyle rho x 1 ldots x m s nbsp has a probability density different from 0 so as to avoid considering a non surjective mapping from X m displaystyle mathfrak X m nbsp to S displaystyle mathfrak S nbsp i e associating via s displaystyle s nbsp to a sample x 1 x m displaystyle x 1 ldots x m nbsp a that could not generate the sample itself local sufficiency 8 1 8 N displaystyle breve theta 1 ldots breve theta N nbsp constitutes a true T sample for the observed s so that the same probability distribution can be attributed to each sampled value Now 8 j h 1 s z 1 j z m j displaystyle breve theta j h 1 s breve z 1 j ldots breve z m j nbsp is a solution of 1 with the seed z 1 j z m j displaystyle breve z 1 j ldots breve z m j nbsp Since the seeds are equally distributed the sole caveat comes from their independence or conversely from their dependence on itself This check can be restricted to seeds involved by s i e this drawback can be avoided by requiring that the distribution of Z 1 Z m S s displaystyle Z 1 ldots Z m S s nbsp is independent of An easy way to check this property is by mapping seed specifications into x i displaystyle x i nbsp s specifications The mapping of course depends on but the distribution of X 1 X m S s displaystyle X 1 ldots X m S s nbsp will not depend on if the above seed independence holds a condition that looks like a local sufficiency of the statistic S The remainder of the present article is mainly concerned with the context of data mining procedures applied to statistical inference and in particular to the group of computationally intensive procedure that have been called algorithmic inference Algorithmic inference editMain article Algorithmic inference In algorithmic inference the property of a statistic that is of most relevance is the pivoting step which allows to transference of probability considerations from the sample distribution to the distribution of the parameters representing the population distribution in such a way that the conclusion of this statistical inference step is compatible with the sample actually observed By default capital letters such as U X will denote random variables and small letters u x their corresponding realizations and with gothic letters such as U X displaystyle mathfrak U mathfrak X nbsp the domain where the variable takes specifications Facing a sample x x 1 x m displaystyle boldsymbol x x 1 ldots x m nbsp given a sampling mechanism g 8 Z displaystyle g theta Z nbsp with 8 displaystyle theta nbsp scalar for the random variable X we have x g 8 z 1 g 8 z m displaystyle boldsymbol x g theta z 1 ldots g theta z m nbsp The sampling mechanism g 8 z displaystyle g theta boldsymbol z nbsp of the statistic s as a function of x 1 x m displaystyle x 1 ldots x m nbsp with specifications in S displaystyle mathfrak S nbsp has an explaining function defined by the master equation s r x 1 x m r g 8 z 1 g 8 z m h 8 z 1 z m 1 displaystyle s rho x 1 ldots x m rho g theta z 1 ldots g theta z m h theta z 1 ldots z m qquad qquad qquad 1 nbsp for suitable seeds z z 1 z m displaystyle boldsymbol z z 1 ldots z m nbsp and parameter Example edit For instance for both the Bernoulli distribution with parameter p and the exponential distribution with parameter the statistic i 1 m x i displaystyle sum i 1 m x i nbsp is well behaved The satisfaction of the above three properties is straightforward when looking at both explaining functions g p u 1 displaystyle g p u 1 nbsp if u p displaystyle u leq p nbsp 0 otherwise in the case of the Bernoulli random variable and g l u log u l displaystyle g lambda u log u lambda nbsp for the Exponential random variable giving rise to statistics s p i 1 m I 0 p u i displaystyle s p sum i 1 m I 0 p u i nbsp and s l 1 l i 1 m log u i displaystyle s lambda frac 1 lambda sum i 1 m log u i nbsp Vice versa in the case of X following a continuous uniform distribution on 0 A displaystyle 0 A nbsp the same statistics do not meet the second requirement For instance the observed sample c c 2 c 3 displaystyle c c 2 c 3 nbsp gives s A 11 6 c displaystyle s A 11 6c nbsp But the explaining function of this X is g a u u a displaystyle g a u ua nbsp Hence a master equation s A i 1 m u i a displaystyle s A sum i 1 m u i a nbsp would produce with a U sample 0 8 0 8 0 8 displaystyle 0 8 0 8 0 8 nbsp and a solution a 0 76 c displaystyle breve a 0 76c nbsp This conflicts with the observed sample since the first observed value should result greater than the right extreme of the X range The statistic s A max x 1 x m displaystyle s A max x 1 ldots x m nbsp is well behaved in this case Analogously for a random variable X following the Pareto distribution with parameters K and A see Pareto example for more detail of this case s 1 i 1 m log x i displaystyle s 1 sum i 1 m log x i nbsp and s 2 min i 1 m x i displaystyle s 2 min i 1 ldots m x i nbsp can be used as joint statistics for these parameters As a general statement that holds under weak conditions sufficient statistics are well behaved with respect to the related parameters The table below gives sufficient Well behaved statistics for the parameters of some of the most commonly used probability distributions Common distribution laws together with related sufficient and well behaved statistics Distribution Definition of density function Sufficient Well behaved statisticUniform discrete f x n 1 n I 1 2 n x displaystyle f x n 1 nI 1 2 ldots n x nbsp s n max i x i displaystyle s n max i x i nbsp Bernoulli f x p p x 1 p 1 x I 0 1 x displaystyle f x p p x 1 p 1 x I 0 1 x nbsp s P i 1 m x i displaystyle s P sum i 1 m x i nbsp Binomial f x n p n x p x 1 p n x I 0 1 n x displaystyle f x n p binom n x p x 1 p n x I 0 1 ldots n x nbsp s P i 1 m x i displaystyle s P sum i 1 m x i nbsp Geometric f x p p 1 p x I 0 1 x displaystyle f x p p 1 p x I 0 1 ldots x nbsp s P i 1 m x i displaystyle s P sum i 1 m x i nbsp Poisson f x m e m x m x x I 0 1 x displaystyle f x mu mathrm e mu x mu x x I 0 1 ldots x nbsp s M i 1 m x i displaystyle s M sum i 1 m x i nbsp Uniform continuous f x a b 1 b a I a b x displaystyle f x a b 1 b a I a b x nbsp s A min i x i s B max i x i displaystyle s A min i x i s B max i x i nbsp Negative exponential f x l l e l x I 0 x displaystyle f x lambda lambda mathrm e lambda x I 0 infty x nbsp s L i 1 m x i displaystyle s Lambda sum i 1 m x i nbsp Pareto f x a k a k x k a 1 I k x displaystyle f x a k frac a k left frac x k right a 1 I k infty x nbsp s A i 1 m log x i s K min i x i displaystyle s A sum i 1 m log x i s K min i x i nbsp Gaussian f x m s 1 2 p s e x m 2 2 s 2 displaystyle f x mu sigma 1 sqrt 2 pi sigma mathrm e x mu 2 2 sigma 2 nbsp s M i 1 m x i s S i 1 m x i x 2 displaystyle s M sum i 1 m x i s Sigma sqrt sum i 1 m x i bar x 2 nbsp Gamma f x r l l G r l x r 1 e l x I 0 x displaystyle f x r lambda lambda Gamma r lambda x r 1 mathrm e lambda x I 0 infty x nbsp s L i 1 m x i s K i 1 m x i displaystyle s Lambda sum i 1 m x i s K prod i 1 m x i nbsp References edit Dawn Iacobucci Mediation analysis and categorical variables The final frontier PDF Retrieved 7 February 2017 John DiNardo Jason Winfree The Law of Genius and Home Runs Refuted PDF Retrieved 7 February 2017 A DasGupta no title PDF Retrieved 7 February 2017 a href Template Cite web html title Template Cite web cite web a Cite uses generic title help Apolloni B Bassis S Malchiodi D Witold P 2008 The Puzzle of Granular Computing Studies in Computational Intelligence Vol 138 Berlin Springer Bahadur R R Lehmann E L 1955 Two comments on Sufficiency and Statistical Decision Functions Annals of Mathematical Statistics 26 139 142 doi 10 1214 aoms 1177728604 Retrieved from https en wikipedia org w index php title Well behaved statistic amp oldid 1202548637, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.