fbpx
Wikipedia

Mean absolute difference

The mean absolute difference (univariate) is a measure of statistical dispersion equal to the average absolute difference of two independent values drawn from a probability distribution. A related statistic is the relative mean absolute difference, which is the mean absolute difference divided by the arithmetic mean, and equal to twice the Gini coefficient. The mean absolute difference is also known as the absolute mean difference (not to be confused with the absolute value of the mean signed difference) and the Gini mean difference (GMD).[1] The mean absolute difference is sometimes denoted by Δ or as MD.

Definition edit

The mean absolute difference is defined as the "average" or "mean", formally the expected value, of the absolute difference of two random variables X and Y independently and identically distributed with the same (unknown) distribution henceforth called Q.

 

Calculation edit

Specifically, in the discrete case,

  • For a random sample of size n of a population distributed uniformly according to Q, by the law of total expectation the (empirical) mean absolute difference of the sequence of sample values yi, i = 1 to n can be calculated as the arithmetic mean of the absolute value of all possible differences:
 
 

In the continuous case,

 

An alternative form of the equation is given by:

 
 

Relative mean absolute difference edit

When the probability distribution has a finite and nonzero arithmetic mean AM, the relative mean absolute difference, sometimes denoted by Δ or RMD, is defined by

 

The relative mean absolute difference quantifies the mean absolute difference in comparison to the size of the mean and is a dimensionless quantity. The relative mean absolute difference is equal to twice the Gini coefficient which is defined in terms of the Lorenz curve. This relationship gives complementary perspectives to both the relative mean absolute difference and the Gini coefficient, including alternative ways of calculating their values.

Properties edit

The mean absolute difference is invariant to translations and negation, and varies proportionally to positive scaling. That is to say, if X is a random variable and c is a constant:

  • MD(X + c) = MD(X),
  • MD(−X) = MD(X), and
  • MD(c X) = |c| MD(X).

The relative mean absolute difference is invariant to positive scaling, commutes with negation, and varies under translation in proportion to the ratio of the original and translated arithmetic means. That is to say, if X is a random variable and c is a constant:

  • RMD(X + c) = RMD(X) · mean(X)/(mean(X) + c) = RMD(X) / (1 + c / mean(X)) for c ≠ −mean(X),
  • RMD(−X) = −RMD(X), and
  • RMD(c X) = RMD(X) for c > 0.

If a random variable has a positive mean, then its relative mean absolute difference will always be greater than or equal to zero. If, additionally, the random variable can only take on values that are greater than or equal to zero, then its relative mean absolute difference will be less than 2.

Compared to standard deviation edit

The mean absolute difference is twice the L-scale (the second L-moment), while the standard deviation is the square root of the variance about the mean (the second conventional central moment). The differences between L-moments and conventional moments are first seen in comparing the mean absolute difference and the standard deviation (the first L-moment and first conventional moment are both the mean).

Both the standard deviation and the mean absolute difference measure dispersion—how spread out are the values of a population or the probabilities of a distribution. The mean absolute difference is not defined in terms of a specific measure of central tendency, whereas the standard deviation is defined in terms of the deviation from the arithmetic mean. Because the standard deviation squares its differences, it tends to give more weight to larger differences and less weight to smaller differences compared to the mean absolute difference. When the arithmetic mean is finite, the mean absolute difference will also be finite, even when the standard deviation is infinite. See the examples for some specific comparisons.

The recently introduced distance standard deviation plays similar role to the mean absolute difference but the distance standard deviation works with centered distances. See also E-statistics.

Sample estimators edit

For a random sample S from a random variable X, consisting of n values yi, the statistic

 

is a consistent and unbiased estimator of MD(X). The statistic:

 

is a consistent estimator of RMD(X), but is not, in general, unbiased.

Confidence intervals for RMD(X) can be calculated using bootstrap sampling techniques.

There does not exist, in general, an unbiased estimator for RMD(X), in part because of the difficulty of finding an unbiased estimation for multiplying by the inverse of the mean. For example, even where the sample is known to be taken from a random variable X(p) for an unknown p, and X(p) − 1 has the Bernoulli distribution, so that Pr(X(p) = 1) = 1 − p and Pr(X(p) = 2) = p, then

RMD(X(p)) = 2p(1 − p)/(1 + p).

But the expected value of any estimator R(S) of RMD(X(p)) will be of the form:[citation needed]

 

where the r i are constants. So E(R(S)) can never equal RMD(X(p)) for all p between 0 and 1.

Examples edit

Examples of mean absolute difference and relative mean absolute difference
Distribution Parameters Mean Standard deviation Mean absolute difference Relative mean absolute difference
Continuous uniform          
Normal  ;         undefined
Exponential          
Pareto  ;          
Gamma  ;          
Gamma  ;          
Gamma  ;          
Gamma  ;          
Gamma  ;          
Bernoulli          
Student's t, 2 d.f.         undefined
  is the Beta function

See also edit

References edit

  1. ^ Yitzhaki, Shlomo (2003). "Gini's Mean Difference: A Superior Measure of Variability for Non-Normal Distributions" (PDF). Metron International Journal of Statistics. Springer Verlag. 61 (2): 285–316.

Sources edit

  • Xu, Kuan (January 2004). "How Has the Literature on Gini's Index Evolved in the Past 80 Years?" (PDF). Department of Economics, Dalhousie University. Retrieved 2006-06-01. {{cite journal}}: Cite journal requires |journal= (help)
  • Gini, Corrado (1912). Variabilità e Mutabilità. Bologna: Tipografia di Paolo Cuppini. Bibcode:1912vamu.book.....G.
  • Gini, Corrado (1921). "Measurement of Inequality and Incomes". The Economic Journal. 31 (121): 124–126. doi:10.2307/2223319. JSTOR 2223319.
  • Chakravarty, S. R. (1990). Ethical Social Index Numbers. New York: Springer-Verlag.
  • Mills, Jeffrey A.; Zandvakili, Sourushe (1997). "Statistical Inference via Bootstrapping for Measures of Inequality". Journal of Applied Econometrics. 12 (2): 133–150. CiteSeerX 10.1.1.172.5003. doi:10.1002/(SICI)1099-1255(199703)12:2<133::AID-JAE433>3.0.CO;2-H.
  • Lomnicki, Z. A. (1952). "The Standard Error of Gini's Mean Difference". Annals of Mathematical Statistics. 23 (4): 635–637. doi:10.1214/aoms/1177729346.
  • Nair, U. S. (1936). "Standard Error of Gini's Mean Difference". Biometrika. 28 (3–4): 428–436. doi:10.1093/biomet/28.3-4.428.
  • Yitzhaki, Shlomo (2003). "Gini's Mean difference: a superior measure of variability for non-normal distributions" (PDF). Metron – International Journal of Statistics. 61: 285–316.

mean, absolute, difference, differences, with, respect, central, point, mean, absolute, deviation, paired, differences, mean, absolute, error, this, article, includes, list, general, references, lacks, sufficient, corresponding, inline, citations, please, help. For differences with respect to a central point see Mean absolute deviation For paired differences see Mean absolute error This article includes a list of general references but it lacks sufficient corresponding inline citations Please help to improve this article by introducing more precise citations November 2010 Learn how and when to remove this template message The mean absolute difference univariate is a measure of statistical dispersion equal to the average absolute difference of two independent values drawn from a probability distribution A related statistic is the relative mean absolute difference which is the mean absolute difference divided by the arithmetic mean and equal to twice the Gini coefficient The mean absolute difference is also known as the absolute mean difference not to be confused with the absolute value of the mean signed difference and the Gini mean difference GMD 1 The mean absolute difference is sometimes denoted by D or as MD Contents 1 Definition 2 Calculation 3 Relative mean absolute difference 4 Properties 5 Compared to standard deviation 6 Sample estimators 7 Examples 8 See also 9 References 10 SourcesDefinition editThe mean absolute difference is defined as the average or mean formally the expected value of the absolute difference of two random variables X and Y independently and identically distributed with the same unknown distribution henceforth called Q M D E X Y displaystyle mathrm MD E X Y nbsp Calculation editSpecifically in the discrete case For a random sample of size n of a population distributed uniformly according to Q by the law of total expectation the empirical mean absolute difference of the sequence of sample values yi i 1 to n can be calculated as the arithmetic mean of the absolute value of all possible differences M D E X Y E X E Y X X Y 1 n 2 i 1 n j 1 n x i y j displaystyle mathrm MD E X Y E X E Y X X Y frac 1 n 2 sum i 1 n sum j 1 n x i y j nbsp if Q has a discrete probability function f y where yi i 1 to n are the values with nonzero probabilities M D i 1 n j 1 n f y i f y j y i y j displaystyle mathrm MD sum i 1 n sum j 1 n f y i f y j y i y j nbsp dd In the continuous case if Q has a probability density function f x M D f x f y x y d x d y displaystyle mathrm MD int infty infty int infty infty f x f y x y dx dy nbsp dd An alternative form of the equation is given by M D 0 2 f x f x d d d x d d displaystyle mathrm MD int 0 infty int infty infty 2 f x f x delta delta dx d delta nbsp dd if Q has a cumulative distribution function F x with quantile function Q F then since f x dF x dx and Q F x x it follows that M D 0 1 0 1 Q F 1 Q F 2 d F 1 d F 2 displaystyle mathrm MD int 0 1 int 0 1 Q F 1 Q F 2 dF 1 dF 2 nbsp dd Relative mean absolute difference editWhen the probability distribution has a finite and nonzero arithmetic mean AM the relative mean absolute difference sometimes denoted by D or RMD is defined by R M D M D A M displaystyle mathrm RMD frac mathrm MD mathrm AM nbsp The relative mean absolute difference quantifies the mean absolute difference in comparison to the size of the mean and is a dimensionless quantity The relative mean absolute difference is equal to twice the Gini coefficient which is defined in terms of the Lorenz curve This relationship gives complementary perspectives to both the relative mean absolute difference and the Gini coefficient including alternative ways of calculating their values Properties editThe mean absolute difference is invariant to translations and negation and varies proportionally to positive scaling That is to say if X is a random variable and c is a constant MD X c MD X MD X MD X and MD c X c MD X The relative mean absolute difference is invariant to positive scaling commutes with negation and varies under translation in proportion to the ratio of the original and translated arithmetic means That is to say if X is a random variable and c is a constant RMD X c RMD X mean X mean X c RMD X 1 c mean X for c mean X RMD X RMD X and RMD c X RMD X for c gt 0 If a random variable has a positive mean then its relative mean absolute difference will always be greater than or equal to zero If additionally the random variable can only take on values that are greater than or equal to zero then its relative mean absolute difference will be less than 2 Compared to standard deviation editThe mean absolute difference is twice the L scale the second L moment while the standard deviation is the square root of the variance about the mean the second conventional central moment The differences between L moments and conventional moments are first seen in comparing the mean absolute difference and the standard deviation the first L moment and first conventional moment are both the mean Both the standard deviation and the mean absolute difference measure dispersion how spread out are the values of a population or the probabilities of a distribution The mean absolute difference is not defined in terms of a specific measure of central tendency whereas the standard deviation is defined in terms of the deviation from the arithmetic mean Because the standard deviation squares its differences it tends to give more weight to larger differences and less weight to smaller differences compared to the mean absolute difference When the arithmetic mean is finite the mean absolute difference will also be finite even when the standard deviation is infinite See the examples for some specific comparisons The recently introduced distance standard deviation plays similar role to the mean absolute difference but the distance standard deviation works with centered distances See also E statistics Sample estimators editFor a random sample S from a random variable X consisting of n values yi the statistic M D S i 1 n j 1 n y i y j n n 1 displaystyle mathrm MD S frac sum i 1 n sum j 1 n y i y j n n 1 nbsp is a consistent and unbiased estimator of MD X The statistic R M D S i 1 n j 1 n y i y j n 1 i 1 n y i displaystyle mathrm RMD S frac sum i 1 n sum j 1 n y i y j n 1 sum i 1 n y i nbsp is a consistent estimator of RMD X but is not in general unbiased Confidence intervals for RMD X can be calculated using bootstrap sampling techniques There does not exist in general an unbiased estimator for RMD X in part because of the difficulty of finding an unbiased estimation for multiplying by the inverse of the mean For example even where the sample is known to be taken from a random variable X p for an unknown p and X p 1 has the Bernoulli distribution so that Pr X p 1 1 p and Pr X p 2 p then RMD X p 2p 1 p 1 p But the expected value of any estimator R S of RMD X p will be of the form citation needed E R S i 0 n p i 1 p n i r i displaystyle operatorname E R S sum i 0 n p i 1 p n i r i nbsp where the r i are constants So E R S can never equal RMD X p for all p between 0 and 1 Examples editExamples of mean absolute difference and relative mean absolute difference Distribution Parameters Mean Standard deviation Mean absolute difference Relative mean absolute differenceContinuous uniform a 0 b 1 displaystyle a 0 b 1 nbsp 1 2 0 5 displaystyle 1 2 0 5 nbsp 1 12 0 2887 displaystyle frac 1 sqrt 12 approx 0 2887 nbsp 1 3 0 3333 displaystyle frac 1 3 approx 0 3333 nbsp 2 3 0 6667 displaystyle frac 2 3 approx 0 6667 nbsp Normal m 0 displaystyle mu 0 nbsp s 1 displaystyle sigma 1 nbsp 0 displaystyle 0 nbsp 1 displaystyle 1 nbsp 2 p 1 1284 displaystyle frac 2 sqrt pi approx 1 1284 nbsp undefinedExponential l 1 displaystyle lambda 1 nbsp 1 displaystyle 1 nbsp 1 displaystyle 1 nbsp 1 displaystyle 1 nbsp 1 displaystyle 1 nbsp Pareto k gt 1 displaystyle k gt 1 nbsp x m 1 displaystyle x m 1 nbsp k k 1 displaystyle frac k k 1 nbsp 1 k 1 k k 2 for k gt 2 displaystyle frac 1 k 1 sqrt frac k k 2 text for k gt 2 nbsp 2 k k 1 2 k 1 displaystyle frac 2k k 1 2k 1 nbsp 2 2 k 1 displaystyle frac 2 2k 1 nbsp Gamma k displaystyle k nbsp 8 displaystyle theta nbsp k 8 displaystyle k theta nbsp k 8 displaystyle sqrt k theta nbsp 2 8 B 0 5 k displaystyle frac 2 theta mathrm B 0 5 k nbsp 2 k B 0 5 k displaystyle frac 2 k mathrm B 0 5 k nbsp Gamma k 1 displaystyle k 1 nbsp 8 1 displaystyle theta 1 nbsp 1 displaystyle 1 nbsp 1 displaystyle 1 nbsp 1 displaystyle 1 nbsp 1 displaystyle 1 nbsp Gamma k 2 displaystyle k 2 nbsp 8 1 displaystyle theta 1 nbsp 2 displaystyle 2 nbsp 2 1 4142 displaystyle sqrt 2 approx 1 4142 nbsp 3 2 1 5 displaystyle 3 2 1 5 nbsp 3 4 0 75 displaystyle 3 4 0 75 nbsp Gamma k 3 displaystyle k 3 nbsp 8 1 displaystyle theta 1 nbsp 3 displaystyle 3 nbsp 3 1 7321 displaystyle sqrt 3 approx 1 7321 nbsp 15 8 1 875 displaystyle 15 8 1 875 nbsp 5 8 0 625 displaystyle 5 8 0 625 nbsp Gamma k 4 displaystyle k 4 nbsp 8 1 displaystyle theta 1 nbsp 4 displaystyle 4 nbsp 2 displaystyle 2 nbsp 35 16 2 1875 displaystyle 35 16 2 1875 nbsp 35 64 0 546875 displaystyle 35 64 0 546875 nbsp Bernoulli 0 p 1 displaystyle 0 leq p leq 1 nbsp p displaystyle p nbsp p 1 p displaystyle sqrt p 1 p nbsp 2 p 1 p displaystyle 2p 1 p nbsp 2 1 p for p gt 0 displaystyle 2 1 p text for p gt 0 nbsp Student s t 2 d f n 2 displaystyle nu 2 nbsp 0 displaystyle 0 nbsp displaystyle infty nbsp p 2 2 2214 displaystyle frac pi sqrt 2 approx 2 2214 nbsp undefined B x y displaystyle mathrm B x y nbsp is the Beta functionSee also editMean absolute error Mean deviation Estimator Coefficient of variation L momentReferences edit Yitzhaki Shlomo 2003 Gini s Mean Difference A Superior Measure of Variability for Non Normal Distributions PDF Metron International Journal of Statistics Springer Verlag 61 2 285 316 Sources editXu Kuan January 2004 How Has the Literature on Gini s Index Evolved in the Past 80 Years PDF Department of Economics Dalhousie University Retrieved 2006 06 01 a href Template Cite journal html title Template Cite journal cite journal a Cite journal requires journal help Gini Corrado 1912 Variabilita e Mutabilita Bologna Tipografia di Paolo Cuppini Bibcode 1912vamu book G Gini Corrado 1921 Measurement of Inequality and Incomes The Economic Journal 31 121 124 126 doi 10 2307 2223319 JSTOR 2223319 Chakravarty S R 1990 Ethical Social Index Numbers New York Springer Verlag Mills Jeffrey A Zandvakili Sourushe 1997 Statistical Inference via Bootstrapping for Measures of Inequality Journal of Applied Econometrics 12 2 133 150 CiteSeerX 10 1 1 172 5003 doi 10 1002 SICI 1099 1255 199703 12 2 lt 133 AID JAE433 gt 3 0 CO 2 H Lomnicki Z A 1952 The Standard Error of Gini s Mean Difference Annals of Mathematical Statistics 23 4 635 637 doi 10 1214 aoms 1177729346 Nair U S 1936 Standard Error of Gini s Mean Difference Biometrika 28 3 4 428 436 doi 10 1093 biomet 28 3 4 428 Yitzhaki Shlomo 2003 Gini s Mean difference a superior measure of variability for non normal distributions PDF Metron International Journal of Statistics 61 285 316 Retrieved from https en wikipedia org w index php title Mean absolute difference amp oldid 1173977004 Relative mean absolute difference, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.