fbpx
Wikipedia

Average absolute deviation

The average absolute deviation (AAD) of a data set is the average of the absolute deviations from a central point. It is a summary statistic of statistical dispersion or variability. In the general form, the central point can be a mean, median, mode, or the result of any other measure of central tendency or any reference value related to the given data set. AAD includes the mean absolute deviation and the median absolute deviation (both abbreviated as MAD).

Measures of dispersion edit

Several measures of statistical dispersion are defined in terms of the absolute deviation. The term "average absolute deviation" does not uniquely identify a measure of statistical dispersion, as there are several measures that can be used to measure absolute deviations, and there are several measures of central tendency that can be used as well. Thus, to uniquely identify the absolute deviation it is necessary to specify both the measure of deviation and the measure of central tendency. The statistical literature has not yet adopted a standard notation, as both the mean absolute deviation around the mean and the median absolute deviation around the median have been denoted by their initials "MAD" in the literature, which may lead to confusion, since in general, they may have values considerably different from each other.

Mean absolute deviation around a central point edit

The mean absolute deviation of a set {x1, x2, ..., xn} is

 

The choice of measure of central tendency,  , has a marked effect on the value of the mean deviation. For example, for the data set {2, 2, 3, 4, 14}:

Measure of central tendency   Mean absolute deviation
Arithmetic Mean = 5  
Median = 3  
Mode = 2  

Mean absolute deviation around the mean edit

The mean absolute deviation (MAD), also referred to as the "mean deviation" or sometimes "average absolute deviation", is the mean of the data's absolute deviations around the data's mean: the average (absolute) distance from the mean. "Average absolute deviation" can refer to either this usage, or to the general form with respect to a specified central point (see above).

MAD has been proposed to be used in place of standard deviation since it corresponds better to real life.[1] Because the MAD is a simpler measure of variability than the standard deviation, it can be useful in school teaching.[2][3]

This method's forecast accuracy is very closely related to the mean squared error (MSE) method which is just the average squared error of the forecasts. Although these methods are very closely related, MAD is more commonly used because it is both easier to compute (avoiding the need for squaring)[4] and easier to understand.[5]

For the normal distribution, the ratio of mean absolute deviation from the mean to standard deviation is  . Thus if X is a normally distributed random variable with expected value 0 then, see Geary (1935):[6]

 
In other words, for a normal distribution, mean absolute deviation is about 0.8 times the standard deviation. However, in-sample measurements deliver values of the ratio of mean average deviation / standard deviation for a given Gaussian sample n with the following bounds:  , with a bias for small n.[7]

The mean absolute deviation from the mean is less than or equal to the standard deviation; one way of proving this relies on Jensen's inequality.

Proof

Jensen's inequality is  , where φ is a convex function, this implies for   that:

 
 

Since both sides are positive, and the square root is a monotonically increasing function in the positive domain:

 

For a general case of this statement, see Hölder's inequality.

Mean absolute deviation around the median edit

The median is the point about which the mean deviation is minimized. The MAD median offers a direct measure of the scale of a random variable around its median

 

This is the maximum likelihood estimator of the scale parameter   of the Laplace distribution.

Since the median minimizes the average absolute distance, we have  . The mean absolute deviation from the median is less than or equal to the mean absolute deviation from the mean. In fact, the mean absolute deviation from the median is always less than or equal to the mean absolute deviation from any other fixed number.

By using the general dispersion function, Habib (2011) defined MAD about median as

 
where the indicator function is
 

This representation allows for obtaining MAD median correlation coefficients.[citation needed]

Median absolute deviation around a central point edit

While in principle the mean or any other central point could be taken as the central point for the median absolute deviation, most often the median value is taken instead.

Median absolute deviation around the median edit

The median absolute deviation (also MAD) is the median of the absolute deviation from the median. It is a robust estimator of dispersion.

For the example {2, 2, 3, 4, 14}: 3 is the median, so the absolute deviations from the median are {1, 1, 0, 1, 11} (reordered as {0, 1, 1, 1, 11}) with a median of 1, in this case unaffected by the value of the outlier 14, so the median absolute deviation is 1.

For a symmetric distribution, the median absolute deviation is equal to half the interquartile range.

Maximum absolute deviation edit

The maximum absolute deviation around an arbitrary point is the maximum of the absolute deviations of a sample from that point. While not strictly a measure of central tendency, the maximum absolute deviation can be found using the formula for the average absolute deviation as above with  , where   is the sample maximum.

Minimization edit

The measures of statistical dispersion derived from absolute deviation characterize various measures of central tendency as minimizing dispersion: The median is the measure of central tendency most associated with the absolute deviation. Some location parameters can be compared as follows:

  • L2 norm statistics: the mean minimizes the mean squared error
  • L1 norm statistics: the median minimizes average absolute deviation,
  • L norm statistics: the mid-range minimizes the maximum absolute deviation
  • trimmed L norm statistics: for example, the midhinge (average of first and third quartiles) which minimizes the median absolute deviation of the whole distribution, also minimizes the maximum absolute deviation of the distribution after the top and bottom 25% have been trimmed off.

Estimation edit

 

The mean absolute deviation of a sample is a biased estimator of the mean absolute deviation of the population. In order for the absolute deviation to be an unbiased estimator, the expected value (average) of all the sample absolute deviations must equal the population absolute deviation. However, it does not. For the population 1,2,3 both the population absolute deviation about the median and the population absolute deviation about the mean are 2/3. The average of all the sample absolute deviations about the mean of size 3 that can be drawn from the population is 44/81, while the average of all the sample absolute deviations about the median is 4/9. Therefore, the absolute deviation is a biased estimator.

However, this argument is based on the notion of mean-unbiasedness. Each measure of location has its own form of unbiasedness (see entry on biased estimator). The relevant form of unbiasedness here is median unbiasedness.

 

See also edit

 

References edit

  1. ^ Taleb, Nassim Nicholas (2014). . Edge. Archived from the original on 2014-01-16. Retrieved 2014-01-16.{{cite web}}: CS1 maint: bot: original URL status unknown (link)
  2. ^ Kader, Gary (March 1999). "Means and MADS". Mathematics Teaching in the Middle School. 4 (6): 398–403. from the original on 2013-05-18. Retrieved 20 February 2013.
  3. ^ Franklin, Christine, Gary Kader, Denise Mewborn, Jerry Moreno, Roxy Peck, Mike Perry, and Richard Scheaffer (2007). Guidelines for Assessment and Instruction in Statistics Education (PDF). American Statistical Association. ISBN 978-0-9791747-1-1. (PDF) from the original on 2013-03-07. Retrieved 2013-02-20.{{cite book}}: CS1 maint: multiple names: authors list (link)
  4. ^ Nahmias, Steven; Olsen, Tava Lennon (2015), Production and Operations Analysis (7th ed.), Waveland Press, p. 62, ISBN 9781478628248, MAD is often the preferred method of measuring the forecast error because it does not require squaring.
  5. ^ Stadtler, Hartmut; Kilger, Christoph; Meyr, Herbert, eds. (2014), Supply Chain Management and Advanced Planning: Concepts, Models, Software, and Case Studies, Springer Texts in Business and Economics (5th ed.), Springer, p. 143, ISBN 9783642553097, the meaning of the MAD is easier to interpret.
  6. ^ Geary, R. C. (1935). The ratio of the mean deviation to the standard deviation as a test of normality. Biometrika, 27(3/4), 310–332.
  7. ^ See also Geary's 1936 and 1946 papers: Geary, R. C. (1936). Moments of the ratio of the mean deviation to the standard deviation for normal samples. Biometrika, 28(3/4), 295–307 and Geary, R. C. (1947). Testing for normality. Biometrika, 34(3/4), 209–242.

External links edit

  • Advantages of the mean absolute deviation

average, absolute, deviation, average, absolute, deviation, data, average, absolute, deviations, from, central, point, summary, statistic, statistical, dispersion, variability, general, form, central, point, mean, median, mode, result, other, measure, central,. The average absolute deviation AAD of a data set is the average of the absolute deviations from a central point It is a summary statistic of statistical dispersion or variability In the general form the central point can be a mean median mode or the result of any other measure of central tendency or any reference value related to the given data set AAD includes the mean absolute deviation and the median absolute deviation both abbreviated as MAD Contents 1 Measures of dispersion 2 Mean absolute deviation around a central point 2 1 Mean absolute deviation around the mean 2 2 Mean absolute deviation around the median 3 Median absolute deviation around a central point 3 1 Median absolute deviation around the median 4 Maximum absolute deviation 5 Minimization 6 Estimation 7 See also 8 References 9 External linksMeasures of dispersion editSeveral measures of statistical dispersion are defined in terms of the absolute deviation The term average absolute deviation does not uniquely identify a measure of statistical dispersion as there are several measures that can be used to measure absolute deviations and there are several measures of central tendency that can be used as well Thus to uniquely identify the absolute deviation it is necessary to specify both the measure of deviation and the measure of central tendency The statistical literature has not yet adopted a standard notation as both the mean absolute deviation around the mean and the median absolute deviation around the median have been denoted by their initials MAD in the literature which may lead to confusion since in general they may have values considerably different from each other Mean absolute deviation around a central point editFor arbitrary differences not around a central point see Mean absolute difference For paired differences also known as mean absolute deviation see Mean absolute error The mean absolute deviation of a set x1 x2 xn is1n i 1n xi m X displaystyle frac 1 n sum i 1 n x i m X nbsp The choice of measure of central tendency m X displaystyle m X nbsp has a marked effect on the value of the mean deviation For example for the data set 2 2 3 4 14 Measure of central tendency m X displaystyle m X nbsp Mean absolute deviationArithmetic Mean 5 2 5 2 5 3 5 4 5 14 5 5 3 6 displaystyle frac 2 5 2 5 3 5 4 5 14 5 5 3 6 nbsp Median 3 2 3 2 3 3 3 4 3 14 3 5 2 8 displaystyle frac 2 3 2 3 3 3 4 3 14 3 5 2 8 nbsp Mode 2 2 2 2 2 3 2 4 2 14 2 5 3 0 displaystyle frac 2 2 2 2 3 2 4 2 14 2 5 3 0 nbsp Mean absolute deviation around the mean edit The mean absolute deviation MAD also referred to as the mean deviation or sometimes average absolute deviation is the mean of the data s absolute deviations around the data s mean the average absolute distance from the mean Average absolute deviation can refer to either this usage or to the general form with respect to a specified central point see above MAD has been proposed to be used in place of standard deviation since it corresponds better to real life 1 Because the MAD is a simpler measure of variability than the standard deviation it can be useful in school teaching 2 3 This method s forecast accuracy is very closely related to the mean squared error MSE method which is just the average squared error of the forecasts Although these methods are very closely related MAD is more commonly used because it is both easier to compute avoiding the need for squaring 4 and easier to understand 5 For the normal distribution the ratio of mean absolute deviation from the mean to standard deviation is 2 p 0 79788456 textstyle sqrt 2 pi 0 79788456 ldots nbsp Thus if X is a normally distributed random variable with expected value 0 then see Geary 1935 6 w E X E X2 2p displaystyle w frac E X sqrt E X 2 sqrt frac 2 pi nbsp In other words for a normal distribution mean absolute deviation is about 0 8 times the standard deviation However in sample measurements deliver values of the ratio of mean average deviation standard deviation for a given Gaussian sample n with the following bounds wn 0 1 displaystyle w n in 0 1 nbsp with a bias for small n 7 The mean absolute deviation from the mean is less than or equal to the standard deviation one way of proving this relies on Jensen s inequality Proof Jensen s inequality is f E Y E f Y displaystyle varphi left mathbb E Y right leq mathbb E left varphi Y right nbsp where f is a convex function this implies for Y X m displaystyle Y vert X mu vert nbsp that E X m 2 E X m 2 displaystyle left mathbb E X mu right 2 leq mathbb E left X mu 2 right nbsp E X m 2 Var X displaystyle left mathbb E X mu right 2 leq operatorname Var X nbsp Since both sides are positive and the square root is a monotonically increasing function in the positive domain E X m Var X displaystyle mathbb E left X mu right leq sqrt operatorname Var X nbsp For a general case of this statement see Holder s inequality Mean absolute deviation around the median edit The median is the point about which the mean deviation is minimized The MAD median offers a direct measure of the scale of a random variable around its medianDmed E X median displaystyle D text med E X text median nbsp This is the maximum likelihood estimator of the scale parameter b displaystyle b nbsp of the Laplace distribution Since the median minimizes the average absolute distance we have Dmed Dmean displaystyle D text med leq D text mean nbsp The mean absolute deviation from the median is less than or equal to the mean absolute deviation from the mean In fact the mean absolute deviation from the median is always less than or equal to the mean absolute deviation from any other fixed number By using the general dispersion function Habib 2011 defined MAD about median asDmed E X median 2Cov X IO displaystyle D text med E X text median 2 operatorname Cov X I O nbsp where the indicator function is IO 1if x gt median 0otherwise displaystyle mathbf I O begin cases 1 amp text if x gt text median 0 amp text otherwise end cases nbsp This representation allows for obtaining MAD median correlation coefficients citation needed Median absolute deviation around a central point editMain article Median absolute deviation While in principle the mean or any other central point could be taken as the central point for the median absolute deviation most often the median value is taken instead Median absolute deviation around the median edit Main article Median absolute deviation The median absolute deviation also MAD is the median of the absolute deviation from the median It is a robust estimator of dispersion For the example 2 2 3 4 14 3 is the median so the absolute deviations from the median are 1 1 0 1 11 reordered as 0 1 1 1 11 with a median of 1 in this case unaffected by the value of the outlier 14 so the median absolute deviation is 1 For a symmetric distribution the median absolute deviation is equal to half the interquartile range Maximum absolute deviation editThe maximum absolute deviation around an arbitrary point is the maximum of the absolute deviations of a sample from that point While not strictly a measure of central tendency the maximum absolute deviation can be found using the formula for the average absolute deviation as above with m X max X displaystyle m X max X nbsp where max X displaystyle max X nbsp is the sample maximum Minimization editThe measures of statistical dispersion derived from absolute deviation characterize various measures of central tendency as minimizing dispersion The median is the measure of central tendency most associated with the absolute deviation Some location parameters can be compared as follows L2 norm statistics the mean minimizes the mean squared error L1 norm statistics the median minimizes average absolute deviation L norm statistics the mid range minimizes the maximum absolute deviation trimmed L norm statistics for example the midhinge average of first and third quartiles which minimizes the median absolute deviation of the whole distribution also minimizes the maximum absolute deviation of the distribution after the top and bottom 25 have been trimmed off Estimation editThis section needs expansion You can help by adding to it March 2009 nbsp The mean absolute deviation of a sample is a biased estimator of the mean absolute deviation of the population In order for the absolute deviation to be an unbiased estimator the expected value average of all the sample absolute deviations must equal the population absolute deviation However it does not For the population 1 2 3 both the population absolute deviation about the median and the population absolute deviation about the mean are 2 3 The average of all the sample absolute deviations about the mean of size 3 that can be drawn from the population is 44 81 while the average of all the sample absolute deviations about the median is 4 9 Therefore the absolute deviation is a biased estimator However this argument is based on the notion of mean unbiasedness Each measure of location has its own form of unbiasedness see entry on biased estimator The relevant form of unbiasedness here is median unbiasedness nbsp See also edit nbsp Deviation statistics Median absolute deviation Squared deviations Least absolute deviations Errors Mean absolute error Mean absolute percentage error Probable error Mean absolute difference Average rectified valueReferences edit Taleb Nassim Nicholas 2014 What scientific idea is ready for retirement Edge Archived from the original on 2014 01 16 Retrieved 2014 01 16 a href Template Cite web html title Template Cite web cite web a CS1 maint bot original URL status unknown link Kader Gary March 1999 Means and MADS Mathematics Teaching in the Middle School 4 6 398 403 Archived from the original on 2013 05 18 Retrieved 20 February 2013 Franklin Christine Gary Kader Denise Mewborn Jerry Moreno Roxy Peck Mike Perry and Richard Scheaffer 2007 Guidelines for Assessment and Instruction in Statistics Education PDF American Statistical Association ISBN 978 0 9791747 1 1 Archived PDF from the original on 2013 03 07 Retrieved 2013 02 20 a href Template Cite book html title Template Cite book cite book a CS1 maint multiple names authors list link Nahmias Steven Olsen Tava Lennon 2015 Production and Operations Analysis 7th ed Waveland Press p 62 ISBN 9781478628248 MAD is often the preferred method of measuring the forecast error because it does not require squaring Stadtler Hartmut Kilger Christoph Meyr Herbert eds 2014 Supply Chain Management and Advanced Planning Concepts Models Software and Case Studies Springer Texts in Business and Economics 5th ed Springer p 143 ISBN 9783642553097 the meaning of the MAD is easier to interpret Geary R C 1935 The ratio of the mean deviation to the standard deviation as a test of normality Biometrika 27 3 4 310 332 See also Geary s 1936 and 1946 papers Geary R C 1936 Moments of the ratio of the mean deviation to the standard deviation for normal samples Biometrika 28 3 4 295 307 and Geary R C 1947 Testing for normality Biometrika 34 3 4 209 242 External links editAdvantages of the mean absolute deviation Retrieved from https en wikipedia org w index php title Average absolute deviation amp oldid 1164534261 Maximum absolute deviation, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.