fbpx
Wikipedia

Winsorized mean

A winsorized mean is a winsorized statistical measure of central tendency, much like the mean and median, and even more similar to the truncated mean. It involves the calculation of the mean after winsorizing -- replacing given parts of a probability distribution or sample at the high and low end with the most extreme remaining values,[1] typically doing so for an equal amount of both extremes; often 10 to 25 percent of the ends are replaced. The winsorized mean can equivalently be expressed as a weighted average of the truncated mean and the quantiles at which it is limited, which corresponds to replacing parts with the corresponding quantiles.

Advantages

The winsorized mean is a useful estimator because by retaining the outliers without taking them too literally, it is less sensitive to observations at the extremes than the straightforward mean, and will still generate a reasonable estimate of central tendency or mean for almost all statistical models. In this regard it is referred to as a robust estimator.

Drawbacks

The winsorized mean uses more information from the distribution or sample than the median. However, unless the underlying distribution is symmetric, the winsorized mean of a sample is unlikely to produce an unbiased estimator for either the mean or the median.

Example

  • For a sample of 10 numbers (from x(1), the smallest, to x(10) the largest; order statistic notation) the 10% winsorized mean is
 
The key is in the repetition of x(2) and x(9): the extras substitute for the original values x(1) and x(10) which have been discarded and replaced.
This is equivalent to a weighted average of 0.1 times the 5th percentile (x(2)), 0.8 times the 10% trimmed mean, and 0.1 times the 95th percentile (x(9)).

Notes

  1. ^ Dodge, Y (2003) The Oxford Dictionary of Statistical Terms, OUP. ISBN 0-19-920613-9 (entry for "winsorized estimation")

References

  • Wilcox, R.R.; Keselman, H.J. (2003). "Modern robust data analysis methods: Measures of central tendency". Psychological Methods. 8 (3): 254–274. doi:10.1037/1082-989X.8.3.254. PMID 14596490.

winsorized, mean, this, article, needs, additional, citations, verification, please, help, improve, this, article, adding, citations, reliable, sources, unsourced, material, challenged, removed, find, sources, news, newspapers, books, scholar, jstor, september. This article needs additional citations for verification Please help improve this article by adding citations to reliable sources Unsourced material may be challenged and removed Find sources Winsorized mean news newspapers books scholar JSTOR September 2009 Learn how and when to remove this template message A winsorized mean is a winsorized statistical measure of central tendency much like the mean and median and even more similar to the truncated mean It involves the calculation of the mean after winsorizing replacing given parts of a probability distribution or sample at the high and low end with the most extreme remaining values 1 typically doing so for an equal amount of both extremes often 10 to 25 percent of the ends are replaced The winsorized mean can equivalently be expressed as a weighted average of the truncated mean and the quantiles at which it is limited which corresponds to replacing parts with the corresponding quantiles Contents 1 Advantages 2 Drawbacks 3 Example 4 Notes 5 ReferencesAdvantages EditThe winsorized mean is a useful estimator because by retaining the outliers without taking them too literally it is less sensitive to observations at the extremes than the straightforward mean and will still generate a reasonable estimate of central tendency or mean for almost all statistical models In this regard it is referred to as a robust estimator Drawbacks EditThe winsorized mean uses more information from the distribution or sample than the median However unless the underlying distribution is symmetric the winsorized mean of a sample is unlikely to produce an unbiased estimator for either the mean or the median Example EditFor a sample of 10 numbers from x 1 the smallest to x 10 the largest order statistic notation the 10 winsorized mean isx 2 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 9 10 displaystyle frac overbrace x 2 x 2 x 3 x 4 x 5 x 6 x 7 x 8 overbrace x 9 x 9 10 dd The key is in the repetition of x 2 and x 9 the extras substitute for the original values x 1 and x 10 which have been discarded and replaced This is equivalent to a weighted average of 0 1 times the 5th percentile x 2 0 8 times the 10 trimmed mean and 0 1 times the 95th percentile x 9 Notes Edit Dodge Y 2003 The Oxford Dictionary of Statistical Terms OUP ISBN 0 19 920613 9 entry for winsorized estimation This article includes a list of references related reading or external links but its sources remain unclear because it lacks inline citations Please help to improve this article by introducing more precise citations March 2012 Learn how and when to remove this template message References EditWilcox R R Keselman H J 2003 Modern robust data analysis methods Measures of central tendency Psychological Methods 8 3 254 274 doi 10 1037 1082 989X 8 3 254 PMID 14596490 Retrieved from https en wikipedia org w index php title Winsorized mean amp oldid 1102396261, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.