fbpx
Wikipedia

Scheffé's method

In statistics, Scheffé's method, named after American statistician Henry Scheffé, is a method for adjusting significance levels in a linear regression analysis to account for multiple comparisons. It is particularly useful in analysis of variance (a special case of regression analysis), and in constructing simultaneous confidence bands for regressions involving basis functions.

Scheffé's method is a single-step multiple comparison procedure which applies to the set of estimates of all possible contrasts among the factor level means, not just the pairwise differences considered by the Tukey–Kramer method. It works on similar principles as the Working–Hotelling procedure for estimating mean responses in regression, which applies to the set of all possible factor levels.

The method edit

Let   be the means of some variable in   disjoint populations.

An arbitrary contrast is defined by

 

where

 

If   are all equal to each other, then all contrasts among them are 0. Otherwise, some contrasts differ from 0.

Technically there are infinitely many contrasts. The simultaneous confidence coefficient is exactly  , whether the factor level sample sizes are equal or unequal. (Usually only a finite number of comparisons are of interest. In this case, Scheffé's method is typically quite conservative, and the family-wise error rate (experimental error rate) will generally be much smaller than  .)[1][2]

We estimate   by

 

for which the estimated variance is

 

where

  •   is the size of the sample taken from the  th population (the one whose mean is  ), and
  •   is the estimated variance of the errors.

It can be shown that the probability is   that all confidence limits of the type

 

are simultaneously correct, where as usual   is the size of the whole population. Norman R. Draper and Harry Smith, in their 'Applied Regression Analysis' (see references), indicate that   should be in the equation in place of  . The slip with   is a result of failing to allow for the additional effect of the constant term in many regressions. That the result based on   is wrong is readily seen by considering  , as in a standard simple linear regression. That formula would then reduce to one with the usual  -distribution, which is appropriate for predicting/estimating for a single value of the independent variable, not for constructing a confidence band for a range of values of the independent value. Also note that the formula is for dealing with the mean values for a range of independent values, not for comparing with individual values such as individual observed data values.[3]

Denoting Scheffé significance in a table edit

Frequently, subscript letters are used to indicate which values are significantly different using the Scheffé method. For example, when mean values of variables that have been analyzed using an ANOVA are presented in a table, they are assigned a different letter subscript based on a Scheffé contrast. Values that are not significantly different based on the post-hoc Scheffé contrast will have the same subscript and values that are significantly different will have different subscripts (i.e. 15a, 17a, 34b would mean that the first and second variables both differ from the third variable but not each other because they are both assigned the subscript "a").[citation needed]

Comparison with the Tukey–Kramer method edit

If only a fixed number of pairwise comparisons are to be made, the Tukey–Kramer method will result in a more precise confidence interval. In the general case when many or all contrasts might be of interest, the Scheffé method is more appropriate and will give narrower confidence intervals in the case of a large number of comparisons.

References edit

  1. ^ Maxwell, Scott E.; Delaney, Harold D. (2004). Designing Experiments and Analyzing Data: A Model Comparison. Lawrence Erlbaum Associates. pp. 217–218. ISBN 0-8058-3718-3.
  2. ^ Milliken, George A.; Johnson, Dallas E. (1993). Analysis of Messy Data. CRC Press. pp. 35–36. ISBN 0-412-99081-4.
  3. ^ Draper, Norman R; Smith, Harry (1998). Applied Regression Analysis (2nd ed.). John Wiley and Sons, Inc. p. 93. ISBN 9780471170822.

External links edit

  • Scheffé's method

  This article incorporates public domain material from the National Institute of Standards and Technology

scheffé, method, statistics, named, after, american, statistician, henry, scheffé, method, adjusting, significance, levels, linear, regression, analysis, account, multiple, comparisons, particularly, useful, analysis, variance, special, case, regression, analy. In statistics Scheffe s method named after American statistician Henry Scheffe is a method for adjusting significance levels in a linear regression analysis to account for multiple comparisons It is particularly useful in analysis of variance a special case of regression analysis and in constructing simultaneous confidence bands for regressions involving basis functions Scheffe s method is a single step multiple comparison procedure which applies to the set of estimates of all possible contrasts among the factor level means not just the pairwise differences considered by the Tukey Kramer method It works on similar principles as the Working Hotelling procedure for estimating mean responses in regression which applies to the set of all possible factor levels Contents 1 The method 2 Denoting Scheffe significance in a table 3 Comparison with the Tukey Kramer method 4 References 5 External linksThe method editLet m 1 m r textstyle mu 1 ldots mu r nbsp be the means of some variable in r textstyle r nbsp disjoint populations An arbitrary contrast is defined by C i 1 r c i m i displaystyle C sum i 1 r c i mu i nbsp where i 1 r c i 0 displaystyle sum i 1 r c i 0 nbsp If m 1 m r textstyle mu 1 ldots mu r nbsp are all equal to each other then all contrasts among them are 0 Otherwise some contrasts differ from 0 Technically there are infinitely many contrasts The simultaneous confidence coefficient is exactly 1 a textstyle 1 alpha nbsp whether the factor level sample sizes are equal or unequal Usually only a finite number of comparisons are of interest In this case Scheffe s method is typically quite conservative and the family wise error rate experimental error rate will generally be much smaller than a textstyle alpha nbsp 1 2 We estimate C textstyle C nbsp by C i 1 r c i Y i displaystyle hat C sum i 1 r c i bar Y i nbsp for which the estimated variance is s C 2 s e 2 i 1 r c i 2 n i displaystyle s hat C 2 hat sigma e 2 sum i 1 r frac c i 2 n i nbsp where n i textstyle n i nbsp is the size of the sample taken from the i textstyle i nbsp th population the one whose mean is m i textstyle mu i nbsp and s e 2 displaystyle hat sigma e 2 nbsp is the estimated variance of the errors It can be shown that the probability is 1 a textstyle 1 alpha nbsp that all confidence limits of the type C s C r 1 F a r 1 N r displaystyle hat C pm s hat C sqrt left r 1 right F alpha r 1 N r nbsp are simultaneously correct where as usual N textstyle N nbsp is the size of the whole population Norman R Draper and Harry Smith in their Applied Regression Analysis see references indicate that r textstyle r nbsp should be in the equation in place of r 1 textstyle r 1 nbsp The slip with r 1 textstyle r 1 nbsp is a result of failing to allow for the additional effect of the constant term in many regressions That the result based on r 1 textstyle r 1 nbsp is wrong is readily seen by considering r 2 textstyle r 2 nbsp as in a standard simple linear regression That formula would then reduce to one with the usual t textstyle t nbsp distribution which is appropriate for predicting estimating for a single value of the independent variable not for constructing a confidence band for a range of values of the independent value Also note that the formula is for dealing with the mean values for a range of independent values not for comparing with individual values such as individual observed data values 3 Denoting Scheffe significance in a table editFrequently subscript letters are used to indicate which values are significantly different using the Scheffe method For example when mean values of variables that have been analyzed using an ANOVA are presented in a table they are assigned a different letter subscript based on a Scheffe contrast Values that are not significantly different based on the post hoc Scheffe contrast will have the same subscript and values that are significantly different will have different subscripts i e 15a 17a 34b would mean that the first and second variables both differ from the third variable but not each other because they are both assigned the subscript a citation needed Comparison with the Tukey Kramer method editIf only a fixed number of pairwise comparisons are to be made the Tukey Kramer method will result in a more precise confidence interval In the general case when many or all contrasts might be of interest the Scheffe method is more appropriate and will give narrower confidence intervals in the case of a large number of comparisons References edit Maxwell Scott E Delaney Harold D 2004 Designing Experiments and Analyzing Data A Model Comparison Lawrence Erlbaum Associates pp 217 218 ISBN 0 8058 3718 3 Milliken George A Johnson Dallas E 1993 Analysis of Messy Data CRC Press pp 35 36 ISBN 0 412 99081 4 Draper Norman R Smith Harry 1998 Applied Regression Analysis 2nd ed John Wiley and Sons Inc p 93 ISBN 9780471170822 Bohrer Robert 1967 On Sharpening Scheffe Bounds Journal of the Royal Statistical Society Series B 29 1 110 114 JSTOR 2984571 Scheffe H 1999 1959 The Analysis of Variance New York Wiley ISBN 0 471 34505 9 External links editScheffe s method nbsp This article incorporates public domain material from the National Institute of Standards and Technology Retrieved from https en wikipedia org w index php title Scheffe 27s method amp oldid 1209443171, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.