fbpx
Wikipedia

Goodman and Kruskal's gamma

In statistics, Goodman and Kruskal's gamma is a measure of rank correlation, i.e., the similarity of the orderings of the data when ranked by each of the quantities. It measures the strength of association of the cross tabulated data when both variables are measured at the ordinal level. It makes no adjustment for either table size or ties. Values range from −1 (100% negative association, or perfect inversion) to +1 (100% positive association, or perfect agreement). A value of zero indicates the absence of association.

This statistic (which is distinct from Goodman and Kruskal's lambda) is named after Leo Goodman and William Kruskal, who proposed it in a series of papers from 1954 to 1972.[1][2][3][4]

Definition

The estimate of gamma, G, depends on two quantities:

  • Ns, the number of pairs of cases ranked in the same order on both variables (number of concordant pairs),
  • Nd, the number of pairs of cases ranked in reversed order on both variables (number of reversed pairs),

where "ties" (cases where either of the two variables in the pair are equal) are dropped. Then

 

This statistic can be regarded as the maximum likelihood estimator for the theoretical quantity  , where

 

and where Ps and Pd are the probabilities that a randomly selected pair of observations will place in the same or opposite order respectively, when ranked by both variables.

Critical values for the gamma statistic are sometimes found by using an approximation, whereby a transformed value, t of the statistic is referred to Student t distribution, where[citation needed]

 

and where n is the number of observations (not the number of pairs):

 

Yule's Q

A special case of Goodman and Kruskal's gamma is Yule's Q, also known as the Yule coefficient of association,[5] which is specific to 2×2 matrices. Consider the following contingency table of events, where each value is a count of an event's frequency:

Yes No Totals
Positive a b a+b
Negative c d c+d
Totals a+c b+d n

Yule's Q is given by:

 

Although computed in the same fashion as Goodman and Kruskal's gamma, it has a slightly broader interpretation because the distinction between nominal and ordinal scales becomes a matter of arbitrary labeling for dichotomous distinctions. Thus, whether Q is positive or negative depends merely on which pairings the analyst considers to be concordant, but is otherwise symmetric.

Q varies from −1 to +1. −1 reflects total negative association, +1 reflects perfect positive association and 0 reflects no association at all. The sign depends on which pairings the analyst initially considered to be concordant, but this choice does not affect the magnitude.

In term of the odds ratio OR, Yule's Q is given by

 

and so Yule's Q and Yule's Y are related by

 
 

See also

References

  1. ^ Goodman, Leo A.; Kruskal, William H. (1954). "Measures of Association for Cross Classifications". Journal of the American Statistical Association. 49 (268): 732–764. doi:10.2307/2281536. JSTOR 2281536.
  2. ^ Goodman, Leo A.; Kruskal, William H. (1959). "Measures of Association for Cross Classifications. II: Further Discussion and References". Journal of the American Statistical Association. 54 (285): 123–163. doi:10.1080/01621459.1959.10501503. JSTOR 2282143.
  3. ^ Goodman, Leo A.; Kruskal, William H. (1963). "Measures of Association for Cross Classifications III: Approximate Sampling Theory". Journal of the American Statistical Association. 58 (302): 310–364. doi:10.1080/01621459.1963.10500850. JSTOR 2283271.
  4. ^ Goodman, Leo A.; Kruskal, William H. (1972). "Measures of Association for Cross Classifications, IV: Simplification of Asymptotic Variances". Journal of the American Statistical Association. 67 (338): 415–421. doi:10.1080/01621459.1972.10482401. JSTOR 2284396.
  5. ^ Yule, G U. (1912). "On the methods of measuring association between two attributes". Journal of the Royal Statistical Society. 49 (6): 579–652. doi:10.2307/2340126. JSTOR 2340126.

Further reading

  • Sheskin, D.J. (2007) The Handbook of Parametric and Nonparametric Statistical Procedures. Chapman & Hall/CRC, ISBN 9781584888147

goodman, kruskal, gamma, statistics, measure, rank, correlation, similarity, orderings, data, when, ranked, each, quantities, measures, strength, association, cross, tabulated, data, when, both, variables, measured, ordinal, level, makes, adjustment, either, t. In statistics Goodman and Kruskal s gamma is a measure of rank correlation i e the similarity of the orderings of the data when ranked by each of the quantities It measures the strength of association of the cross tabulated data when both variables are measured at the ordinal level It makes no adjustment for either table size or ties Values range from 1 100 negative association or perfect inversion to 1 100 positive association or perfect agreement A value of zero indicates the absence of association This statistic which is distinct from Goodman and Kruskal s lambda is named after Leo Goodman and William Kruskal who proposed it in a series of papers from 1954 to 1972 1 2 3 4 Contents 1 Definition 2 Yule s Q 3 See also 4 References 5 Further readingDefinition EditThe estimate of gamma G depends on two quantities Ns the number of pairs of cases ranked in the same order on both variables number of concordant pairs Nd the number of pairs of cases ranked in reversed order on both variables number of reversed pairs where ties cases where either of the two variables in the pair are equal are dropped Then G N s N d N s N d displaystyle G frac N s N d N s N d This statistic can be regarded as the maximum likelihood estimator for the theoretical quantity g displaystyle gamma where g P s P d P s P d displaystyle gamma frac P s P d P s P d and where Ps and Pd are the probabilities that a randomly selected pair of observations will place in the same or opposite order respectively when ranked by both variables Critical values for the gamma statistic are sometimes found by using an approximation whereby a transformed value t of the statistic is referred to Student t distribution where citation needed t G N s N d n 1 G 2 displaystyle t approx G sqrt frac N s N d n 1 G 2 and where n is the number of observations not the number of pairs n N s N d displaystyle n neq N s N d Yule s Q EditA special case of Goodman and Kruskal s gamma is Yule s Q also known as the Yule coefficient of association 5 which is specific to 2 2 matrices Consider the following contingency table of events where each value is a count of an event s frequency Yes No TotalsPositive a b a bNegative c d c dTotals a c b d nYule s Q is given by Q a d b c a d b c displaystyle Q frac ad bc ad bc Although computed in the same fashion as Goodman and Kruskal s gamma it has a slightly broader interpretation because the distinction between nominal and ordinal scales becomes a matter of arbitrary labeling for dichotomous distinctions Thus whether Q is positive or negative depends merely on which pairings the analyst considers to be concordant but is otherwise symmetric Q varies from 1 to 1 1 reflects total negative association 1 reflects perfect positive association and 0 reflects no association at all The sign depends on which pairings the analyst initially considered to be concordant but this choice does not affect the magnitude In term of the odds ratio OR Yule s Q is given by Q O R 1 O R 1 displaystyle Q frac OR 1 OR 1 and so Yule s Q and Yule s Y are related by Q 2 Y 1 Y 2 displaystyle Q frac 2Y 1 Y 2 Y 1 1 Q 2 Q displaystyle Y frac 1 sqrt 1 Q 2 Q See also EditKendall tau rank correlation coefficient Goodman and Kruskal s lambda Yule s Y also known as the coefficient of colligationReferences Edit Goodman Leo A Kruskal William H 1954 Measures of Association for Cross Classifications Journal of the American Statistical Association 49 268 732 764 doi 10 2307 2281536 JSTOR 2281536 Goodman Leo A Kruskal William H 1959 Measures of Association for Cross Classifications II Further Discussion and References Journal of the American Statistical Association 54 285 123 163 doi 10 1080 01621459 1959 10501503 JSTOR 2282143 Goodman Leo A Kruskal William H 1963 Measures of Association for Cross Classifications III Approximate Sampling Theory Journal of the American Statistical Association 58 302 310 364 doi 10 1080 01621459 1963 10500850 JSTOR 2283271 Goodman Leo A Kruskal William H 1972 Measures of Association for Cross Classifications IV Simplification of Asymptotic Variances Journal of the American Statistical Association 67 338 415 421 doi 10 1080 01621459 1972 10482401 JSTOR 2284396 Yule G U 1912 On the methods of measuring association between two attributes Journal of the Royal Statistical Society 49 6 579 652 doi 10 2307 2340126 JSTOR 2340126 Further reading EditSheskin D J 2007 The Handbook of Parametric and Nonparametric Statistical Procedures Chapman amp Hall CRC ISBN 9781584888147 Retrieved from https en wikipedia org w index php title Goodman and Kruskal 27s gamma amp oldid 1123334445 Yule s Q, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.