fbpx
Wikipedia

Cochran–Mantel–Haenszel statistics

In statistics, the Cochran–Mantel–Haenszel test (CMH) is a test used in the analysis of stratified or matched categorical data. It allows an investigator to test the association between a binary predictor or treatment and a binary outcome such as case or control status while taking into account the stratification.[1] Unlike the McNemar test, which can only handle pairs, the CMH test handles arbitrary strata size. It is named after William G. Cochran, Nathan Mantel and William Haenszel.[2][3] Extensions of this test to a categorical response and/or to several groups are commonly called Cochran–Mantel–Haenszel statistics.[4] It is often used in observational studies where random assignment of subjects to different treatments cannot be controlled, but confounding covariates can be measured.

Definition edit

We consider a binary outcome variable such as case status (e.g. lung cancer) and a binary predictor such as treatment status (e.g. smoking). The observations are grouped in strata. The stratified data are summarized in a series of 2 × 2 contingency tables, one for each stratum. The i-th such contingency table is:

Treatment No treatment Row total
Case Ai Bi N1i
Controls Ci Di N2i
Column total M1i M2i Ti

The common odds-ratio of the K contingency tables is defined as:

 

The null hypothesis is that there is no association between the treatment and the outcome. More precisely, the null hypothesis is   and the alternative hypothesis is  . The test statistic is:

 

It follows a   distribution asymptotically with 1 df under the null hypothesis.[1]

Subset stability edit

The standard odds- or risk ratio of all strata could be calculated, giving risk ratios  , where   is the number of strata. If the stratification were removed, there would be one aggregate risk ratio of the collapsed table; let this be  .[citation needed]

One generally expects the risk of an event unconditional on the stratification to be bounded between the highest and lowest risk within the strata (or identically with odds ratios). It is easy to construct examples where this is not the case, and   is larger or smaller than all of   for  . This is comparable but not identical to Simpson's paradox, and as with Simpson's paradox, it is difficult to interpret the statistic and decide policy based upon it.

Klemens[5] defines a statistic to be subset stable iff   is bounded between   and  , and a well-behaved statistic as being infinitely differentiable and not dependent on the order of the strata. Then the CMH statistic is the unique well-behaved statistic satisfying subset stability.[citation needed]

Related tests edit

  • The McNemar test can only handle pairs. The CMH test is a generalization of the McNemar test as their test statistics are identical when each stratum shows a pair.[6]
  • Conditional logistic regression is more general than the CMH test as it can handle continuous variable and perform multivariate analysis. When the CMH test can be applied, the CMH test statistic and the score test statistic of the conditional logistic regression are identical.[7]
  • Breslow–Day test for homogeneous association. The CMH test supposes that the effect of the treatment is homogeneous in all strata. The Breslow-Day test allows to test this assumption. This is not a concern if the strata are small e.g. pairs.

Notes edit

  1. ^ a b Agresti, Alan (2002). Categorical Data Analysis. Hoboken, New Jersey: John Wiley & Sons, Inc. pp. 231–232. ISBN 0-471-36093-7.
  2. ^ William G. Cochran (December 1954). "Some Methods for Strengthening the Common χ2 Tests". Biometrics. 10 (4): 417–451. doi:10.2307/3001616. JSTOR 3001616.
  3. ^ Nathan Mantel and William Haenszel (April 1959). "Statistical aspects of the analysis of data from retrospective studies of disease". Journal of the National Cancer Institute. 22 (4): 719–748. doi:10.1093/jnci/22.4.719. PMID 13655060.
  4. ^ Nathan Mantel (September 1963). "Chi-Square Tests with One Degree of Freedom, Extensions of the Mantel–Haenszel Procedure". Journal of the American Statistical Association. 58 (303): 690–700. doi:10.1080/01621459.1963.10500879. JSTOR 2282717.
  5. ^ Ben Klemens (June 2021). "An Analysis of U.S. Domestic Migration via Subset-stable Measures of Administrative Data". Journal of Computational Social Science. 5: 351–382. doi:10.1007/s42001-021-00124-w. S2CID 236308711.
  6. ^ Agresti, Alan (2002). Categorical Data Analysis. Hoboken, New Jersey: John Wiley & Sons, Inc. p. 413. ISBN 0-471-36093-7.
  7. ^ Day N.E., Byar D.P. (September 1979). "Testing hypotheses in case-control studies-equivalence of Mantel–Haenszel statistics and logit score tests". Biometrics. 35 (3): 623–630. doi:10.2307/2530253. JSTOR 2530253. PMID 497345.

External links edit

  • Introduction to the Cochran-Mantel-Haenszel Test

cochran, mantel, haenszel, statistics, statistics, cochran, mantel, haenszel, test, test, used, analysis, stratified, matched, categorical, data, allows, investigator, test, association, between, binary, predictor, treatment, binary, outcome, such, case, contr. In statistics the Cochran Mantel Haenszel test CMH is a test used in the analysis of stratified or matched categorical data It allows an investigator to test the association between a binary predictor or treatment and a binary outcome such as case or control status while taking into account the stratification 1 Unlike the McNemar test which can only handle pairs the CMH test handles arbitrary strata size It is named after William G Cochran Nathan Mantel and William Haenszel 2 3 Extensions of this test to a categorical response and or to several groups are commonly called Cochran Mantel Haenszel statistics 4 It is often used in observational studies where random assignment of subjects to different treatments cannot be controlled but confounding covariates can be measured Contents 1 Definition 2 Subset stability 3 Related tests 4 Notes 5 External linksDefinition editWe consider a binary outcome variable such as case status e g lung cancer and a binary predictor such as treatment status e g smoking The observations are grouped in strata The stratified data are summarized in a series of 2 2 contingency tables one for each stratum The i th such contingency table is Treatment No treatment Row totalCase Ai Bi N1iControls Ci Di N2iColumn total M1i M2i TiThe common odds ratio of the K contingency tables is defined as R i 1 K A i D i T i i 1 K B i C i T i displaystyle R sum i 1 K frac A i D i T i over sum i 1 K B i C i over T i nbsp The null hypothesis is that there is no association between the treatment and the outcome More precisely the null hypothesis is H 0 R 1 displaystyle H 0 R 1 nbsp and the alternative hypothesis is H 1 R 1 displaystyle H 1 R neq 1 nbsp The test statistic is 3 CMH i 1 K A i N 1 i M 1 i T i 2 i 1 K N 1 i N 2 i M 1 i M 2 i T i 2 T i 1 displaystyle xi text CMH frac left sum i 1 K left A i frac N 1i M 1i T i right right 2 sum i 1 K N 1i N 2i M 1i M 2i over T i 2 T i 1 nbsp It follows a x 2 displaystyle chi 2 nbsp distribution asymptotically with 1 df under the null hypothesis 1 Subset stability editThe standard odds or risk ratio of all strata could be calculated giving risk ratios r 1 r 2 r n displaystyle r 1 r 2 dots r n nbsp where n displaystyle n nbsp is the number of strata If the stratification were removed there would be one aggregate risk ratio of the collapsed table let this be R displaystyle R nbsp citation needed One generally expects the risk of an event unconditional on the stratification to be bounded between the highest and lowest risk within the strata or identically with odds ratios It is easy to construct examples where this is not the case and R displaystyle R nbsp is larger or smaller than all of r i displaystyle r i nbsp for i 1 n displaystyle i in 1 dots n nbsp This is comparable but not identical to Simpson s paradox and as with Simpson s paradox it is difficult to interpret the statistic and decide policy based upon it Klemens 5 defines a statistic to be subset stable iff R displaystyle R nbsp is bounded between min r i displaystyle min r i nbsp and max r i displaystyle max r i nbsp and a well behaved statistic as being infinitely differentiable and not dependent on the order of the strata Then the CMH statistic is the unique well behaved statistic satisfying subset stability citation needed Related tests editThe McNemar test can only handle pairs The CMH test is a generalization of the McNemar test as their test statistics are identical when each stratum shows a pair 6 Conditional logistic regression is more general than the CMH test as it can handle continuous variable and perform multivariate analysis When the CMH test can be applied the CMH test statistic and the score test statistic of the conditional logistic regression are identical 7 Breslow Day test for homogeneous association The CMH test supposes that the effect of the treatment is homogeneous in all strata The Breslow Day test allows to test this assumption This is not a concern if the strata are small e g pairs Notes edit a b Agresti Alan 2002 Categorical Data Analysis Hoboken New Jersey John Wiley amp Sons Inc pp 231 232 ISBN 0 471 36093 7 William G Cochran December 1954 Some Methods for Strengthening the Common x2 Tests Biometrics 10 4 417 451 doi 10 2307 3001616 JSTOR 3001616 Nathan Mantel and William Haenszel April 1959 Statistical aspects of the analysis of data from retrospective studies of disease Journal of the National Cancer Institute 22 4 719 748 doi 10 1093 jnci 22 4 719 PMID 13655060 Nathan Mantel September 1963 Chi Square Tests with One Degree of Freedom Extensions of the Mantel Haenszel Procedure Journal of the American Statistical Association 58 303 690 700 doi 10 1080 01621459 1963 10500879 JSTOR 2282717 Ben Klemens June 2021 An Analysis of U S Domestic Migration via Subset stable Measures of Administrative Data Journal of Computational Social Science 5 351 382 doi 10 1007 s42001 021 00124 w S2CID 236308711 Agresti Alan 2002 Categorical Data Analysis Hoboken New Jersey John Wiley amp Sons Inc p 413 ISBN 0 471 36093 7 Day N E Byar D P September 1979 Testing hypotheses in case control studies equivalence of Mantel Haenszel statistics and logit score tests Biometrics 35 3 623 630 doi 10 2307 2530253 JSTOR 2530253 PMID 497345 External links editIntroduction to the Cochran Mantel Haenszel Test Retrieved from https en wikipedia org w index php title Cochran Mantel Haenszel statistics amp oldid 1199287891, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.