fbpx
Wikipedia

Sequential analysis

In statistics, sequential analysis or sequential hypothesis testing is statistical analysis where the sample size is not fixed in advance. Instead data is evaluated as it is collected, and further sampling is stopped in accordance with a pre-defined stopping rule as soon as significant results are observed. Thus a conclusion may sometimes be reached at a much earlier stage than would be possible with more classical hypothesis testing or estimation, at consequently lower financial and/or human cost.

History edit

The method of sequential analysis is first attributed to Abraham Wald[1] with Jacob Wolfowitz, W. Allen Wallis, and Milton Friedman[2] while at Columbia University's Statistical Research Group as a tool for more efficient industrial quality control during World War II. Its value to the war effort was immediately recognised, and led to its receiving a "restricted" classification.[3] At the same time, George Barnard led a group working on optimal stopping in Great Britain. Another early contribution to the method was made by K.J. Arrow with D. Blackwell and M.A. Girshick.[4]

A similar approach was independently developed from first principles at about the same time by Alan Turing, as part of the Banburismus technique used at Bletchley Park, to test hypotheses about whether different messages coded by German Enigma machines should be connected and analysed together. This work remained secret until the early 1980s.[5]

Peter Armitage introduced the use of sequential analysis in medical research, especially in the area of clinical trials. Sequential methods became increasingly popular in medicine following Stuart Pocock's work that provided clear recommendations on how to control Type 1 error rates in sequential designs.[6]

Alpha spending functions edit

When researchers repeatedly analyze data as more observations are added, the probability of a Type 1 error increases. Therefore, it is important to adjust the alpha level at each interim analysis, such that the overall Type 1 error rate remains at the desired level. This is conceptually similar to using the Bonferroni correction, but because the repeated looks at the data are dependent, more efficient corrections for the alpha level can be used. Among the earliest proposals is the Pocock boundary. Alternative ways to control the Type 1 error rate exist, such as the Haybittle–Peto bounds, and additional work on determining the boundaries for interim analyses has been done by O’Brien & Fleming[7] and Wang & Tsiatis.[8]

A limitation of corrections such as the Pocock boundary is that the number of looks at the data must be determined before the data is collected, and that the looks at the data should be equally spaced (e.g., after 50, 100, 150, and 200 patients). The alpha spending function approach developed by Demets & Lan[9] does not have these restrictions, and depending on the parameters chosen for the spending function, can be very similar to Pocock boundaries or the corrections proposed by O'Brien and Fleming. Another approach that has no such restrictions at all is based on e-values and e-processes.

Applications of sequential analysis edit

Clinical trials edit

In a randomized trial with two treatment groups, group sequential testing may for example be conducted in the following manner: After n subjects in each group are available an interim analysis is conducted. A statistical test is performed to compare the two groups and if the null hypothesis is rejected the trial is terminated; otherwise, the trial continues, another n subjects per group are recruited, and the statistical test is performed again, including all subjects. If the null is rejected, the trial is terminated, and otherwise it continues with periodic evaluations until a maximum number of interim analyses have been performed, at which point the last statistical test is conducted and the trial is discontinued.[10]

Other applications edit

Sequential analysis also has a connection to the problem of gambler's ruin that has been studied by, among others, Huygens in 1657.[11]

Step detection is the process of finding abrupt changes in the mean level of a time series or signal. It is usually considered as a special kind of statistical method known as change point detection. Often, the step is small and the time series is corrupted by some kind of noise, and this makes the problem challenging because the step may be hidden by the noise. Therefore, statistical and/or signal processing algorithms are often required. When the algorithms are run online as the data is coming in, especially with the aim of producing an alert, this is an application of sequential analysis.

Bias edit

Trials that are terminated early because they reject the null hypothesis typically overestimate the true effect size.[12] This is because in small samples, only large effect size estimates will lead to a significant effect, and the subsequent termination of a trial. Methods to correct effect size estimates in single trials have been proposed.[13] Note that this bias is mainly problematic when interpreting single studies. In meta-analyses, overestimated effect sizes due to early stopping are balanced by underestimation in trials that stop late, leading Schou & Marschner to conclude that "early stopping of clinical trials is not a substantive source of bias in meta-analyses".[14]

The meaning of p-values in sequential analyses also changes, because when using sequential analyses, more than one analysis is performed, and the typical definition of a p-value as the data “at least as extreme” as is observed needs to be redefined. One solution is to order the p-values of a series of sequential tests based on the time of stopping and how high the test statistic was at a given look, which is known as stagewise ordering,[12] first proposed by Armitage.

See also edit

Notes edit

  1. ^ Wald, Abraham (June 1945). "Sequential Tests of Statistical Hypotheses". The Annals of Mathematical Statistics. 16 (2): 117–186. doi:10.1214/aoms/1177731118. JSTOR 2235829.
  2. ^ Berger, James (2008). "Sequential Analysis". (2nd ed.). pp. 438–439. doi:10.1057/9780230226203.1513. ISBN 978-0-333-78676-5. {{cite book}}: |journal= ignored (help); Missing or empty |title= (help)
  3. ^ Weigl, Hans Günter (2013). Abraham Wald : a statistician as a key figure for modern econometrics (PDF) (Doctoral thesis). University of Hamburg.
  4. ^ Kenneth J. Arrow, David Blackwell and M.A. Girshick (1949). "Bayes and minimax solutions of sequential decision problems". Econometrica. 17 (3/4): 213–244. doi:10.2307/1905525. JSTOR 1905525.
  5. ^ Randell, Brian (1980), "The Colossus", A History of Computing in the Twentieth Century, p. 30
  6. ^ W., Turnbull, Bruce (2000). Group sequential methods with applications to clinical trials. Chapman & Hall. ISBN 9780849303166. OCLC 900071609.{{cite book}}: CS1 maint: multiple names: authors list (link)
  7. ^ O'Brien, Peter C.; Fleming, Thomas R. (1979-01-01). "A Multiple Testing Procedure for Clinical Trials". Biometrics. 35 (3): 549–556. doi:10.2307/2530245. JSTOR 2530245. PMID 497341.
  8. ^ Wang, Samuel K.; Tsiatis, Anastasios A. (1987-01-01). "Approximately Optimal One-Parameter Boundaries for Group Sequential Trials". Biometrics. 43 (1): 193–199. doi:10.2307/2531959. JSTOR 2531959. PMID 3567304.
  9. ^ Demets, David L.; Lan, K. K. Gordon (1994-07-15). "Interim analysis: The alpha spending function approach". Statistics in Medicine. 13 (13–14): 1341–1352. doi:10.1002/sim.4780131308. ISSN 1097-0258. PMID 7973215.
  10. ^ Korosteleva, Olga (2008). Clinical Statistics: Introducing Clinical Trials, Survival Analysis, and Longitudinal Data Analysis (First ed.). Jones and Bartlett Publishers. ISBN 978-0-7637-5850-9.
  11. ^ Ghosh, B. K.; Sen, P. K. (1991). Handbook of Sequential Analysis. New York: Marcel Dekker. ISBN 9780824784089.[page needed]
  12. ^ a b Proschan, Michael A.; Lan, K. K. Gordan; Wittes, Janet Turk (2006). Statistical monitoring of clinical trials : a unified approach. Springer. ISBN 9780387300597. OCLC 553888945.
  13. ^ Liu, A.; Hall, W. J. (1999-03-01). "Unbiased estimation following a group sequential test". Biometrika. 86 (1): 71–78. doi:10.1093/biomet/86.1.71. ISSN 0006-3444.
  14. ^ Schou, I. Manjula; Marschner, Ian C. (2013-12-10). "Meta-analysis of clinical trials with early stopping: an investigation of potential bias". Statistics in Medicine. 32 (28): 4859–4874. doi:10.1002/sim.5893. ISSN 1097-0258. PMID 23824994. S2CID 22428591.

References edit

  • Wald, Abraham (1947). Sequential Analysis. New York: John Wiley and Sons.
  • Bartroff, J., Lai T.L., and Shih, M.-C. (2013) Sequential Experimentation in Clinical Trials: Design and Analysis. Springer.
  • Ghosh, Bhaskar Kumar (1970). Sequential Tests of Statistical Hypotheses. Reading: Addison-Wesley.
  • Chernoff, Herman (1972). Sequential Analysis and Optimal Design. SIAM.
  • Siegmund, David (1985). Sequential Analysis. Springer Series in Statistics. New York: Springer-Verlag. ISBN 978-0-387-96134-7.
  • Bakeman, R., Gottman, J.M., (1997) Observing Interaction: An Introduction to Sequential Analysis, Cambridge: Cambridge University Press
  • Jennison, C. and Turnbull, B.W (2000) Group Sequential Methods With Applications to Clinical Trials. Chapman & Hall/CRC.
  • Whitehead, J. (1997). The Design and Analysis of Sequential Clinical Trials, 2nd Edition. John Wiley & Sons.

External links edit

  • R Package: Wald's Sequential Probability Ratio Test by OnlineMarketr.com
  • Software for conducting sequential analysis and in the study of group interaction in computer-mediated communication by Dr. Allan Jeong at Florida State University
Commercial

sequential, analysis, confused, with, sequence, analysis, statistics, sequential, analysis, sequential, hypothesis, testing, statistical, analysis, where, sample, size, fixed, advance, instead, data, evaluated, collected, further, sampling, stopped, accordance. Not to be confused with Sequence analysis In statistics sequential analysis or sequential hypothesis testing is statistical analysis where the sample size is not fixed in advance Instead data is evaluated as it is collected and further sampling is stopped in accordance with a pre defined stopping rule as soon as significant results are observed Thus a conclusion may sometimes be reached at a much earlier stage than would be possible with more classical hypothesis testing or estimation at consequently lower financial and or human cost Contents 1 History 2 Alpha spending functions 3 Applications of sequential analysis 3 1 Clinical trials 3 2 Other applications 4 Bias 5 See also 6 Notes 7 References 8 External linksHistory editThe method of sequential analysis is first attributed to Abraham Wald 1 with Jacob Wolfowitz W Allen Wallis and Milton Friedman 2 while at Columbia University s Statistical Research Group as a tool for more efficient industrial quality control during World War II Its value to the war effort was immediately recognised and led to its receiving a restricted classification 3 At the same time George Barnard led a group working on optimal stopping in Great Britain Another early contribution to the method was made by K J Arrow with D Blackwell and M A Girshick 4 A similar approach was independently developed from first principles at about the same time by Alan Turing as part of the Banburismus technique used at Bletchley Park to test hypotheses about whether different messages coded by German Enigma machines should be connected and analysed together This work remained secret until the early 1980s 5 Peter Armitage introduced the use of sequential analysis in medical research especially in the area of clinical trials Sequential methods became increasingly popular in medicine following Stuart Pocock s work that provided clear recommendations on how to control Type 1 error rates in sequential designs 6 Alpha spending functions editWhen researchers repeatedly analyze data as more observations are added the probability of a Type 1 error increases Therefore it is important to adjust the alpha level at each interim analysis such that the overall Type 1 error rate remains at the desired level This is conceptually similar to using the Bonferroni correction but because the repeated looks at the data are dependent more efficient corrections for the alpha level can be used Among the earliest proposals is the Pocock boundary Alternative ways to control the Type 1 error rate exist such as the Haybittle Peto bounds and additional work on determining the boundaries for interim analyses has been done by O Brien amp Fleming 7 and Wang amp Tsiatis 8 A limitation of corrections such as the Pocock boundary is that the number of looks at the data must be determined before the data is collected and that the looks at the data should be equally spaced e g after 50 100 150 and 200 patients The alpha spending function approach developed by Demets amp Lan 9 does not have these restrictions and depending on the parameters chosen for the spending function can be very similar to Pocock boundaries or the corrections proposed by O Brien and Fleming Another approach that has no such restrictions at all is based on e values and e processes Applications of sequential analysis editClinical trials edit In a randomized trial with two treatment groups group sequential testing may for example be conducted in the following manner After n subjects in each group are available an interim analysis is conducted A statistical test is performed to compare the two groups and if the null hypothesis is rejected the trial is terminated otherwise the trial continues another n subjects per group are recruited and the statistical test is performed again including all subjects If the null is rejected the trial is terminated and otherwise it continues with periodic evaluations until a maximum number of interim analyses have been performed at which point the last statistical test is conducted and the trial is discontinued 10 Other applications edit Sequential analysis also has a connection to the problem of gambler s ruin that has been studied by among others Huygens in 1657 11 Step detection is the process of finding abrupt changes in the mean level of a time series or signal It is usually considered as a special kind of statistical method known as change point detection Often the step is small and the time series is corrupted by some kind of noise and this makes the problem challenging because the step may be hidden by the noise Therefore statistical and or signal processing algorithms are often required When the algorithms are run online as the data is coming in especially with the aim of producing an alert this is an application of sequential analysis Bias editTrials that are terminated early because they reject the null hypothesis typically overestimate the true effect size 12 This is because in small samples only large effect size estimates will lead to a significant effect and the subsequent termination of a trial Methods to correct effect size estimates in single trials have been proposed 13 Note that this bias is mainly problematic when interpreting single studies In meta analyses overestimated effect sizes due to early stopping are balanced by underestimation in trials that stop late leading Schou amp Marschner to conclude that early stopping of clinical trials is not a substantive source of bias in meta analyses 14 The meaning of p values in sequential analyses also changes because when using sequential analyses more than one analysis is performed and the typical definition of a p value as the data at least as extreme as is observed needs to be redefined One solution is to order the p values of a series of sequential tests based on the time of stopping and how high the test statistic was at a given look which is known as stagewise ordering 12 first proposed by Armitage See also editOptimal stopping Sequential estimation Sequential probability ratio test CUSUMNotes edit Wald Abraham June 1945 Sequential Tests of Statistical Hypotheses The Annals of Mathematical Statistics 16 2 117 186 doi 10 1214 aoms 1177731118 JSTOR 2235829 Berger James 2008 Sequential Analysis 2nd ed pp 438 439 doi 10 1057 9780230226203 1513 ISBN 978 0 333 78676 5 a href Template Cite book html title Template Cite book cite book a journal ignored help Missing or empty title help Weigl Hans Gunter 2013 Abraham Wald a statistician as a key figure for modern econometrics PDF Doctoral thesis University of Hamburg Kenneth J Arrow David Blackwell and M A Girshick 1949 Bayes and minimax solutions of sequential decision problems Econometrica 17 3 4 213 244 doi 10 2307 1905525 JSTOR 1905525 Randell Brian 1980 The Colossus A History of Computing in the Twentieth Century p 30 W Turnbull Bruce 2000 Group sequential methods with applications to clinical trials Chapman amp Hall ISBN 9780849303166 OCLC 900071609 a href Template Cite book html title Template Cite book cite book a CS1 maint multiple names authors list link O Brien Peter C Fleming Thomas R 1979 01 01 A Multiple Testing Procedure for Clinical Trials Biometrics 35 3 549 556 doi 10 2307 2530245 JSTOR 2530245 PMID 497341 Wang Samuel K Tsiatis Anastasios A 1987 01 01 Approximately Optimal One Parameter Boundaries for Group Sequential Trials Biometrics 43 1 193 199 doi 10 2307 2531959 JSTOR 2531959 PMID 3567304 Demets David L Lan K K Gordon 1994 07 15 Interim analysis The alpha spending function approach Statistics in Medicine 13 13 14 1341 1352 doi 10 1002 sim 4780131308 ISSN 1097 0258 PMID 7973215 Korosteleva Olga 2008 Clinical Statistics Introducing Clinical Trials Survival Analysis and Longitudinal Data Analysis First ed Jones and Bartlett Publishers ISBN 978 0 7637 5850 9 Ghosh B K Sen P K 1991 Handbook of Sequential Analysis New York Marcel Dekker ISBN 9780824784089 page needed a b Proschan Michael A Lan K K Gordan Wittes Janet Turk 2006 Statistical monitoring of clinical trials a unified approach Springer ISBN 9780387300597 OCLC 553888945 Liu A Hall W J 1999 03 01 Unbiased estimation following a group sequential test Biometrika 86 1 71 78 doi 10 1093 biomet 86 1 71 ISSN 0006 3444 Schou I Manjula Marschner Ian C 2013 12 10 Meta analysis of clinical trials with early stopping an investigation of potential bias Statistics in Medicine 32 28 4859 4874 doi 10 1002 sim 5893 ISSN 1097 0258 PMID 23824994 S2CID 22428591 References editWald Abraham 1947 Sequential Analysis New York John Wiley and Sons Bartroff J Lai T L and Shih M C 2013 Sequential Experimentation in Clinical Trials Design and Analysis Springer Ghosh Bhaskar Kumar 1970 Sequential Tests of Statistical Hypotheses Reading Addison Wesley Chernoff Herman 1972 Sequential Analysis and Optimal Design SIAM Siegmund David 1985 Sequential Analysis Springer Series in Statistics New York Springer Verlag ISBN 978 0 387 96134 7 Bakeman R Gottman J M 1997 Observing Interaction An Introduction to Sequential Analysis Cambridge Cambridge University Press Jennison C and Turnbull B W 2000 Group Sequential Methods With Applications to Clinical Trials Chapman amp Hall CRC Whitehead J 1997 The Design and Analysis of Sequential Clinical Trials 2nd Edition John Wiley amp Sons External links editR Package Wald s Sequential Probability Ratio Test by OnlineMarketr com Software for conducting sequential analysis and applications of sequential analysis in the study of group interaction in computer mediated communication by Dr Allan Jeong at Florida State University Commercial PASS Sample Size Software includes features for the setup of group sequential designs Retrieved from https en wikipedia org w index php title Sequential analysis amp oldid 1193641352, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.