Post hoc analysis
In a scientific study, post hoc analysis (from Latin post hoc, "after this") consists of statistical analyses that were specified after the data were seen.[1][2][3] A post hoc analysis is usually used to explore specific, statistically significant differences between the means of three or more independent groups-- differences detected with an analysis of variance (ANOVA).[4] An ANOVA does not identify the group(s); for that, a post hoc analysis is required.[5]
Because each post hoc analysis is effectively a statistical test, conducting multiple post hoc comparisons introduces a family-wise error rate problem, which is a type of multiple testing problem. This increases the likelihood of false positives unless corrected.
Post hoc tests are follow-up tests performed after a significant ANOVA result[6] to identify where the differences lie (which specific groups differ). To compensate, multiple post hoc testing procedures are sometimes used, but that is often difficult or impossible to do precisely. Post hoc analysis that is conducted and interpreted without adequate consideration of this problem is sometimes called data dredging (p-hacking) by critics because the statistical associations that it finds are often spurious.[7] In other words, findings from data dredging are invalid or not trustworthy.
Post hoc analyses are acceptable when transparently reported as exploratory. In other words, post hoc analyses are not inherently unethical.[8] The main requirement for their ethical use is simply that their results not be mispresented as the original hypothesis.[8] Modern editions of scientific manuals have clarified this point; for example, APA style now specifies that "hypotheses should now be stated in three groupings: preplanned–primary, preplanned–secondary, and exploratory (post hoc). Exploratory hypotheses are allowable, and there should be no pressure to disguise them as if they were preplanned."[8]
Types of post hoc analysis
[edit]Types or categories of post hoc analyses include[9]:
- Pairwise comparisons: Tests all possible pairs
- Trend analysis: Tests for linear or quadratic trends across ordered groups
- Simple effects analysis: Examines effects within factorial ANOVA
- Interaction probing: Analyzes interaction constraints within factorial ANOVA
- Restricted Sets of Contrasts: Testing smaller families of comparisons
In addition, a subgroup analysis[10] examines whether findings differ between discrete categories of subjects in the sample. This approach is common in clinical and observational studies.
Common post hoc tests
[edit]Common post hoc tests include:[11][12]
- Holm-Bonferroni Procedure
- Newman-Keuls
- Rodger's Method
- Scheffé's Method
- Tukey's Test and Honestly Significance Difference (HSD) (see also: Studentized Range Distribution)
However, with the exception of Scheffès Method, these tests should be specified "a priori" despite being called "post-hoc" in conventional usage. For example, a difference between means could be significant with the Holm-Bonferroni method but not with the Turkey Test and vice versa. It would be poor practice for a data analyst to choose which of these tests to report based on which gave the desired result.
Causes
[edit]Sometimes the temptation to engage in post hoc analysis is motivated by a desire to produce positive results or see a project as successful. In the case of pharmaceutical research, there may be significant financial consequences to a failed trial.[citation needed]
See also
[edit]References
[edit]- ^ "What is the significance and use of post-hoc analysis studies?". www.cwauthors.com. Retrieved 2022-12-09.
- ^ "11.8: Post Hoc Tests". Statistics LibreTexts. 2019-11-12. Retrieved 2022-12-09.
- ^ "Post Hoc". FORRT - Framework for Open and Reproducible Research Training. Retrieved 2025-11-02.
- ^ "SAGE Research Methods - The SAGE Encyclopedia of Communication Research Methods". methods.sagepub.com. Retrieved 2022-12-09.
- ^ "11.8: Post Hoc Tests". Statistics LibreTexts. 2019-11-12. Retrieved 2025-11-02.
- ^ Bobbitt, Zach (2019-04-14). "A Guide to Using Post Hoc Tests with ANOVA". Statology. Retrieved 2025-11-02.
- ^ Zhang, Yiran; Hedo, Rita; Rivera, Anna; Rull, Rudolph; Richardson, Sabrina; Tu, Xin M. (2019-08-01). "Post hoc power analysis: is it an informative and meaningful analysis?". General Psychiatry. 32 (4) e100069. doi:10.1136/gpsych-2019-100069. ISSN 2517-729X. PMC 6738696.
- ^ a b c American Psychological Association (2020). Publication Manual of the American Psychological Association: the Official Guide to APA Style (7th ed.). Washington, DC: American Psychological Association. ISBN 978-1-4338-3217-8.
- ^ Beaton, Albert E.; Keppel, Geoffrey (1975). "Design and Analysis: A Researcher's Handbook". American Educational Research Journal. 12 (1): 101. doi:10.2307/1162588. ISSN 0002-8312.
- ^ Andrade, Chittaranjan (2023-11-01). "Types of Analysis: Planned (prespecified) vs Post Hoc, Primary vs Secondary, Hypothesis-driven vs Exploratory, Subgroup and Sensitivity, and Others". Indian Journal of Psychological Medicine. 45 (6): 640–641. doi:10.1177/02537176231216842. ISSN 0253-7176. PMC 10964884. PMID 38545527.
- ^ "Post Hoc Definition and Types of Tests". Statistics How To. Retrieved 2022-12-09.
- ^ Pamplona, Fabricio (2022-07-28). "Post Hoc Analysis: Process and types of tests". Mind the Graph Blog. Retrieved 2022-12-09.