RelationDigest

Thursday, 9 May 2024

P-Curve corrects for publication bias that does not exist.

This is another blog post illustrating the problems of p-curve analyses. P-curve was developed to correct the results of meta-analyses when studies report too many significant results. In this case, p-curve disregards the evidence from non-significant r…
Read on blog or Reader
Site logo image Replicability-Index Read on blog or Reader

P-Curve corrects for publication bias that does not exist.

Ulrich Schimmack

May 9

This is another blog post illustrating the problems of p-curve analyses. P-curve was developed to correct the results of meta-analyses when studies report too many significant results. In this case, p-curve disregards the evidence from non-significant results and applies a correct to the significant results. If the significant results were obtained without a real effect, the distribution of the significant p-values would be uniform. In contrast, the distribution would be 'right-skewed' (i.e., there are more p-values below .01 than p-values between .01 and .05). The distribution of the observed p-values is compared against the uniform distribution using visual inspection and statistical tests.

The main problem of p-curve is that it assumes bias rather than testing for bias. As a result, non-significant results are discarded even when there is no bias and the non-significant results provide valuable information about the presence of an effect. The main reason, p-curve does not test or bias is that the method was developed in a research area, social psychology and behavioral economics, where the use of questionable practices were rampant and non-significant results were rarely published. This is not the case in other research areas like medicine, where non-significant results are reported more often.

The example is from a meta-analysis of the use of hypno-therapy for the treatment of depression (Milling et al. 2019).

Leonard S. Milling, Keara E. Valentine, Hannah S. McCarley & Lindsey
M. LoStimolo (2019) A Meta-Analysis of Hypnotic Interventions for Depression Symptoms:
High Hopes for Hypnosis?, American Journal of Clinical Hypnosis, 61:3, 227-243, DOI:
10.1080/00029157.2018.1489777

The meta-analysis was based on 13 studies and the results are reported in Table 2.

Only 7 of the 13 results are statistically significant, p < .05 in the last column. The z-scores were entered in the p-curve app.

The results show a right-skewed distribution but the slope is flat and the significance test does not allow to reject the null-hypothesis that the deviation from a uniform distribution is just chance, p = .18. Based on these results, the data provide no evidence for the effectiveness of hypnotherapy.

Is there Publication Bias?

To justify the use of a selection model, it is necessary that there is publication bias. A powerful tool to test for the presence of publication bias is the comparison of the percentage of significant results against the mean power of studies. Without bias, the success rate should match mean power (Brunner & Schimmack, 2021). To test this hypothesis, I entered the data into the R-Index spreadsheet. The spreadsheet converts test statistics into two-sided p-values, z-scores, and observed power. In addition, it is recorded whether a result was significant or not.

In this case, the test statistics are z-scores and the conversion to p-values and back to z-scores is not necessary. The key finding is that the success rate is 7 out of 13 (54%) and that mean power is 41%. Thus, there is only a relatively small amount of bias at best. It is therefore problematic to discard the information from the non-significant results, which reduces the statistical power to demonstrate an effect.

Z-Curve

Z-curve was developed as a selection model like p-curve. However, z-curve can also be used for all observed values including non-significant results, if there is no selection bias. Using all 13 observations, the data provide sufficient evidence that hypnotherapy can be effective. The null-hypothesis implies that the expected discovery rate (EDR) and the expected replication rate (ERR) are 5%, which is the percentage of significant results expected by chance alone. The 95%CI excludes a value of 5%. Thus, there is evidence of an effect. At the same time, the results show that the studies had low power to produce significant results individually. The reason is that the sample sizes were small ranging from N = 20 to 53. Thus, future studies should use larger samples to avoid false negative (type II) errors.

Conclusion

The main conclusion is that researchers should use selection models only after they have demonstrated that selection bias is present. Assuming selection and discarding non-significant results reduces power and can lead to false negative results. A comparison of success rates and observed power using the R-Index spreadsheet provides a simple method to examine publication bias in small sets of studies.

R-Index Spreadsheet

Comment

Replicability-Index © 2024. Manage your email settings or unsubscribe.

WordPress.com and Jetpack Logos

Get the Jetpack app

Subscribe, bookmark, and get real-time notifications - all from one app!

Download Jetpack on Google Play Download Jetpack from the App Store
WordPress.com Logo and Wordmark title=

Automattic, Inc. - 60 29th St. #343, San Francisco, CA 94110  

at May 09, 2024
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest

No comments:

Post a Comment

Newer Post Older Post Home
Subscribe to: Post Comments (Atom)

A cake that tastes like summer

on hydrangeas, rural baking competitions, and peach canning ͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏  ...

  • [New post] Wiggle Kingdom: April Earnings on Spring Savings!
    Betsi...
  • [New post] Balancing the ‘E’ and ‘S’ in Environment, Social and Governance (ESG) crucial to sustaining liquidity and resilience in the African loan market (By Miranda Abraham)
    APO p...
  • Something plus something else
    Read on bl...

Search This Blog

  • Home

About Me

RelationDigest
View my complete profile

Report Abuse

Blog Archive

  • August 2025 (8)
  • July 2025 (59)
  • June 2025 (53)
  • May 2025 (47)
  • April 2025 (42)
  • March 2025 (30)
  • February 2025 (27)
  • January 2025 (30)
  • December 2024 (37)
  • November 2024 (31)
  • October 2024 (28)
  • September 2024 (28)
  • August 2024 (2729)
  • July 2024 (3249)
  • June 2024 (3152)
  • May 2024 (3259)
  • April 2024 (3151)
  • March 2024 (3258)
  • February 2024 (3046)
  • January 2024 (3258)
  • December 2023 (3270)
  • November 2023 (3183)
  • October 2023 (3243)
  • September 2023 (3151)
  • August 2023 (3241)
  • July 2023 (3237)
  • June 2023 (3135)
  • May 2023 (3212)
  • April 2023 (3093)
  • March 2023 (3187)
  • February 2023 (2865)
  • January 2023 (3209)
  • December 2022 (3229)
  • November 2022 (3079)
  • October 2022 (3086)
  • September 2022 (2791)
  • August 2022 (2964)
  • July 2022 (3157)
  • June 2022 (2925)
  • May 2022 (2893)
  • April 2022 (3049)
  • March 2022 (2919)
  • February 2022 (2104)
  • January 2022 (2284)
  • December 2021 (2481)
  • November 2021 (3146)
  • October 2021 (1048)
Powered by Blogger.