{"id":456,"date":"2018-07-31T12:05:19","date_gmt":"2018-07-31T12:05:19","guid":{"rendered":"https:\/\/onemol.org.uk\/?page_id=456"},"modified":"2025-09-21T18:34:10","modified_gmt":"2025-09-21T18:34:10","slug":"papers-about-p-values","status":"publish","type":"page","link":"https:\/\/onemol.org.uk\/?page_id=456","title":{"rendered":"Some papers about <em>p<\/em> values"},"content":{"rendered":"\n<p><a href=\"#follow\"><span class=\"smallprint\">Jump to follow-up<\/span><\/a><\/p>\n\n\n\n<p>These papers have nothing much to do with single molecule kinetics. They were written by David Colquhoun after his retirement from the world of single ion channels, as a way to keep him off the streets. They are listed here as a convenient place to keep a record.<\/p>\n\n\n\n<p>The papers concern the misinterpretation of tests of significance. Such tests were barely ever used in our single ion channel work. They represent a return to the interest of DC in statistical inference that he had in the 1960s, and which culminated on the publication of a textbook, <a title=\"Download pdf of the book\" href=\"http:\/\/www.dcscience.net\/Lectures_on_biostatistics-ocr4.pdf\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Lectures on Biostatistics<\/em><\/a> (OUP, 1971). The textbook has aged quite well, with the exception of the parts on interpretation of <em>p<\/em> values. In the 1960s, I missed entirely the problems of null hypothesis significance testing. But better late than never.<\/p>\n\n\n\n<p>The problem lies in the fact that most people still think that the <em>p<\/em> value is the probability that your results occurred by chance -see, for example. <a title=\"Download pdf of Gigerenzer 2004\" href=\"https:\/\/onemol.org.uk\/Gigerenzer-2004.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">Gigerenzer <em>et al<\/em>.,(2006) [download pdf]<\/a>. It is nothing of the sort.<\/p>\n\n\n\n<p>The false positive risk (FPR) is the probability that a result that has been labelled as &#8220;statistically significant&#8221; is in fact a false positive. It is always bigger than the <em>p<\/em> value, often much bigger.<\/p>\n\n\n\n<p><strong>My recommendations<\/strong>. In brief, I suggest that <em>p<\/em> values and confidence intervals should still be cited, but they should be supplemented by a single number that gives an idea of the false positive risk (FPR). The simplest way to do this is to calculate the false positive risk that corresponds to a prior probability of there being a real effect of 0.5. This would still be optimistic for implausible hypotheses but it would be a great improvement on <em>p<\/em> values. The FPR<sub>50<\/sub>, calculated in this way is just a more comprehensible way of citing likelihood ratio (see <a href=\"https:\/\/www.tandfonline.com\/doi\/full\/10.1080\/00031305.2018.1529622\" target=\"_blank\" rel=\"noopener noreferrer\">2019 paper<\/a>). And the likelihood ratio (not the <em>p<\/em> value) is what sums up the evidence from your experiment; it does so in an entirely frequentist way, so avoiding the problem of priors that is the bane of Bayesian approaches.<\/p>\n\n\n\n<p><strong>Please note<\/strong>: the term &#8220;false discovery rate&#8221;, which was used in earlier papers, has now been replaced by &#8220;false positive risk&#8221;. The reasons for this change are explained in the introduction of the <a href=\"https:\/\/royalsocietypublishing.org\/doi\/full\/10.1098\/rsos.171085\" target=\"_blank\" rel=\"noopener noreferrer\">2017 paper<\/a>.<\/p>\n\n\n\n<p>If you prefer a video to reading, try <a href=\"#DCvideo2018\" target=\"_blank\" rel=\"noopener noreferrer\">this, on YouTube<\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Original papers about the problem<\/h3>\n\n\n\n<p><a name=\"DC2014\"><\/a><br><strong>Colquhoun, D. (2014)<\/strong>. An investigation of the false discovery rate and the misinterpretation of <em>p<\/em>-values. Open access at <a title=\"Link to full text\" href=\"http:\/\/rsos.royalsocietypublishing.org\/content\/1\/3\/140216\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Royal Society Open Science<\/em><\/a> <span style=\"color: #800000;\"><em>This first paper looked at the risk of false positive results by simulation of Student&#8217;s <em>t<\/em> test. The advantage of simulation is that it makes the assumptions very clear without much mathematics. The disadvantage is that the results aren&#8217;t very general.<\/em>   #DC2014                       <\/span><\/p>\n\n\n\n<p><a name=\"DC2017\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun, D. (2017)<\/strong>. The reproducibility of research and the misinterpretation of <em>p<\/em>-values .  Open Access at <a title=\"Full text. open access\" href=\"http:\/\/rsos.royalsocietypublishing.org\/content\/4\/12\/171085\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Royal Society Open Science<\/em><\/a>. <span style=\"color: #800000;\"><em>This paper gives, in the appendix, mathematically exact solutions or the false positive risk, calculated by the p-equals method. This allows the false positive risk to be calculated, as a function of the observed p value, for a range of sample sizes. A web calculator is provided that makes the calculations simple to do.<\/em><\/span>                     #DC2017<\/p>\n\n\n\n<p><a name=\"fpr-calc\"><\/a><\/p>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<p><strong>Colquhoun, D. and Longstaff, C. (2017)<\/strong>. <strong>Web calculator<\/strong> for false positive risk <a href=\"https:\/\/fpr-calc.ucl.ac.uk\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/fpr-calc.ucl.ac.uk\/<\/a><\/p>\n<\/div><\/div>\n\n\n\n<p><p><strong><em>Update<\/em><\/strong>. Thanks to a cock-up by UCL, the web calculator was offline for a while.  Thanks to a ingenious solution provided by the wonderful people at our web hosts, <a href=\"https:\/\/positive-internet.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Positive Internet<\/a>, the original link, <a href=\"http:\/\/fpr-calc.ucl.ac.uk\/\" target=\"_blank\" rel=\"noreferrer noopener\">http:\/\/fpr-calc.ucl.ac.uk\/<\/a>, now works again. That link is actually redirected to an html page (http:\/\/fpr-calc.positive-dedicated.net\/) on the Positive Internet server in which is embedded the back up copy that is kindly  hosted by Daniel Lakens in Eindhoven, Netherlands. The only difference is that this version on longer rearranges on a mobile phone.                                #fpr-calc <\/p><\/p>\n\n\n\n<p>You can also go directly the server at Eindhoven  University of Technology, at <a href=\"https:\/\/shiny.ieis.tue.nl\/fpr_calc\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/shiny.ieis.tue.nl\/fpr_calc\/<\/a><span style=\"font-size: revert; color: initial;\">.<\/span><\/p>\n\n\n\n<p>There is also a copy of the web calculator at <a href=\"https:\/\/davidcolquhoun.shinyapps.io\/fpr-calc-ver1-7\/\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/davidcolquhoun.shinyapps.io\/fpr-calc-ver1-7\/<\/a><\/p>\n\n\n\n<p><a name=\"DC2019\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun, D. (2019)<\/strong>. The false positive risk: a proposal concerning what to do about <em>p<\/em> values, <a href=\"https:\/\/www.tandfonline.com\/doi\/full\/10.1080\/00031305.2018.1529622\" target=\"_blank\" rel=\"noopener noreferrer\"><em>The American Statistician<\/em>, 2019<\/a> (open access to full text). Also available on arXiv). <span style=\"color: #800000;\"><em>This paper examines more closely than before the assumptions that are made in calculations of FPR. It makes concrete proposals about how to solve the problem posed by the inadequacy of <em>p<\/em> values, with examples.<\/em><\/span>                                                            #DC2019<\/p>\n\n\n\n<p>In the same online edition, <em>The American Statistician<\/em> <a title=\"List of the 43 papers\" href=\"https:\/\/www.tandfonline.com\/toc\/utas20\/73\/sup1?nav=tocList\" target=\"_blank\" rel=\"noopener noreferrer\">published 43 papers<\/a> that were designed to say what <em>should<\/em> be done about the problem of abuse of <em>p<\/em> values.  One of these makes suggestions that are similar to mine, but based on even simpler calculations -see <a href=\"https:\/\/www.tandfonline.com\/doi\/full\/10.1080\/00031305.2018.1543135\">Benjamin and Berger (2019)<\/a>.  The only problem with their approach is that it doesn&#8217;t cater for the <a href=\"https:\/\/www.tandfonline.com\/doi\/full\/10.1080\/00031305.2018.1529622#d1e1079\" target=\"_blank\" rel=\"noreferrer noopener\">Jeffreys-Lindley phenomenon<\/a> and it is therefore not suitable for very large samples.<\/p>\n\n\n\n<p>At the same time, <em>Nature<\/em> published a <a href=\"https:\/\/www.nature.com\/articles\/d41586-019-00857-9\" target=\"_blank\" rel=\"noopener noreferrer\">comment piece<\/a> on the <em>p<\/em> value problem. The gist of this piece was a plea to abandon the term &#8220;statistically-significant&#8221;, because it involves the obviously silly idea that observing <em>p<\/em> = 0.049 tells you something different from <em>p<\/em> = 0.051. It was <a href=\"https:\/\/www.nature.com\/magazine-assets\/d41586-019-00857-9\/data-and-list-of-co-signatories\" target=\"_blank\" rel=\"noopener noreferrer\">co-signed by 840 people<\/a> (including me). <em>Nature<\/em> also published <a href=\"https:\/\/www.nature.com\/articles\/d41586-019-00874-8\" target=\"_blank\" rel=\"noopener noreferrer\">an editorial<\/a> which half-understood the problem and, sadly, said &#8220;<em>Nature<\/em> is not seeking to change how it considers statistical analysis in evaluation of papers at this time&#8221;. This sums up the problem: it is in the interests of both authors, and of journals, to continue to publish too many false positive results. Until this problem is solved, the corruption will continue.<\/p>\n\n\n\n<p><a name=\"DC2019b\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun, D. (2019b)<\/strong>. A response to critiques of \u2018The reproducibility of research and the misinterpretation of <em>p<\/em>-values\u2019. <a title=\"Full text. open access\" href=\"https:\/\/royalsocietypublishing.org\/doi\/10.1098\/rsos.190819\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Royal Society Open Science<\/em><\/a>. <span style=\"color: #800000;\"><em>This one started life as a response to a critique of my 2017 paper. But it evolved into a more general discussion of the assumptions made in my approach, and concluded with a summary of my present views about what should be done about <em>p<\/em> values. In brief, I now think that <em>p<\/em> values and confidence intervals should continue to be given, but they should be supplemented by an estimate of the false positive risk. I suggest the notation FPR<sub>50<\/sub> for the false positive risk that&#8217;s calculated on the basis that the prior probability of a real effect existing is 0.5. <\/em><\/span><\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Popular accounts of the problem<\/h3>\n\n\n\n<p>Colquhoun, D. (2015) False discovery rates and P values: the movie. <a title=\"Now superseded.\" href=\"https:\/\/www.youtube.com\/watch?v=tRZMD1cYX_c\" target=\"_blank\" rel=\"noopener noreferrer\">On YouTube<\/a>. <span style=\"color: #800000;\"><em>This slide show is now <a title=\"YouTube talk\" href=\"https:\/\/www.youtube.com\/watch?v=jZWgijUnIxI\" target=\"_blank\" rel=\"noopener noreferrer\">superseded by the 2018 version<\/a>.<\/em><\/span><\/p>\n\n\n\n<p><a name=\"DC2015\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun, D. (2015)<\/strong>. The perils of p-values. In <em>Chalkdust magazine<\/em>. Available at <a title=\"Link to Chalkdust magazine\" href=\"http:\/\/chalkdustmagazine.com\/features\/the-perils-of-p-values\/\" target=\"_blank\" rel=\"noopener noreferrer\">http:\/\/chalkdustmagazine.com\/features\/the-perils-of-p-values\/<\/a>. <span style=\"color: #800000;\"><em> Chalkdust is a magazine run by students at UCL. This article deals with the principles of randomisation tests as a non-mathematical way to get p values, plus a bit about what&#8217;s wrong with p values.<\/em><\/span><\/p>\n\n\n\n<p><a name=\"DCrantest\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun,D.(2015)<\/strong>. Randomisation tests. How to get a P value with no mathematics. A short (6 slides, 15 min) <a href=\"https:\/\/www.youtube.com\/watch?v=RFLbkJbd46Y\" target=\"_blank\" rel=\"noopener noreferrer\">video on YouTube<\/a>. <span style=\"color: #800000;\"><em>Forget t tests. The randomisation test is at least as powerful and it makes no assumption of normal distributions. Furthermore it makes very clear the fact that random allocation of treatments is an essential assumption for all tests of statistical significance. Of course the result is just a p value. It doesn&#8217;t tell you the probability that you are wrong: for that, see the other stuff on this page.<\/em><\/span><\/p>\n\n\n\n<p><a name=\"DCaeon2015\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun, D.(2016)<\/strong>.&nbsp;The problem with <em>p<\/em>-values. <a href=\"https:\/\/aeon.co\/essays\/it-s-time-for-science-to-abandon-the-term-statistically-significant\" target=\"_blank\" rel=\"noopener noreferrer\">Aeon magazine<\/a>. (This attracted <a href=\"https:\/\/aeon.co\/conversations\/what-should-be-done-to-improve-statistical-literacy\" target=\"_blank\" rel=\"noopener noreferrer\">147 comments<\/a>.<br><span style=\"color: #800000;\"><em>This essay is about the logic of inductive inference<em>e<\/em>. It is a non-mathematical introduction to the ideas raised in my 2014 paper.<\/em> <\/span><\/p>\n\n\n\n<p><a name=\"DCNature2017\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun, D. (2017)<\/strong>. Five ways to fix statistics. State false positive risk, too. <a title=\"Download pdf (free)\" href=\"https:\/\/www.nature.com\/magazine-assets\/d41586-017-07522-z\/d41586-017-07522-z.pdf\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Nature<\/em>, volume 251<\/a>. <span style=\"color: #800000;\"><em>A collection of short comments by five authors on what should be done about p values.<\/em><\/span><\/p>\n\n\n\n<p><a name=\"DCvideo2018\"><\/a><\/p>\n\n\n\n<p><strong>Colquhoun, D. (2018)<\/strong>. The false positive risk: a proposal concerning what to do about <em>p<\/em>-values (version 2). <span style=\"color: #800000;\"><em>This video is a slightly extended version of a talk that I gave at the Evidence Live meeting, June 2018, at the Centre for Evidence-Based Medicine, Oxford. It supersedes my earlier 2015 video on the same topic. It is an exposition of the ideas that are given in more detail in the 2017 paper and in the 2019 paper. In November 2019 an new version was posted -the content is the same, but the volume of the sound track is better. <\/em> <\/span><\/p>\n\n\n\n<p><iframe loading=\"lazy\" src=\"https:\/\/www.youtube.com\/embed\/jZWgijUnIxI?rel=0\" width=\"500\" height=\"281\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Why p values can\u2019t tell you what you need to know and what to do about it<\/strong><\/h4>\n\n\n\n<p><strong> 2020 version&nbsp;&nbsp;<\/strong>I gave a talk to the RIOT science club, on 1 October 2020.&nbsp; It has appeared <a href=\"https:\/\/www.youtube.com\/watch?v=agC-SG5-Qyk\" target=\"_blank\" rel=\"noopener noreferrer\">on YouTube<\/a>.&nbsp; This gave me a chance to update my ideas about what to do about <em>p<\/em> values.&nbsp; After the talk. Chris F. Carroll kindly sent me a transcript of it.&nbsp; I took the chance to improve a bit on some of the explanations that I&#8217;d given in the talk -especially in the Q&amp;A and I&#8217;ve posted the result on my blog, with links to the original talk. <a href=\"http:\/\/www.dcscience.net\/2020\/10\/18\/why-p-values-cant-tell-you-what-you-need-to-know-and-what-to-do-about-it\/\" target=\"_blank\" rel=\"noopener noreferrer\">It&#8217;s here.<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image\"><a href=\"http:\/\/www.dcscience.net\/2020\/10\/18\/why-p-values-cant-tell-you-what-you-need-to-know-and-what-to-do-about-it\/\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/onemol.org.uk\/slide1-500.jpg\" alt=\"slide 1\"\/><\/a><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\">2021 version<a name=\"follow\"><\/a><\/h3>\n\n\n\n<p>This is a recording of a Zoom seminar, for the UCL Department of Statistical Science, given on 6 May 2021,  It&#8217;s <em>slightly<\/em> more technical than earlier versions. It explains better than earlier versions the assumptions that underlie my suggestions.<\/p>\n\n\n\n<p>It\u2019s remarkable that statisticians are still at war about how best to decide whether the difference between the means of two independent samples is a result of sampling error alone or whether it\u2019s real.<\/p>\n\n\n\n<p><strong>The FPR<sub>50<\/sub>: a simple, but rough, solution to the p values war (?)<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<span class=\"embed-youtube\" style=\"text-align:center; display: block;\"><iframe loading=\"lazy\" class=\"youtube-player\" width=\"500\" height=\"282\" src=\"https:\/\/www.youtube.com\/embed\/Kp-jpQCLL1o?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=en-US&#038;autohide=2&#038;wmode=transparent\" allowfullscreen=\"true\" style=\"border:0;\" sandbox=\"allow-scripts allow-same-origin allow-popups allow-presentation allow-popups-to-escape-sandbox\"><\/iframe><\/span>\n<\/div><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\">Latest video: 2022<\/h3>\n\n\n\n<figure class=\"wp-block-video\"><\/figure>\n\n\n\n<p>This is a recording of a talk to the UCL R users&#8217; group, via Zoom, on 8 December 2022. It has a bit more about the web calculator (an R Shiny app) than other talks.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<span class=\"embed-youtube\" style=\"text-align:center; display: block;\"><iframe loading=\"lazy\" class=\"youtube-player\" width=\"500\" height=\"282\" src=\"https:\/\/www.youtube.com\/embed\/tFTc_kQ9kSM?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=en-US&#038;autohide=2&#038;wmode=transparent\" allowfullscreen=\"true\" style=\"border:0;\" sandbox=\"allow-scripts allow-same-origin allow-popups allow-presentation allow-popups-to-escape-sandbox\"><\/iframe><\/span>\n<\/div><\/figure>\n\n\n\n\n\n<h3 class=\"wp-block-heading\">Follow-up<\/h3>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Jump to follow-up These papers have nothing much to do with single molecule kinetics. They were written by David Colquhoun after his retirement from the world of single ion channels, as a way to keep him off the streets. They &hellip; <a href=\"https:\/\/onemol.org.uk\/?page_id=456\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":10,"menu_order":0,"comment_status":"open","ping_status":"closed","template":"","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"jetpack_post_was_ever_published":false,"footnotes":""},"class_list":["post-456","page","type-page","status-publish","hentry"],"jetpack_shortlink":"https:\/\/wp.me\/P3xlLM-7m","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/onemol.org.uk\/index.php?rest_route=\/wp\/v2\/pages\/456","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/onemol.org.uk\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/onemol.org.uk\/index.php?rest_route=\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/onemol.org.uk\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/onemol.org.uk\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=456"}],"version-history":[{"count":44,"href":"https:\/\/onemol.org.uk\/index.php?rest_route=\/wp\/v2\/pages\/456\/revisions"}],"predecessor-version":[{"id":768,"href":"https:\/\/onemol.org.uk\/index.php?rest_route=\/wp\/v2\/pages\/456\/revisions\/768"}],"up":[{"embeddable":true,"href":"https:\/\/onemol.org.uk\/index.php?rest_route=\/wp\/v2\/pages\/10"}],"wp:attachment":[{"href":"https:\/\/onemol.org.uk\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=456"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}