class: left, bottom, inverted, title-slide .title[ # Perspectives on the Bayes factor ] .author[ ### Jorge N. Tendeiro
Hiroshima University
tendeiro@hiroshima-u.ac.jp
Materials:
https://www.jorgetendeiro.com/talk/2023_unilisboa/
] .date[ ### 01 March 2023
] --- background-image: url(Figures/pexels-max-fischer-5212343.png) background-size: cover <!-- To print to PDF with pauses included: --> <!-- renderthis::to_pdf("name_of_file.Rmd", partial_slides = TRUE) --> # Outline The <span style="color:#A97F12">Bayes factor</span>: 1. Introduction. 2. In practice. 3. Properties. 4. In applied research. 5. Conclusions, next steps. <br> The contents of this talk include materials that I recently presented at a conference: [https://www.jorgetendeiro.com/talk/2023_csp/](https://www.jorgetendeiro.com/talk/2023_csp/) --- # Setting For this talk, I do _not_ assume that everyone is... - ... acquainted with the <span style="color:#A97F12">Bayesian framework</span>. - ... acquainted with the <span style="color:#A97F12">Bayes factor</span>. - ... familiar with <span style="color:#A97F12">R</span> nor <span style="color:#A97F12">JASP</span>. -- <br><br><br> I included <span style="color:#A97F12">more</span> material than I can discuss in today's talk, _on purpose_. <br> Those interested should have enough info to follow up afterwards! --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 1. Bayes factor — Introduction --- ## Bayes factor <span style="color:#A97F12">Bayes factors</span> are being increasingly advocated as a better alternative to _null hypothesis significance testing_ (NHST).<sup>1,2,3,4,5</sup> .footnote[ <sup>1</sup>Jeffreys (1961) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Wagenmakers et al. (2010) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Vanpaemel (2010) <span style="display:inline-block; width: 20px;"></span> <sup>4</sup>Masson (2011) <span style="display:inline-block; width: 20px;"></span> <sup>5</sup>Dienes(2014) ] --- ## Bayes factor — Definition The Bayes factor<sup>1,2</sup> quantifies the change from <span style="color:#A97F12">prior odds</span> to <span style="color:#A97F12">posterior odds</span> due to the data observed. <br> Consider: - Two hypotheses (or models) to compare, `\(\mathcal{H}_0\)` _vs_ `\(\mathcal{H}_1\)`. - Data `\(D\)`. -- Assume that either `\(\mathcal{H}_0\)` or `\(\mathcal{H}_1\)` must hold true. <br> Then by Bayes’ rule ( `\(i=0, 1\)`): `$$p(\mathcal{H}_i|D) = \frac{p(\mathcal{H}_i)p(D|\mathcal{H}_i)} {p(\mathcal{H}_0)p(D|\mathcal{H}_0) + p(\mathcal{H}_1)p(D|\mathcal{H}_1)},$$` -- and dividing member by member leads to `$$\underset{\text{prior odds}}{\underbrace{\frac{p(\mathcal{H}_0)}{p(\mathcal{H}_1)}}} \times \underset{\color{#A97F12}{\text{Bayes factor, }BF_{01}}}{\underbrace{\frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)}}} = \underset{\text{posterior odds}}{\underbrace{\frac{p(\mathcal{H}_0|D)}{p(\mathcal{H}_1|D)}}}.$$` .footnote[ <sup>1</sup>Jeffreys(1939) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Kass and Raftery (1995) ] --- ## Bayes factor — Interpretation (1/2) `$$\boxed{ BF_{01} = \frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)} }$$` <br><br><br> For instance, `\(BF_{01} = 5\)`: > _The data are <span style="color:#A97F12">five times more likely</span> to have occurred under `\(\mathcal{H}_0\)` than under `\(\mathcal{H}_1\)`._ --- ## Bayes factor — Interpretation (2/2) `$$\boxed{ \underset{\text{prior odds}}{\underbrace{\frac{p(\mathcal{H}_0)}{p(\mathcal{H}_1)}}}\times \underset{\color{#A97F12}{\text{Bayes factor, }BF_{01}}}{\underbrace{\frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)}}} = \underset{\text{posterior odds}}{\underbrace{\frac{p(\mathcal{H}_0|D)}{p(\mathcal{H}_1|D)}}} }$$` <br> For instance, `\(BF_{01} = 5\)`: > _After observing the data, my relative belief in `\(\mathcal{H}_0\)` over `\(\mathcal{H}_1\)` increased by 5 times._ -- <br><br> This holds regardless of the initial relative belief (i.e., prior odds) of a rational agent. <center> <img src="Figures/BF_table.png" alt="Bayes factor interpretation." style="width:80%;"/> </center> --- ## Bayes factor — Possible values `\(BF_{01}=\frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)} \in [0, \infty)\)`: - `\(BF_{01} > 1 \longrightarrow\)` Evidence in favor of `\(\mathcal{H}_0\)` over `\(\mathcal{H}_1\)`. - `\(BF_{01} = 1 \longrightarrow\)` Equal support for either model. - `\(BF_{01} < 1 \longrightarrow\)` Evidence in favor of `\(\mathcal{H}_1\)` over `\(\mathcal{H}_0\)`. -- <br> Some qualitative cutoff labels have been suggested, for instance<sup>1,2,3</sup>. Here's Kass and Raftery's classifier: <center> <img src="Figures/BF_classifier.png" alt="Bayes factor classifier." style="width:60%;"/> </center> .footnote[ <sup>1</sup>Jeffreys (1939) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Kass and Raftery (1995) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Lee and Wagenmakers (2013) ] --- ## Bayes factor — Computation `$$\boxed{BF_{01} = \frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)}}$$` -- <br> Essentially, any two statistical models that make predictions are in theory eligible to be compared via the Bayes factor. We ''just'' need to evaluate each model's <span style="color:#A97F12">marginal likelihood</span>: `$$P(D|\mathcal{H}_i) = \displaystyle\int_{\Theta_i} \underbrace{p(D|\theta, \mathcal{H}_i)}_{\text{likelihood}}\underbrace{p(\theta|\mathcal{H}_i)}_{\text{prior}}d\theta.$$` -- There are various numerical procedures for this.<sup>1,2,3,4,5,6,7,8</sup> <br> As of recently, bridge sampling<sup>7</sup> has been of great practical use (in combination JAGS, Stan, or NIMBLE). .footnote[ <sup>1</sup>Berger and Pericchi (2001) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Carlin and Chib (1995) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Chen, Shao, and Ibrahim (2000) <span style="display:inline-block; width: 20px;"></span> <sup>4</sup>Gamerman and Lopes (2006) <span style="display:inline-block; width: 20px;"></span> <br> <sup>5</sup>Gelman and Meng (1998) <span style="display:inline-block; width: 20px;"></span> <sup>6</sup>Green (1995) <span style="display:inline-block; width: 20px;"></span> <sup>7</sup>Gronau et al. (2017) <span style="display:inline-block; width: 20px;"></span> <sup>8</sup>Kass and Raftery (1995) <span style="display:inline-block; width: 20px;"></span> ] --- ## Bayes factor — Computation `$$\boxed{BF_{01} = \frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)}}$$` For simpler models there are a few R packages available to assist with the computations: - `BayesFactor`<sup>1</sup> (mostly used). - `bain`<sup>2</sup>. - `easystats`<sup>3</sup>. - `bayestestR`<sup>4</sup>. - `brms`<sup>5</sup> and `rstanarm`<sup>6</sup>, relying on the `bridgesampling`<sup>7</sup> package. There is also [JASP](https://jasp-stats.org/), a handy and open source GUI. .footnote[ <sup>1</sup>Morey and Rouder (2022) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Gu et al. (2021) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Lüdecke et al. (2022) <span style="display:inline-block; width: 20px;"></span> <sup>4</sup>Makowski, Ben-Shachar, and Lüdecke (2019) <span style="display:inline-block; width: 20px;"></span> <br> <sup>5</sup>Bürkner (2021) <span style="display:inline-block; width: 20px;"></span> <sup>6</sup>Goodrich et al. (2022) <span style="display:inline-block; width: 20px;"></span> <sup>7</sup>Gronau, Singmann, and Wagenmakers (2020) ] --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 2. Bayes factor — In practice --- ## Bayes factor — In JASP <br> <center> <video width="700" controls> <source src="Videos/video_JASP.mp4" type="video/mp4"> </video> </center> --- ## Bayes factor — In R <br> <center> <video width="700" controls> <source src="Videos/video_R.mp4" type="video/mp4"> </video> </center> --- ## Bayes factor — Default priors <center> <img src="Figures/ttestBF_priors.png" alt="Bayes factor default priors (independent samples t-test)." style="width:80%;"/> </center> --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 3. Bayes factor — Properties --- ## Bayes factor — Critical appraisal Bayes factor have been praised in many instances.<sup>1,2,3,4,5</sup> But, surprisingly, I could not find many sources with <span style="color:#A97F12">critical</span> appraisals of the Bayes factor. -- <br><br><br><br> I have been doing this for a few years now.<sup>6,7,8,9</sup> .footnote[ <sup>1</sup>Dienes (2011) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Dienes (2014) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Masson (2011) <span style="display:inline-block; width: 20px;"></span> <sup>4</sup>Vanpaemel (2010) <span style="display:inline-block; width: 20px;"></span> <sup>5</sup>Wagenmakers et al. (2018) <span style="display:inline-block; width: 20px;"></span> <sup>6</sup>Tendeiro and Kiers (2019) <span style="display:inline-block; width: 20px;"></span> <br> <sup>7</sup>Tendeiro, Kiers, and Ravenzwaaij (2022) <span style="display:inline-block; width: 20px;"></span> <sup>8</sup>Tendeiro and Kiers (2023a) <span style="display:inline-block; width: 20px;"></span> <sup>9</sup> Tendeiro and Kiers (2023b) ] --- ## Bayes factor — Some properties - Bayes factors are <span style="color:#A97F12">not</span> posterior odds! - Bayes factors are (at least _can be_) <span style="color:#A97F12">sensitive</span> to priors! - Bayes factors are a measure of <span style="color:#A97F12">relative</span> evidence! - Bayes factors can <span style="color:#A97F12">not</span> establish absence/presence! - Bayes factors are <span style="color:#A97F12">not</span> an effect size measure! - Inconclusive evidence is <span style="color:#A97F12">not</span> evidence of absence! - Bayes factors are a <span style="color:#A97F12">continuous</span> measure of relative evidence! --- ## Bayes factor — Some properties For the rest of this presentation, I will: - Present the results of a study aiming at studying the occurrence of misconceptions in the literature. - Explain each misconception. - Speculate on why these misconceptions come about. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research --- ## Bayes factors — In applied research Until recently, there was <span style="color:#A97F12">no</span> characterization of the use of the Bayes factor in applied research. <br> Wong and colleagues<sup>1</sup> were the first to start unveiling the current state of affairs. -- <br><br><br><br> In an ongoing effort, I am currently extending the work of Wong et al.. <br> Here I report the details and main findings of my study. <br> Work with [Henk Kiers](https://www.rug.nl/staff/h.a.l.kiers/research?lang=en), [Rink Hoekstra](https://www.rug.nl/staff/r.hoekstra/), [Tsz Keung Wong](https://hk.linkedin.com/in/tsz-keung-wong-a93738161?trk=people_directory), and [Richard Morey](https://richarddmorey.com/). <br><br><br><br> Preprint (under review): <br> [https://psyarxiv.com/du3fc/](https://psyarxiv.com/du3fc/) .footnote[ <sup>1</sup>Wong, Kiers, and Tendeiro (2022) ] --- ## Context **Background** <br> Social Sciences. <br><br><br> **Target:** <br> NHBT and the Bayes factor in particular. <br><br><br> **Motivation:** <br> Bayes factors have been regularly used since, say, 2010. <br> It is very recent. <br> Not many researchers have received formal training. <br> It is unclear how things are working out. --- ## Advanced literature search _Google Scholar_ (2010—): <small> > `\(\texttt{("bayes factor" AND "bayesian test" AND psychol)}\)` </small> <br> _Web of Science_: <small> > `\(\texttt{(TI=((bayes factor OR bayes* selection OR bayes* test*) AND psycho*) OR}\)` > `\(\texttt{AB=((bayes factor OR bayes* selection OR bayes* test* OR bf*) AND psychol*) OR}\)` > `\(\texttt{AK=((bayes factor OR bayes* selection OR bayes* test* OR bf*) AND psychol*))}\)` > `\(\texttt{AND PY=(2010-2022)}\)` </small> <br><br> `\(109 + 58 = 167\)` papers (after selection). --- ## Grading criteria <center> <img src="Figures/BF_QRIPs.png" alt="QRIPs." style="width:90%;"/> </center> --- ## Results <center> <img src="Figures/BF_results.png" alt="Bayes factor study results" style="width:53%;"/> </center> --- ## Results <br><br><br><br> Overall: - 149 papers (<span style="color:#A97F12">89.2%</span>) displayed at least one QRIP. - 104 papers (<span style="color:#A97F12">62.3%</span>) displayed at least two QRIPs. --- ## Discussion of the results <br><br><br><br> We reasoned over the reasons behind the found problems. Below is a selected synopsis of our considerations. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research ## Bayes factors are _not_ posterior odds --- ## Bayes factors are _not_ posterior odds — _Explanation_ `$$\underset{\text{prior odds}}{\underbrace{\frac{p(\mathcal{H}_0)}{p(\mathcal{H}_1)}}} \times \underset{\color{#A97F12}{\text{Bayes factor, }BF_{01}}}{\underbrace{\frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)}}} = \underset{\color{#A97F12}{\text{posterior odds}}}{\underbrace{\frac{p(\mathcal{H}_0|D)}{p(\mathcal{H}_1|D)}}}.$$` -- <br> Say that `\(BF_{01} = 32\)`; what does this mean? > _After looking at the data, we revise our belief towards `\(\mathcal{H}_0\)` by 32 times._ -- <br> **Q:** What does this imply concerning the probability of each model, given the observed data? <br> **A:** On its own, <span style="color:#A97F12">nothing at all</span>! -- <br><br> Bayes factors `\(=\)` rate of _change_ of belief, <span style="color:#A97F12">not</span> the _updated_ belief.<sup>1</sup> .footnote[ <sup>1</sup>Edwards, Lindman, and Savage (1963) ] --- ## Bayes factors are _not_ posterior odds — _What we found..._ > *"The alternative hypothesis is 2 times more likely than the null hypothesis ( `\(B_{+0}=2.46\)`; Bayesian 95% CI [0.106, 0.896])."* .pull-right-30[ <font color="#808080"> <i> Incidence: <br> - 13.2% as definition <br> - 20.4% as interpretation </i> </font> ] <br><br> **Possible explanations:** - Principle of indifference. - Overselling Bayes as the _theory of inverse probability_.<sup>1</sup> - Cognitive dissonance. .footnote[ <sup>1</sup>Jeffreys(1961) ] --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research ## Bayes factors are (at least can be) _sensitive_ to priors --- ## Bayes factors are (at least can be) _sensitive_ to priors — _Explanation_ Very well known.<sup>1,2,3,4,5</sup> `$$\boxed{P(D|\mathcal{H}_i) = \displaystyle\int_{\Theta_i} p(D|\theta, \mathcal{H}_i)\color{#A97F12}{p(\theta|\mathcal{H}_i)}d\theta}$$` -- **Example: Bias of a coin**<sup>6</sup> - `\(\mathcal{H}_0: \theta = .5\)` _vs_ `\(\mathcal{H}_1: \theta \not= .5\)` - Data: 60 successes in 100 throws. - Four within-model priors; all `\(Beta(a, b)\)`. <center> <img src="Figures/BF_priors.png" alt="The effect of priors." style="width:60%;"/> </center> .footnote[ <sup>1</sup>Kass (1993) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Gallistel (2009) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Vanpaemel (2010) <span style="display:inline-block; width: 20px;"></span> <sup>4</sup>Robert (2016) <span style="display:inline-block; width: 20px;"></span> <sup>5</sup>Withers (2002) <span style="display:inline-block; width: 20px;"></span> <sup>6</sup>Liu and Aitkin (2008) <span style="display:inline-block; width: 20px;"></span> ] --- ## Bayes factors are (at least can be) _sensitive_ to priors — _What we found..._ Reporting nothing at all (29.9%) or relying on software defaults (35.3%) was quite common. -- <br><br><br> **Possible explanations:** - Lack of awareness. - Economic writing style. - Default priors to... <br> ... ease comparison, avoid specification, meet 'objectivity'. <br> Also: improve peer-review chances, principle of indifference, preregistration. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research ## Bayes factors are a measure of _relative_ evidence --- ## Bayes factors are a measure of _relative_ evidence — _Explanation_ Say that `\(BF_{01} = 100\)`; what does this mean? > *The observed data are 100 times more likely under `\(\mathcal{H}_0\)` <span style="color:#A97F12">than under this particular `\(\mathcal{H}_1\)`</span>.* -- <br><br><br> - Evidence is _relative_.<sup>1</sup> - A model may actually be dreadful, but simply less so than its competitor.<sup>2,3</sup> - Little is known as to how Bayes factors behave under model misspecification (but see<sup>4</sup>). .footnote[ <sup>1</sup>Morey, Romeijn, and Rouder (2016) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Rouder (2014) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Gelman and Rubin (1995) <span style="display:inline-block; width: 20px;"></span> <sup>4</sup>Ly, Verhagen, and Wagenmakers (2016) <span style="display:inline-block; width: 20px;"></span> ] --- ## Bayes factors are a measure of _relative_ evidence — _What we found..._ > *"With this 'stronger' VB05 prior, we found strong evidence for the null hypothesis ( `\(\text{BFs}_\text{null}\)` ranging from 12.7 to 22.7 for the 5 ROIs)."* .pull-right-30[ <font color="#808080"> <i> Incidence 62.3% </i> </font> ] <br><br> **Possible explanations:** - Writing style. - Implicitly assumed. - Increased impact. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research ## Bayes factors can _not_ establish absence/presence --- ## Bayes factors can _not_ establish absence/presence — _Explanation_ Say that `\(BF_{01} = 100\)`, for `\(\mathcal{H}_0: \mu=0\)` vs `\(\mathcal{H}_1: \mu\not=0\)`. > *This does not imply that `\(\mu=0\)`.* -- <br><br><br> - First of all, the Bayes factor (as the `\(p\)`-value) is a stochastic endeavor, not a factual proof. - Furthermore, the Bayes factor provides a relative assessment of the likelihood of the observed data, not of the entertained hypotheses. --- ## Bayes factors can _not_ establish absence/presence — _What we found..._ > *"For 6-year-olds, there was no difference between environments ( `\(M_\textit{smooth} = 2.11\)` vs. `\(M_\textit{rough} = 1.93\)`, `\(t(52) = 1.0\)`, `\(p = 0.31\)`, `\(d = 0.3\)`, `\(BF = .42\)`)."* .pull-right-30[ <font color="#808080"> <i> Incidence 35.3% </i> </font> ] <br><br> **Possible explanations:** - Increased impact. - Avoid uncertainty. - Writing style. - Influence from NHST. - Decision making. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research ## Bayes factors are _not_ an effect size measure --- ## Bayes factors are _not_ an effect size measure — _Explanation_ **Example:** - Bayesian one sample `\(t\)`-test: <br> `\(\mathcal{H}_0: \mu=0\)` vs `\(\mathcal{H}_1: \mu\not=0\)`. - JZS default prior ( `\(r=.707\)`). - `\(\overline{x}=0.1\)`, `\(sd=1\)` at each sample size (thus, the effect size is fixed throughout). <center> <img src="Figures/BFvsES.png" alt="Bayes factor vs effect sizes." style="width:80%;"/> </center> --- ## Bayes factors are _not_ an effect size measure — _What we found..._ > *"Pupil size was larger in a higher tracking load (...). However, the Bayesian test showed only positive, but smaller, effect of Load on tracking pupil size ( `\(BF_\text{incl.} = 7.506\)`)."* .pull-right-30[ <font color="#808080"> <i> Incidence 4.2% </i> </font> ] <br><br> **Possible explanations:** - Recreating a similar misconception based on `\(p\)`-values. - Bayes factor labels in use. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research ## Inconclusive evidence is _not_ evidence of absence --- ## Inconclusive evidence is _not_ evidence of absence — _Explanation_ `$$\boxed{BF_{01} = \frac{p(D|\mathcal{H}_0)}{p(D|\mathcal{H}_1)}\color{#A97F12}{=1}}$$` > _Data are equally likely under either model._ -- <br><br> Data are perfectly uninformative. This does not equate to ''_there is nothing to be found_''. --- ## Inconclusive evidence is _not_ evidence of absence — _What we found..._ > *"In contrast there was no difference in meaning between the thinking without examples and planning conditions; the Bayes factor provided anecdotal evidence in favor of the null ( `\(BF_{10} = .86\)`)."* .pull-right-30[ <font color="#808080"> <i> Incidence 3.6% </i> </font> ] <br><br> **Possible explanations:** - Recreating a similar misconception based on `\(p\)`-values. - Absence as default. - Dichotomization. - Increased impact. - Preference for parsimony. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 4. Bayes factors — In applied research ## Bayes factors are a _continuous_ measure of relative evidence --- ## <small>Bayes factors are a _continuous_ measure of relative evidence — _Explanation_</small> Bayes factors are a <span style="color:#A97F12">continuous</span> measure of evidence in `\([0, \infty)\)`. <br> For instance, if `\(BF_{01} > 1\)` then - The observed data are <span style="color:#A97F12">more likely</span> under `\(\mathcal{H}_0\)` than under `\(\mathcal{H}_1\)`. - The <span style="color:#A97F12">larger</span> `\(BF_{01}\)`, the <span style="color:#A97F12">stronger</span> the evidence for `\(\mathcal{H}_0\)` over `\(\mathcal{H}_1\)`. -- <br><br> **Q:** Can ''_more likely than_'' be qualified? <br> **A:** Several categorizations of strength of evidence (what is weak?, moderate?, strong?) exist.<sup>1,2,3,4</sup> <br><br> But this is problematic in various ways. .footnote[ <sup>1</sup>Jeffreys (1961) <span style="display:inline-block; width: 20px;"></span> <sup>2</sup>Kass and Raftery (1995) <span style="display:inline-block; width: 20px;"></span> <sup>3</sup>Lee and Wagenmakers (2013) <span style="display:inline-block; width: 20px;"></span> <sup>4</sup>Dienes (2016) <span style="display:inline-block; width: 20px;"></span> ] --- ## <small>Bayes factors are a _continuous_ measure of relative evidence — _What we found..._</small> > *"(...) In terms of Bayes factor ( `\(BF\)`), evidence for greater disgust in the experimental group was strong ( `\(BF_{10} > 10\)`), but there was only weak evidence for a difference in other emotions ( `\(BF_{10}\text{’s} < 3\)`)."* .pull-right-30[ <font color="#808080"> <i> Incidence 5.4% </i> </font> ] <br><br> **Possible explanations:** - Summary. - Seeking authority. - Avoiding criticism. - Borrowing from the literature and JASP. - NHST ('significant', 'not significant'). --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # 5. Conclusions, next steps --- ## Conclusions (1/2) <br><br> I think that, concerning <span style="color:#A97F12">testing</span>: - Model comparison (including hypothesis testing) is really important. - However, and clearly, researchers test _way_ too much. - Testing says very little about how well a model fits to data. --- ## Conclusions (2/2) And what about <span style="color:#A97F12">estimation</span>? I think that: - Testing need <span style="color:#A97F12">not</span> be a prerequisite for estimation, unlike what some advocate.<sup>1</sup> - Estimation quantifies uncertainty in ways that Bayes factors simply can not. - Estimating effect sizes (direction, magnitude) is crucial. Bayes factors ignore this! - Avoiding the dichotomous reasoning subjacent to Bayes factors can help. <br><br><br> Bayes factors can be very useful (I use them!). But they should not always be the end of our inference. .footnote[ <sup>1</sup>Wagenmakers et al. (2018) <span style="display:inline-block; width: 20px;"></span> ] --- ## What’s next? <br><br> A follow-up study is in preparation. - Create and deploy a Shiny app that illustrates correct and incorrect usage of the Bayes factor. - Assess the efficacy of this app by means of an experiment. --- class: center, middle background-image: url(Figures/pexels-ryutaro-tsukata-5191373_SEP.png) background-size: cover # Questions?