Simplifying Bayesian Inference

Transcription

1 Simplifying Bayesian Inference Stefan Krauß, Laura Martignon & Ulrich Hoffrage Max Planck Institute For Human Development Lentzeallee 94, Berlin-Dahlem Probability theory can be used to model inference under uncertainty. The particular way in which Bayes formula is stated, which is of only minor importance in standard probability textbooks, becomes central in this context. When events can be interpreted as evidences and hypotheses, Bayes formula allows one to update one s belief in a hypothesis in light of new data. Is unaided human reasoning Bayesian? Kahneman and Tversky (1972) affirmed: In his evaluation of evidence, man is not Bayesian at all. In their book Judgment under uncertainty (1982), they attempted to prove that human judgment is riddled with systematic deviations from the logical and probabilistic norm. In chapter 18 of the same book David M. Eddy stressed that medical doctors do not follow Bayes formula when solving the following task: The probability that a woman at age 40 has (B) is 1%. (P(B) = prevalence = 1%) According to the literature, the probability that the disease is detected by a mammography (M) is 80%. (P(M+ B) = sensitivity = 80%) The probability that the test misdetects the disease although the patient does not have it is 9.6%. (P(M+ B) = 1 - specificity = 9.6%) If a woman at age 40 is tested as positive, what is the probability that she indeed has breast cancer (P(B M+)? Bayes formula yields the following result: P(B M+) = P(M+ B)?p(B) P(M+ B)?P(B) + P(M+ B)?P( B) = 80%?1% 80%?1% + 9.6%?99% = Thus, the probability of is only 7.8%, while Eddy reports that 95 out of 100 doctors estimated this probability to be between 70% and 80%. Gigerenzer and Hoffrage (1995) focused on another aspect of the problem: the representation of uncertainty. In Eddy s task, quantitative information was given in probabilities. Gigerenzer and Hoffrage presented Eddy s problem to medical doctors replacing probabilities with a different representation of uncertainty, namely natural frequencies. In their formulation the task was: 100 out of every at age 40 who participate in routine screening have breast cancer. 80 of every 100 women with will get a positive mammography. 950 out of every 9900 women without will also get a positive mammography. Here is a new representative sample of women at age forty who get a positive mammography in routine screening. How many of these women do you expect to actually have? Now nearly half (46%) of all doctors gave the Bayesian answer: 80 out of 1030 (7.8%).

2 Probabilities Natural Frequencies p(b) p(t+ B) p(t+ B) =.01 =.80 =.096 breast cancer no breast cancer positiv (T+) negativ (T-) positiv (T+) negativ (T-) = p(b T+).01 x x x.096 = p(b T+) Figure 1 What is the crucial property that helps one to find the Bayesian solution? To answer this question, it is helpful to consider a more general case. In real-life situations, decisions are usually based on several cues. A medical doctor, for instance, seldom diagnoses a disease based on a single test. The usual procedure after a mammography is to perform an ultrasound test (U). For an ultrasound test, sensitivity and specificity are usually given in the instructions: P(U+ B) = 95% P(U+ B) = 4% In an empirical study, we presented this information together with P(B), P(M+ B) and P(M+ B) to a group of participants. They were asked: What is the probablity that a woman at age 40 has, given that she has a positive mammography and a positive ultrasound test? When given this probability format, only 12.2% of our participants reached the correct solution ( 2 3 ). D. Massaro (1998) gave an example describing the same situation with frequencies 1 :

3 no M+&U+ M+&U- M+&U+ M+&U- M-&U+ M-&U- M-&U+ M-&U- Figure 2 Massaro writes that in the case of two cues a frequency algorithm will not work and it might not be reasonable to assume that people can maintain exemplars of all possible symptom configurations. However, his statements are not based on experimental evidence, and his frequency configuration is not really equivalent to the probability format because he works with combined sensitivity P(M+ & U+ B) and combined specificity 1-P(M- & U- B). One possible frequency format, which does correspond to our probability format, is 2 : no no M+ 80 M M M- U+ 95 U U U- Figure 3

4 In words: 100 out of every woman at age 40 who participate in routine screening have breast cancer. 80 of every 100 women with will get a positive mammography. 950 out of every 9900 woman without will also get a positive mammography. 95 out of 100 women with cancer will get a positive ultrasound test. 396 out of 9900 women, although they do not have cancer, nevertheless obtain a positive ultrasound test. How many of the women who get a positive mammography and a positive ultrasound test do you expect to actually have? 14.6% of our participants solved this version correctly. Another possibility is to consider the tests sequentially. This is possible because the ultrasound test and the mammography are conditionally independent, i.e. P(U+ B) = P(U+ B & M+). Now we have 3 : 100 no 9900 M+ 80 M M M U+ U- U+ U- U+ U- U+ U- Figure 4 In words: 100 out of every at age 40 who participate in routine screening have breast cancer. 80 of every 100 women with will get a positive mammography. 950 out of every 9900 women without will also get a positive mammography. 76 out of 80 women who had a positive mammography and have cancer also have a positive ultrasound test. 38 out of 950 women who had a positive mammography, although they do not have cancer, also have a positive ultrasound test. How many of the women who get a positive mammography and a positive ultrasound test do you expect to actually have? 53.7% of our participants solved this task correctly.

5 Not all frequencies in the tree were actually used. The next step is to eliminate all frequencies irrelevant to the task. Thus we obtain: 100 no 9900 M M+ U U+ Figure 5 These frequencies, namely those that really foster insight, deserve a special name. We decided to call them Markov frequencies because of the natural analogy with Markov chains. In fact: 1) Our tree consists of two chains which are joined at the root. 2) Each node corresponds to the reference class that determines the next node. As in a Markov chain, the frequency in each node depends only upon its predecessor, not upon previous nodes. Being able to think in chains seems crucial for human insight and fits the modern view that problem solving, unlike perception, is sequential rather than parallel. Markov frequencies are task-oriented, i.e., only information that is relevant for the task appears in the tree. Gigerenzer and Hoffrage (1995) also used a tree (see Figure 1). Their tree contains the information (P(T- B) and P(T- B)), which is not relevant to the question P(B T+) =?. In our chains, the odds of the problem can be read directly from the last two nodes. This is because the tree with Markov frequencies corresponds to the well-known likelihood-combination rule (see, for instance, Spies, 1993): prior odds product of the likelihood ratios = posterior odds The prior odds for are Multiplying this with the likelihood ratio for the mammography 4, we obtain Again multiplying this with the likelihood ratio of the ultrasound test, we finally get By using Markov frequencies, it is not only clear which information should be given to experts, but also which information should be omitted 5. Appropriately deleting useless information is part of the overall computation, as we know from information theory.

6 References Eddy, D. M. (1982). Probabilistic reasoning in clinical medicine: Problems and opportunities. In D. Kahneman, P. Slovic & A. Tversky (Eds.), Judgment under uncertainty: Heuristics and biases (pp ). Cambridge, England: Cambridge University Press. Gigerenzer, G. & Hoffrage, U. (1995). How to improve bayesian reasoning without instruction: Frequency formats. Psychological Review, 102, Kahneman, D. & Tversky, A. (1972). Subjective probability: A judgement of representativeness. Cognitive Psychology, 3, Massaro, D. (1998). Perceiving talking faces (pp ). Boston. MIT Press. Spies, M. (1993). Unsicheres Wissen: Wahrscheinlichkeit, Fuzzy-Logik, neuronale Netze und menschliches Denken (pp.51-54). Heidelberg, Berlin, Oxford: Spektrum Akademischer Verlag. Footnotes 1) To integrate the research on this topic, we borrowed concepts from various sources and explored them in the example. In fact, Gigerenzer and Hoffrage used a sample of (not ) women, Massaro speaks of symptoms instead of tests and we tested our subjects with tuberculosis tasks instead of tasks. 2) Gigerenzer and Hoffrage stressed that only frequencies work that can be sampled naturally. A doctor would get information of this kind when he samples instructions for different tests and translates the information therein into frequencies. 3) A doctor would get information of this kind when he samples patients with respect to their state of illness. 4) The likelihood ratio L(B, M+) is defined by The likelihood ratio L(B, U+) therefore is 95% 4% = P(M + / B) 80%, which is P(M + / B) 9,6% 8.3 5) Because Bayes formula can be used to model inference under uncertainty, it is also a tool in scientific reasoning. Klaus Hasselmann from the Max Planck Institute for Meteorology in Hamburg is presently applying a Bayesian analysis to hypotheses about changes in climate. The Society for Mathematics and Data Analysis in St. Augustin is investigating various methods for

7 estimating credit risks, such as analysis of discriminance, fuzzy-pattern classification, and neural networks with the help of Bayes theorem. The Krebsatlas (almanac of cancer patients) for Germany is being reviewed at the Ludwig Maximilian University in Munich by means of Bayesian methods. The task is to detect and eliminate spurious correlations. Even the Microsoft Office Assistant uses Bayesian procedures. The mathematician Anthony O Hagan elicits on behalf of the Britsh government hydrological conductivity of the rock at Sellafield from experts. He uses their beliefs to determine a prior distribution, with which the appropriateness of the area as a permanent diposal site for nuclear waste can be estimated (Neue Zürcher Zeitung, May 13, 1998, S.39.). Even the most expert systems are based on Bayes formula. A famous example is MUNIN (Muscle and Nerve Inference Network) from Lauritzen and Spiegelhalter (1988), which is used for making diagnoses on the basis of measurements of muscular electrical impulses ( electromyography ). Maybe Markov frequencies can also help to facilitate programming those expert systems. Acknowledgments We thank Valerie Chase, Martin Lages, Donna Alexander and Matthias Licha for helpful comments and Ursula Dohme for running the experiments. (Submission to the 1998 Conference on Model-Based Reasoning in Scientific Discovery )

: What are natural frequencies, and why are they relevant for physicians and patients? It just takes a look at the daily newspaper to remind us that statistics are an important part of our everyday lives:

Datum 15.06.2012 Humanities new methods Challenges for confirmation theory Presentation for The Making of the Humanities III Jan-Willem Romeijn Faculty of Philosophy University of Groningen Interactive

Ignoring base rates People were told that they would be reading descriptions of a group that had 30 engineers and 70 lawyers. People had to judge whether each description was of an engineer or a lawyer.

Syllabus Master s Programme in Statistics and Data Mining 120 ECTS Credits Aim The rapid growth of databases provides scientists and business people with vast new resources. This programme meets the challenges

CMPSCI 683 Artificial Intelligence Questions & s 1. General Learning Consider the following modification to the restaurant example described in class, which includes missing and partially specified attributes:

STA 371G: Statistics and Modeling Decision Making Under Uncertainty: Probability, Betting Odds and Bayes Theorem Mingyuan Zhou McCombs School of Business The University of Texas at Austin http://mingyuanzhou.github.io/sta371g

The role of probability theory in statistics We collect data so as to provide evidentiary support for answers we give to our many questions about the world (and in our particular case, about the business

In Proceedings of the Eleventh Conference on Biocybernetics and Biomedical Engineering, pages 842-846, Warsaw, Poland, December 2-4, 1999 A Bayesian Network Model for Diagnosis of Liver Disorders Agnieszka

Review Bayesianism and Reliability Models and Simulations in Philosophy April 14th, 2014 Last Class: Difference between individual and social epistemology Why simulations are particularly useful for social

Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

psychology and economics lecture 9: biases in statistical reasoning tomasz strzalecki failures of Bayesian updating how people fail to update in a Bayesian way how Bayes law fails to describe how people

University of Michigan Dearborn Graduate Psychology Assessment Program Graduate Clinical Health Psychology Program Goals 1 Psychotherapy Skills Acquisition: To train students in the skills and knowledge

Pooling and Meta-analysis Tony O Hagan Pooling Synthesising prior information from several experts 2 Multiple experts The case of multiple experts is important When elicitation is used to provide expert

BEHAVIORAL OPERATIONS MANAGEMENT: A BLIND SPOT AND A RESEARCH PROGRAM KONSTANTINOS V. KATSIKOPOULOS Max Planck Institute for Human Development GERD GIGERENZER Max Planck Institute for Human Development

Prospect Theory Ayelet Gneezy & Nicholas Epley Word Count: 2,486 Definition Prospect Theory is a psychological account that describes how people make decisions under conditions of uncertainty. These may

FORMULATING THE RESEARCH QUESTION James D. Campbell, PhD Department of Family and Community Medicine University of Missouri Where do questions come from? From patient-centered questions in routine clinical

Course Title: Honors Algebra Course Level: Honors Textbook: Algebra Publisher: McDougall Littell The following is a list of key topics studied in Honors Algebra. Identify and use the properties of operations

Behavioral Interventions Based on the Theory of Planned Behavior Icek Ajzen Brief Description of the Theory of Planned Behavior According to the theory, human behavior is guided by three kinds of considerations:

Chances are you ll learn something new about probability CONFERENCE NOTES First off, let me say that I was very pleased (and honoured) to have such a great turnout for the workshop. I really enjoyed the

Science Practices Standard SP.1: Scientific Questions and Predictions Asking scientific questions that can be tested empirically and structuring these questions in the form of testable predictions SP.1.1

22S:101 Biostatistics: J. Huang 1 Bayes Theorem For two events A and B, if we know the conditional probability P (B A) and the probability P (A), then the Bayes theorem tells that we can compute the conditional

Bayesian Tutorial (Sheet Updated 20 March) Practice Questions (for discussing in Class) Week starting 21 March 2016 1. What is the probability that the total of two dice will be greater than 8, given that

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

191 The probability distribution on a single variable must sum to 1 It is also true that any joint probability distribution on any set of variables must sum to 1 Recall that any proposition a is equivalent

cis20.2 design and implementation of software applications 2 spring 2010 lecture # IV.1 introduction to intelligent systems AI is the science of making machines do things that would require intelligence

The result of the bayesian analysis is the probability distribution of every possible hypothesis H, given one real data set D. This prestatistical approach to our problem was the standard approach of Laplace

Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the

Chapter 4 PRINCIPLE OF MATHEMATICAL INDUCTION Analysis and natural philosophy owe their most important discoveries to this fruitful means, which is called induction Newton was indebted to it for his theorem

Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

Mathematical Philosophy LMU Munich Hannes Leitgeb September 2010 Mathematical Philosophy the application of logical and mathematical methods in philosophy is experiencing a tremendous boom in various areas

Statistics in Geophysics: Introduction and Steffen Unkel Department of Statistics Ludwig-Maximilians-University Munich, Germany Winter Term 2013/14 1/32 What is Statistics? Introduction Statistics is the

Study Manual Probabilistic Reasoning 2015 2016 Silja Renooij August 2015 General information This study manual was designed to help guide your self studies. As such, it does not include material that is

General Information on Mammography and Breast Cancer Screening General Information on Mammography and Breast Cancer Screening You have made an appointment with your doctor for a mammogram. If this is your

Eliciting beliefs to inform parameters within a decision analytic model Laura Bojke Centre for Health Economics University of York Uncertainty in decision modelling Uncertainty is pervasive in any assessment

APPENDIX Revised Bloom s Taxonomy In 1956, Benjamin Bloom and his colleagues published the Taxonomy of Educational Objectives: The Classification of Educational Goals, a groundbreaking book that classified

Kapitel 1 Multiplication of Long Integers (Faster than Long Multiplication) Arno Eigenwillig und Kurt Mehlhorn An algorithm for multiplication of integers is taught already in primary school: To multiply

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Be able to explain the difference between the p-value and a posterior

8 Resampling: The New Statistics INTRODUCTION Uses of Probability and Statistics Introduction What Kinds of Problems Shall We Solve? Probabilities and Decisions Whether to Buy More Trucks Types of Statistics

THE PRIZE IN ECONOMIC SCIENCES 2012 INFORMATION FOR THE PUBLIC Stable matching: Theory, evidence, and practical design This year s Prize to Lloyd Shapley and Alvin Roth extends from abstract theory developed