Conditional Probability, Hypothesis Testing, and the Monty Hall Problem

Similar documents
CSSS/SOC/STAT 321 Case-Based Statistics I. Introduction to Probability

INTRODUCTION TO HYPOTHESIS TESTING. Unit 4A - Statistical Inference Part 1

Discussion Notes for Bayesian Reasoning

A Layperson s Guide to Hypothesis Testing By Michael Reames and Gabriel Kemeny ProcessGPS

Sample Cross-Examination Questions That the Prosecutor May Ask

Module - 02 Lecturer - 09 Inferential Statistics - Motivation

Classroom Voting Questions: Statistics

1/17/2018 ECE 313. Probability with Engineering Applications Section B Y. Lu. ECE 313 is quite a bit different from your other engineering courses.

Bounded Rationality. Gerhard Riener. Department of Economics University of Mannheim. WiSe2014

Introduction to Inference

CS485/685 Lecture 5: Jan 19, 2016

NPTEL NPTEL ONINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture-59 Ensemble Methods- Bagging,Committee Machines and Stacking

Introduction to Statistical Hypothesis Testing Prof. Arun K Tangirala Department of Chemical Engineering Indian Institute of Technology, Madras

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21

MLLunsford, Spring Activity: Conditional Probability and The Law of Total Probability

The State s Case. 1. Why did fire investigators believe the cause of the fire wasn t accidental?

Error and the Law Exchanges with Larry Laudan

Grade 7 Math Connects Suggested Course Outline for Schooling at Home 132 lessons

Computational Learning Theory: Agnostic Learning

Types of Error Power of a Hypothesis Test. AP Statistics - Chapter 21

I think, therefore I am. - Rene Descartes

About Type I and Type II Errors: Examples

The Addition Rule. Lecture 44 Section 9.3. Robb T. Koether. Hampden-Sydney College. Mon, Apr 14, 2014

Boxes and envelopes. 1. If the older child is a girl. What is the probability that both children are girls?

Class #14: October 13 Gödel s Platonism

Marcello Pagano [JOTTER WEEK 5 SAMPLING DISTRIBUTIONS ] Central Limit Theorem, Confidence Intervals and Hypothesis Testing

Logical (formal) fallacies

Curriculum Guide for Pre-Algebra

Trial Roles. Attorney Witness Research Assistant Jury Prepare testimony with witnesses Prepare questions for crossexamination

ANSWER SHEET FINAL EXAM MATH 111 SPRING 2009 (PRINT ABOVE IN LARGE CAPITALS) CIRCLE LECTURE HOUR 10AM 2PM FIRST NAME: (PRINT ABOVE IN CAPITALS)

Module 02 Lecture - 10 Inferential Statistics Single Sample Tests

Philosophy of Science. Ross Arnold, Summer 2014 Lakeside institute of Theology

Take Home Exam #1. PHI 1500: Major Issues in Philosophy Prof. Lauren R. Alpert

Betting With Sleeping Beauty

This is the central lesson of Calculated Risks, a fascinating book by Gerd Gigerenzer, a cognitive

I thought I should expand this population approach somewhat: P t = P0e is the equation which describes population growth.

Guide Reading Monster Group Guide Readin

Responses to Respondents RESPONSE #1 Why I Reject Exegetical Conservatism

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 3

1. Introduction Formal deductive logic Overview

THE ROLE OF COHERENCE OF EVIDENCE IN THE NON- DYNAMIC MODEL OF CONFIRMATION TOMOJI SHOGENJI

The following content is provided under a Creative Commons license. Your support

Lies, Damned Lies and Statistics

Prof. Bryan Caplan Econ 812

INSTRUCTION NO. 1 - INTRODUCTION

Family Studies Center Methods Workshop

Probability Foundations for Electrical Engineers Prof. Krishna Jagannathan Department of Electrical Engineering Indian Institute of Technology, Madras

Waiting to Inhale: 3. Mucking the Figs. A sermon preached by James F. McIntire. Text: Luke 13:1-9. March 3, 2013 Lent 3

Appendix: The Logic Behind the Inferential Test

We all generalize about things. That is, we all make broad comments about a group of people or things. We say things like:

PHLA10 Reason and Truth Exercise 1

Middle School Sunday School Lessons by. rfour.org

ECE 5424: Introduction to Machine Learning

McDougal Littell High School Math Program. correlated to. Oregon Mathematics Grade-Level Standards

The St. Petersburg paradox & the two envelope paradox

Asking the Right Questions: A Guide to Critical Thinking M. Neil Browne and Stuart Keeley

Uncommon Priors Require Origin Disputes

Sins of the Epistemic Probabilist Exchanges with Peter Achinstein

MATH 1000 PROJECT IDEAS

Do we have knowledge of the external world?

Shalom Mission Team Application. Contact Information Complete Street Address: City: State: ZIP: Details

POLS 205 Political Science as a Social Science. Making Inferences from Samples

ABC News' Guide to Polls & Public Opinion

Causal fallacies; Causation and experiments. Phil 12: Logic and Decision Making Winter 2010 UC San Diego 2/26/2010

Corporate Team Training Session # 2 June 8 / 10

Firing Squads and Fine-Tuning: Sober on the Design Argument Jonathan Weisberg

Impact Hour. April 10, 2016

Verdicts: Trial D. The Church banned his Dialogue. The book remained banned for the next 200 years.

MITOCW watch?v=4hrhg4euimo

Philosophy Epistemology Topic 5 The Justification of Induction 1. Hume s Skeptical Challenge to Induction

Introduction To Probability Theory By Paul G. Hoel, Sidney C. Port

UNIT 3 MODULE 5 PROBABILITIES INVOLVING NEGATIONS, DISJUNCTIONS, and CONDITIONAL PROBABILITY

Sunday, July 19, Solitude

Other Logics: What Nonclassical Reasoning Is All About Dr. Michael A. Covington Associate Director Artificial Intelligence Center

Jury Service: Is Fulfilling Your Civic Duty a Trial?

Baumann on the Monty Hall problem and single-case probabilities

Against Coherence: Truth, Probability, and Justification. Erik J. Olsson. Oxford: Oxford University Press, Pp. xiii, 232.

MISSOURI S FRAMEWORK FOR CURRICULAR DEVELOPMENT IN MATH TOPIC I: PROBLEM SOLVING

The end of the world & living in a computer simulation

Everything s An Argument. Chapter 1: Everything Is an Argument

Chapter 20 Testing Hypotheses for Proportions

Statistics for Experimentalists Prof. Kannan. A Department of Chemical Engineering Indian Institute of Technology - Madras

This report is organized in four sections. The first section discusses the sample design. The next

Excel Lesson 3 page 1 April 15

Pulling Rabbits from Hats (Conditional Probability), Part I

Talkin' to America. Interview with Doug Friesen - Part 2 August 5th 2009

2014 THE BIBLIOGRAPHIA ISSN: Online First: 21 October 2014

Module 5. Knowledge Representation and Logic (Propositional Logic) Version 2 CSE IIT, Kharagpur

Philosophy 148 Announcements & Such. Inverse Probability and Bayes s Theorem II. Inverse Probability and Bayes s Theorem III

Explanationist Aid for the Theory of Inductive Logic

YouGov April 7-8, US Adults

CHURCH GROWTH UPDATE

Bar Mock Trial Competition 2017/18. Student Role Guide: Barrister England, Wales and Northern Ireland

6. Truth and Possible Worlds

Primer For The Nonmathematically Inclined On Mathematical Evidence In Criminal Cases: People V. Collins And Beyond

Some questions about Adams conditionals

VOIR DIRE AND JURY SELECTION

Boosting. D. Blei Interacting with Data 1 / 15

Statistical Inference Casella

Observation and Categories

Transcription:

Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Ernie Croot August 29, 2008 On more than one occasion I have heard the comment Probability does not exist in the real world, and most recently I heard in the context of plane crashes: If you ask me what the probability that the next plane I get on will crash, I would say probably 1 in 10,000,000; but if you told me that the pilot was drunk, I might say 1 in 1,000,000. The problem here is that there is no the probability by the probability we mean the probability conditioned on the given information, making certain natural assumptions. Thus, as the information and our assumptions change, so will our probability measure. That is the difficulty in applying probability theory to real-world problems: We often have to infer from the data a natural probability measure to use; however, in the purely theoretical world, that measure is usually just given to us. In this short note I will work through two examples in depth to show how conditional probability and Bayes s theorem can be used to solve them: the alcohol testing problem, and the Monty Hall problem. 1 Hypothesis Testing A prosecutor stands and stares at a jury and holds up a slip of paper containing the results of a lab test: The defendant was high on drugs while flying a commercial airliner, putting 150 people s lives at risk. The prosecutor says that the test is 98% accurate; that is, if a high person is given the test, the test responds high 98% of the time, and if a sober person is given the test, the test responds clean 98% of the time. So, the prosecutor concludes that Beyond a reasonable doubt, the defendant is guilty. You must convict. 1

The defense is then allowed to make its case, and it calls a statistician to the stand. The statistician starts by saying, Wait a minute, the test given to the defendant is only supposed to be used to eliminate a suspect as being high, not to determine whether the person was actually high the error rate of 2% is far too high, no pun intended. The jury doesn t seem to be buying it. The only thing in their mind is that 98%, and what is that business about eliminating a suspect as being different from determining whether the person is high, they wonder. The statistician continues by saying that if only 2% of the population uses illegal drugs, if you get a test that says high, then it would be correct only 50% of the time! This seems to thoroughly confuse the jury how did they go all the way from 98% down to 50%? The statistician explains Suppose you have 5000 people, 2% or 100 of whom are illegal users, and so 4900 of whom are clean. If you give these people the test, 2% of those 4900 will result in high test results (that is, false positives) while 98 of the 100 drug users will result in a high test result. In total you will get (0.02)4900 + 98 196 test results that read high ; and, 98 of these will actually be illegal users, and 98 will be innocent. Thus, the test is only 50-50! In the language of conditional probability and Bayes s theorem, P(User Tests High) P(Tests High User)P(User) P(Tests High User)P(User) + P(Tests High Clean)P(Clean) (0.98)(0.02) (0.98)(0.02) + (0.02)(0.98) 1 2. After the rest of the case is presented, the jury adjourns, and they carefully pour over the facts of the case. One of the jurors looks at the numbers and the testimony of the statistician, and becomes flustered that he can t understand it, and he remarks to the foreman I was always good at math in high school. I think we should convict. The foreman rolls his eyes. In the end, the jury does the right thing, and votes Not Guilty. Fortunately for the defendant, the foreman of the jury was an actuary! 2

1.1 Eliminating Suspects Just what did the statistician mean by there being a difference between eliminating a suspect and deciding guilt? Well, of course you know what the difference is, but maybe what you don t know is just how different they can be. Let me explain: If you want to determine how reliable your test is for eliminating a suspect, you want to compute P(Clean Tests Clean), and hope that it is nearly 100%; and if you want your test to be good for deciding if a person is guilty, you want that P(User Tests High) is near 100%. We know that the latter is not true if 2% of the population are users, because in that case we computed P(User Tests High) 0.5. But what about the former probability? Again by Bayes s theorem we have that P(Clean Tests Clean) P(Tests Clean Clean)P(Clean) P(Tests Clean Clean)P(Clean) + P(Tests Clean User)P(User) (0.98)(0.98) (0.98)(0.98) + (0.02)(0.02) 0.9604 0.9608 99.96%. So, the test is extremely good at eliminating suspects, but very poor at deciding whether a person is guilty! 2 The Monty Hall Problem Here I will state a variant of the problem, where the math is easier to analyze: There are three doors. Behind two are goats, and behind one is a prize. You begin by picking door 1 as the door you want the host to open. The host looks at you and says Wait a minute. Let me show you what is behind this door, and you see a goat behind one of doors 2 or 3. He then asks if you want to stay with door 1, or switch to the remaining unopened door. It turns out that your chances of winning if you switch are 2/3, making certain natural assumptions, such as that the prize was chosen at random to be behind doors 1,2,3, each with equal probability; and, that if the prize was behind door 1, then the host picked door 2 or 3 with equal probability, for the one to reveal. 3

A quick way to get to this answer of 2/3 is to realize that if after initially selecting door 1 you shut your ears and eyes, and pay no attention to the host (Monty Hall), your chances of winning with door 1 must be 1/3. So, the chances that the remaining door was the winner must be 1 1/3 2/3. In the next subsections we will give alternative ways to solve the problem. 2.1 Exhausting the Possibilities One way to solve the problem is to write down the sample space S, and reason for the probability of all the elementary elements; and, then, interpret the event you win if you switch as a subset of S. A natural definition for S is {(1, 2), (1, 3), (2, 3), (3, 2)}, where the first coordinates are the doors that the prize is behind, and the second coordinate is the door Monty decides to reveal. Since it is equally likely that the prize is behind doors 1, 2, or 3, we know that P({(1, 2), (1, 3)}) P({(2, 3)}) P({(3, 2)}) 1 3. Also, we assume that P({(1, 2)}) P({(1, 3)}), as Monty is not picky about whether to reveal door 2 or door 3 when the prize is behind door 1. With these assumptions, it follows that P({(1, 2)}) 1 6 P({(1, 3)}) 1 6 P({(2, 3)}) 1 3 P({(3, 2)}) 1 3. Now, the event You win if you switch to door 2 or 3 that Monty did not reveal is E {(2, 3), (3, 2)}, 4

which has probability P(E) P({(2, 3)}) + P({(3, 2)}) 2 3. A common mistake here is to assume that the probabilities of all the events of S are equal, which would give 1/2 for P(E). This problem is a good example of a case where your probability measure on S is not uniform (like it is for fair die or fair coin experiments)! 2.2 Conditional Probability Approach Although I solved the problem in class using Bayes s theorem, there is actually a much better way to do it using the following fact (which itself is a part of the general Bayes s Theorem): Suppose that E 1, E 2,..., E k are k disjoint events whose union is all of S, the sample space. Then, given an event E we have P(E) P(E E 1 )P(E 1 ) + + P(E E k )P(E k ). The way we apply this theorem is as follows: As in the previous subsection, let E be the event that you win if you switch to the door 2 or 3 that Monty didn t show you; and, let E 1 be the event that the prize is behind door 1, E 2 is the event the prize is behind door 2, and E 3 is the event the prize is behind door 3. Clearly, E 1, E 2, E 3 are disjoint and their union is S. We now have P(E) P(E E 1 )P(E 1 ) + P(E E 2 )P(E 2 ) + P(E E 3 )P(E 3 ). Clearly, P(E E 1 ) 0, P(E 1 ) P(E 2 ) P(E 3 ) 1/3, and P(E E 2 ) P(E E 3 ) 1. So, 2.3 Two Variations P(E) 0 + 1 3 + 1 3 2 3. We consider the following two variations of the problem which look almost identical, but are subtlely different (actually, it isn t that subtle if one thinks about it). 5

Problem 1. Given that the host reveals door 3, what is the probability of winning if you switch? Problem 2. Given that door 3 has a goat, what is the probability of winning if you switch? The conditional information in problem 1 is the set A {(1, 3), (2, 3)}, and the conditional information in problem 2 is the set B {(1, 3), (1, 2), (2, 3)}. So, in problem 2 you are given less information (that is, a larger set of possibilities). The solution to problem 1 is thus P(E A) P(E A) P(A) P({(2, 3)} P(A) 1/3 1/6 + 1/3 2 3. The solution to problem 2 is P(E B) P(E B) P(B) 1/3 1/6 + 1/6 + 1/3 1 2. 6