The Birthday Problem

Similar documents
Introduction to Inference

Module 02 Lecture - 10 Inferential Statistics Single Sample Tests

The following content is provided under a Creative Commons license. Your support

Introductory Statistics Day 25. Paired Means Test

CSSS/SOC/STAT 321 Case-Based Statistics I. Introduction to Probability

INTRODUCTION TO HYPOTHESIS TESTING. Unit 4A - Statistical Inference Part 1

MISSOURI S FRAMEWORK FOR CURRICULAR DEVELOPMENT IN MATH TOPIC I: PROBLEM SOLVING

Discussion Notes for Bayesian Reasoning

Rationalizing Denominators

Torah Code Cluster Probabilities

The end of the world & living in a computer simulation

Chapter 20 Testing Hypotheses for Proportions

MITOCW watch?v=ogo1gpxsuzu

Curriculum Guide for Pre-Algebra

In general, the simplest of argument maps will take the form of something like this:

Math 11 Final Exam Review Part 3 #1

SUMMARY COMPARISON of 6 th grade Math texts approved for 2007 local Texas adoption

The Effect of Religiosity on Class Attendance. Abstract

Grade 6 Math Connects Suggested Course Outline for Schooling at Home

The Fixed Hebrew Calendar

Excel Lesson 3 page 1 April 15

Project: The Power of a Hypothesis Test

Detachment, Probability, and Maximum Likelihood

occasions (2) occasions (5.5) occasions (10) occasions (15.5) occasions (22) occasions (28)

Grade 6 correlated to Illinois Learning Standards for Mathematics

There are various different versions of Newcomb s problem; but an intuitive presentation of the problem is very easy to give.

Module - 02 Lecturer - 09 Inferential Statistics - Motivation

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21

McDougal Littell High School Math Program. correlated to. Oregon Mathematics Grade-Level Standards

Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur

Introduction to Statistical Hypothesis Testing Prof. Arun K Tangirala Department of Chemical Engineering Indian Institute of Technology, Madras

How many imputations do you need? A two stage calculation using a quadratic rule

Decimals_Mid-term Exam Review #4

Ambassador College and Recent Calendar History

Class #14: October 13 Gödel s Platonism

The following content is provided under a Creative Commons license. Your support

Summary of Registration Changes

Junior Soldiers. Consider & Prepare Read: Luke 2:8 20 Matthew 1:18 25 Matthew 2:1 12. Unit 2 : Lesson 9

Friends and strangers

Georgia Quality Core Curriculum

Marcello Pagano [JOTTER WEEK 5 SAMPLING DISTRIBUTIONS ] Central Limit Theorem, Confidence Intervals and Hypothesis Testing

Content Area Variations of Academic Language

Decimals_Mid-term Exam Review #3

Metropolitan Chicago Synod Part-time Ministry Guidelines

November Frank W. Nelte A CALENDAR FOR THE CHURCH OF GOD TODAY

2018 Unit Charter Renewal Guide

Statistics, Politics, and Policy

Probability Distributions TEACHER NOTES MATH NSPIRED

III Knowledge is true belief based on argument. Plato, Theaetetus, 201 c-d Is Justified True Belief Knowledge? Edmund Gettier

6.00 Introduction to Computer Science and Programming, Fall 2008

Please complete the report by March 31

Logicola Truth Evaluation Exercises

DOUBTS AND QUESTIONS ON THE CALCULUS OF PROBABILITIES

Pulling Rabbits from Hats (Conditional Probability), Part I

NEWS AND RECORD / HIGH POINT UNIVERSITY POLL MEMO RELEASE 3/29/2018

I thought I should expand this population approach somewhat: P t = P0e is the equation which describes population growth.

WORKING WITH AND MANAGING TEMPLE INFORMATION IN FAMILY SEARCH, FAMILY TREE

Logic & Proofs. Chapter 3 Content. Sentential Logic Semantics. Contents: Studying this chapter will enable you to:

MATH 1000 PROJECT IDEAS

CHAPTER 17: UNCERTAINTY AND RANDOM: WHEN IS CONCLUSION JUSTIFIED?

Pastoral Research Online

OPENRULES. Tutorial. Determine Patient Therapy. Decision Model. Open Source Business Decision Management System. Release 6.0


Number, Part I of II

HIGH POINT UNIVERSITY POLL MEMO RELEASE 11/29/2017 (UPDATE)

This report is organized in four sections. The first section discusses the sample design. The next

History of Probability and Statistics in the 18th Century. Deirdre Johnson, Jessica Gattoni, Alex Gangi

HAS DAVID HOWDEN VINDICATED RICHARD VON MISES S DEFINITION OF PROBABILITY?

ECE 5424: Introduction to Machine Learning

Grade 7 Math Connects Suggested Course Outline for Schooling at Home 132 lessons

A Stroke of Genius: Striving for Greatness in All You Do

IN THE TIME OF ALGORITHMS. A sermon preached by Galen Guengerich All Souls Unitarian Church, New York City January 13, 2013

RootsWizard User Guide Version 6.3.0

Aquinas Cosmological argument in everyday language

Philosophy 12 Study Guide #4 Ch. 2, Sections IV.iii VI

Keeping Your Kids On God s Side - Natasha Crain

Have complete fun with Online Casino Games!

Studying Adaptive Learning Efficacy using Propensity Score Matching

Houghton Mifflin MATHEMATICS

Running head: PERSONALITY TYPE/PREFERENCE REFLECTION PAPER 1

On the Relationship between Religiosity and Ideology

Math Matters: Why Do I Need To Know This? 1 Logic Understanding the English language

Semantic Entailment and Natural Deduction

Surveying Prof. Bharat Lohani Department of Civil Engineering Indian Institute of Technology, Kanpur. Module - 7 Lecture - 3 Levelling and Contouring

Robert Formaini's illuminating work throws into question a

Why Did Jesus Catch 153 Fish?

NEWS AND RECORD / HIGH POINT UNIVERSITY POLL MEMO RELEASE 3/1/2017

2016 Statement of ROI

Nassau BOCES IDW User Group Meeting January 15, 2013

Rational and Irrational Numbers 2

HIGH POINT UNIVERSITY POLL MEMO RELEASE 2/10/2017 (UPDATE)

Peter Healed a Beggar

ABOUT THE STUDY Study Goals

Take Home Exam #1. PHI 1500: Major Issues in Philosophy Prof. Lauren R. Alpert

This pamphlet was produced by Leadership Ministries.

Practical Intuition: How To Harness The Power Of Your Instinct And Make It Work For You PDF

Near and Dear? Evaluating the Impact of Neighbor Diversity on Inter-Religious Attitudes

FamilySearch / FamilyTree Lesson #7 Submitting Names for Temple Ordinances

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 3

NATIONAL PUBLIC OPINION SURVEY

Transcription:

The Birthday Problem

In 1939, a mathematician named Richard von Mises proposed what we call today the birthday problem. He asked: How many people must be in a room before the probability that two share a birthday is at least 50%? He only considered the month and day, and not the year, of a person s birthday. 2

This question is only interested in whether or not there is a match; if there are 3 or more people with the same birthday, or more than 1 pair of people with the same birthday, that does not matter. 3

Is there a match in our class? Let s check by having each person say their birthday and see if there is a match. Does this particular experiment give us an idea of the probability of having a match? 4

In order to get an idea about this problem, we either need to find a way to calculate the theoretical probability, or gather some experimental data. There is a formula for the theoretical probability (which we will see); however, we will focus on experimental data. 5

The course website has an excel spreadsheet which simulates the problem. However, there is a website which does the same thing, and is a little easier to use. Its address is www.mste.uiuc.edu/reese/birthday/ 6

To start, we make the assumption that each day is just as likely as another to be a birthday. We also ignore leap years. The probability of a given day is a person s birthday is then 1/365, since there are 365 possible birthdays. 7

Ignoring leap years simplifies the problem, and shouldn t affect the probabilities very much. The assumption that each day is just as likely to be a birthday is probably a fair assumption. I do not know how accurate it is. I suspect it does not affect the answer to the birthday problem very much. 8

We will use the webpage to estimate some probabilities. First of all, if you enter a number to represent the number of people in a room, it will randomly select that many birthdays, check to see if there is a match, and calculate an experimental probability. This is one trial. The website can do one trial at a time or 100 trials by one click of a button. 9

If it runs multiple trials, it calculates the experimental probability by taking the number of trials in which there is a birthday match and divides it by the total number of trials. So, an experimental probability of.5 means that half of all the trials had matches. 10

Before starting, what do you think the probability is that two players on an NFL team have the same birthday? NFL teams have aroung 60-70 players. You can find team rosters at nfl.com. 11

Before starting, what do you think the probability is that two players on an NFL team have the same birthday? NFL teams have aroung 60-70 players. You can find team rosters at nfl.com. Two years ago I had a class check each team. They found that there was a birthday match on every team! 12

We will first do some simulations for rooms of 20, 30, and 60 people to get an idea about the answer to the birthday problem. 13

I ran simulations over the weekend to get the data in the following table. This data will not be exactly the same as what we get today; that is the nature of experimental data. However, if the number of trials is large for this problem, then we should get similar data today. 14

The simulations from the website, by running 1000 trials for each number, indicates that the probabilities of a birthday match among a group of m people are approximately as follows. m approximate probability of a match 20 0.41 30 0.68 60 0.99 15

This data indicates that the number needed to have a probability of 50% of a match is somewhere between 20 and 30. 16

By doing some more simulations, we have estimates for the probability of a match among m people in the following table. m approximate probability of a match 22 0.46 23 0.48 24 0.53 25 0.56 17

This indicates that to have a probability of 50% that two people in a room will have the same birthday, the room needs 23 or 24 people. This is far smaller than what most people s intuition would say. 18

What are the theoretical probabilities for the birthday problem?

While the theoretical probability is not so easy to determine, we will see the formula and some ideas about how to find it. First, it turns out to be more direct to calculate, for a group of people, what is the probability that there is NO birthday match. If we find this probability, then subtracting it from 1 gives the probability of a match. 20

If there are 2 people, the probability that the second person doesn t have the same birthday as the first is 364/365. This is because of the 365 possible birthdays, 364 of the days represent not sharing the birthday of the first person. 21

If there are 3 people, and assuming the first two do not share a birthday, then the probability that the third person doesn t share a birthday with the first two is 363/365. This is because of the 365 possible birthdays, 363 are neither the birthday of the first or the second person. 22

If we recall the formula for conditional probability, then the probability that there is not a common birthday among three people is probability of no common birthday = probability of no common birthday among the first two * probability the third doesn t share a birthday given the first two do not share a birthday. 23

This is equal to 364/365 * 363/365. If one continues with this type of reasoning, for m people, the probability that there is not a common birthday is 364/365 * 363/365 * 362/365 *... * (365-m+1)/365 24

If the room has m people, this formula can be written as 365Pm / 365 m The term 365Pm is related to binomial coefficients. If your calculator has an ncr button, then it also has an npr button. By using this formula, we obtain the following theoretical probabilities for a given number of people. 25

n probability of a common birthday 20 41.1% 21 44.4% 22 47.6% 23 50.7% 24 53.8% 25 56.9% 30 70.6% 40 89.1% 50 97% 60 99.4% 100 99.99997% 26

What this shows is that the probability of a match is about 50% for a room of 23 people. It also shows that it is extremely likely that there is a match in a room of 60 people, and that it is practically impossible not to have a match in a room of 100 people. 27