Philosophy 148 Announcements & Such. Inverse Probability and Bayes s Theorem II. Inverse Probability and Bayes s Theorem III

Similar documents
Jeffrey, Richard, Subjective Probability: The Real Thing, Cambridge University Press, 2004, 140 pp, $21.99 (pbk), ISBN

This is the central lesson of Calculated Risks, a fascinating book by Gerd Gigerenzer, a cognitive

Philosophy 125 Day 1: Overview

Philosophy 125 Day 21: Overview

THE ROLE OF COHERENCE OF EVIDENCE IN THE NON- DYNAMIC MODEL OF CONFIRMATION TOMOJI SHOGENJI

Retrospective Remarks on Events (Kim, Davidson, Quine) Philosophy 125 Day 20: Overview. The Possible & The Actual I: Intensionality of Modality 2

Philosophy 125 Day 12: Overview

Philosophy 125 Day 4: Overview

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 3

Nominalism III: Austere Nominalism 1. Philosophy 125 Day 7: Overview. Nominalism IV: Austere Nominalism 2

The argument from so many arguments

Overview of Today s Lecture

The end of the world & living in a computer simulation

Molnar on Truthmakers for Negative Truths

Is There Immediate Justification?

Oxford Scholarship Online Abstracts and Keywords

Is Epistemic Probability Pascalian?

Who or what is God?, asks John Hick (Hick 2009). A theist might answer: God is an infinite person, or at least an

Philosophy 125 Day 13: Overview

Quine on the analytic/synthetic distinction

Avicenna, Proof of the Necessary of Existence

LGCS 199DR: Independent Study in Pragmatics

NOTES ON WILLIAMSON: CHAPTER 11 ASSERTION Constitutive Rules

Van Fraassen: Arguments Concerning Scientific Realism

Bayesian Probability

Boghossian & Harman on the analytic theory of the a priori

Reply to Kit Fine. Theodore Sider July 19, 2013

Philosophy Epistemology Topic 5 The Justification of Induction 1. Hume s Skeptical Challenge to Induction

Bayesian Probability

What Should We Believe?

Richard L. W. Clarke, Notes REASONING

1. Introduction. Against GMR: The Incredulous Stare (Lewis 1986: 133 5).

Against Coherence: Truth, Probability, and Justification. Erik J. Olsson. Oxford: Oxford University Press, Pp. xiii, 232.

HIGH CONFIRMATION AND INDUCTIVE VALIDITY

Lecture 1 The Concept of Inductive Probability

PHILOSOPHICAL PROBLEMS & THE ANALYSIS OF LANGUAGE

Subjective Logic: Logic as Rational Belief Dynamics. Richard Johns Department of Philosophy, UBC

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21

Statistical Inference Without Frequentist Justifications

x Philosophic Thoughts: Essays on Logic and Philosophy

Scientific Realism and Empiricism

Justified Inference. Ralph Wedgwood

Introduction: Belief vs Degrees of Belief

PHILOSOPHY 4360/5360 METAPHYSICS. Methods that Metaphysicians Use

Outline. The argument from so many arguments. Framework. Royall s case. Ted Poston

Class #14: October 13 Gödel s Platonism

HANDBOOK. IV. Argument Construction Determine the Ultimate Conclusion Construct the Chain of Reasoning Communicate the Argument 13

There are various different versions of Newcomb s problem; but an intuitive presentation of the problem is very easy to give.

Overview. Is there a priori knowledge? No: Mill, Quine. Is there synthetic a priori knowledge? Yes: faculty of a priori intuition (Rationalism, Kant)

Resemblance Nominalism and counterparts

Detachment, Probability, and Maximum Likelihood

Phil/Ling 375: Meaning and Mind [Handout #10]

1.2. What is said: propositions

Truth and Modality - can they be reconciled?

THE MEANING OF OUGHT. Ralph Wedgwood. What does the word ought mean? Strictly speaking, this is an empirical question, about the

Defending A Dogma: Between Grice, Strawson and Quine

Contradictory Information Can Be Better than Nothing The Example of the Two Firemen

Anaphoric Deflationism: Truth and Reference

5: Preliminaries to the Argument

This handout follows the handout on The nature of the sceptic s challenge. You should read that handout first.

Choosing Rationally and Choosing Correctly *

Faults and Mathematical Disagreement

Constructing the World

Tom Vinci. Dalhousie University

What is a counterexample?

Is it rational to have faith? Looking for new evidence, Good s Theorem, and Risk Aversion. Lara Buchak UC Berkeley

G. H. von Wright Deontic Logic

Logic: inductive. Draft: April 29, Logic is the study of the quality of arguments. An argument consists of a set of premises P1,

RATIONALITY AND SELF-CONFIDENCE Frank Arntzenius, Rutgers University

17. Tying it up: thoughts and intentionality

Bertrand Russell Proper Names, Adjectives and Verbs 1

Logic is the study of the quality of arguments. An argument consists of a set of

Philosophy 5340 Epistemology Topic 4: Skepticism. Part 1: The Scope of Skepticism and Two Main Types of Skeptical Argument

THE SEMANTIC REALISM OF STROUD S RESPONSE TO AUSTIN S ARGUMENT AGAINST SCEPTICISM

Review Tutorial (A Whirlwind Tour of Metaphysics, Epistemology and Philosophy of Religion)

Issue 4, Special Conference Proceedings Published by the Durham University Undergraduate Philosophy Society

POLLOCK ON PROBABILITY IN EPISTEMOLOGY. 1. Some Remarks on Pollock s Critique of Bayesian Epistemology

Grounding and Analyticity. David Chalmers

Phil 1103 Review. Also: Scientific realism vs. anti-realism Can philosophers criticise science?

Understanding Belief Reports. David Braun. In this paper, I defend a well-known theory of belief reports from an important objection.

British Journal for the Philosophy of Science, 62 (2011), doi: /bjps/axr026

The Rightness Error: An Evaluation of Normative Ethics in the Absence of Moral Realism

6. Truth and Possible Worlds

Has Nagel uncovered a form of idealism?

Kant Lecture 4 Review Synthetic a priori knowledge

The SAT Essay: An Argument-Centered Strategy

Reading/Study Guide: Rorty and his Critics. Richard Rorty s Universality and Truth. I. The Political Context: Truth and Democratic Politics (1-4)

THE CONCEPT OF OWNERSHIP by Lars Bergström

PHILOSOPHY 5340 EPISTEMOLOGY

Que sera sera. Robert Stone

HOW TO BE (AND HOW NOT TO BE) A NORMATIVE REALIST:

Is the Existence of the Best Possible World Logically Impossible?

Cory Juhl, Eric Loomis, Analyticity (New York: Routledge, 2010).

From Grounding to Truth-Making: Some Thoughts

Analyticity and reference determiners

A romp through the foothills of logic Session 3

The Philosophy of Education. An Introduction By: VV.AA., Richard BALEY (Ed.) London: Continuum

Evidential Support and Instrumental Rationality

From Necessary Truth to Necessary Existence

Knowledge and Authority

Transcription:

Branden Fitelson Philosophy 148 Lecture 1 Branden Fitelson Philosophy 148 Lecture 2 Philosophy 148 Announcements & Such Administrative Stuff I ll be using a straight grading scale for this course. Here it is: A+ > 97, A (94,97], A- (90,94], B+ (87,90], B (84,87], B- (80,84], C+ (77,80], C (74,77], C- (70,74], D [50,70], F < 50. People did very well on the quiz (µ 93). HW #1 assigned (due 2/28). Today s Agenda Some real world probability examples (and problems with them) Then, starting over from scratch with a guiding analogy: truth-on-i :: probability-in-m truth probability That is, we ll start again (from scratch) by comparing the informal notions of truth and probability, and their analogue formal or analytic notions truth-on-i and probability-in-m. This will give us a bottom-up approach for the rest of the course. Inverse Probability and Bayes s Theorem II Here s a famous example, illustrating the subtlety of Bayes s Theorem: The (unconditional) probability of breast cancer is 1% for a woman at age forty who participates in routine screening. The probability of such a woman having a positive mammogram, given that she has breast cancer, is 80%. The probability of such a woman having a positive mammogram, given that she does not have breast cancer, is 10%. What is the probability that such a woman has breast cancer, given that she has had a positive mammogram in routine screening? We can formalize this, as follows. Let H such a woman (age 40 who participates in routine screening) has breast cancer, and E such a woman has had a positive mammogram in routine screening. Then: Pr(E H) 0.8, Pr(E H) 0.1, and Pr(H) 0.01. Question (like Hacking s O.Q. #5): What is Pr(H E)? What would you guess? Most experts guess a pretty high number (near 0.8, usually). Branden Fitelson Philosophy 148 Lecture 3 If we apply Bayes s Theorem, we get the following answer: Pr(E H) Pr(H) Pr(H E) Pr(E H) Pr(H) + Pr(E H) Pr( H) 0.8 0.01 0.8 0.01 + 0.1 0.99 0.075 We can also use our algebraic technique to compute an answer. E H Pr(s i ) T T a 1 0.008 T F a 2 0.099 F T a 3 0.002 F F a 4 0.891 Pr(E H) Pr(E H) Pr(E & H) Pr(H) Pr(E & H) Pr( H) a 1 a 1 + a 3 0.8 Pr(H) a 1 + a 3 0.01 a 2 1 (a 1 + a 3 ) 0.1 Note: The posterior is about eight times the prior in this case, but since the prior is so low to begin with, the posterior is still pretty low. This mistake is usually called the base rate fallacy. I will return to this example later in the course, and ask whether it really is a mistake to report a large number in this example. Perhaps it is not a mistake. Branden Fitelson Philosophy 148 Lecture 4 Inverse Probability and Bayes s Theorem III Hacking s O.Q. #6: You are a physician. You think it is quite probable (say 90% probable) that one of your patients has strep throat (S). You take some swabs from the throat and send them to the lab for testing. The test is imperfect, with the following likelihoods (Y is + result, N is ): Pr(Y S) 0.7, Pr(Y S) 0.1 You send five successive swabs to the lab, from the same patient. You get the following results, in order: Y, N, Y, N, Y. What is Pr(S Y NY NY )? Hacking: Assume that the 5 test results are conditionally independent, given both S and S, i.e., that S screens-off the 5 tests results. So: Pr(Y NY NY S) 0.7 0.3 0.7 0.3 0.7 0.03087 Pr(Y NY NY S) 0.1 0.9 0.1 0.9 0.1 0.00081 Pr(Y NY NY S) Pr(S) Pr(S Y NY NY ) Pr(Y NY NY S) Pr(S) + Pr(Y NY NY S) Pr( S) 0.03087 0.9 0.03087 0.9 + 0.00081 0.1 0.997

Branden Fitelson Philosophy 148 Lecture 5 General Analysis of Hacking s Odd Question #6 Branden Fitelson Philosophy 148 Lecture 6 An Anecdotal Prelude to Interpretations of Probability If n is the number of Y results, then (5 n) is the number of N results (out of 5 results). Bayes s theorem allows us to calculate Pr(S E n ), where E n is evidence consisting of n Y results and (5 n) N results (any order): Pr(E n S) Pr(S) Pr(E n S) Pr(S) + Pr(E n S) Pr( S) 0.7 n 0.3 5 n 0.9 0.7 n 0.3 5 n 0.9 + 0.1 n 0.9 5 n 0.1 1 0.8 0.6 0.4 0.2 0 0 1 2 3 4 5 After the O.J. trial, Alan Dershowitz remarked that fewer than 1 in 1,000 women who are abused by their mates go on to be killed by them. He said the probability that Nicole Brown Simpson (N.B.S.) was killed by her mate (O.J.) given that he abused her was less than 1 in 1,000. Presumably, this was supposed to have some consequences for people s degrees of confidence (degrees of belief ) in the hypothesis of O.J. s guilt. The debate that ensued provides a nice segué from our discussion of the formal theory of probability calculus to its interpretation(s). Let A be the proposition that N.B.S. is abused by her mate (O.J.), let K be the proposition that N.B.S. is killed by her mate (O.J.), and let Pr( ) be whatever probability function Dershowitz has in mind here, over the salient algebra of propositions. Dershowitz is saying the following: (1) Pr(K A) < 1 1000 Branden Fitelson Philosophy 148 Lecture 7 Shortly after Dershowitz s remark, the statistician I.J. Good wrote a brief response in Nature. Good pointed out that, while Dershowitz s claim may be true, it is not salient to the case at hand, since it ignores evidence. Good argues that what s relevant here is is the probability that she was killed by O.J., given that she was abused by O.J. and that she was killed. After all, we do know that Nicole was killed, and (plausibly) this information should be taken into account in our probabilistic musings. To wit: let K be the proposition that N.B.S was killed (by someone). Using Dershowitz s (1) as a starting point, Good does some ex cathedra back-of-the-envelope calculations, and he comes up with the following: (2) Pr(K A & K ) 1 2 1 1000 This would seem to make it far more probable that O.J. is the killer than Dershowitz s claim would have us believe. Using statistical data about murders committed in 1992, Merz & Caulkins estimated that: (3) Pr(K A & K ) 4 5 Branden Fitelson Philosophy 148 Lecture 8 This would seem to provide us with an even greater estimate of the probability that N.B.S. was killed by O.J. Dershowitz replied to analyses like those of Good and Merz & Caulkins with the following rejoinder:... whenever a woman is murdered, it is highly likely that her husband or her boyfriend is the murderer without regard to whether battery preceded the murder. The key question is how salient a characteristic is the battery as compared with the relationship itself. Without that information, the 80 percent figure [as in Merz & Caulkins estimation] is meaningless. I would expect that a couple of statisticians would have spotted this fallacy. Dershowitz s rejoinder seems to trade on something like the following: (4) Pr(K K ) Pr(K A & K ) [i.e., K, not A, is doing the real work here] Not to be outdone, Merz & Caulkins give the following estimate of the salient probabilities (again, this is based on statistics for 1992): (5) Pr(K K ) 0.29 Pr(K A & K ) 0.8 We could continue this dialectic ad nauseam. I ll stop here. This anecdote raises several key issues about interpretations and applications of Pr.

Branden Fitelson Philosophy 148 Lecture 9 Our discussants seem to be talking about some kind of objective probabilities involving N.B.S. s murder (and murderer) in particular. But, the estimates Merz & Caulkins appeal to involve statistical frequencies of murders in some population. First, are there such things as objective probabilities at all? If so, what are they (are there different kinds?) and what determines them? More specifically, are there objective probabilities of token events, or only frequencies (in populations)? If there are such probabilities of token events, then how (if at all) do they relate to frequencies? Specifically, which population is the right one in which to include the token event (this is known as the reference class problem)? Finally, how are objective probabilities related to degrees of belief. Generally, how are objective and subjective probabilities related? We ll be thinking more about some of these questions in the next unit. But, first, we re going to back-up and start from scratch... Branden Fitelson Philosophy 148 Lecture 10 T-on-I : Truth : : Pr-on-M : Probability (I) In logic (and formal semantics), we have a formal notion called truth-on-an-interpretation (or truth-on-i). This is not truth (simpliciter). It s useful to think about examples now. Here s a very simple example. Consider a 2-atom sentential language L, where the atoms are extra-systematically understood as having the following content: X John is unmarried. Y John is a bachelor. As usual, we can picture all four interpretations of L, as follows: X Y Interpretations T T I 1 T F I 2 F T I 3 F F I 4 Facts about truth-on-i i do not depend on (extra-systematic) content. Branden Fitelson Philosophy 148 Lecture 11 T-on-I : Truth : : Pr-on-M : Probability (II) X Y Interpretations T T I 1 T F I 2 F T I 3 F F I 4 Specifically, we have the following facts about truth-on-i i : X is T-on-I 1 and T-on-I 2, but X is F-on-I 3 and F-on-I 4. Y is T-on-I 1 and T-on-I 3, but X is F-on-I 2 and F-on-I 4. Indeed, all facts about truth-on-i i are determined for all sentences p of L just by our conventions about truth-tables for truth-functional logic. In this sense, truth-on-i i does not depend on the extra-sysetmatic content of the sentences of L. But, the truth (simpliciter) of sentences does. In this sense, truth is external to logic (and to formal semantics). OK, so then what is truth, and how is it related to truth-on-i? Branden Fitelson Philosophy 148 Lecture 12 T-on-I : Truth : : Pr-on-M : Probability (III) For each interpretation I i of L, there is a corresponding state-description s i of L. As a result, p is T-on-I i is synonymous with s i p. What this reveals is that truth-on-i is a systematic logical concept. On the other hand, truth is an extra-systematic concept. In our example, Y extra-systematially entails (or conceptually necessitates) X [Y X], since it is a conceptual truth that all bachelors are unmarried. This allows us to extra-systematically rule-out the truth of the third state description s 3 of L. That is, s 3 cannot be true, despite the fact that Y X. This is a very strong sense of ruling-out an interpretation. There are also two weaker senses of ruling out that can obtain: Although Y X, Y conceptually probabilifies X. Y epistemically (but not conceptually) probabilifies X. Next: examples of each of these other two grades of ruling-out.

Branden Fitelson Philosophy 148 Lecture 13 T-on-I : Truth : : Pr-on-M : Probability (IV) Consider the following example, again involving two sentences X and Y. X The coin will land heads when it is tossed. Y The coin is heavily biased in favor of heads. Here, we have neither Y X nor Y X. But, I 3 still seems somehow inappropriate. If the coin was 2-headed, then we would have Y X. There is some sort of extra-systematic conceptual probability relation between Y and X. But, X and Y are not conceptually inconsistent here. The natural thing to do here is to try to represent this as some sort of probabilistic extra-systematic constraint. But, which constraint is it? 1. Y Pr(X) 1. [Y e.-s.-entails that X is highly probable. Meaningful?] 2. Pr(Y X) 1. [The conditional Y X is highly probable.] 3. Pr(X Y ) 1. [The conditional probability of X, given Y, is high.] None of these rules-out the truth of s 3. But, they all place e.-s.-constraints on how probable s 3 is. For instance, (2) forces Pr(s 3 ) to be low (why?). Branden Fitelson Philosophy 148 Lecture 14 T-on-I : Truth : : Pr-on-M : Probability (V) Initially, we have only systematic constraints. Specifically, we have no systematic logical relations between atomic sentences, and the only systematic probabilistic constraints are a i [0, 1] and i a i 1. E.g.: X Y Interpretations/S.D. s Models (M) T T I 1 / s 1 a 1 [0, 1] T F I 2 / s 2 a 2 [0, 1] F T I 3 / s 3 a 3 [0, 1] F F I 4 / s 4 a 4 1 (a 1 + a 2 + a 3 ) Then, we associate extra-systematic contents with the atoms, e.g.: X John is unmarried. Y John is a bachelor. In this case, we can conceptually rule-out interpretation I 3 on extra-sysetmatic grounds. In other words, s 3 is (necessarily) false. Branden Fitelson Philosophy 148 Lecture 15 That leads to the following extra-systematic revision of our initial STT: X Y Interpretations/S.D. s Models (M) T T I 1 / s 1 a 1 [0, 1] T F I 2 / s 2 a 2 [0, 1] F T I 3 / s 3 0 F F I 4 / s 4 a 4 1 (a 1 + a 2 ) In other cases, we will not be able to rule-out any interpretations. But, we will be able to rule-out certain probability assignments/models. X The coin will land heads when it is tossed. Y The coin is heavily biased in favor of heads. probability models. Finally, there is a third grade of ruling-out... E.g.: In this case, let s assume the right constraint is Pr(X Y ) 1. Then, this will impose the following extra-systematic constraint on our initial STT: a 1 a 1 + a 3 1 This doesn t rule-out any interpretations, but it does rule-out some Branden Fitelson Philosophy 148 Lecture 16 T-on-I : Truth : : Pr-on-M : Probability (VI) Here is another example of a pair of sentences: X The ball is black. Y The ball is either black or white. Some philosophers claim that there is some sense in which we should have Pr(X Y ) 1 2 here as an extra-systematic constraint, of course. But, intuitively, it s a different sort of constraint than the one in our last example. In our last example biased was itself a probabilistic concept. Here, there is no probabilistic extra-systematic content involved. As such, if some extra-systematic probabilistic constraint is called for here, it s not for purely conceptual reasons. I will call this an epistemic extra-systematic constraint (an instance of the Principle of Indifference ). This can be motivated by unpacking Pr(X Y ) as (something like) the degree of confidence one should have in X if Y were all one knew. We ll come back to this epistemic understanding of probabilities shortly.

Branden Fitelson Philosophy 148 Lecture 17 T-on-I : Truth : : Pr-on-M : Probability (VII) We ll come back to the probabilistic issues soon enough. Let s back up first, and think more about (extra-systematic) truth (simpliciter). There are various Theories or Philosophical Explications of truth. I have posted a nice overview by Haack (and the SEP entry by Glanzberg). I will separate the philosophical theories of truth into two categories: Objective Theories of Truth. Correspondence theories. Subjective Theories of Truth. Epistemic theories. Coherence theories. Pragmatic theories. There are also theories that are neutral on the subjective/objective question. For instance, deflationary theories (like the redundancy theory). Branden Fitelson Philosophy 148 Lecture 18 T-on-I : Truth : : Pr-on-M : Probability (VIII) According to correspondence theories of truth, p is true if p corresponds to some truthmaker t p (that is, if there exists a truthmaker t p for p). There are different views on the bearers of truth-values (sentences, propositions, beliefs) and truthmakers (facts, states of affairs). Moreover, there are different views about whether truthmakers must exist in some mind-independent realm. Realists will require that there is a mind-independent realm of truthmakers. Anti-realists will not. Sentence (s): "John loves Mary." expresses Proposition (p): that John loves Mary. corresponds to Truthmaker (t John's (actual) loving of Mary. p ): If p is false, there is no corresponding t p at the bottom of the diagram. Branden Fitelson Philosophy 148 Lecture 19 T-on-I : Truth : : Pr-on-M : Probability (IX) Subjective theories of truth do not involve any sort of correspondence between sentences/propositions/beliefs and some realm of truthmakers. The epistemic theory of truth, for instance, holds that (Alston): The truth of a truth bearer consists not in its relation to some transcendent state of affairs, but in the epistemic virtues the former displays within our thought, experience, and discourse. Truth value is a matter of whether, or the extent to which, a belief is justified, warranted, rational, well grounded, or the like. The coherence theory of truth is a instance of the epistemic theory (where coherence with one s other beliefs is the salient epistemic virtue ). The pragmatic theory of holds that truth is satisfactory to believe. Basically, a belief is true if believing it works for its believer. We will adopt an objective/realist stance toward truth in this course. I find it hard to understand the other conceptions of truth. Explain. Branden Fitelson Philosophy 148 Lecture 20 T-on-I : Truth : : Pr-on-M : Probability (X) Just as we can talk about p being true-on-i i, which is synonymous with s i p, we can also talk about p having probability-r -on-m. And, like truth-on-i i, probability-on-m is a logical/formal concept. That is, once we have specified a probability model M, this logically determines the probability-on-m values of all sentences in L. Moreover, just as the truth-on-i i of sentence p does not imply anything about p s truth (simpliciter), neither does the probability-on-m of p imply anything about p s probability (simpliciter) if there be such a thing. Finally, just as we have different philosophical theories of truth, we will also have different philosophical theories of probability. And, as in the case of truth, there will be objective theories and subjective theories of probability. However, there will be more compelling reasons for going subjective in the probability case than in the truth case. Ultimately, we will be most interested in the assessment of arguments.