Same-different and A-not A tests with sensr. Same-Different and the Degree-of-Difference tests. Outline. Christine Borgen Linander

Similar documents
Exersices and solutions ANOVA tests for d-primes in sensr

Module 02 Lecture - 10 Inferential Statistics Single Sample Tests

Family Studies Center Methods Workshop

INTRODUCTION TO HYPOTHESIS TESTING. Unit 4A - Statistical Inference Part 1

Grade 6 Math Connects Suggested Course Outline for Schooling at Home

Introduction to Statistical Hypothesis Testing Prof. Arun K Tangirala Department of Chemical Engineering Indian Institute of Technology, Madras

MITOCW watch?v=4hrhg4euimo

Scientific errors should be controlled, not prevented. Daniel Eindhoven University of Technology

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21

It is One Tailed F-test since the variance of treatment is expected to be large if the null hypothesis is rejected.

Grade 7 Math Connects Suggested Course Outline for Schooling at Home 132 lessons

CS485/685 Lecture 5: Jan 19, 2016

Introductory Statistics Day 25. Paired Means Test

ECE 5424: Introduction to Machine Learning

Deconstructing Data Science

Marcello Pagano [JOTTER WEEK 5 SAMPLING DISTRIBUTIONS ] Central Limit Theorem, Confidence Intervals and Hypothesis Testing

Particle Sizes and Clumps from Stellar Occultations

Radiomics for Disease Characterization: An Outcome Prediction in Cancer Patients

POLS 205 Political Science as a Social Science. Making Inferences from Samples

Probability Distributions TEACHER NOTES MATH NSPIRED

Social Perception Survey. Do people make prejudices based on appearance/stereotypes? We used photos as a bias to test this.

CHAPTER FIVE SAMPLING DISTRIBUTIONS, STATISTICAL INFERENCE, AND NULL HYPOTHESIS TESTING

Some basic statistical tools. ABDBM Ron Shamir

A Layperson s Guide to Hypothesis Testing By Michael Reames and Gabriel Kemeny ProcessGPS

About Type I and Type II Errors: Examples

Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur

Georgia Quality Core Curriculum

MISSOURI S FRAMEWORK FOR CURRICULAR DEVELOPMENT IN MATH TOPIC I: PROBLEM SOLVING

Introduction to Inference

occasions (2) occasions (5.5) occasions (10) occasions (15.5) occasions (22) occasions (28)

Knowledge, Trade-Offs, and Tracking Truth

Introduction Chapter 1 of Social Statistics

A FIRST COURSE IN PARAMETRIC INFERENCE BY B. K. KALE DOWNLOAD EBOOK : A FIRST COURSE IN PARAMETRIC INFERENCE BY B. K. KALE PDF

SUMMARY COMPARISON of 6 th grade Math texts approved for 2007 local Texas adoption

The World Wide Web and the U.S. Political News Market: Online Appendices

Chapter 20 Testing Hypotheses for Proportions

Report about the Latest Results of Precipitation Verification over Italy

McDougal Littell High School Math Program. correlated to. Oregon Mathematics Grade-Level Standards

The distinctive should of assertability

EXPERIMENTAL PSYCHOLOGY NST PART IB PBS PART 2A TIMETABLE (DRAFT COPY) Course Organiser: Dr GJ Davis (

Brandeis University Maurice and Marilyn Cohen Center for Modern Jewish Studies

NPTEL NPTEL ONINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture-59 Ensemble Methods- Bagging,Committee Machines and Stacking

PSY 202 Sample 2. Question/Prompt: It is logical that others see us differently than we see ourselves, and there is

The following content is provided under a Creative Commons license. Your support

August Parish Life Survey. Saint Benedict Parish Johnstown, Pennsylvania

Computational Learning Theory: Agnostic Learning

NPTEL NPTEL ONLINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture 31

A Path Forward for NDE Reliability

Module - 02 Lecturer - 09 Inferential Statistics - Motivation

Logical (formal) fallacies

Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur. Lecture No.

How many imputations do you need? A two stage calculation using a quadratic rule

Who wrote the Letter to the Hebrews? Data mining for detection of text authorship

Experimental Design. Introduction

1. Introduction Formal deductive logic Overview

Belief in the Claim of an Argument Increases Perceived Argument Soundness

In Our Own Words 2000 Research Study

Types of Error Power of a Hypothesis Test. AP Statistics - Chapter 21

ECE 5424: Introduction to Machine Learning

Factors related to students focus on God

and Voting for Evangelicals in Latin America Appendix

ANSWER SHEET FINAL EXAM MATH 111 SPRING 2009 (PRINT ABOVE IN LARGE CAPITALS) CIRCLE LECTURE HOUR 10AM 2PM FIRST NAME: (PRINT ABOVE IN CAPITALS)

Torah Code Cluster Probabilities

TECHNICAL WORKING PARTY ON AUTOMATION AND COMPUTER PROGRAMS. Twenty-Fifth Session Sibiu, Romania, September 3 to 6, 2007

MITOCW watch?v=ogo1gpxsuzu

Now you know what a hypothesis is, and you also know that daddy-long-legs are not poisonous.

Overview of College Board Noncognitive Work Carol Barry

Statistics for Experimentalists Prof. Kannan. A Department of Chemical Engineering Indian Institute of Technology - Madras

Observation and categories. Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/8/2010

Agnostic KWIK learning and efficient approximate reinforcement learning

Bounded Rationality. Gerhard Riener. Department of Economics University of Mannheim. WiSe2014

ECE 5424: Introduction to Machine Learning

Leadership Survey Report for Adventist Review Russian Edition

AN EXPLORATORY SURVEY EXAMINING THE FAMILIARITY WITH AND ATTITUDES TOWARD CRYONIC PRESERVATION. W. Scott Badger, Ph.D. ABSTRACT INTRODUCTION

Final Exam (PRACTICE-2) #2

Studying Religion-Associated Variations in Physicians Clinical Decisions: Theoretical Rationale and Methodological Roadmap

Overview of the ATLAS Fast Tracker (FTK) (daughter of the very successful CDF SVT) July 24, 2008 M. Shochet 1

Christians Say They Do Best At Relationships, Worst In Bible Knowledge

Survey of Pastors. Source of Data in This Report

Content Area Variations of Academic Language

Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur. Lecture No. # 18 Acceptance Sampling

Survey Report New Hope Church: Attitudes and Opinions of the People in the Pews

I'm just curious, even before you got that diagnosis, had you heard of this disability? Was it on your radar or what did you think was going on?

Supplement to: Aksoy, Ozan Motherhood, Sex of the Offspring, and Religious Signaling. Sociological Science 4:

PROBABILITY DISTRIBUTIONSOF THE VERSES, WORDS, AND LETTERS OF THE HOLY QURAN

Outline. The argument from so many arguments. Framework. Royall s case. Ted Poston

Recoding of Jews in the Pew Portrait of Jewish Americans Elizabeth Tighe Raquel Kramer Leonard Saxe Daniel Parmer Ryan Victor July 9, 2014

Identity and Curriculum in Catholic Education

Nigerian University Students Attitudes toward Pentecostalism: Pilot Study Report NPCRC Technical Report #N1102

A Statistical Scientist Meets a Philosopher of Science: A Conversation between Sir David Cox and Deborah Mayo (as recorded, June, 2011)

Young Adult Catholics This report was designed by the Center for Applied Research in the Apostolate (CARA) at Georgetown University for the

January Parish Life Survey. Saint Paul Parish Macomb, Illinois

Summary of Research about Denominational Structure in the North American Division of the Seventh-day Adventist Church

Sociology Exam 1 Answer Key February 18, 2011

Factors Influencing on Peaceful Co-Existence: Christian s Living in Tehran

Review Articles THE LOGIC OF STATISTICAL INFERENCE1

Laboratory Exercise Saratoga Springs Temple Site Locator

Statistical Inference Casella

Fusion Confusion? Comments on Nancy Reid: BFF Four Are we Converging?

Ability, Schooling Inputs and Earnings: Evidence from the NELS

Transcription:

Same-different and -not tests with sensr Christine Borgen Linander DTU Compute Section for Statistics Technical University of Denmark chjo@dtu.dk huge thank to a former colleague of mine Rune H B Christensen. ugust 20th 2015 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 1 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 2 / 23 Outline Outline Same-Different and the Degree-of-Difference tests 1 2 The -not protocol 3 Measures of sensitivity 4 The -not with sureness protocol 2 products 2 confusable stimuli: Chocolate bar (standard) B Chocolate bar with less saturated fat Setting: One pair of samples evaluated at each trial Question: re the samples the same or different? Stimuli: Same stimuli pairs: and BB Different stimuli pairs: B and B Same-Different test: Same Different Degree-of-Difference test: Same 2 3 4 Different Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 3 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 4 / 23

Characteristics of the DOD test Giving answers τ criteria and the decision rule Same-Different: B Intensity B > τ different n unspecified test (like Triangle, Duo-Trio, Tetrad) Only 2 samples compared at each trial Easily understood test (by consumers) (O Mahony and Rousseau, 2002) No prior knowledge of products required (unlike -not ) Response bias (like -not ) τ τ Degree of difference: B Intensity 4 3 2 1 2 3 4 B Intensity τ 1 τ 2 τ 3 B < τ same Rating scale: 1 2 3 4 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 5 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 6 / 23 Thurstonian model for the DOD test Thurstonian distributions: Difference distributions 0.5 0.5 different same 0.4 0.4 τ 3 τ 2 τ 1 τ 1 τ 2 different τ 3, BB 0.3 B 0.3 0.2 0.2 σ 2 = 2 0.1 0.1 B, B Same-different example Examples in R difference and similarity assessments. 0.0 0.0 0 δ Probability of answer in the j th category: P( j Same-pair) = f s (τ ) P( j Different-pair) = f d (τ, δ) Maximum likelihood estimation of parameters: likelihood f s (τ ) + f d (τ, δ) 0 δ Sample Response Same Different Total Same 8 5 13 Different 4 9 13 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 7 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 8 / 23

The -not protocol The -not protocol The -not protocol Example: the -not test Situation: 2 products: and B ( not ) ssessors are familiarized with samples (and sometimes B samples as well) ssessors are served one sample either or B Question: Is the sample an or a not sample? Known as the yes-no method in Signal Detection Theory (Macmillan and Creelman, 2005) Example data: Sample Response Not- Total 26 29 55 Not- 14 41 55 Null hypothesis, H 0 : products are similar lternative hypothesis, H : products are different Problem: There are many tests to choose from. What is the p-value? Can we reject H 0? Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 9 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 10 / 23 The -not protocol The Thurstonian model for the -not test The -not protocol Estimation of d with sensr Density 0.0 0.1 0.2 0.3 0.4 0 θ d' Not Estimation of d with sensr: > library(sensr) > not(26, 55, 14, 55) Call: not(x1 = 26, n1 = 55, x2 = 14, n2 = 55) Results for the -Not test: Estimate Std. Error Lower Upper P-value d-prime 0.591838 0.2492597 0.1032979 1.080378 0.01431559 Psychological continuuum d : Sensory difference θ: Decision threshold Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 11 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 12 / 23

The -not protocol Likelihood confidence intervals for -not tests Similarity testing The -not protocol > (ana <- not(26, 55, 14, 55)) Call: not(x1 = 26, n1 = 55, x2 = 14, n2 = 55) Results for the -Not test: Estimate Std. Error Lower Upper P-value d-prime 0.591838 0.2492597 0.1032979 1.080378 0.01431559 Standard normal based confidence intervals: CI 95% = d ± 1.96se(d ) Improved likelihood based confidence intervals: > confint(ana) 2.5 % 97.5 % threshold -0.4007727 0.2627993 d.prime 0.1063875 1.0842269 im: Prove that products are identical Prove that products are identical Establish similarity within a similarity bound at some α-level How: Interchange the roles of the hypotheses: Example: H 0 : d is larger than 1 H : d is less than 1 Huge practical challenge: How to choose the similarity bound? Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 13 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 14 / 23 The -not protocol Similarity testing with d for -not tests Measures of sensitivity Discrimination measures in equal-variance models Use d for similarity testing: Hypotheses: H 0 : d is larger than 1 H : d is less than 1 p-value = P(Z < (d d 0 )/se(d ) H 0 ): > ## The Wald statistic: > statistic <- (0.592-1)/ 0.249 > ## Compute p-value: > pnorm(statistic, lower.tail=true) [1] 0.05065307 d = (µ 2 µ 1 )/σ is the (relative) distance between normal distributions λ = 2Φ( d /2) is the distribution overlap (0 < λ 1) Sensitivity, S = P(x 1 < x 2 ) = Φ(d / 2) is the probability that a random sample from the low-intensity distribution has a lower intensity than a random sample from the high-intensity distribution overlap 1.0 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.05 0.01 d.prime 0.0 0.25 0.51 0.77 1.05 1.35 1.68 2.07 2.56 3.29 3.92 5.15 UC 0.5 0.60 0.69 0.78 0.85 0.91 0.95 0.98 0.99 1.00 1.00 1.00 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 15 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 16 / 23

Measures of sensitivity The Receiver Operating Characteristic (ROC) curve The -not with sureness protocol Basics of the -not with sureness protocol What is a ROC curve? visual description of the discriminative ability central concept in Signal Detection Theory ( plot of False positive ratio True positive ratio (alt. Hit rate False-alarm Rate) The real number of interest the area under the ROC curve, UC: Examples in R S = UC = Φ(d / 2) nswers are given on a sureness scale with J categories The model assumes J 1 thresholds are adopted by the assessors Multinomial response: several ordered response categories Many parameters: Thresholds, θ j and effect, δ Table: Soup data Reference Not Reference Product Sure Not Sure Guess Guess Not Sure Sure Reference 134 162 66 41 122 222 Test 101 101 51 57 157 653 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 17 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 18 / 23 The -not with sureness protocol Thurstonian model for the -not with sureness protocol The -not with sureness protocol Unequal variances model 0.0 0.1 0.2 0.3 0.4 θ 1 0 θ 2 δ σ θ 3 Psychological continuum Multiple thresholds (θ) parameters. NOT 0.0 0.1 0.2 0.3 0.4 : σ 1 = 1 NOT : σ 2 = 1.3 0 δ σ k Psychological continuum Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 19 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 20 / 23

The -not with sureness protocol n unequal-variance model in practice The -not with sureness protocol Cumulative link models for -not with sureness Reference Test products products N(0, 1) N(δ, σ 22 ) δ θ 1 θ 2 θ 3 θ 4 θ 5 Sensory intensity Reference Not Reference Product Sure Not Sure Guess Guess Not Sure Sure Reference 132 161 65 41 121 219 Test 96 99 50 57 156 650 Table: Discrimination of packet soup (Christensen, Cleaver and Brockhoff, 2011) Christensen showed that the -not protocol (with and without sureness) is a version of a cumulative link model: ( ) θj δ(prod P(S i θ j ) = Φ i ) σ(prod i ) where σ is the ratio of scales (std. dev). This provides (optimal) ML estimates of the parameters, standard errors etc. profile likelihood confidence intervals available with confint. > fm1 <- clm(sureness ~ prod, data=my_data, link="probit") > summary(fm1) ## print d-prime etc. > confint(fm1) ## likelihood confidence interval for d-prime Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 21 / 23 Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 22 / 23 The -not with sureness protocol Discrimination measures in unequal-variance models d = (µ 2 µ 1 )/σ (no change) Distribution overlap: use the overlap function. Sensitivity: S = Φ(d / 1 + σ 2 2 ) where σ 2 is the scale ratio of the high-intensity distribution relative to the low-intensity distribution. ll measureas are equivalent in the equal-variance model, but not so in the unequal-variance model. In the unequal-variance model d can be a poor measure of discrimination. Christine Borgen Linander (DTU) Same-different and -not tests Sensometrics Summer School 23 / 23