It is One Tailed F-test since the variance of treatment is expected to be large if the null hypothesis is rejected.

Similar documents
Factors related to students focus on God

Introduction to Statistical Hypothesis Testing Prof. Arun K Tangirala Department of Chemical Engineering Indian Institute of Technology, Madras

Sociology Exam 1 Answer Key February 18, 2011

Module 02 Lecture - 10 Inferential Statistics Single Sample Tests

Introductory Statistics Day 25. Paired Means Test

Multiple Regression-FORCED-ENTRY HIERARCHICAL MODEL Dennessa Gooden/ Samantha Okegbe COM 631/731 Spring 2018 Data: Film & TV Usage 2015 I. MODEL.

POLS 205 Political Science as a Social Science. Making Inferences from Samples

CHURCH PLANTING SPONSORSHIP: A STATISTICAL ANALYSIS OF SPONSORING A CHURCH PLANT AS A MEANS OF REVITALIZATION OF THE SPONSOR CHURCH

Computational Learning Theory: Agnostic Learning

INTRODUCTION TO HYPOTHESIS TESTING. Unit 4A - Statistical Inference Part 1

Exersices and solutions ANOVA tests for d-primes in sensr

This is certainly a time series. We can see very strong patterns in the correlation matrix. This comes out in this form...

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21

Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur

Experimental Design. Introduction

Near and Dear? Evaluating the Impact of Neighbor Diversity on Inter-Religious Attitudes

Family Studies Center Methods Workshop

Same-different and A-not A tests with sensr. Same-Different and the Degree-of-Difference tests. Outline. Christine Borgen Linander

Netherlands Interdisciplinary Demographic Institute, The Hague, The Netherlands

The following content is provided under a Creative Commons license. Your support

ANSWER SHEET FINAL EXAM MATH 111 SPRING 2009 (PRINT ABOVE IN LARGE CAPITALS) CIRCLE LECTURE HOUR 10AM 2PM FIRST NAME: (PRINT ABOVE IN CAPITALS)

NPTEL NPTEL ONINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture-59 Ensemble Methods- Bagging,Committee Machines and Stacking

CHAPTER FIVE SAMPLING DISTRIBUTIONS, STATISTICAL INFERENCE, AND NULL HYPOTHESIS TESTING

How many imputations do you need? A two stage calculation using a quadratic rule

Module - 02 Lecturer - 09 Inferential Statistics - Motivation

Torah Code Cluster Probabilities

Grade 7 Math Connects Suggested Course Outline for Schooling at Home 132 lessons

Probability Distributions TEACHER NOTES MATH NSPIRED

Appendix 1. Towers Watson Report. UMC Call to Action Vital Congregations Research Project Findings Report for Steering Team

A Layperson s Guide to Hypothesis Testing By Michael Reames and Gabriel Kemeny ProcessGPS

Okay, good afternoon everybody. Hope everyone can hear me. Ronet, can you hear me okay?

Lampiran 1. Daftar Sampel Reksa dana campuran syariah

Identity and Curriculum in Catholic Education

Now you know what a hypothesis is, and you also know that daddy-long-legs are not poisonous.

A Statistical Scientist Meets a Philosopher of Science: A Conversation between Sir David Cox and Deborah Mayo (as recorded, June, 2011)

This is certainly a time series. We can see very strong patterns in the correlation matrix. This comes out in this form...

Brandeis University Maurice and Marilyn Cohen Center for Modern Jewish Studies

ECE 5424: Introduction to Machine Learning

PHILOSOPHIES OF SCIENTIFIC TESTING

The distinctive should of assertability

Grade 6 Math Connects Suggested Course Outline for Schooling at Home

Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur. Lecture No.

AMERICAN SECULARISM CULTUR AL CONTOURS OF NONRELIGIOUS BELIEF SYSTEMS. Joseph O. Baker & Buster G. Smith

AN EXPLORATORY SURVEY EXAMINING THE FAMILIARITY WITH AND ATTITUDES TOWARD CRYONIC PRESERVATION. W. Scott Badger, Ph.D. ABSTRACT INTRODUCTION

Types of Error Power of a Hypothesis Test. AP Statistics - Chapter 21

Content Area Variations of Academic Language

Perceptions of Spiritual Formation Among Nontraditional Seminary Students

The World Wide Web and the U.S. Political News Market: Online Appendices

and Voting for Evangelicals in Latin America Appendix

Nigerian University Students Attitudes toward Pentecostalism: Pilot Study Report NPCRC Technical Report #N1102

Religious affiliation, religious milieu, and contraceptive use in Nigeria (extended abstract)

Studying Adaptive Learning Efficacy using Propensity Score Matching

From the Greek Oikos = House Ology = study of

Statistics for Experimentalists Prof. Kannan. A Department of Chemical Engineering Indian Institute of Technology - Madras

MISSOURI S FRAMEWORK FOR CURRICULAR DEVELOPMENT IN MATH TOPIC I: PROBLEM SOLVING

In Our Own Words 2000 Research Study

Project: The Power of a Hypothesis Test

ASSESSMENT OF CUSTOMER SATISFACTION OF SAMSUNG

International Journal of Multidisciplinary Research and Modern Education (IJMRME) Impact Factor: 6.725, ISSN (Online):

Georgia Quality Core Curriculum

Key Words Bhramari Pranayama, State and Trait Anxiety

Survey of Pastors. Source of Data in This Report

Curriculum Guide for Pre-Algebra

Introduction to Inference

Church Leader Survey. Source of Data

IJRIM Volume 6, Issue 6 (June, 2016) (ISSN ) International Journal of Research in IT & Management (IMPACT FACTOR 5.96)

Factors Influencing on Peaceful Co-Existence: Christian s Living in Tehran

Houghton Mifflin MATHEMATICS

ECE 5424: Introduction to Machine Learning

McDougal Littell High School Math Program. correlated to. Oregon Mathematics Grade-Level Standards

Prioritizing Issues in Islamic Economics and Finance

This report is organized in four sections. The first section discusses the sample design. The next

Ability, Schooling Inputs and Earnings: Evidence from the NELS

University of Warwick institutional repository:

University Faculty and Their Knowledge & Acceptance of Biological Evolution

Work Ethic, Social Ethic, no Ethic: Measuring the Economic Values of Modern Christians

About Type I and Type II Errors: Examples

Discussion Notes for Bayesian Reasoning

Marcello Pagano [JOTTER WEEK 5 SAMPLING DISTRIBUTIONS ] Central Limit Theorem, Confidence Intervals and Hypothesis Testing

Radiomics for Disease Characterization: An Outcome Prediction in Cancer Patients

ABSTRACT. Religion and Economic Growth: An Analysis at the City Level. Ran Duan, M.S.Eco. Mentor: Lourenço S. Paz, Ph.D.

Tests of Homogeneity and Independence

Belief in the Claim of an Argument Increases Perceived Argument Soundness

MITOCW watch?v=4hrhg4euimo

Factors related to students spiritual orientations

Evangelicals, the Gospel, and Jewish People

IS GOD JUST A BIG PERSON?: THE INFLUENCE OF RELIGIOUS BACKGROUND ON THE DEVELOPMENT OF GOD CONCEPTS. Melanie A. Nyhof. B.A., St. Olaf College, 1998

Appendix A: Scaling and regression analysis

EMPOWERMENT OF WOMEN IN THE INFORMAL SECTOR IN KERALA: A STUDY WITH SPECIAL REFERENCE TO KUDUMBASHREE PROJECT

Grade 6 correlated to Illinois Learning Standards for Mathematics

RootsWizard User Guide Version 6.3.0

Department of Economics, Faculty of Economics and Political Sciences, Omdurman Islamic University, Sudan Shaqra University, KSA (Secondment)

Working Paper No Two National Surveys of American Jews, : A Comparison of the NJPS and AJIS

SRJIS/Bimonthly/P.N.Srakaew and A.V.Jagtap ( )

T his study aimed to analyze the comparative

Analysis of Heart Rate Variability during Meditative and Non-Meditative State using Analysis Of variance

Laboratory Exercise Saratoga Springs Temple Site Locator

Social Satisfaction towards Islamic Banking in Saudi Arabia: A Survey

BIRTH CONTROL: CHRISTIAN ETHICAL PERSPECTIVE

Transcription:

EXST 7014 Experimental Statistics II, Fall 2018 Lab 10: ANOVA and Post ANOVA Test Due: 31 st October 2018 OBJECTIVES Analysis of variance (ANOVA) is the most commonly used technique for comparing the means of groups of measurement data. There are lots of different experimental designs that can be analyzed with different kinds of ANOVA. For this week s lab, the most basic type of ANOVA, one-way ANOVA will be introduced. In a one-way ANOVA, there is one continuous variable (Response Variable) and one categorical variable (Treatment). Multiple observations of the response variable are made for each level of the treatment. One-way ANOVA corresponds to the completely randomized designed experiment (CRD) with one fixed treatment effect, with its linear model as the following: Y ij = μ + τ i + ε ij = μ i + ε ij ( i = 1, 2,, t; j = 1, 2,, n) Where μ is the overall mean; τi are the treatment level effects, and εij is the random error. μ i is the mean of the i th level of treatment. The null hypothesis test of one-way ANOVA is that the means of the response variable are the same for the different levels of treatment (H 0 : μ 1 = μ 2 = = μ t ); the alternative hypothesis is that they are not all the same. There are three assumptions need to be considered for ANOVA: the treatments are independently sampled; residuals or deviation of observations within groups should be normally distributed (evaluated by residual plot, normality test); and the variance from each level of treatment is the same (i.e. homogenous variance, Bartlett test is preferred method to evaluate the homogeneity of variance). ANOVA usually proceeds with an F-test of the MSTreatment (d.f.= t-1) over MSError (d.f. = t(n-1)). The MSError estimates a variance σ 2 ε, and MSTreatment estimates the same σ 2 ε plus the difference between the levels of treatment (σ 2 ε + nσ 2 τ). So the F-test can be written as the following: F= MSTreatment/MSError = ( σ 2 ε + nσ 2 τ)/ σ 2 ε It is One Tailed F-test since the variance of treatment is expected to be large if the null hypothesis is rejected. Once the null hypothesis of ANOVA is rejected, i.e., the significant difference among the levels of treatment is concluded from F-Test of ANOVA, Post-ANOVA tests are needed to determine how the treatment levels are interrelated. Common Post hoc tests will be introduced in this lab. ANOVA and post-anova tests can be performed by suing PROC GLM that has been used in precious labs to fit the regression models. Actually its form does not look very different from what we have used for the regression model, with only a couple of additional statements and options that will be explained following the SAS codes. In addition, mixed model analysis will be briefly introduced to construct ANOVA and post-anova tests by using PROC MIXED, which has many options not available in the traditional

analysis of variance. LABORATORY INSTRUCTIONS Housekeeping Statements dm 'log; clear; output; clear'; options nodate nocenter pageno = 1 ls=78 ps=53; title1 'EXST7014 lab 10, Name, Section#'; ods rtf file = c:/temp/lab9.rtf ; ods html file = c:/temp/lab9.html ; Data set The word lists are standard audiology tools for assessing hearing. They are calibrated to be equally difficult to perceive. However, the original calibration was performed with normal hearing subjects and no noise background. The experimenter wished to determine whether the lists were still equally difficult to understand in the presence of a noisy background. 24 subjects with normal hearing listened to standard audiology tapes of four standard 50-English-word lists at low volume with a noisy background. They repeated the words and were scored correct or incorrect in their perception of the words. The percent of words heard correctly was recorded and will be used as the response variable. Note that this is actually a randomized block design, which has not been covered. Therefore, we will pretend that the data was collected without control on the subjects and treat it as a CRD. Detailed information and references can be found at http://lib.stat.cmu.edu/dasl/datafiles/hearing.html. The variables in the dataset are: 1. List: ID of the word lists, which has four levels (Treatment): - List 1 - List 2 - List 3 - List 4 2. Score: Percent of the words heard correctly (Response Variable). data hearing; input id list $ score; drop id; cards; 1 List1 28 2 List1 24 3 List1 32 4 List1 30 5 List1 34 6 List1 30 7 List1 36 8 List1 32 9 List1 48 10 List1 32 11 List1 32 12 List1 38 13 List1 32 14 List1 40 15 List1 28

16 List1 48 17 List1 34 18 List1 28 19 List1 40 20 List1 18 21 List1 20 22 List1 26 23 List1 36 24 List1 40 1 List2 20 2 List2 16 3 List2 38 4 List2 20 5 List2 34 6 List2 30 7 List2 30 8 List2 28 9 List2 42 10 List2 36 11 List2 32 12 List2 36 13 List2 28 14 List2 38 15 List2 36 16 List2 28 17 List2 34 18 List2 16 19 List2 34 20 List2 22 21 List2 20 22 List2 30 23 List2 20 24 List2 44 1 List3 24 2 List3 32 3 List3 20 4 List3 14 5 List3 32 6 List3 22 7 List3 20 8 List3 26 9 List3 26 10 List3 38 11 List3 30 12 List3 16 13 List3 36 14 List3 32 15 List3 38 16 List3 14 17 List3 26 18 List3 14 19 List3 38 20 List3 20 21 List3 14 22 List3 18 23 List3 22 24 List3 34

1 List4 26 2 List4 24 3 List4 22 4 List4 18 5 List4 24 6 List4 30 7 List4 22 8 List4 28 9 List4 30 10 List4 16 11 List4 18 12 List4 34 13 List4 32 14 List4 34 15 List4 32 16 List4 18 17 List4 20 18 List4 20 19 List4 40 20 List4 26 21 List4 14 22 List4 14 23 List4 30 24 List4 42 ; proc boxplot data=hearing; plot score*list; PROC BOXPLOT will constructs boxplots for each level of treatment. Analysis of Variance using PROC GLM proc glm data=hearing; class list; model score = list ; means list / lsd tukey hovtest=bartlett; output out=outdata p=pred residual=resid; proc univariate data=outdata normal plot; var resid; proc plot data=outdata; plot resid*pred; ClASS statement tells the GLM procedure to treat the variable as categorical variable. MEANS statement: tests for differences between different pairs of treatment levels. LSD TUKEY Options following the MEANS statement specifies post hoc tests. Other methods such as sheffe, Dunnett s can be specified as needed.

HOVTEST = Bartlett: Bartlett s test is generally considered the best test for homogeneity of variance. This test uses a calculation which follows a Chi-Square distribution. The null hypothesis test is that the model output residuals are distributed homogeneously about the hypothesized mean of zero. It must be used with the MEANS statement in the PROC GLM. In addition to Bartlett, other methods such as Levene and Obrien could be used to test the homogeneity of variance. Mixed Model Analysis using PROC MIXED proc mixed data=hearing; class list; model score=list / ddfm=satterth outp=outdata; repeated / group=list; lsmeans list / adjust=tukey pdiff; proc univariate data=outdata normal plot; var resid; REPEATED /GROUP=List: returns the results of the homogeneity of variance. LSMEAN effect /adjust = <adjust method> pdiff: The LSMEANS statement tests for differences between different pairs of treatment levels, where EFFECT is the treatment effect variable, and the option adjust= specifies adjustment methods. If this options Statement is omitted then SAS will conduct LSD tests by default. Pdiff will provide actual probabilities for each pairwise comparison. LAB ASSIGNMENT 1. Write the linear model to test the hypothesis that there is no treatment effect. Clearly describe each term in the model, and the range of the subscripts. Write the null hypothesis that you are testing. 2. Use both proc glm and proc mixed to test the hypothesis you stated in Question 1. Report your results, including your F-value, p-value, and conclusions. 3. List the assumptions necessary for your analysis and determine whether they have been violated. Include any relevant SAS output in you report. 4. If there is a significant treatment effect, describe which pairs of means are different. Explain which adjustment method you chose.