Hypothesis Testing - Relationships

Transcription

1 - Relationships Session 3 AHX43 (28) 1 Lecture Outline Correlational Research. The Correlation Coefficient. An example. Considerations. One and Two-tailed Tests. Errors. Power. for Relationships AHX43 (28) 2 Correlational Research Correlational research is concerned with the relationships between variables. Scattergram: method for gaining an impression of the nature of a relationship between two variables. Whether high scores on one variable go with high (or low) scores on another, or whether there is no observable pattern. AHX43 (28) 3 Construction: Independent variable goes on the horizontal axis (X) and dependent variable on the vertical axis (Y). Determine the range of raw scores and mark them on the axes from the lowest to highest (from the origin). Plot each cases score on the Y axis with their corresponding score on the X axis. AHX43 (28) 4 Some examples Closeness Scattergram of Months Known By Closeness (males) AHX43 (28) Months Known Closeness Scattergram of Months Known By Closeness (females) AHX43 (28) 6 Months Known Research Methods & Design 1

3 The Correlation Coefficient Z score formula for Pearson s r : Z X Z r= N The sum of cross products divided by the number of pairs of scores. Y AHX43 (28) 13 The Correlation Coefficient Underlying Logic: Positive correlation - positive values on one variable correspond with positive values on the other, and negative values on one variable correspond with negative values on the other resulting in a positive sum of cross products. Negative correlation - positive values on one variable correspond with negative values on the other, and visa versa resulting in a negative sum of cross products. Weak or no correlation - mixture of positive and negative values canceling each other out resulting in a near zero sum of cross products. AHX43 (28) 14 The Correlation Coefficient The logic of Pearson s r: Using Z scores converts the two variables onto the same scale. Dividing by the number of cases gives the average of the cross products. Results can range from -1 to = perfect negative linear correlation. = no correlation. +1 = perfect positive linear correlation. The Correlation Coefficient Steps in calculating Pearson s r: 1. Construct a scatter diagram. 2. Determine if curvilinear - if so do not use Pearson s r. 3. Estimate degree and direction of correlation. 4. Compute correlation coefficient.. Check with estimation. AHX43 (28) 1 AHX43 (28) 16 An example... A researcher predicts a strong positive linear relationship between managers stress levels and the number of employees they supervise. In other words, the more employees managers supervise, the greater the managers perceived stress levels. The researcher randomly selects five managers working in a major corporation and obtains information on their stress levels (via a pencil and paper test), and the number of employees they supervise. AHX43 (28) 17 An example Construct a scattergram Employees Supervised Perceived Stress Level Perceived Stress Employees Supervised by Perceived Stress AHX43 (28) 18 Employees Supervised Research Methods & Design 3

5 The branch of statistics that helps you determine whether or not the prediction you made about something occurred by chance, or may actually represent a generalisable observation. A hypothesis test allows us to draw conclusions or make decisions regarding population data from sample data. A hypothesis is a claim or statement about a property of a population A hypothesis test (or test of significance) is a standard procedure for testing a claim or statement about a property of a population. Hypothesis testing does not result in definitive conclusions. We are dealing in probabilities. We either conclude that the results we get are likely (or unlikely) to be due to chance. AHX43 (28) 2 AHX43 (28) 26 The null hypothesis (denoted H ) is a statement that the value of a population parameter (such as proportion or mean) is equal to some claimed value. The alternative hypothesis (denoted H 1 ) is a statement that the value of a population parameter somehow differs from the null hypothesis. The symbolic form must be a >, < or statement. Hypothesis testing is based on the preliminary assumption that the null hypothesis is true. The null hypothesis generally represents a distribution based on no relationship or no difference between groups. Similar to the notion of innocent until proven guilty. AHX43 (28) 27 AHX43 (28) 28 Normal curve: distribution of scores that is characterised by a bell shaped curve in which the probability of a score drops off rapidly from the midpoint to the tails of the distribution. Normal curve tables enable us to determine precise areas under the normal curve. AHX43 (28) 29 AHX43 (28) 3 Research Methods & Design

6 Probability: the expected relative frequency of a particular outcome, or how certain we are that a particular thing will happen. Probability (p) = the number of successful outcomes divided by the number of possible outcomes. Population: the entire set of things of interest. Sample: the subset of the population about which you actually have information. AHX43 (28) 31 AHX43 (28) 32 In conducting inferential research we collect information from a sample to make probabilistic inferences about the parameters of a normally distributed population. A test statistic is a value computed from the sample data, used in making the decision whether or not to reject the null hypothesis. Examples are t, z, F, r etc. The test statistic indicates how far our sample deviates from the assumed population parameter. AHX43 (28) 33 AHX43 (28) 34 Critical region (or rejection region) is an area extremely different enough from the null hypothesis that would allow us to reject it. Significance level (α) is the probability that the test statistic will fall in the critical region when the null hypothesis is actually true. Common values are.1,. (i.e. 1% & %). Significance level α A critical value is any value that separates the critical region from values of the test statistic that would not cause us to reject the null hypothesis. AHX43 (28) 3 Critical region AHX43 (28) 36 Research Methods & Design 6

7 One and Two-tailed Tests One-tailed test: a situation where a researcher predicts a relationship or difference in a specific direction. One and Two-tailed Tests E.g. A researcher predicts that there will be a significant positive relationship between height and weight. AHX43 (28) 37 AHX43 (28) 38 One and Two-tailed Tests E.g. A researcher predicts there will be a significant negative relationship between body fat and fitness. One and Two-tailed Tests Two-tailed test: a situation where a researcher predicts a relationship or difference but does not specify in what direction. AHX43 (28) 39 AHX43 (28) 4 One and Two-tailed Tests E.g. A researcher predicts that a particular drug will have a significant effect on body weight, but is unsure in which direction. Errors in Type 1 error: The mistake of rejecting the null hypothesis when it is actually true. α is the probability of a type 1 error. Researcher concludes that treatment had an effect when it really did not. May be a serious problem. E.g. Stating that a drug is effective in treating cancer and it is really ineffective (can cost time, hope, & $). AHX43 (28) 41 AHX43 (28) 42 Research Methods & Design 7

8 Errors in Errors in Type 2 error: The mistake of failing to reject the null hypothesis when it actually false. The symbol β is used to represent the probability of a type 2 error Decision Outcome if Ho is true Outcome if Ho is false Treatment or difference really exists, but hypothesis test fails to show this. E.g. Test for cancer (telling someone they do not have it when in fact they do). Do not reject Ho Reject Ho Correct decision Type I error Type II error Correct decision AHX43 (28) 43 AHX43 (28) 44 Power Power In relation to Type II error, power is define as 1 - beta. In other words, power is the probability of detecting a true significant difference. for Relationships Assumptions for hypothesis testing using Pearson s r: 1. Both variables should be normally distributed. 2. Each variable should have approximately equal variance. Minor violations are OK. AHX43 (28) 4 AHX43 (28) 46 for Relationships Five Stages in : 1. Reframe the question into a research hypothesis and a null hypothesis about the populations. 2. Determine the characteristics of the comparison distribution. 3. Determine the cutoff sample score on the comparison distribution at which the null hypothesis should be rejected. 4. Determine the score of your sample on the comparison distribution.. Compare the scores obtained in steps 3 and 4 to decide whether or not to reject the null hypothesis. for Relationships 1. Reframe the question into a research hypothesis and a null hypothesis about the populations. Population 1: Managers like those in this study. Population 2: Managers for whom there is no correlation between numbers of employees supervised and stress. Ho: The two populations have the same correlation. H1: Population 1 has a significantly higher correlation than Population 2. AHX43 (28) 47 AHX43 (28) 48 Research Methods & Design 8

9 for Relationships 2. Determine the characteristics of the comparison distribution. The comparison distribution is an r distribution with a mean of. for Relationships 3. Determine the cutoff sample score on the comparison distribution at which the null hypothesis should be rejected. Read the appropriate critical r value from the table. You need to know: The number of pairs of scores (n = ). The level of significance (α =.). Type of hypothesis (directional). In this instance the critical r score is.8 AHX43 (28) 49 AHX43 (28) for Relationships 4. Determine the score of your sample on the comparison distribution. Calculate r. In the example r was calculated to be.6 for Relationships. Compare the scores obtained in steps 3 and 4 to decide whether or not to reject the null hypothesis. The r value calculated (r =.6) was not more extreme than the critical r value found in the table (r =.8) therefore we cannot reject the null hypothesis. In terms of statistical significance, the results of this study remain inconclusive. AHX43 (28) 1 AHX43 (28) 2 for Relationships Our observed r value (.6) did not reach the critical r value (.8) for Relationships Correlation Matrix: A table in which each variable is listed across the top and down the left side, and the calculated r values between each pair of variables are shown inside the table. Table 1 Correlations between the CTAI-2 (cognitive), CTAI-2 (somatic), CSAI-2 (cognitive) CSAI-2 (somatic) scores. population mean µ = Sample r... therefore, we cannot reject the null hypothesis that µ=. Measure r n r n r n r n 1. CTAI-2 (cognitive) CTAI-2 (somatic).4*** SAS (worry).6*** 93.3** SAS (concentration d.).36*** SAS (somatic).28** 93.8*** 93.22* 94.29** 94 * p <.. ** p <.. *** p <.. AHX43 (28) 3 AHX43 (28) 4 Research Methods & Design 9

10 Cromwell, R. L., & Newton, R. A. (24). Relationship Between Balance and Gait Stability in Healthy Older Adults. Journal of Aging and Physical Activity, 24, 11, 9-1. Gait patterns of young adults are characterised by phases of instability that allow for efficient forward progression and lateral shifting of the body's center of mass with each step With aging, adaptations in older adults' walking pattern increase stability and decrease the capacity for moving the body forward. Previous research has revealed that age-related changes in gait create a more stable walking pattern, and measures of balance are related to walking performance. AHX43 (28) AHX43 (28) 6 The purpose of this study was to determine the relationship between walking stability and measures of balance in healthy older adults. Method 17 older and 2 young adults performed the Berg Balance Test (BBT) and walked 1 m. Walking velocity (WV) and cadence were measured, and a gait-stability ratio (GSR) was calculated. Higher GSR indicated that a greater portion of the gait cycle was spent in double-limb support. AHX43 (28) 7 AHX43 (28) 8 AHX43 (28) 9 Conclusions A strong inverse relationship was observed between walking stability and dynamic balance in healthy older adults. Significant relationships were determined for walking velocity and GSR with BBT Item 12 (alternate stepping on a stool) for older adult participants. Results suggest that GSR provides a better indication than walking velocity for assessing balance deficits during walking in older adults. AHX43 (28) 6 Research Methods & Design 1

Chapter 16 Multiple Choice Questions (The answers are provided after the last question.) 1. Which of the following symbols represents a population parameter? a. SD b. σ c. r d. 0 2. If you drew all possible

DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

AMS7: WEEK 8. CLASS 1 Correlation Monday May 18th, 2015 Type of Data and objectives of the analysis Paired sample data (Bivariate data) Determine whether there is an association between two variables This

Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis

Chapter 4: Data & the Nature of Graziano, Raulin. Research Methods, a Process of Inquiry Presented by Dustin Adams Research Variables Variable Any characteristic that can take more than one form or value.

1 Linear Regression 1.1 Simple Linear Regression Model The linear regression model is applied if we want to model a numeric response variable and its dependency on at least one numeric factor variable.

8-2 Basics of Hypothesis Testing Definitions This section presents individual components of a hypothesis test. We should know and understand the following: How to identify the null hypothesis and alternative

Chapter 8 Hypothesis Testing Hypothesis In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test (or test of significance) is a standard procedure for testing

Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

Simple Regression and Correlation Today, we are going to discuss a powerful statistical technique for examining whether or not two variables are related. Specifically, we are going to talk about the ideas

Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Between-subjects manipulations: variable to

Scatter Plot, Regression Line, Linear Correlation Coefficient, and Coefficient of Determination What is a Scatter Plot? A Scatter Plot is a plot of ordered pairs (x, y) where the horizontal axis is used

FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this

Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is

Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

Ch. 8 Hypothesis Testing 8.1 Foundations of Hypothesis Testing Definitions In statistics, a hypothesis is a claim about a property of a population. A hypothesis test is a standard procedure for testing

Hypothesis test In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test (or test of significance) is a standard procedure for testing a claim about a property

Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

Hypothesis Testing Summary Hypothesis testing begins with the drawing of a sample and calculating its characteristics (aka, statistics ). A statistical test (a specific form of a hypothesis test) is an

Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.

Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

THE LOGIC OF HYPOTHESIS TESTING Hypothesis testing is a statistical procedure that allows researchers to use sample to draw inferences about the population of interest. It is the most commonly used inferential

Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

CHAPTER SEVEN Hypothesis Testing with z Tests NOTE TO INSTRUCTOR This chapter is critical to an understanding of hypothesis testing, which students will use frequently in the coming chapters. Some of the

PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

GCSE Statistics Revision notes Collecting data Sample This is when data is collected from part of the population. There are different methods for sampling Random sampling, Stratified sampling, Systematic

Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

Population and sample Sampling and Hypothesis Testing Allin Cottrell Population : an entire set of objects or units of observation of one sort or another. Sample : subset of a population. Parameter versus

496 STATISTICAL ANALYSIS OF CAUSE AND EFFECT * Use a non-parametric technique. There are statistical methods, called non-parametric methods, that don t make any assumptions about the underlying distribution

This is Dr. Chumney. The focus of this lecture is hypothesis testing both what it is, how hypothesis tests are used, and how to conduct hypothesis tests. 1 In this lecture, we will talk about both theoretical

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

The Analysis of Research Data The design of any project will determine what sort of statistical tests you should perform on your data and how successful the data analysis will be. For example if you decide

Homework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm. Political Science 15 Lecture 12: Hypothesis Testing Sampling

Lecture #10 Chapter 10 Correlation and Regression The main focus of this chapter is to form inferences based on sample data that come in pairs. Given such paired sample data, we want to determine whether

UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 All inferential statistics have the following in common: Use of some descriptive statistic Use of probability Potential for estimation

PS 511: Advanced Statistics for Psychological and Behavioral Research 1 Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables ( and

Introduction to Hypothesis Testing A Hypothesis Test for μ Heuristic Hypothesis testing works a lot like our legal system. In the legal system, the accused is innocent until proven guilty. After examining

Introduction to Hypothesis Testing A Hypothesis Test for Heuristic Hypothesis testing works a lot like our legal system. In the legal system, the accused is innocent until proven guilty. After examining

277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies

Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.

Two-sample hypothesis testing, I 9.07 3/09/2004 But first, from last time More on the tradeoff between Type I and Type II errors The null and the alternative: Sampling distribution of the mean, m, given

Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

Chapter 1 Hypothesis Testing Principles of Hypothesis Testing tests for one sample case 1 Statistical Hypotheses They are defined as assertion or conjecture about the parameter or parameters of a population,

Lesson 4 Part 1 Relationships between two numerical variables 1 Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables

STAT E-150 Statistical Methods Multiple Regression Three percent of a man's body is essential fat, which is necessary for a healthy body. However, too much body fat can be dangerous. For men between the

Pearson s Correlation Correlation the degree to which two variables are associated (co-vary). Covariance may be either positive or negative. Its magnitude depends on the units of measurement. Assumes the

Lesson Part Relationships between two numerical variables Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear between two numerical variables Relationship

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) HIGHER CERTIFICATE IN STATISTICS, 2007 Paper II : Statistical Methods Time Allowed: Three Hours

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

Spearman s correlation Introduction Before learning about Spearman s correllation it is important to understand Pearson s correlation which is a statistical measure of the strength of a linear relationship

Five types of statistical analysis General Procedure for Hypothesis Test Descriptive Inferential Differences Associative Predictive What are the characteristics of the respondents? What are the characteristics

WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club

Hypothesis Testing PSY 360 Introduction to Statistics for the Behavioral Sciences Reminder of Inferential Statistics All inferential statistics have the following in common: Use of some descriptive statistic

Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis