Fairness in Testing: Introduction Suzanne Lane University of Pittsburgh Member, Management Committee for the JC on Revision of the 1999 Testing Standards.

Similar presentations

Presentation on theme: "Fairness in Testing: Introduction Suzanne Lane University of Pittsburgh Member, Management Committee for the JC on Revision of the 1999 Testing Standards."— Presentation transcript:

1
Fairness in Testing: Introduction Suzanne Lane University of Pittsburgh Member, Management Committee for the JC on Revision of the 1999 Testing Standards

3
Proposed Revision Combine three of the chapters in Part II into a single chapter: Fairness in Testing Chapter 7: Fairness in Testing and Test Use Chapter 9: Testing Individuals of Diverse Linguistic Backgrounds Chapter 10: Testing Individuals with Disabilities Move combined chapter to Part I: Foundational Chapters

4
Why Reorganize the Chapters? Fairness in testing cannot be separated from accessibility Individuals should be able to understand and respond without performance being influenced by construct irrelevant characteristics All examinees that test is intended for should have an unobstructed opportunity to demonstrate their standing on the construct(s) being measured by the assessment

5
Accessibility is Essential for all Members of the Testing Population Accessibility is a fundamental aspect of fairness and is the right of all members of the intended test taking population

6
Draft Fairness Chapter Four sections: Section I: General Views of Fairness Section II: Threats to the Fair and Valid Interpretations of Test scores Section III: Minimizing Construct Irrelevant Components Through the Use of Test Design and Testing Adaptations Section IV: The Standards

7
Four Themes or Clusters 1. Use test design, development administration and scoring procedures that minimize barriers to valid test interpretations for all individuals. 2. Conduct studies to examine the validity of test score inferences for the intended examinee population. 3. Provide appropriate accommodations to remove barriers to the accessibility of the construct measured by the assessment and to the valid interpretation of the assessment scores. 4. Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups.

10
Use test design, development, administration, and scoring procedures that minimize barriers to valid test interpretations for all individuals. Test Design: use strategies to be as inclusive as possible for wide range of individuals Universal Design Administration Clearly delineate construct

11
Use test design, development, administration, and scoring procedures that minimize barriers to valid test interpretations for all individuals. Test Design: linguistic and reading demands consistent with construct Removes construct irrelevant variance Enhances validity of score interpretation; clarifies interpretation of standing on intended construct Even when language is part of construct, demand should be commensurate with needed levels for performance

15
Use test design, development, administration, and scoring procedures that minimize barriers to valid test interpretations for all individuals. Documentation: include aspects of testing process that supports valid score interpretations Specify how construct irrelevant variance was addressed in test design and development Include results of technical studies to examine measurement quality for subgroups Include studies of impact of accommodations and modifications on valid score interpretations

18
Conduct studies to examine the validity of test score inferences for the intended examinee population the reliability and validity of score inferences for individuals from relevant subgroups should be specifically examined

19
Conduct studies to examine the validity of test score inferences for the intended examinee population When differential prediction is an issue, use regression equations computed separately for each group under consideration or an analysis in which group is entered as moderator variable.

20
Conduct studies to examine the validity of test score inferences for the intended examinee population When tests require scoring of constructed responses, evidence of reliability and validity of inferences should be obtained for relevant subgroups.

22
Provide Appropriate Accommodations to Remove Barriers to the Accessibility of the Construct Measured by the Assessment and to the Valid Interpretation of Scores Provide test accommodations, when appropriate and feasible, to remove construct irrelevant barriers that otherwise would interfere with an examinees ability to demonstrate their standing on the target construct(s).

23
Provide Appropriate Accommodations to Remove Barriers to the Accessibility of the Construct Measured by the Assessment and to the Valid Interpretation of Scores When test accommodations and/or modifications are permitted, test developers and/or test users are responsible for documenting provisions for their use.

24
Provide Appropriate Accommodations to Remove Barriers to the Accessibility of the Construct Measured by the Assessment and to the Valid Interpretation of Scores Whoever assigns, administers or documents the use of permissible test accommodations and/or modifications should have sufficient information available to them and sufficient expertise to carry out this role.

25
Provide Appropriate Accommodations to Remove Barriers to the Accessibility of the Construct Measured by the Assessment and to the Valid Interpretation of Scores When a test is changed to remove barriers to the construct being measured, empirical evidence of the reliability, validity, and comparability of inferences made from the scores should be obtained and documented.

26
Provide Appropriate Accommodations to Remove Barriers to the Accessibility of the Construct Measured by the Assessment and to the Valid Interpretation of Scores When tests are translated to a different language, empirical evidence of the reliability, validity, and comparability of inferences made from the scores from the changed test should be documented.

27
Provide Appropriate Accommodations to Remove Barriers to the Accessibility of the Construct Measured by the Assessment and to the Valid Interpretation of Scores A test generally should be administered in the test takers most proficient language for the testing context, unless proficiency in the language of the test is part of the construct that is being measured.

28
Provide Appropriate Accommodations to Remove Barriers to the Accessibility of the Construct Measured by the Assessment and to the Valid Interpretation of Scores When an interpreter is used in testing, the interpreter should be sufficiently fluent in the language and content of the test and the examinee's native language and culture to translate the test instructions and questions, and, where required, to explain the examinees test responses. Procedures for administering a test when an interpreter is used should be standardized.

30
Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups. Focus of this theme is on the use of test scoresinterpretation and consequences. As with the previous themes, the goal is to apply the general principles to relevant subgroups. ELLs, cultural minorities, immigrants, older individuals

31
Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups. Test developers and publishers need to provide information supporting claims that a test can be used with examinees from specific subgroups (e.g., individuals from different linguistic or cultural backgrounds, individuals with disabilities).

32
Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups. Research evidence is necessary to support the comparability of scores, when test scores are disaggregated and reported for subgroups (e.g., gender, ethnicity, age, language proficiency, disability).

33
Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups. Tests should not be used with subgroups if credible evidence suggests that examinees scores are affected by construct-irrelevant characteristics of the test or of the examinees.

34
Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups. It is inappropriate to use test scores as the sole indicator of an individuals functioning, competence, attitudes and/or predisposition for the purposes of diagnosis and intervention.

35
Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups. When alternative and equal measures of a construct exist, group differences (e.g., in mean scores or in percentages of subgroups of examinees passing) should be considered in deciding which test to use.

36
Guard against inappropriate interpretations, use, and/or unintended consequences of test results for individuals or subgroups. When a test is used as an instrument of public policy, test users and policy makers must provide evidence (e.g., reliability, validity, and comparability of scores, likely consequences for individuals from relevant subgroups) in support of the proposed use.