Basic descriptive and item statistics for criterion-referenced tests

James Dean Brown; Thom Hudson

doi:10.1017/CBO9781139524803.006

4 - Basic descriptive and item statistics for criterion-referenced tests

Published online by Cambridge University Press: 05 October 2012

James Dean Brown and

Thom Hudson

Show author details

James Dean Brown: Affiliation:
University of Hawaii, Manoa
Thom Hudson: Affiliation:
University of Hawaii, Manoa

Book contents

Get access

Summary

Introduction

In this chapter, we will cover the basic statistics testers use for describing and revising criterion-referenced tests. Although this is a book about CRT, the chapter will include a fairly detailed discussion of NRT statistics as well. This is done in order to provide a foundation against which to compare the differing orientations utilized by each. We feel this is important in order for the reader to put the kinds of issues each approach attempts to accommodate into perspective. The chapter first discusses assumptions that each approach makes about the distributions of test scores, and presents basic concepts of descriptive statistics. It then turns to details of item analysis. This examination should make clear that NRT item analyses are designed to help achieve a test which distributes examinees across a scale whereas the CRT item analyses are more concerned with finding items that distribute examinees into known or predicted categories according to their knowledge of the domain criteria.

Since tests are made up of units called items, the chapter will examine the types of item-related analyses that are used for the two basic families of tests. For NRTs, the techniques described here for developing, analyzing, selecting, and improving items will include item format analysis, item facility, and item discrimination indices, as well as distractor efficiency analysis. For CRTs, some of the same analyses will often be used plus others: a focus on item quality analysis, an index which compares item performance of masters and non-masters (called the difference index), and three statistics that are based on whether students passed or failed the test, called the B-index, agreement statistic, and item phi (ϕ).

Type: Chapter
Information: Criterion-Referenced Language Testing , pp. 101 - 148

DOI: https://doi.org/10.1017/CBO9781139524803.006 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2002

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

4 - Basic descriptive and item statistics for criterion-referenced tests

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive