Professional Documents
Culture Documents
Sample dependence
Limitation to the specific test situation.
Dependence on the parallel forms
Same error variance for all
Sample Dependence
The first shortcoming of CTS is that the values of
commonly used item statistics in test development
such as item difficulty and item discrimination
depend on the particular examinee samples in
which they are obtained. The average level of
ability and the range of ability scores in an
examinee sample influence, often substantially, the
values of the item statistics.
Difficulty level changes with the level of samples
ability and discrimination index is different
between heterogeneous sample and the
homogeneous sample.
Assumptions of IRT
Unidimensionality
Local independence
Unidimensionality Assumption
It is possible to estimate an examinee's ability on
the same ability scale from any subset of items in
the domain of items that have been fitted to the
model. The domain of items needs to be
homogeneous in the sense of measuring a single
ability: If the domain of items is too heterogenous,
the ability estimates will have little meaning.
Most of the IRT models that are currently being
applied make the specific assumption that the
items in a test measure a single, or unidimensional
ability or trait, and that the items form a
unidimensional scale of measurement.
Local Independence
This assumption states that an examinee's
responses to different items in a test are
statistically independent. For this
assumption to be true, an examinee's
performance on one item must not affect,
either for better or for worse, his or her
responses on any other items in the test.
ICC
Probability
Ability Scale
pseudo-chance parameter
c: p=0.20 for two items
difficulty parameter b:
halfway
between the
pseudo-chance parameter
and one
discrimination parameter
a: proportional to the slop
of the ICC at the point of
the difficulty parameter
The steeper the slope, the
greater the discrimination
parameter.
Ability Score
1. The test developer collects a set of observed
item responses from a relatively large number of
test takers.
2. After an initial examination of how well
various models fit the data, an IRT model is
selected.
3. Through an iterative procedure, parameter
estimates are assigned to items and ability scores
to individuals, so as to maximize the agreement, or
fit between the particular IRT model and the test
data.
Ability Score
Item Bank
If there is a need for regular test administration and
analysis, the construction of item bank may be
taken into consideration.
Item bank is not a simple collection of test items
that is organized in their raw form, but with
parameters assigned on the basis of CTS or IRT
models.
Item bank should also have a data processing
system that assures the steady quality of the data in
the bank (describing, classifying, accepting, and
rejecting items)
Form of items
Type of item parts
Describing data
Classifying data
Form of Items
Dichotomous
Listening comprehension
Statement + question + choices
Short conversation +question + choices
Long conversation / passage + some questions + choices
Reading comprehension
Passage + some questions + choices
Passage + T/F questions
Syntactic knowledge / vocabulary
Question stem with blank/underlined parts + choices
Cloze
Passage + choices
Form of Items
Nondichotomous
Listening comprehension
Dictation
Dictation passage with blanks to be filled
Describing data
Ability measured
Difficulty index
Discrimination
Storage code