Professional Documents
Culture Documents
Scale Construction
Session 6
Prof. Swati Dhir
Step 1
Step 2
Step 3
Step 4
Step 5
Step 6
Step 7
Step 8
Purpose
Step 1
Step 2
Step 3
Theory as an aid
to clarity
Specificity as an
aid to clarity
Step 4
Step 5
Step 6
Step 7
Step 8
Optimize scale
Length
Creating Items
Writing good items for a scale is definitely an art rather than a science
Think creatively about the construct you seek to measure
Make the questions simple, specific and straightforward
Avoid biased language (emotional words, emphasized text)
Multidimensional
LOC
What to include in
a measure
Only people in the military should be allowed to personally own assault rifles.
Step 1
Step 2
Step 3
Step 4
Step 5
Step 6
Step 7
Step 8
Optimize scale
Length
7/10/2015
Creating Items
Cognitive
Component
Affective
Component
Behavioral
A persons behavioral
predisposition to
respond to an attitude
object in a certain way
Step 1
Step 2
Step 3
Step 4
Step 5
Step 6
Step 7
Step 8
Optimize scale
Length
Measurement
The term questionnaire item is used to
denote a single question on a survey,
corresponding to a single column in a
dataset.
Scales typically denote sets of questions
which become mathematical
combinations of survey items.
Step 1
Step 2
Determine
Generate an
clearly What it Item Pool
is you want to
measure
Measurement/Scaling Properties
Assignment
You can assign objects to categories
Step 4
Step 6
Step 7
Consider
Administer
inclusion of
Items to a
Validation items Development
Sample
Step 5
Evaluate the
items
Step 8
Optimize scale
Length
Types of Scales
Nominal Scale
Has Assignment Only (What is Your Gender?)
Ordinal
Has Assignment, Order (Education)
Order (Magnitude)
You can order objects in terms of having more or less of some quality
Distance (Equal Intervals)
The distance between adjacent points on the scale is identical
Step 3
Interval
Has Assignment, Order, Equal Intervals (Temperature)
Ratio
Has Assignment, Order, Equal Intervals, Absolute Zero (Number of Cars,
weight)
7/10/2015
Thurstone
Scaling
Guttman
Scaling
Semantic
Differential
Likert Scale
Extremely
Dissatisfied
Dissatisfied
Somewhat
Dissatisfied
Neither
Somewhat
Satisfied
Satisfied
Extremely
Satisfied
Very
Unlikely
Unlikely
Somewhat
Unlikely
Somewhat
Likely
Likely
Very
Likely
1
Very
Unlikely
2
Unlikely
3
Somewhat
Unlikely
4
Neither
5
Somewhat
Likely
2
Dissatisfied
3
Somewhat
Dissatisfied
4
Neither
5
Somewhat
Satisfied
6
Satisfied
7
Extremely
Satisfied
7
Very
Likely
1
Extremely
Dissatisfied
6
Likely
1
Strongly
Disagree
7
Strongly
Agree
1
Strongly
Disagree
2
Moderately
Disagree
3
Slightly
Disagree
4
Neither Agree
or Disagree
5
Slightly
Agree
6
Moderately
Agree
7
Strongly
Agree
7
Very
Satisfied
7/10/2015
Moderately
Disagree
2
-2
Slightly
Disagree
3
-1
Neither Agree
or Disagree
4
0
Slightly
Agree
5
1
Moderately
Agree
6
2
Strongly
Agree
7
3
Should we have
numbers here?
Comparative question
Compared to your current brand, how would you evaluate
Pepsodent toothpaste?
Comparative questions establish the referent and can be useful if you
need to know how your product compares to a specific competitor or
the customers current brand
Direction of Scale?
Typical direction (lower values, negative connotation on left):
Strongly
Disagree
1
Moderately
Disagree
2
Slightly
Disagree
3
Neither Agree
or Disagree
4
Slightly
Agree
5
Moderately
Agree
6
Strongly
Agree
7
Some scales are not valenced, so must be careful about positioning. For
a semantic differential scale, with amusing positioning:
Unpleasant
-2
-1
Pleasant
Flimsy
Male
-2
-2
-1
-1
0
0
1
1
2
2
Sturdy
Female
Formative
items
Can be combined to
measure the multiple
aspects of a construct,
though not necessary
that respondents answer
each item similarly
Reflective
items
7/10/2015
Timeliness
Pricing
It upsets me to know others on the same flight have paid a lower price for their seat.
JA ticketing personnel are polite.
Staff
Service
The two-item restriction on carry-on luggage is insensitive to the needs of todays passengers.
JA has ample leg-room for me in coach seating.
Travelling Comfort
The things I own say a lot about how well Im doing in life.
I dont pay much attention to the material objects other people own.*
* Reverse coded
I have not been bumped from a JA flight in the last two years.
Reviewed By Experts
Ask panel of expert to rate how relevant they think each item
is to what you intend to measure
Provide the expert the working definition of the construct
Can evaluate the items clarity and conciseness (by rating
relevance as high, moderate or low)
Step 2
Determine
Generate an
clearly What it is Item Pool
you want to
measure
Step 3
Determine the
format for
Measurement
Step 4
Step 5
Step 6
Step 7
Evaluate the
items
Step 8
Optimize scale
Length
Step 1
Step 2
Step 3
Step 4
Step 5
Step 6
Step 7
Step 8
Optimize scale
Length
7/10/2015
Reverse Scoring
Item Scale co relation- an uncorrected itemtotal co relation makes good conceptual
sense , the reality is that the items inclusion
in scale can inflate the co relation coefficient
Step 1
Step 2
Step 3
Step 5
Step 6
Step 7
Step 8
Optimize scale
Length
Step 1
Step 2
Step 3
Step 4
Step 4
Step 5
Step 6
Step 7
Step 8
Optimize scale
Length
Psychological and
Psychometric testing
7/10/2015
Linda Croker
Selected response
Multiple choice
Likert scale
Q-sort
Constructed response
Free response
Fill-in-the-blank
Essay tests
Portfolios
In-basket technique
A. Selected response
Task is to choose between set answers
Multiple
choice or
forced choice Advantage: Ease of scoring &
scoring requires little skill
Disadvantage: may test memory rather
than comprehension
Correct response must be distinct
Distracters should not be obvious or
ambiguous
A. Selected response
Multiple choice or
forced choice
Likert format
A. Selected response
Multiple choice or
forced choice
Likert format
Q-sort
7/10/2015
Strengths
Essay tests
Portfolios
In-basket
technique
Weaknesses
Time consuming to use
Possible subjectivity in
scoring
A.
B.
C.
D.
E.
F.
Define clearly
Generate a pool of potential items
Monitor reading level
Use unitary items
Avoid long items
Break any response set
4. Item analysis
Multiple
choice
distracter
analysis
Item difficulty
measure P
Discrimination
index D
Item total
correlation
Distracters should be
equally attractive
Correct choice should be
based on knowledge
Where knowledge is
lacking, choice should be
random
7/10/2015
Estimation Methods
Method for
Dichotomously
Scored Item
Method for
Polytomously
Scored Item
Grouping
Method
Difficulty Factor
Range 0 -1; Optimal Level is .5
R
P
N
Guided Practice
What is the P for Items 1-3
Example 1
There are 80 high school students attending a
science achievement test, and 61 students pass item
1, 32 students pass item 10. Please calculate the
difficulty for item 1 and 10 separately.
P1= 0.76; P10= 0.4
Student
Raw
score
Item 1
Item 2
Item 3
Item 4
Item 5
10
7/10/2015
Difficulty Factor
X
X max
PU
is the proportion for examinees of upper group who get the item
correct.
PL
is the proportion for examinees of lower group who get the item
correct.
Example 3
There are 371 examinees attending a language test.
Known that 64 examinees of 27% upper extreme
group pass item 5, and 33 examinees of 27%
lower extreme group pass the same item. Please
compute the difficulty of item 5.
Key : 0.49
CP
KP 1
K 1
ANSWER
CP1
KP 1 5 0.5 1
0.38
K 1
5 1
CP2
KP 1 4 0.53 1
0.37
K 1
4 1
10
7/10/2015
Item Discrimination
Item-total
correlation
Discrimination
index D
Discrimination Index D
D= U L
nU
nL
Example 1
Good item
High correlation
People who get item correct have high score on
the test
Poor item
11
7/10/2015
Choice Analysis
Session 8&9
Prof. Swati Dhir
Excel Add-ins
Use the Analysis ToolPak to perform complex
data analysis
If data analysis command is not available
Command: File_Option_Add
Ins_Manage_Select_ Analysis Toolpak (check
box and ok
Research Methodology
Item Generation
Content validation
Adding some criterion related construct
Context of the study
Interitem Analysis
Exploratory Factor Analysis
Construct validity (Convergent and Divergent)
External Validity
Sampling Adequacy
Reliability
Criterion Validity (Predictive and Concurrent)
Content Validity
Rating by experts
80% consensus
Drop the items if it is not consistent
Items may be reworded
Command: Analyze_ Descriptive Statistics_
Cross tabs
Select rater 1 as row and rater 2 as column
Click statistics_ select kappa_ continue
12
7/10/2015
Example
Content Validity
Kappa might be interpreted (Landis & Koch,1977)
Kappa
Data Entry
Files export
Variable view
Missing Values (Analyze_Missing Value)
Descriptive Statistics (DS):
Frequency (Analyze_DS_Frequency
Data cleaning
Interpretation
<0
Poor agreement
0.0 0.20
Slight agreement
0.21 0.40
Fair agreement
0.41 0.60
Moderate agreement
0.61 0.80
Substantial agreement
0.81 1.00
Interitem Analysis
Selection of closely associated items thereby
increasing the reliability of the scale
Mean, Standard Deviation and Intercorrelations
Though, there is no definite cutoff score for
adequate variability
However, SD of 1 represents adequate amount of
variability for usefulness of an item
Any item that correlates at less than 0.40 with all
other items should be dropped
Too high means for particular item_ Outliers
Command: Analyze_Correlate_Bivariate
13
7/10/2015
Eigen Values 1
Scree Test
components,
with the first value reflecting the variance explained by the
strongest component,
Example of a Scree Plot
Limitations
There is no clear definition of what constitutes a
major drop.
Sometimes the data may produce a gradual
decreasing slope with no major break points
The scree test has been found to function reasonably
well in cases where strong PCs are present.
External Validity
Means and medians should not be very
different
Skewness: measure of symmetry or more
precisely the lack of symmetry (<2)
Kurtosis: measure of whether the data are
peaked or flat relative to a normal distribution
(<5)
Command: Analyze_Descritive statistics_
Descriptives_Option_Distribution_Kurtosis
and Skewness
Discriminant Validity
No cross loading
Correlations among factors should be low
Variance Extracted between construct > Correlation
Construct
Sampling Adequacy
Kaiser Meyer Olkin KMO: To check the case to
variable ratio for the analysis
Range= 0-1
Acceptance limit >0.6
14
7/10/2015
Command: Analyze_Scale_Reliability
Analysis_Alpha
15