Professional Documents
Culture Documents
PAPER
Representing intentions in self and other: studies of autism
and typical development
David Williams1 and Francesca Happé2
1. Institute of Child Health, University College London, UK
2. Social, Genetic and Developmental Psychiatry Research Centre, Kings College London, UK
Abstract
Two experiments were conducted to explore the extent to which individuals with Autism Spectrum Disorder (ASD), as well as
young typically developing (TD) children, are explicitly aware of their own and others’ intentions. In Experiment 1, participants
with ASD were significantly less likely than age- and ability-matched comparison participants to correctly recognize their own
knee-jerk reflex movements as unintentional. Performance on this knee-jerk task was associated with performance on measures
of false belief understanding, independent of age and verbal ability, in both participants with ASD and TD children. In
Experiment 2, participants with ASD were significantly less able than comparison participants to correctly recognize their own
or another person’s mistaken actions as unintended, in a ‘Transparent Intentions’ task (Russell & Hill, 2001; Russell, Hill &
Franco, 2001). Performance on aspects of the Transparent Intentions task was associated with performance on measures of false
belief understanding, independent of age and verbal ability, in both participants with ASD and TD children. This study suggests
that individuals with ASD have a diminished awareness of their own and others’ intentions and that this diminution is associated
with other impairments in theory of mind.
Introduction trying to complete the action but failing (the adult’s hand
would ‘accidentally’ slip off the dumbbell, resulting in the
Actions, unlike accidents, are considered to have two halves failing to separate). Meltzoff found that when
distinctive mental causes, namely an intention to infants in this latter ‘failed attempt’ condition were
perform the action and a belief that such an action is offered the materials they had seen the adult act upon,
possible (e.g. Davidson, 1963). In order to account for they did not simply copy the surface behaviour they had
actions which are not obviously accompanied by any observed (i.e. slipping), but would actually complete the
conscious mental state (e.g. changing gear in a car), action the adult had intended but failed to achieve
Searle (1983) has distinguished between two different (pulling the dumbbell apart). Indeed, infants in the failed
kinds of intention. A ‘prior intention’ is a purely mental- attempt condition pulled the dumbbell apart as
representational state which is conscious and which can frequently as infants in the ‘full demonstration’
occur regardless of whether any action is actually condition, who had observed the adult complete the
performed. An ‘intention-in-action’, by contrast, serves action. This finding suggests that from at least
to guide ongoing action and is often, but not always (as 18 months, infants can accurately distinguish the
in the example of automatically changing car gears), mental states of the actor from the behaviour the actor
explicitly conscious. actually displays. Similarly, Carpenter, Akhtar and
The upshot of this position is that to grasp the Tomasello (1998) showed that 14- to 18-month-olds
distinction between an action and an accident, a person would copy an adult’s action when the action was
may need to recognize the mental states – specifically, the accompanied by the vocalization ‘There!’ (to signal the
intentions – that underlie behaviour. In this context, action had been intentional), but not when the action
several studies of young typically developing (TD) was accompanied by the vocalization ‘Whoops!’ (to
infants have indicated the presence of a rudimentary signal that the action was unintended). In this case,
appreciation of others’ intentions from at least young infants appear to distinguish very similar actions
18 months of age. In a study by Meltzoff (1995; see on the basis of the actor’s underlying goal.
also, Bellagamba & Tomasello, 1999), 18-month-olds saw Although the results of studies employing imitation
an adult either successfully completing an intended paradigms to explore mental state recognition in young
action (e.g. pulling apart two halves of a dumbbell), or infants are not uncontroversial (see Huang, Heyes &
Address for correspondence: David Williams, Department of Language and Communication Science, City University, London ECIV OHB;
e-mail: david.williams.2@city.ac.uk
2009 The Authors. Journal compilation 2010 Blackwell Publishing Ltd, 9600 Garsington Road, Oxford OX4 2DQ, UK and
350 Main Street, Malden, MA 02148, USA.
308 David Williams and Francesca Happ
Charman, 2002), they do suggest that some aspects of satisfied). On the other hand, the pattern of performance
others’ minds are registered by infants from at least shown by 4-year-olds on this task suggested that they did
18 months of age. However, it is highly debatable not understand their own intentions as differentiated
whether young infants ⁄ children have the same grasp of from their own desires. When there was a discrepancy
intentions as older children do. Several researchers have between their intention and their desire, 4-year-olds
argued that young children, before approximately 4 years tended to perform poorly. For example, unlike 5-year-
of age, do not properly distinguish intentions from olds, if they succeeded in hitting the target of their choice
desires (e.g. Astington, 1993; Feinfield, Lee, Flavell, (i.e. fulfilled their intention) but that target did not
Green & Flavell, 1999; Moses, 1993). Desires and contain a prize (i.e. did not satisfy their desire), 4-year-
intentions are similar in many ways. Both are mental olds denied having intended to hit the target.
states which are either fulfilled or unfulfilled, unlike Further suggesting that young children have difficulty
mental states like beliefs which are either true or false, in explicitly representing their own intentions, Shultz,
and much of the time desires and intentions coincide in Wells and Sarda (1980) found that children below
any particular individual. However, the two mental states approximately 5 years of age tended to incorrectly
are not synonymous, as is evident from the fact that one report their own knee-jerk reflex as an intentional
can desire an outcome that one has no intention of action. Reflexes, unlike deliberate actions, are
bringing about and, conversely, one can intend to do accompanied neither by desires nor intentions. With no
something that one does not desire. Astington (1993, pp. possibility of matching goals and outcomes, one
91–92) highlights the crucial distinction between desires may require a coherent concept of intention,
and intentions as follows, ‘Desires are fulfilled so long as differentiated from desire, in order to recognize a reflex
the outcome is achieved, it doesn’t matter how, but movement as an ‘accident’. Further evidence for a
intentions are carried out only if the intention causes the relationship between understanding reflexes and a
action that achieves the outcome.’ Now, the results meta-representational ToM was provided by Lang and
provided by Meltzoff (1995) and Carpenter et al. (1998), Perner (2002). They found that the performance of TD
among others, do not establish whether infants represent children on an unexpected contents false belief task was
(and therefore copy) the intentions underlying others’ highly correlated with performance on the knee-jerk task,
behaviour, or whether their representation is of an even after the influences of age and verbal ability were
undifferentiated desire-intention state. In other words, controlled.
infants could simply have produced an action which they Using a different methodology, Russell, Hill and
interpreted as having been desired by the adult actor, Franco (2001) found further evidence that young TD
rather than intended as such. children do not possess a coherent concept of intention.
Some researchers (e.g. Perner, 1991) have suggested In a ‘Transparent Intentions’ task, children were asked to
that recognition of intentions as mental states that are complete a drawing of, for instance, a boy with a missing
differentiated from desires emerges later in development ear, on a transparency like those used with overhead
and depends upon a meta-representational theory of projectors. Unknown to the child, a second transparency
mind (ToM). Traditionally, performance on so-called with a different, unfinished drawing (e.g. of a cup with a
false belief tasks is seen as the ‘acid test’ of a meta- missing handle) was laid in precise alignment on top of
representational ToM (Dennett, 1978). Typically the first transparency. Therefore, participants ended up
developing children pass such tasks from approximately unintentionally finishing off the top drawing (of a handle
age 4 to 5 years (Wellman, Cross & Watson, 2001) and, on a cup) rather than the bottom drawing (of an ear on a
importantly, there is evidence that their understanding of boy). When their mistake was revealed, children were
intentional action undergoes a developmental shift in asked both what they had meant to draw and what they
this same period. For example, in line with the arguments had thought they were drawing throughout their action.
of Astington (1993) and others, young TD children have These two types of test question were designed to assess
difficulty representing intentions (independently of participants’ awareness of their own prior intentions and
desires), in self or other, in scenarios where the desire is intentions-in-action, respectively (see below and also
not obvious (e.g. Astington & Lee, 1991; Feinfield et al., Russell et al., 2001, p. 780 for further discussion). Russell
1999) or when the desire conflicts with the intention (e.g. et al. also included an Other-person condition in which
Feinfield et al., 1999; Phillips, Baron-Cohen & Rutter, children observed a glove puppet performing the same
1998). actions and making the same errors.
In the study by Phillips et al. (1998), children Findings from Russell et al.’s (2001) study indicated
attempted to hit a prespecified target in a rigged that 4-year-olds were significantly better than 3-year-olds
shooting game. Some of the targets contained prizes at answering correctly what they had meant to draw. In
while others did not. Phillips et al. found that 5-year-olds contrast, 3-year-olds were as able as 4-year-olds to
were able to report their prior intention to hit a correctly distinguish between what they had thought they
particular target whether or not their intention was were drawing and what they had mistakenly drawn. The
fulfilled and regardless of whether or not the target same pattern also held in the Other-person condition.
yielded a prize (i.e. whether or not their desire was Finally, Russell et al. found that performance on the
Transparent Intentions task (as measured by a composite successful performance), it seems probable that
of performance on the four test questions, across Self and participants need only represent the experimenter’s
Other-person conditions) was not related significantly to desires, rather than intentions, in order to succeed. In
ToM (as measured by a composite of performance across this regard, it is important to highlight how children with
three different ToM tests) in these TD participants. autism appear well able to represent the simple desires of
It is debatable, however, whether the different types of other people despite their poor performance on multiple
Transparent Intentions test question are measuring the other measures of ToM (Baron-Cohen, 1991; Tan &
same underlying knowledge of intentions. Russell et al. Harris, 1991).
(2001) suggest, first (p. 780), that both the ‘Mean’ and Together, these findings appear to support the
‘Think’ question-types assess awareness of intentions- suggestion made above that successful performance on
in-action. In a footnote (p. 780), however, they seem to such imitation tasks does not rely on an awareness of the
suggest that the Mean question refers to a prior mental-representational aspects of intentions. Indeed, in
intention, whereas the Think question refers more to the small number of studies exploring intention
an intention-in-action: ‘one can regard the mean understanding in ASD that have used different
question as referring to a discrete mental event and the paradigms, results tend to indicate impairment in
think question as referring to a continuous experience. affected children. For example, Phillips et al. (1998)
An intention-in-action, in contrast to a prior intention, assessed children with ASD on the previously mentioned
would seem to invite the use of continuous aspect.’ If target-shooting task that they also implemented with
these different forms of question measure different young TD children. Like young TD children,
underlying knowledge, it is questionable why Russell participants with ASD (but not matched comparison
et al. implemented a composite measure of performance participants) performed poorly when there was a
across both types of question for comparison with ToM discrepancy between their intention and their desire.
performance. If only one of the question types reliably For instance, when participants with ASD mistakenly hit
measures knowledge of the mental-representational an unspecified target (i.e. did not fulfil their intention)
aspects of intention, then perhaps answering only these but that target contained a prize (i.e. fulfilled their
questions (in Self and Other conditions) requires ToM, desire), they tended to incorrectly claim that they had
whereas answering the other type of question does not. meant to hit the target. Like young TD children,
therefore, individuals with ASD appear to confuse their
own desires and intentions (although see Russell & Hill,
Understanding of intentions in Autism Spectrum
2001, Experiment 2, for alternative findings).
Disorder
In contrast, Russell and Hill (2001, Experiment 3)
Autism Spectrum Disorder (ASD) is a condition found that, relative to matched comparison participants
diagnosed on the basis of severe impairments in social with moderate learning difficulties (MLD), children with
interaction and communication, and a rigid and ASD were relatively unimpaired on the ‘Transparent
repetitive behavioural repertoire (American Psychiatric Intentions’ task implemented with TD participants by
Association, 2000). On the cognitive level, ASD is Russell et al. (2001). In the Self condition of the task,
characterized by deficits in ToM. Individuals with ASD Russell and Hill found that when the ‘Mean’ question
show delayed (e.g. Happ, 1995) and atypical (e.g. Leslie (‘What did you mean to draw?’) was asked before the
& Thaiss, 1992) performance on false belief tasks, and in ‘Think’ question (‘What did you think you were
everyday life their social and communicative difficulties drawing?’), children with ASD were significantly
suggest that they do not fully grasp the mental states impaired relative to comparison children. Only
underlying people’s behaviour. approximately 50% of participants with ASD correctly
Studies exploring the awareness of intentions by recognized that they had not intended the actual
individuals with ASD have produced mixed results. outcome. However, no other between-group differences
Aldridge, Stone, Sweeney and Bower (2000) found that reached significance, leading Russell and Hill (p. 326) to
young, preverbal children with ASD were unexpectedly conclude that ‘while there was a weak trend in the
competent at completing the ‘failed attempts’ of actors, transparent intentions task for children with autism to
in the behavioural re-enactment paradigm implemented perform at a lower level than children with MLD …
by Meltzoff (1995) and Bellagamba and Tomasello there was certainly nothing in these data to justify the
(1998) with TD infants. This finding led Aldridge et al. term ‘‘autistic deficit’’.’
(p. 294) to claim that ‘preverbal children with There are, however, potential concerns about the data
autism understand the intentions of others’. However, and design of Russell and Hill’s (2001) experiment which
as discussed above, such imitation paradigms provide suggest that the results should be interpreted cautiously.
little information about the content of children’s First, the use of a repeated-measures design, in which the
representations of mental states. Even if one accepts order of (Self ⁄ Other) condition presentation was
that successful performance on such tasks indicates a counterbalanced, resulted in some of Russell and Hill’s
rudimentary awareness of mind (and see Huang et al., participants receiving the Other-person condition of the
2002, for an ecological, non-representational account of task first. Having already seen the crucial experimental
manipulation – that there were two transparencies even in the studies by Russell and Hill (2001) and Russell et al.
though there only appeared to be one – participants who (2001) were eliminated. First, it was predicted that
undertook the Other-person condition first would have participants with ASD would perform significantly less
had some knowledge that they were being ‘tricked’ well than DD participants on each of the four test
throughout their own action in the subsequent Self questions, incorporating both the ‘Mean’ (i.e. ‘What did
condition. This may have altered the nature of the task you mean to draw?’) and ‘Think’ (i.e. ‘What did you think
for these participants. Counterbalancing of conditions you were drawing?’) question types, across both Self and
could well have confounded results, therefore. Other conditions. Second, it was predicted that
A second difficulty with Russell and Hill’s (2001) performance on the Transparent Intentions task would
experiment, which may also have affected the results of be associated significantly with ToM performance,
Russell et al. (2001), was the nature of the stimuli used. independently of CA and VMA, in all participants.
In two out of the four drawing combinations used, the
missing piece on each transparency was very similar.
Hence, in one combination, the participant intended to Experiment 1
draw smoke emerging from the chimney of a house (on
the bottom transparency) and ended up drawing smoke
Method
emerging from the chimney of a boat (on the top
transparency). In another example, the participant
Participants
intended to draw a face on a girl and ended up
drawing a face on a boy. In these cases, children ended Ethical approval for this research was obtained from the
up drawing objects so similar to those that they intended joint South London and Maudsley NHS Trust ⁄ Institute
to draw, that there may have been no perceived mistake of Psychiatry Research Ethics Committee. Twenty-two
to report. children with ASD and 22 comparison children
One further possibility for the lack of group differences participated in Experiment 1, after parents ⁄ guardians
in Russell and Hill’s (2001) study was the surprisingly had given written, informed consent for their children
poor performance of their comparison group on a to be included. The participants in the ASD group
standard unexpected contents false belief task. Nearly had received formal diagnoses, by a trained
two-thirds of participants from each diagnostic group psychiatrist or pediatrician, of autistic disorder (n = 20),
failed the Other-person test question from this measure. Asperger’s disorder (n = 1), or atypical autism ⁄ pervasive
As such, it is perhaps not surprising that both groups developmental disorder not otherwise specified (PDD-
found the experimental task difficult, given its potential NOS; n = 1) according to established criteria (American
relation to ToM. Indeed, the numbers of participants in Psychiatric Association, 2000). All participants in this
each group failing each of the four test questions on the group attended specialist autism schools, which required
Transparent Intentions task was closely in line with the a diagnosis of autism, Asperger’s syndrome or PDD-
numbers failing the false belief task, although Russell NOS for entry into the school. The comparison
and Hill did not analyse the interactions between these (developmentally delayed; DD) group consisted of 19
tasks. children with general learning disability of unknown
The two experiments reported in this paper assessed origin and three TD children who were recruited for
the explicit awareness that children with ASD, as well as comparison with those (three) children in the ASD group
young TD children, have of their own and others’ who achieved IQ scores in the ‘average’ range (i.e. over
intentions. In Experiment 1, participants’ awareness of 90). Finally, a separate group of 32 TD children also
their own knee-jerk reflex movements was assessed. First, took part.
it was predicted that participants with ASD would
incorrectly report their reflex movements as intentional
Background assessments
significantly more often than comparison participants
with developmental delay (DD). Second, it was predicted Baseline verbal and non-verbal abilities were assessed by
that performance on the knee-jerk task would be an appropriate measure for the developmental level of
associated significantly with performance on measures each participant. The verbal abilities of 16 (out of 22)
of false belief understanding, independently of children with ASD and 17 (out of 22) DD children were
chronological age (CA) and verbal mental age (VMA). assessed by performance on the Vocabulary and
Regarding TD participants, it was predicted that Information subtests of the Wechsler Intelligence Scale
performance on the knee-jerk task would be for Children - Third Edition UK (WISC-III; Wechsler,
significantly associated with false belief understanding, 1991). The verbal IQ estimate gained from this short
independently of CA and VMA (cf. Lang & Perner, form has high reliability (Sattler, 1992). Because the
2002). lowest test age-equivalent offered by the WISC-III is
In Experiment 2, participants undertook a revised 6 years and 2 months, the VMA of any participant
version of the Transparent Intentions task, in which the who fell below this level on either of the verbal subtests
methodological confounds that may have biased results could not be calculated. Under these circumstances,
Table 1 Participant characteristics for Experiment 1: Means VIQ: t(42) = 0.80, p = .43, r = .12; PIQ: t(38) = 0.86,
and (standard deviations) p = .40, r = .14.
Given that some ASD and DD participants received
ASD ASD-Extended DD TD
the Wechsler Scales (Wechsler, 1991), while others
n 22 30 22 32 received the BPVS (Dunn et al., 1997), independent
CA: yrs 10.61 (3.51) 10.02 (3.21) 11.20 (3.56) 4.43 (0.68)
t-tests were conducted comparing ASD and DD
VMA:yrs 7.69 (3.06) 7.02 (2.90) 7.51 (2.42) 4.86 (1.24) participants from each sub-sample to ensure adequacy
VIQ 77.32 (17.38) 76.50 (15.58) 73.32 (15.71) 104.1 (10.40) of matching in each case, as well as overall. ASD and DD
a b a
PIQ 77.95 (22.06) 78.88 (20.50) 72.10 (21.19) _
participants who received the Wechsler Scales were well
a
Due to limited child availability, PIQ data were available for n = 20 participants matched on all variables (all ts < 1.14, all ps > .27), as
with ASD and n = 20 comparison participants. b In the Extended ASD sample, were ASD and DD participants who received the BPVS
PIQ data were available for n = 24 participants.
(all ts < 0.80, all ps > .46).
(95.5%) DD participants and 17 ⁄ 32 (53.1%) TD deemed valid after initial analyses revealed that
participants passing the task. The difference in performance on each of the three measures was
performance between ASD and DD participants was strongly associated, in this sample (/ = .73–.76
significant, v2(1) = 5.50, Fisher’s Exact p = .04, / = .35. between the three test questions).
Bivariate point-biserial correlations revealed that
performance on the knee-jerk task was significantly
Relation between reflex understanding and ToM
associated with the false belief composite score,
Table 2 shows the percentage of ASD, DD, and TD rpb = .53, p = .003. When CA and VMA were
participants passing each false belief measure, as well as controlled, in a partial correlation, this association
each group’s mean total score across the three tasks. remained significant, rpb = .40, p = .04.
Participants with ASD (including those from the
extended sample) performed significantly less well than Developmentally delayed sample. Given that only one
DD participants on each measure: Plasters Self: comparison participant failed the knee-jerk test, the
v2(1) = 9.65, p = .002, / = .43; Plasters Other-person: relationship between reflex understanding and ToM
v2(1) = 5.30, p = .02, / = .38; Sally-Anne: v2(1) = 3.42, could not be assessed statistically.
p (one-tailed) < .04, / = .26; Total False Belief Score:
t(49.91) = )2.91, p = .005, r = .38. Typically developing sample. In TD participants, there
Given the theoretical and empirical links between was a significant association between performance on the
understanding reflex actions and ToM, a series of chi- Plasters Self question and performance on the knee-jerk
square analyses were conducted to assess the degree of task, v2(1) = 7.04, p = .008, / = .47. In this sample,
association in each group between performance on the however, performance on the knee-jerk task was not
knee-jerk task and performance on each false belief significantly associated with performance on either
measure. the Plasters Other-person question, v2(1) = 0.98,
p = .76, / = .06, or the Sally-Anne task, v2(1) = 2.92,
ASD sample. In the extended sample of 30 participants Fisher’s exact p = .15, / = .31.
with ASD, there was a significant association between In these participants, success on the knee-jerk task was
performance on the Plasters Self question and perfor- significantly related to VMA, rpb = .35, p = .05, but not
mance on the knee-jerk task, v2(1) = 9.60, p = .002, / = to CA, rpb = .09, p = .65. The next step, therefore, was to
.57. The association between performance on the Plasters ensure that the association between knee-jerk task
Other-person question and performance on the knee-jerk performance and performance on the Plasters Self
task was also significant, v2(1) = 4.34, p = .04, / = .38. question was not an artifact of any mediating relation-
Finally, there was a significant association between ship with VMA. A forced-entry logistic regression was
performance on the Sally-Anne task and performance therefore conducted with performance on the knee-jerk
on the knee-jerk task, v2(1) = 7.18, Fisher’s Exact task (pass ⁄ fail) entered as the dependent variable. In the
p = .007, / = .49. first instance, Plasters Self performance was entered as a
In this expanded sample of ASD participants, success predictor variable in the first block and VMA was
on the knee-jerk task was significantly related to VMA, included as a second predictor variable in block 2.
rpb = .40, p = .03, and CA, rpb = .31, p (one-tailed) = Table 3 displays the regression statistics for this initial
.05. The next step, therefore, was to ensure that the above analysis.
associations between knee-jerk task performance and The inclusion of Plasters Self in block 1 resulted in a
ToM performance were not simply an artifact of each significantly improved model, v2 = 7.53, df = 1,
task’s relationship with VMA and CA. In order to assess p = .006, accounting for 28% of the variance in knee-
this, the false belief composite score (see Table 2) was jerk task performance (R2 = .28; see Table 3). In block 1,
employed as a measure of ToM. This composite was Plasters Self was a significant predictor of knee-jerk task
performance. The inclusion of VMA in block 2 did not
significantly improve the model, v2 = 0.05, df = 1,
Table 2 Percentage of participants from each group in p = .83. VMA did not account for any additional
Experiment 1 passing the Plasters Self, Plasters Other and variance in knee-jerk task performance over and above
Sally-Anne tasks, as well as overall performance across the Plasters Self (R2 = .28; 4R2 = .00). However, after the
three tasks inclusion of VMA in block 2, Plasters Self, as a predictor
ASD-extended DD TD
of knee-jerk task performance, only approached
(n = 30) (n = 22) (n = 32) significance.
In a second regression analysis (see Table 4), Plasters
Plasters Self 50.0 90.9 37.5
Plasters Other 56.7 86.4 56.3 Self and VMA were entered in reverse order. The
Sally-Anne 63.3 86.4 83.9* inclusion of VMA in block 1 resulted in a significantly
Total false belief 1.70 (SD 1.37) 2.64 (SD 0.95) 1.81 (SD 1.04)* improved model, v2 = 4.19, df = 1, p = .04, accounting
score (out of 3)
for 16% of the variance in knee-jerk task performance
* Based on 31 ⁄ 32 TD participants. (R2 = .16). The significance of VMA as a predictor of
Table 3 Summary of logistic regression analysis for variables describes a large and worrying difference as 74.43 units.
predicting knee-jerk task performance Given these checks, it is unlikely that multicollenearity
was biasing the above regression.
95% confidence
interval for Exp B
their own mistaken actions on a ‘Transparent Intentions’ Picture B (bottom transparency) Picture A (top transparency)
task.
Experiment 2
Method
Participants
Thirty-four children with ASD, 30 comparison children,
and 35 TD children completed the Transparent
Intentions task. The (developmentally disabled; DD)
comparison group consisted of 25 children with general
learning disability and five TD children who were
recruited for comparison with those children in the
ASD group who achieved IQ scores in the ‘average’ range
(i.e. over 90). The verbal ability of 14 (out of 34) children
with ASD and 17 (out of 30) comparison children was
assessed by performance on the Vocabulary and
Information subtests of the Wechsler Intelligence Scale
for Children - Third Edition UK (WISC-III; Wechsler,
1991). The verbal ability of the remaining 20 participants
with ASD and 13 DD participants was assessed by
performance on the BPVS (Dunn et al., 1997). Non-
verbal ability was assessed by the Block Design and
Picture Completion subtests of the WISC-III. Due to
Figure 1 Experimental materials for the Transparent
limited child availability, the non-verbal ability of five Intentions task.
(out of 34) participants with ASD and six (out of 30)
DD comparison participants was not assessed. The
characteristics of each participant group are presented
in Table 5. Statistical analyses showed that the ASD and t(29) = 2.56, p = .02, r = .43. Participants with ASD
DD groups were well matched for: CA: t(62) = )0.07, (mean VMA = 10.13, SD = 2.11) also had significantly
p = .95, r = 0.01; VMA: t(59.66) = 0.25, p = .80, r = higher VMAs than DD participants (mean VMA = 8.51,
0.03; VIQ: t(62) = 0.65, p = .52, r = 0.08. However, SD = 1.68), t(29) = 2.38, p = .02, r = .40. No other
compared to DD participants, ASD participants had significant differences were found between groups in
significantly higher PIQs: t(51) = 2.29, p = .03, r = 0.31. the Wechsler sub-sample (all ts < 1.74, all ps > .09).
Given that some ASD and DD participants received In the BPVS sub-sample, participants with ASD had
the WISC-III (Wechsler, 1991), while others received the significantly higher PIQs (mean PIQ = 75.87,
BPVS (Dunn et al., 1997), independent t-tests were SD = 24.74) than DD participants (mean PIQ = 53.29,
conducted on each sub-sample to ensure adequacy of SD = 8.69), t(20) = 2.32, p = .03, r = .46. No other
matching in each case, as well as overall. significant differences were found between groups in
In the WISC-III sub-sample, participants with ASD the BPVS sub-sample (all ts < 0.47, all ps > .64).
(mean CA = 14.10, SD = 1.04) were significantly older
than DD participants (mean CA = 12.93, SD = 1.42),
Design and procedures
Transparent Intentions task. The experimental task,
Table 5 Participant characteristics for Experiment 2: Means based on that used by Russell and Hill (2001), involved
and (standard deviations) two sheets of transparent acetate (A and B) each with a
different (incomplete) drawing on. When the sheets were
ASD DD TD
placed on top of each other, the drawings became
n 34 30 35 perfectly aligned resulting in only one – the drawing on
the bottom sheet – being visible. For example, one pair of
CA: years 11.06 (3.59) 11.12 (3.55) 4.46 (0.67)
VMA: years 7.48 (2.79) 7.33 (2.00) 4.81 (1.23)
pictures involved a teacup with a missing handle (picture
VIQ 76.03 (17.70) 73.10 (18.60) 103.6 (9.79) A) and a choir boy with a missing ear (picture B). When
PIQa 75.69 (19.93) 63.67 (17.81) – A was placed on B, only B could be seen. Figure 2 shows
a
Due to limited child availability, PIQ data were collected for n = 29 participants
the experimental pictures (taken from Russell & Hill, p.
with ASD and n = 24 DD participants. 324) that were used.
p = .003
ASD that the experimenter had ‘played this game with another
100
DD
p = .05 p = .01 person’ and had filmed it. Participants were then shown
TD a 30-second clip of the experimenter administering the
80 task to a colleague who pretended to be ignorant of
the set-up. The clip ended at the moment when the
Percent passing
Table 6 Percentage of participants from each group in Table 7 Correlations between ToM total score (0–3) and
Experiment 2 passing the Plasters Self, Plasters Other and performance on each test question from the Transparent
Sally-Anne tasks, as well as overall performance across the Intentions task. Figures in parentheses are partial correlations
three tasks after controlling age and verbal ability
* Based on 33 ⁄ 34 ASD participants, 29 ⁄ 30 DD participants, and 34 ⁄ 35 TD * p < .05; ** p < .01; *** p < .001.
participants. ** Based on 33 ⁄ 35 TD participants.
passing the Mean question, only two (5.7%) children ToM – independent of age and verbal ability. In TD
displayed the opposite pattern of performance. For these participants, also, the ability to attribute false beliefs to
participants, therefore, the Think question was self and other was significantly associated with
significantly more difficult than the Mean question in performance on aspects of each of the tasks from
the Self condition of the Transparent Intentions task, Experiments 1 and 2. These findings support the
v2(1) = 0.70, McNemar’s p = .01. notion that recognizing intentions and false beliefs
In the Other-person condition of the Transparent depends upon the same representational mechanism
Intentions task, five (14.7%) participants with ASD (e.g. Perner, 1991).
failed the Think question despite passing the Mean It is intriguing that among both ASD and TD
question and two (5.9%) participants showed the participants, success on the knee-jerk task in
opposite pattern of performance. For participants with Experiment 1 was most strongly related to success on
ASD, the two test questions did not vary systematically the Plasters Self false belief question. Note that both
in difficulty, v2(1) = 10.46, McNemar’s p = .45. In the semantically and pragmatically, the test questions on the
TD sample, nine (25.7%) participants failed the Think respective tasks are quite different: ‘Did you mean to
question despite passing the Mean question, whereas move your leg?’ in the knee-jerk task, versus ‘What did
only two (5.7%) children displayed the opposite pattern you think was inside the box before you looked?’ in the
of performance. This difference was very nearly Plasters Self task. The answers required are also quite
significant, v2(1) = 0.70, McNemar’s p = .06. different: a yes ⁄ no response in the knee-jerk task, versus
an open-ended response in the Plasters task. Again, these
findings suggest that the underlying representational
Discussion
demands are similar in each task, both requiring, it is
Relative to matched comparison participants, children argued, recognition of mental states as casual of actions:
with ASD were clearly impaired on the Transparent in the knee-jerk task, there was no intention to move and
Intentions task. Over one-third of participants with ASD therefore the movement could not have been ‘meant’. In
incorrectly reported that both they and another person the Plasters Self task, one’s (false) belief, rather than
had (a) intended a mistaken action and (b) thought that reality itself, caused one to choose the plasters box.
this (mistake) was the action they were performing The findings from Experiment 2 that participants with
throughout. In contrast, comparison participants were ASD were significantly impaired on a Transparent
generally highly proficient at distinguishing their own Intentions task clearly differs from the findings of
and another’s unfulfilled intentions from reality. In this Russell and Hill (2001) which provided little support
regard, comparison participants were significantly for an ASD-specific impairment on this task. However,
superior to participants with ASD, except on the Self concerns over aspects of Russell and Hill’s design, most
‘Think’ question which approximately 20% failed. These notably their counterbalancing of the order of
findings stand in contrast to those of Russell and Hill presentation of the Self and Other conditions and their
(2001) who found children with ASD to be impaired, use of potentially insensitive stimulus materials, suggest
relative to matched comparison participants, on the that the present results may be a more accurate reflection
‘Mean’ question in the Self condition only. of the ability of children with ASD to represent their own
Among both participants with ASD and TD children, (unfulfilled) intentions.
ToM was associated significantly with performance only Perhaps the discrepancy between the findings of the
on the Think questions (in Self and Other conditions) of current study and those of Russell and Hill (2001) can be
the Transparent Intentions task, independent of age and explained in terms of the comparison participants used
verbal ability. This contrasts with Russell et al.’s (2001) in each study. Comparison participants in Russell and
finding that ToM was not related to overall performance Hill’s study performed quite poorly on standard false
on the Transparent Intentions task in TD children. belief tasks and their performance on the Transparent
Intentions task appeared to reflect this. However,
comparison participants in the current study performed
General discussion significantly better than participants with ASD on a false
belief composite measure, which may explain their
The findings from the two experiments reported here superior performance on the experimental task. This
provide further evidence that individuals with ASD are explanation is partially supported by the finding that
impaired in their understanding of their own and others’ among both participants with ASD and TD children in
intentions. Relative to age- and ability-matched DD the current study, ToM ability was significantly
comparison participants, participants with ASD were associated with performance on the Think test
less able to recognize their own reflex movements as questions of the Transparent Intentions task, after the
unintentional (in Experiment 1) or their own mistaken effects of age and verbal ability were controlled.
actions as unintended (in Experiment 2). In each case, The finding that ToM was related only to performance
these impairments were associated with a specific on the Think questions (in Self and Other-person
difficulty in representing false beliefs – a proxy for conditions) of the Transparent Intentions task, but not
to the Mean questions (in either condition), suggests exists one cognitive mechanism for the representation of
that only the former question-type taps a meta- one’s own propositional attitudes and another
representational ToM. Indeed, for TD participants, the mechanism for representing others’ propositional
Mean test questions were significantly less challenging attitudes. Furthermore, they claim that ASD involves a
than the Think test questions. With hindsight, this result deficit only in reading the minds of other people, leaving
might have been predicted. A number of studies have introspection of own mental states intact. The current
shown that before they are able to pass false belief tasks, findings argue strongly against this position, suggesting
young TD children can report an action as ‘mistaken’ that (a) individuals with ASD have as much difficulty
when a behaviourally specified goal does not match an recognizing their own mental states as they do
outcome (e.g. Feinfield et al., 1999; Phillips et al., 1998; recognizing others’ mental states and (b) that one
Shultz et al., 1980). It was argued above that young cognitive mechanism (or process) is involved in both
children’s success in reporting unintended actions in such recognizing one’s own and others’ mental states (cf.
tasks is based on their recognition of desires, rather than Carruthers, 2009). Whether this mechanism ⁄ process
intentions. On this basis, young TD children in involves a form of simulation, such that knowledge of
Experiment 2 may have succeeded on the Mean test one’s own mental states is employed to read others’
questions of the Transparent Intentions task by minds, or whether introspection of own mental states
defaulting to their concepts of desire, in the absence of involves the same theoretical underpinnings as other-
a coherent concept of intention. Therefore, this may be person mentalizing is a matter of debate. We believe there
cause for concern over the validity of the Transparent are persuasive arguments why simulation from one’s case
Intentions task as a measure of intention understanding cannot alone account for the ability to recognize mental
in TD children. However, in Experiment 1 TD children states in others (e.g. Carruthers, 2009; Hobson, 1990;
had a great deal of difficulty correctly reporting their Strawson, 1962; Williams & Happe, 2009a; Wittgenstein,
reflex movements as unintentional, with participants’ 1953), although it is beyond the scope of this paper to
performance on the knee-jerk closely associated with explore these arguments directly. What is clear from the
their ability to recognize their own false beliefs. The current findings is that, in addition to difficulties with
poorer performance of TD children on the test question representing false beliefs, individuals with ASD also
from the knee-jerk task (‘Did you mean to move your show deficits in recognizing their own and others’
leg?’) than the Mean test question from the Transparent intentions.
Intentions task (‘Did you mean to draw X?’) may be
explained by the fact that no desire was evident in the
knee-jerk task. Hence, unlike on the Transparent Acknowledgements
Intentions task, participants in Experiment 1 could not
succeed on the test question by reporting on their desire, This study was supported by a Medical Research
rather than intention, to have acted. Overall, therefore, Council Studentship, awarded to the first author. The
confidence can still be maintained in the conclusion that manuscript was prepared during an Economic and
young TD children, as well as individuals with ASD, Social Research Council Fellowship awarded to the
have difficulty in recognizing intentions as mental states first author. The authors would like to thank Sophie
differentiated from desires. Lind and Peter Hobson for their helpful comments on
an early draft of the manuscript. We would like to thank
Dr Elizabeth Hill for her assistance in the planning of
Wider theoretical implications of the current research
this study and for providing us with pictures for Figure 1
Several theoretical implications follow from the current of the manuscript. The authors would also like to thank
set of findings. Specifically in relation to ASD, the the staff and students at the schools from which
findings indicate that impairments in representing mental participants were recruited: Cambridge School,
states in self are at least as profound as impairments in Christchurch School, Dalmain School, Garratt Park
representing the mental states of other people. This adds School, Linden Bridge School, Queensmill School,
to the growing body of literature indicating diminution Springhallow School, St Winifred’s School, The Vines
of at least some aspects of self-awareness in ASD (e.g. School, Turney School.
Ben-Shalom, Mostofsky, Hazlett, Goldberg, Mcloud &
Hoehn-Saric, 2003; Hobson, Chidambi, Lee & Meyer,
2006; but see Williams & Happ, 2009b), and in References
particular Theory of own Mind (Frith & Happ, 1999;
Williams & Happ, 2009a). Aldridge, M.A., Stone, K.R., Sweeney, M.H., & Bower,
More generally, the results provide a direct challenge T.G.R. (2000). Preverbal children with autism understand
to models of mindreading that postulate distinct the intentions of others. Developmental Science, 3, 294–301.
mechanisms for representing one’s own and others’ American Psychiatric Association. (2000). Diagnostic and
statistical manual: 4th edition text revised (DSM IV-TR).
mental states (e.g. Nichols & Stich, 2003; Raffman,
New York: Author.
1999). Nichols and Stich, for instance, claim that there
Astington, J.W. (1993). The child’s discovery of mind. London: Menard, S. (1995). Applied logistic regression analysis. Thou-
Fontana Press. sand Oaks, CA: Sage.
Astington, J.W., & Lee, E. (1991, April). What do children Moses, L.J. (1993). Young children’s understanding of belief
know about intentional causation. Paper presented at the constraints on intention. Cognitive Development, 8, 1–25.
Biennial Meeting of the Society for Research in Child Myers, R. (1990). Classical and modern regression with appli-
Development, Seattle, USA. cations. Boston, MA: Duxbury.
Baron-Cohen, S. (1991). The development of a theory of mind Nichols, S., & Stich, S. (2003). Mindreading: An integrated
in autism: deviance and delay. Psychiatric Clinics of North account of pretence, self-awareness, and understanding others'
America, 14 (1), 33–51. minds. Oxford: Oxford University Press.
Bellagamba, F., & Tomasello, M. (1999). Re-enacting intended Perner, J. (1991). Understanding the representational mind.
acts: comparing 12- and 18-month-olds. Infant Behavior and Cambridge, MA: MIT Press.
Development, 22, 277–282. Perner, J., Frith, U., Leslie, A.M., & Leekam, S.R. (1989).
Ben-Shalom, D., Mostofsky, S.H., Hazlett, R.L., Goldberg, Explorations of the autistic child’s theory of mind: knowl-
M.C., Mcloud, D., & Hoehn-Saric, R.L. (2003). Intact GSRs edge, belief, and communication. Child Development, 60,
and impaired self-reports in response to emotional pictures in 689–700.
high-functioning autism. Annals of the New York Academy of Phillips, W., Baron-Cohen, S., & Rutter, M. (1998).
Sciences, 985, 501–504. Understanding intention in normal development and autism.
Carpenter, M., Akhtar, N., & Tomasello, M. (1998). Four- British Journal of Developmental Psychology, 16, 337–348.
teen- through 18-month-old infants differentially imitate Raffman, D. (1999). What autism may tell us about self-
intentional and accidental actions. Infant Behavior and awareness: a commentary on Frith and Happe. Mind &
Development, 21, 315–330. Language, 14 (1), 23–31.
Carruthers, P. (2009). How we know our own minds: the Russell, J., & Hill, E.L. (2001). Action monitoring and inten-
relationship between mindreading and metacognition. tion reporting in children with autism. Journal of Child
Behavioral and Brain Sciences, 32, 1–18. Psychology and Psychiatry, 42, 317–328.
Cohen, J. (1992). A power primer. Psychological Bulletin, 112 Russell, J., Hill, E., & Franco, F. (2001). The role of belief
(1), 155–159. veracity in understanding intentions-in-action: preschool
Davidson, D. (1963). Actions, reasons, and causes. Journal of children’s performance on the transparent intentions task.
Philosophy, 60 (23), 685–700. Cognitive Development, 16, 775–792.
Dennett, D.C. (1978). Brainstorms. Montgomery, VT: Bradford Sattler, J.M. (1992). Assessment of children: WISC-III and
Books. WPPSI-R supplement. La Mesa, CA: Jerome M. Sattler
Dunn, L.M., Dunn, L.M., Whetton, C., & Burley, L. (1997). Publishers.
British Picture Vocabulary Scale (2nd edn.). Windsor: Searle, J. (1983). Intentionality. Cambridge: Cambridge Uni-
NFER-Nelson. versity Press.
Feinfield, K.A., Lee, P.P., Flavell, E.R., Green, F.L., & Flavell, Shultz, T.R., Wells, D., & Sarda, M. (1980). Development of
J.H. (1999). Young children’s understanding of intention. the ability to distinguish intended actions from mistakes,
Cognitive Development, 14, 463–486. reflexes, and passive movements. British Journal of Social and
Field, A. (2005). Discovering statistics using SPSS (2nd edn.). Clinical Psychology, 19, 301–310.
London: Sage. Strawson, P.F. (1962, originally 1958). Persons. In V.C.
Frith, U., & Happ, F. (1999). Theory of mind and self- Chappell (Ed.), The philosophy of mind (pp. 127–146).
consciousness: what is it like to be autistic? Mind and Englewood Cliffs, NJ: Prentice Hall.
Language, 14, 1–22. Tan, J., & Harris, P.L. (1991). Autistic children understand
Happ, F. (1995). The role of age and verbal ability in the seeing and wanting. Development and Psychopathology, 3,
theory of mind task performance of subjects with autism. 163–174.
Child Development, 66, 843–855. Wechsler, D. (1991). Wechsler intelligence scale for children (3rd
Hobson, R.P. (1990). On the origins of self and the case of edn.). London: The Psychological Corporation.
autism. Development and Psychopathology, 2, 163–181. Wellman, H., Cross, D., & Watson, J. (2001). Meta-analysis of
Hobson, R.P., Chidambi, G., Lee, A., & Meyer, J. (2006). theory of mind development: the truth about false belief.
Foundations for self-awareness: an exploration through aut- Child Development, 72, 655–684.
ism. Monographs of the Society for Research in Child Devel- Williams, D., & Happ, F. (2009b). Pre-conceptual aspects of
opment, 71 (2, Serial no. 284). self-awareness in autism spectrum disorder: the case of action
Huang, C.-T., Heyes, C.M., & Charman, T. (2002). Infants’ monitoring. Journal of Autism and Developmental Disorders,
behavioral re-enactment of ‘failed attempts’: exploring the 39, 251–259.
roles of emulation learning, stimulus enhancement and under- Williams, D., & Happ, F. (2009a). ‘What did I say?’ versus
standing of intentions. Developmental Psychology, 38, 840–855. ‘what did I think?’: attributing false beliefs to self amongst
Lang, B., & Perner, J. (2002). Understanding of intention and children with and without autism. Journal of Autism and
false belief and the development of self-control. British Developmental Disorders, 39, 865–873.
Journal of Developmental Psychology, 20, 67–76. Wittgenstein, L. (1953). The philosophical investigations. New
Leslie, A.M., & Thaiss, L. (1992). Domain specificity in con- York: Macmillan.
ceptual development: neuropsychological evidence from
autism. Cognition, 43, 225–251.
Meltzoff, A.N. (1995). Understanding the intentions of others:
re-enactment of intended acts by 18-month-old children. Received: 22 July 2008
Developmental Psychology, 31, 1–16. Accepted: 20 January 2009