Journal of Experimental Psychology:
Learning. Memory, and Cognition
1995, Vol. 21, No. 5,1155-1168
Copyright 1995 by the American Psychological Association, Inc.
0278-7393/95/$3.00
The Inconsistency of Consistency Effects in Reading:
The Case of Japanese Kanji
Taeko N. Wydell and Brian Butteworth
University College London
Karalyn Patterson
Medical Research Council Applied Psychology Unit
Most Japanese Kanji characters have several different pronunciations, at least one ON-reading (of
Chinese origin) and a KUN-reading (of Japanese origin); the appropriate pronunciation is
determined by intraword wntext. There are also Kanji characters that have a single ON-reading
and no KUN-reading. With 2-character ON-reading Kanji words as stimuli, naming experiments
were carried out to investigate print-to-soundconsistencyeffects. The consistent Kanji words were
those in which neither constituent character had an alternative ON-reading or a KUN-reading,
hence there can be no pronunciation ambiguity for these words. The inconsistent items were
ON-reading words composed of characters that have KUN-readings that are appropriate to other
words in which the characters occur, hence there should be some ambiguity about the
pronunciation of the constituent characters. Six experiments yielded reliable effects of both word
and character frequency and familiarity on speed and accuracy of word naming but virtually no
evidence for consistency effects. It was concluded that for Kanji, phonology is dominantly
computed at the word rather than at the character level.
A number of theoretical issues regarding reading processes
may fruitfully be addressed by comparing different written
languages or writing systems, for example, alphabetic English
and logographic Japanese Kanji. The broader issue considered
here is the process by which phonology is computed from
orthography, and the more specific question is whether various
"sizes of unit" contribute to this process for the two writing
systems. A prominent technique for investigating the question
of unit size is to assess the impact, on accuracy and speed of
word naming, of the consistency of spelling-sound correspondences within neighborhoods of orthographically similar words.
The principle underlying this line of investigation is that all
whole words have a single conventionally correct pronunciation (except for rare homographs like wind and lead in English,
in which the reader requires extraword wntext to select
between two pronunciations); however, in many writing systems, including both English and Japanese, subword sized
units may take several different alternative pronunciations,
with the correct one determined by intraword context. If the
reader translates spelling to sound at the level of the whole
word only, then the existence of several competing pronunciaTaeko N. Wydell and Brian Butterworth, Department of Psychology, University College London, London, United Kingdom; Karalyn
Patterson, Medical Research Council Applied Psychology Unit, Cambridge, United Kingdom.
Part of this research was presented at a joint meeting of the Brain,
Behaviour and Cognitive Science Society of Canada and the Experimental Psychology Society of Britain in Toronto, Canada, July 1993. The
research was supported by a project grant (G9015838N) from the
Medical Research Council.
We are grateful to A. Jonckheere for statistical advice and to Debra
Jared for helpful comments on the original version of this article.
Correspondence concerning this article should be addressed to
Taeko N. Wydell, Department of Psychology, University College
London, Gower Street, London WC1E 6BT United Kingdom. Electronic mail may be sent via Internet to ucjtrtw@ucl.ac.uk.
tions for subword chunks should not interfere; however, if the
computation of phonology from orthography also occurs for
units smaller than the word, then words containing "inconsistent" units-that is, those with different pronunciations in
other words-might be at a disadvantage in accuracy or speed
(or both) in word-naming tasks.
Many studies have addressed this question in English (e.g.,
Andrews, 1982, 1989, 1992; Glushko, 1979; Jared, McRae, &
Seidenberg, 1990; Parkin, 1982; Seidenberg, Waters, Barnes,
& Tanenhaus, 1984; Stanhope & Parkin, 1987; Taraban &
McClelland, 1987; Waters & Seidenberg, 1985). On the basis
of the frequent observation (e.g., Coltheart, 1978; Parkin,
1982; Waters & Seidenberg, 1985) that words with regular or
typical spelling-sound correspondences (such as five) produce
shorter naming latencies and lower error rates than words with
exceptional correspondences (e.g., give), regularity was originally considered to be the critical variable. However, Glushko
(1979) argued that consistency, rather than rule-defined regularity, provided a better account of empirical results. Although
five may be a regular word "by rule," its spelling-sound
relationship is inconsistent with orthographically similar words
such as give. To the extent that the process of computing
phonology from orthography is sensitive to the characteristics
of the neighborhood, then performance on a regular but
inconsistent word like five should also adversely be affected.
Glushko did indeed demonstrate longer response times (RTs)
for regular inconsistent words than for regular words from
consistent neighborhoods, though this result was not always
obtained in subsequent experiments (e.g., Stanhope & Parkin,
1987; Taraban & McClelland, 1987; see also Brown, 1987;
Patterson & Coltheart, 1987). In all studies manipulating
either regularity or consistency, the effect has been shown to
be stronger for low- than for high-frequency target items and is
often statistically reliable only for the lower frequency words.
Jared et al. (1990) produced a more sophisticated hypothesis
that captures aspects of results not handled by previous
T. WYDELL, B. BUTTERWORTH, AND K. PATTERSON
accounts referring solely to either regularity or consistency.
According to Jared et a]., the magnitude of the consistency
effect for a given word is related to the summed frequency of
that word's "friends" (words with a similar spelling pattern
and a similar pronunciation) and of its "enemies" (words with
a similar spelling pattern but with a different pronunciation),
For example, an inconsistent word such as haste has a number
of friends (e.g., waste, paste, taste, and so forth) and just a single
enemy, caste. Against the strength of friends, the single enemy
(which in the case of caste, moreover, is a rarely encountered
word) cannot exert a marked influence; its negative impact on
computing the pronunciation of haste will thus be small or even
undetectable. An inconsistent word like gown, however, with
many enemies (e.g., blown, mown, sown, and so forth), as well
as friends (e.g., town, clown, and frown), gives rise to a more
substantial effect. The commonly observed effect of regularity
also finds a natural explanation within this account because
many words with irregular spelling-sound correspondences
defined by rule (such as caste and pint) happen to have few or
no friends and many enemies. The theory (and supporting
data) of Jared et al. also mesh well with (a) other results demonstrating the inadequacy of a simple regular-irregular dichotomy, such as
the degrees of regularity effect described by Shallice, Warrington,
and McCarthy (1983), and (b) computational models of translation
from print to sound that demonstrate strong frequency and
consistency effects (Plaut, McClelland, Seidenberg, & Patterson, 1994; Seidenberg & McClelland, 1989).
It thus appears that the computation of a word's pronunciation in English has contributions from both whole-word and
subword levels; the former is demonstrated by the fact that
consistency effects are modulated by the frequency of the
target word, with very common words less vulnerable to the
existence of unfriendly neighbors; the impact of the consistency of a word's orthographic neighbors provides evidence of
generalization at the subword level. In a series of experiments
reported in this article, we attempted to establish whether
Japanese readers naming words written in Kanji show consistency effects paralleling those for English. To demonstrate
what is meant by consistency in Kanji, we begin with a brief
description of Japanese orthography.
Japanese Orthography and Pronunciation Consistency
The Japanese orthography comprises two scripts: syllabic
Kana and logographic Kanji (see Sampson, 1985; Sasanuma,
1986; Wydell, Patterson, & Humphreys, 1993, for more detailed descriptions). The feature of Kanji that is particularly
germane to an understanding of how Japanese readers might
translate orthography to phonology is that a typical Kanji
character has two (or sometimes more) pronunciations: a
KUN-reading and one or more ON-readings. A KUN-reading
pronunciation is part of the original Japanese language and
was assigned to Chinese characters from their meanings when
Japan adopted the Chinese writing system. At the time of
importing Chinese characters, however, not only were Chinese
characters introduced, but Chinese words and their pronunciations were also added to the Japanese vocabulary-these are
ON-readings. Also, a number of Kanji characters now have
more than one ON-reading, as a consequence of the same
Chinese characters being introduced to Japan at several
different periods.
A Kanji word may have from one to four characters; words
with more than one character typically, but by no means
always, take ON-readings. For Kanji characters with more
than a single ON-reading (be it a KUN- or an additional
ON-reading), the appropriate pronunciation is determined by
the intraword context, that is, the other character(s) with
which the particular character combines to constitute the word
in question. For example, the Kanji character :X (meaning
parent) is pronounced "oya" (the KUN-reading) when it
occurs in compound words such as '.SX oyakata (meaning
foreman) o r XSB chichioya (meaning father); however, the
same character is assigned the ON-reading "shin" when it
occurs in the compound word NfJX shinrui (relatives) or
ryoshin (parents). Any word containing the character
9 can therefore be described as an inconsistent word because
pronunciation of the character varies across the orthographic
neighborhood of words containing it.
There are, however, some Kanji characters that do not have
KUN-readings and also have just a single ON-reading. Kanji
compound words made up of these characters, therefore, have
no pronunciation ambiguity because each constituent character has but a single possible pronunciation (e.g., for UK Idkou,
meaning climate, each constituent character K Id and <( b u
has no KUN-reading and no other ON-reading). These words,
which can be described as consistent, must have been the ones
added to the Japanese vocabulary at the time of borrowing
Chinese characters.
It should be noted that there are at least two respects in
which issues of "regularity" and "consistency" of print-sound
relationships differ between Kanji and English. The first of
these is that, despite some ambiguity about how to define
regularity in English, most investigators would agree that-for
the inconsistent spelling pattern int, for example-words like
mint, hint, and lint have the regular or typical pronunciation,
with pint being the exception. There is no exact equivalent of
such a regular or typical correspondence at the subword level
in an inconsistent two-character Kanji word. It could be argued
that the (dominant) ON pronunciation is the more typical
because multicharacter words most often have ON pronunciations. On the other hand, because single-character words with
KUN-readings almost always take that KUN-reading, it could
also be argued that, at the character level, the KUN-reading
for these items is the more typical. Because Kanji words do not
lend themselves to classification as regular or exceptional (as
well as for the reasons given above favoring a description in
terms of consistency even for English), we refer to consistency
rather than regularity in Kanji print-sound correspondences.
The issue does, however, have some bearing on the design of
empirical investigations. In the first experiments we reported
in this article, we assess consistency effects by comparing
two-character ON-reading consistent and inconsistent words
primarily because there is a reasonable (though limited)
number of characters with just a single ON-reading; there are
very few characters with just a KUN-reading, which wouid
make it impossible to select a set of multicharacter consistent
KUN-reading words. On the grounds that the ON-reading of
an inconsistent character in a multicharacter word might in
READING JAPANESE KANJI
some sense be the more typical, however-which could make
consistency effects more difficult to detect (just as consistency
effects are harder to see in regular inconsistent than in
exception inconsistent words in English)-in the last two
experiments we focus on KUN-reading words.
A second notable difference between English and Japanese
is that each KANJI character is a morphographic element that
cannot phonemically be decomposed in the way that an
alphabetic word can be. There are no separate components of
the character R that correspond to the individual phonemes in
oya or shin. Whereas incompatible words from a neighborhood
of shared-body words in English (such as haste and caste) will
often have several phonemes in common, incompatible words
from a neighborhood of shared-character words in Kanji (such
as oyakata and shinrui) will typically have no phonological
elements in common. Furthermore, because each character of
an inconsistent Kanji word is a morpheme (with not only
whole-morpheme pronunciation but also with meaning), in
some sense the English equivalent would be not so much an
example like haste versus caste but rather something more like
English compound words containing heterophonic homographs (e.g., bowtie vs. bow-wow or lead-in vs. lead-free).l We
know of no evidence on consistency effects in such English
compounds, nor indeed is it clear that there are a sufficient number
of compounds containing heterophonic homographs to form the
stimuli for an experiment to investigate such putative effects.
Despite these rather different senses of consistency in
alphabetic writing systems like English and in the morphographic multireadings system of Japanese Kanji, with regard to
the potential impact of inconsistency on word naming, the
theoretical interpretation seems approximately parallel in the
two cases. The fact that inconsistent English words give rise to
longer RTs and higher error rates than consistent words
suggests that subword levels play a significant role in the
computation of word pronunciation. Analogously, if inconsistent Kanji words were to yield longer RTs and higher error
rates than consistent words, this would then suggest that the
subword level of constituent character plays a major role in
computing the pronunciation of written Kanji words.
In this study, therefore, we designed the experiments to
evaluate whether Kanji words with consistent character-tosound correspondences have a naming advantage over Kanji
words with inconsistent correspondences. A significant impact
of consistency should reveal itself either in longer RTs or more
errors (or both) to inconsistent words. A further prediction is
that just as errors to inconsistent words in English typically
take the form of assigning a pronunciation appropriate to other
members of the orthographicneighborhood (e.g.,pint pronounced
to rhyme with hint), a consistency effect on accuracy in Kanji word
naming should produce errors in which inconsistent characters
are assigned an alternative ON- or KUN-reading.
Experiment 1
Method
Participants
Twenty Japanese male and female native speakers (aged between 18
and 41 years), who were brought up in Japan until at least age 18 and
had lived in the United Kingdom
for less than 18 months, served as
participants in this experiment. All had normal or corrected-to-normal
vision. Each participant was paid a small fee for participating.
Apparatus
A Macintosh I1 computer with the software Psychlab (Gum & Bub,
1988) was used to run the experiment. Japanese Kanji stimulus words
were generated by the Macintosh software SweetJAM 4.5 (A & A
Company, Ltd., Tokyo, 1990) installed on the English operating
system. RTs were recorded by the computer via a throat microphone
connected to a voice key. Responses were also tape recorded for later
checking.
Stimuli
Ail stimulus words in this and all subsequent experiments in the
series were composed of characters from approximately the 3,000 most
commonly used Kanji characters and would therefore be known by any
average literate ~ a p a n e s eadult. In Experiment 1, 80 two-character
ON-reading Kanji nouns, all with three-syllable (or rather threemorae) pronunciations, were divided into four different conditions of
20 words each as follows.
Consistent. Each component character of a two-character word has
only a single ON-reading, hence the pronunciations of words in this
condition are consistent.
Intennedlate-1. Either the first or the second character of a
two-character word has a KUN-reading as well as an ON-reading, but
the KUN-readings do not apply to any two-character words. Therefore, the pronunciations of these words are still fairly consistent.
Intermediate-2. Both the first and second characters of a word have
KUN-readings, but these KUN-readings do not occur in other twocharacter words.
Inconsistent. Either one or both component characters have KUNreadings, and these KUN-readings are appropriate to other words
containing the characters)). Hence, the pronunciations of these words
are inconsistent.
In an experiment comparing groups of words (here, differing in
consistency), it is obviously important to balance the different sets of
stimuli for other factors, such as word frequency or familiarity, that
might influence performance. Unfortunately, there are no published
word norms for Japanese Kanji; therefore, values for word frequency
and word familiarity were taken from a corpus of 2,357 Japanese nouns
rated for frequency, familiarity, and imageability on a scale of 1 (very
low) to 7 (very high) that were collected by Wydell (1991; see also
Wydell, Quinlan, & Butterworth, 1995). A pilot study preceding
Experiment 1 established rated familiarity as a better predictor than
rated frequency of naming RT. The same pilot study revealed that, in
addition to effects of whole-word familiarity, naming RT was significantly predicted by individual Kanji character frequency, a measure
for which there are published norms in Gendai Shinbun No Kanji
(National Language Institute, 1976,A Study of Uses ofKanji Characters
in Modem Newspapers). Unfortunately, it is not possible to achieve
perfect matching of component character frequencies across conditions because a character's frequency of occurrence and its ambiguity
of pronunciation are closely correlated in the Japanese language:
Characters with multiple pronunciations appear in many more words
than do characters with consistent pronunciations.
Stimulus words were matched across conditions as closely as
possible for both rated word frequency and rated word familiarity
(mean values for the four conditions of Experiment 1 are given in
Table 1). We made an attempt to match both first and second
character frequencies across the four conditions, although (for the
We are grateful to an anonymous reviewer for raising this point.
T.WYDELL, B. BUTTERWORTH, AND K. PATTERSON
reasons explained above) the matching achieved was far from ideal;
this imbalance was handled with an analysis of covariance (ANCOVA).
Finally, the initial phonemes of the stimulus words were matched as
closely as possible across the four conditions, in which it was not
possible to match at the level of the identical initial phoneme, then at
least matching was achieved within groups of similar phonemes (e.g.,
fricatives, nasals, and so forth).
Procedure
Each trial began with the presentation of a fixation dot for 500 ms.
The fixation dot was then replaced by a target word, which remained
visible until a response was made or a maximum of 3,000 ms had
elapsed. The order of the stimulus words was randomized by the
computer program for each participant. We instructed the participants
to pronounce the target word as quickly and accurately as possible on
its presentation. Naming latencies were recorded by the computer. T.
Wydell sat next to each participant throughout the experiment and
made notes of errors and accidental triggerings of the voice key (e.g.,
throat clearing before the response). Twenty practice trials were given
before the experimental trials. For the analysis, a cutoff point of two
standard deviations above or below the mean for each participant for
that wndition was used to clean the raw RT data.
Results
Table 1 shows mean RTs (in milliseconds) and percentage
of error rates for each condition.
A one-way analysis of variance (ANOVA) with experimental condition as a variable, performed on the RTs correspondTable 1
Word Characteristics, Mean Reaction Times (RTs; in
Milliseconds), and Error Rates (in %) in Kanji Naming With
Four Conditions in Experiment 1 (Straightforward Naming)
and Experiment 2 (Speeded Naming)
Condition
Variable
1
2
3
4
581
77.06
584
8
569
76.42
569
10
Word characteristics
Word frequency
M
SD
Word familiarity
M
SD
Experiment 1
RT
M
SD
Adjusted RT
Error rate
Experiment 2
RT
M
SD
Adjusted RT
Error rate
575
65.34
579
6
584
74.85
585
12
Note. 1 = consistent condition; 2 = Intermediate-1 condition; 3 =
Intermediate-2 condition; 4 = inconsistent condition.
Table 2
Number of Different Types of Errors for the Four Conditions in
Experiments 1 and 2
Experiment 1
Experiment 2
Condition
Condition
Error type
1
2
3
4
1
2
3
4
ON-KUNreading
Startstop
Visual
Semantic
Other
0
7
1
5
1
1
7
1
0
13
0
34
5
0
0
8
17
2
1
6
3
29
6
0
3
23
12
31
41
Total
1
0
3
0
4
2
14
15
1
4
23
2
2
1
25
9
0
5
47
8
Note. 1 = consistent condition: 2 -v Intermediate-1 condition: 3 =
Intermediate-2 wndition; 4 = inconsistent condition.
ing to correct word-naming responses, yielded no significant
difference because of condition, either over subjects, Fi(3, 19)
< 1, or over items, Fz (3,76) < 1.
Analyses of regression and covariance, with whole-word
familiarity and individual character frequencies (log transformed) as variables, revealed only a significant effect of
familiarity: for the regression, ((74) = - 3 . 0 2 , ~ < .01; for the
ANCOVA, ((74) = - 3 . 2 0 , ~< .01. The adjusted mean RTs for
the four conditions (as if the covariate of character frequency
had properly been matched across the four conditions) are
shown in Table 1.
Errors
Error scores were square root transformed after the addition of a constant of 0.1 to each error score and then submitted
to a one-way ANOVA. There was no significant difference
between the four conditions ( F < 1). A priori mean comparisons between the four conditions also revealed no significant
differences between these conditions.
The errors were categorized according to the error types in
Table 2. The error types here (a) alternative ON- or KUNreading (one of the characters was given a KUN-reading or a
different inappropriate ON-reading), (b) start-stop errors (the
participant uttered the first character correctly then stopped
and named the whole word correctly), (c) visual error (a
different word visually similar to the target word was produced), (d) semantic error (a different word semantically
related to the target was produced), and (e) other (including
nonwords). For the consistent condition, there were 10 (out of
23 total errors) visual errors, but one particular item produced
all 10 errors. Similarly, in the consistent group, one word
Sii (a relatively uncommon word, meaning protection) yielded
all five semantic errors; for the inconsistent condition, a
particular item produced 6 out of 15 start-stop errors. There
was no obvious meaningful pattern in the error types.
Discussion
The results of Experiment 1 are straightforward: Wordnaming latency did not vary as a function of the degree of
character-to-sound consistency (as analyzed by both ANOVA
and ANCOVA). The only variable significantly influencing
naming latency was rated word familiarity, with faster RTs to
high- than to low-familiarity Kanji words. Error analyses also
failed to reveal significant consistency effects. If pronunciation
ambiguity of constituent characters were having a marked
impact, then one would expect to see errors for Intermediate-2
and especially inconsistent words, in which participants substitute a KUN-reading for the correct ON-reading. Only one
error of this type occurred in each of these two conditions.
which was decreased by 15-ms steps until the participant reached a
critical exposure duration, defined as 80%-85% correct responses.
Response latencies were recorded by the computer from the onset
of the target. Despite the fact that participants were instructed to try to
"beat" the deadline, all reaction times were included in the analyses,
provided that the words were pronounced correctly. This is the
procedure used in other experiments with speeded naming of either
words (Strain et al., 1995) or pictures (Vitkovitch & Humphreys,
1991). In all other respects, the procedure was identical to that in
Experiment 1.
Experiment 2
Results
One possible reason for our failure to observe consistency
effects in Experiment 1 is that character-to-sound consistency
may have its impact very early in processing; by the time that a
word is uttered-when of course any ambiguity must be
resolved-these effects might be attenuated (Monsell, Doyle,
& Haggard, 1989). In this case, the paradigm of straightforward naming may not ideally be suited to observing consistency
effects. In Experiment 2, therefore, a speeded naming technique was used. Strain, Patterson, and Seidenberg (1995),
using this technique for naming of English words, obtained a
high rate of regularization errors to lower frequency words
with atypical spelling-sound correspondences. They argued
that this may indicate the prominence of correspondences
between subword orthographic and phonological segments in
the direct translation from spelling to sound; these will
especially reveal their strength when the whole-word influence
of meaning-level representations is prevented from having an
impact on the naming process, as in the case of speeded
naming. If the same argument applied to Kanji, then perhaps
speeded naming would produce the kind of ON-KUN confusion errors that we failed to observe in Experiment 1.
Table 1 shows correct mean RTs (in milliseconds) and
percentage of error rates for the four conditions.
Correct RTs were submitted to a one-way ANOVA with
experimental condition as a variable, which yielded a significant difference between conditions over subjects, Fi(l, 19) =
4.10, MSE = 223.48,~< .01, but not over items, F2(3, 76) < 1.
Planned comparisons between conditions over subject RTs
revealed that RTs in the inconsistent condition were significantly faster than those in both intermediate-1, F(1, 57) =
1 0 . 4 0 , ~< .002, and Intermediate-2, F(l,57) = 6 . 6 0 , ~< .012.
Other comparisons were nonsignificant.
As with Experiment 1, ANCOVA with first and second
character frequencies (log transformed), word frequency,
familiarity, and imageability as covariates yielded no significant differences between the four experimental conditions
(F < 1). The ANCOVA-adjusted means for the four conditions are shown in Table 1. A multiple regression analysis with
the same variables showed, as in Experiment 1, a significant
effect only for rated word familiarity, ((73) = - 3 . 2 8 , ~ < .002.
Method
Participants
Twenty Japanese male and female native speakers (aged between 19
and 25 years), who were brought up in Japan until at least age 18 and
had lived in the United Kingdom for less than 18 months, served as
participants in this experiment. All had normal or corrected-to-normal
vision. Each participant was paid a small fee for participating.
Apparatus and Stimuli
The apparatus and stimuli were identical to those used in Experiment 1.
Procedure
Each trial began with the presentation of a fixation cross (for 500
ms), followed by a target word exposed for a duration calibrated
individually for each participant in the course of 50 practice trials.
Participants were asked to try to start naming each target word before
the deadline, as indicated by the disappearance of the stimulus word
from the screen. Target-word exposure durations varied across participants between 316 ms and 449 ms (average was 384 ms). T o enable
settings of the exposure durations, we increased the number of
practice trials from 30 to 50. Practice began with a 600-ms duration,
Errors
A one-way ANOVA on error scores (square root transformed after a constant of 0.1 was added to each error score)
just failed to reach significance, F(3, 19) = 2.46, MSE = 0.215,
p = .07. However, the a priori comparison between the
consistent and inconsistent groups revealed that more errors
were made to words in the inconsistent condition, F(1, 57) =
4.14, p < ,046. Also, Intermediate-1 words produced significantly more errors than consistent words, F ( l , 5 7 ) = 6 . 2 1 , ~<
.015.
As with Experiment 1, errors were categorized according to
the error types in Table 2. By comparison with straightforward
naming in Experiment 1, speeded naming produced a higher
rate of errors, except in the consistent condition. This increase
is almost entirely attributable to one category of response,
start-stop, which in a sense is not a true error. These are cases
in which the participant pronounced the first character (or,
much more rarely, the second character) of a two-character
Kanji word correctly, paused, and then pronounced the whole
word correctly. These responses, which clearly result from the
pressure to start speaking quickly, probably correspond to the
anticipation errors characteristic of speeded naming in English, i n which the participant utters the first phoneme or two
T.WYDELL. B. BUTTERWORTH. AND K PAof the target word, pauses, and then produces the whole word
(see Strain et al., 1995). Speeded naming also yielded a few
more errors, relative to straightforward naming, of the type
labeled visual errors, in which participants produced a different
two-character Kanji word containing the same first (or, much
more rarely, second) character as the target word.
Although the increased incidence of visual errors and
especially start-stop errors under conditions of speeded naming is not without interest, it is not.especially germane to the
primary issue of consistency effects because the likelihood of
these errors did not vary systematically across the four conditions. In particular, the largest number of start-stop errors
occurred in the Intermediate-1 condition. The e r r o r that
would be most indicative of consistency effects are alternative
ON- or KUN-reading errors, in which a participant initially
produced either a KUN-reading (or different ON-reading) of
the first character of a target word, then realized his or her
mistake and pronounced the whole word correctly. As noted
earlier, only two errors of this type occurred in Experiment 1:
one each in the Intermediate-2 and inconsistent groups. Under
conditions of speeded naming, the corresponding numbers of
ON-KUN errors for the Intermediate-2 and inconsistent
conditions were five and three. Although this is still an almost
vanishingly small number of such errors, the fact that they
occurred at all, and more under conditions of speeded naming,
provides the first hint of character-level processing in this
series of experiments.
Discusswn
Mean latencies in Experiment 2 were on average 82 ms
shorter than those in Experiment 1, suggesting the general
effectiveness of the speeded naming instructions. Initial analyses of Experiment 2 suggested that the inconsistent words were
named more quickly than the other types of words (Intermediate-1, -2, and consistent words). However, this difference
disappeared under ANCOVA. In other words, once again,
naming latencies of two-character Kanji words did not differ
significantly as a function of degree of consistency of their
component characters. Significantly more errors were made to
the inconsistent words than to the consistent words; however,
this apparent small consistency effect itself was not very
consistent because the error rate in the inconsistent group was
very similar to that for the fairly consistent words of the
Intermediate-1 condition. Thus, with the exception of a very
few ON-KUN confusion errors in the Intermediate-2 and
inconsistent conditions, both latency and error analyses of
speeded naming of Kanji words have once again failed to
reveal any prominent character-to-sound consistency effects.
Experiment 3
In Experiments 1 and 2, word familiarity ranged over a
relatively wide spectrum; apart from the best possible matching of familiarity across conditions, however, we made no
attempt to manipulate these variables. Consistency effects in
English (e.g., Jared et al., 1990), and also in other writing
systems such as Italian (Colombo, 1992) and Chinese (Hue,
1992), are more prominent for lower frequency words; we
could thus argue W.96ayfll)~sisweffectsin k n j i will have
been diluted by. u&tf^~fdsets ranging ftOtn low to high
& r e f ~ ' ~ ^ Ã § Q pfamiliarity was
frequency. fa
treated as a
~fbdhemore, given our
even between
previous fad
inconsistent
the most extram
more subtle
words, much less
3 liemanipulation of
gradations of
consistency was çM
mous classification. If there is
w'@p%&b
Eleven Japanese male d f q p a l e native t o f f ~ t
(-aged
~ kn.18
and 46 years), who wre b r d t IQ inJapsai.+l
at W a g e l&&
had lived in the United Kingdom tott thy 20 umflls, s e e as
participants in this experiment. AN h a d d o r c u k t e d - t & e l
vision. Each participant.~~-pafel
a small fee fbrpafikbaling.
Apparatus
The apparatus was identic&*.-
Stimuli
*
-*
4
I
There were 0 0 N W - ~ y ^
consistent oronunciationa C a t
ments 1 and 2) and
inconsistent group in the p*$
each consistency group
lb&k
ity: abo\ie 4.45 on i sia l e
(low fanniliarity: be10w 3
in each consistency--fat
tions, and half four-moraci WoQlfan
matched, pairwise, between
Procedure
Presentation of sti
that is, participants n
all of the inconsistent
with consistent words,
presentation such that emh p
order. The rest of the procedwrr
Table 3 shows mean RTs for correct responses (in rniUiseconds percentage of error rates) par condition.
READING JAPANESE KANJI
RTs were submitted to a two-way ANOVA, with independent variables of consistency and familiarity, and with repeated measures over subjects and items. The results showed a
significant main effect of familiarity: Fi(l, 10) = 47.90, MSE =
910, p < .001; F2(l, 76) = 21.16, MSE = 4,187, p < .001.
Although there was a small RT advantage of consistent over
inconsistent words, neither the main effect of consistency nor
the interaction between familiarity and consistency were
significant in either subject or item analyses (all Fs < 1).
Table 3
Mean Reaction Times (in Milliseconds) and Error Rates (in %)
in Kanji Naming With Blocked Stimuli Presentation
(Consistency and Familiarity) in Experiment 3
Variable
Familiarity
M
SD
Consistency
Note.
Error type
ON-KUN reading
Start-Stop
Visual
Semantic
Other
Note.
Error scores were square root transformed after addition of
a constant of 0.1 and were submitted to a two-way ANOVA
with the same variables as the RT analyses. Subject analyses
showed that the main effects of consistency, Fl(l, 10) = 7.84,
MSE = 0.040, p < ,018, and familiarity, Fi(l, 10) = 74.64,
MSE = .008,p < .001, as well as the interaction between
consistency and familiarity, F1(l, 10) = 14.41, MSE = 0.011,
p < .003, were all significant. More errors were made with
inconsistent than with consistent words and with lowfamiliarity than high-familiarity words. Further, a priori mean
comparison tests revealed that the effect of consistency was
significant only for the low-familiar words and that there was
no significant difference between consistent and inconsistent
high-familiarity words. Item analyses, however, showed only a
significant effect of familiarity, F,(l, 76) = 6,31, MSE = 0.002,
p < .01; the consistency effect was only approaching significance, F l ( l , 76) = 3.40, p = .069. The interaction between
consistency and familiarity was not significant (F < 1).
Errors were categorized according to the usual scheme and
are shown in Table 4. Here it can be seen that, despite the
higher error rate for inconsistent than for consistent words, (a)
the actual number of errors was still very low, (b) half of the
errors on inconsistent words (9 out of 17) were of the
start-stop variety that-as indicated before-are not exactly
incorrect responses, and (c) there were once again rather few
of the alternative ON- or KUN-reading errors that would be
most indicative of character-level consistency effects,
ER
Experiment 3
Condition
Total
Errors
M
SD
Table 4
Number of Different Types of Errors for the Two Conditions in
Experiments 3 and 4
Consistent
High
Low
familiarity familiarity
Inconsistent
Low
familiarity familiarity
High
4.95
0.32
3.05
0.46
5.10
0.46
3.20
0.29
640
92.73
0.5
700
102.37
2
646
83.72
0.9
713
106.40
7
ER = error rate.
Experiment 4
Condition
1
4
1
4
0
4
1
0
1
5
9
0
1
2
7
1
1
8
0
4
2
2
2
6
17
17
1
11
1 = consistent condition;4 = inconsistent condition.
Discussion
In Experiment 3, naming latency for two-character Kanji
nouns was significantly affected by word familiarity but not by
pronunciation consistency of component characters; there was
also no significant Familiarity x Consistency interaction.
Virtually all of the relatively few errors occurred on lower
familiarity words; although more errors were made on inconsistent than on consistent words, the ANOVA produced a
significant main effect of consistency and a significant Familiarity x Consistency interaction for errors over subjects but not
over items. Once again, then, we have failed to uncover any R T
effects of character-sound consistency in naming Kanji words
and have detected only a tiny trend toward such effects in the
accuracy data.
Experiment 4
In Experiment 4, pronounceable two-character Kanji nonwords were included along with the word stimuli used in
Experiment 3. These nonwords have unambiguous pronunciations because they were constructed by combining into nonlexical two-character strings characters from consistent words in
which constituent characters have only one ON-reading each.
Like the frequency and blocking manipulations of Experiment
3, we hypothesized that the inclusion of nonwords in Experiment 4 might coaxout the consistency effects that we have thus
far failed to observe. The basis for this hypothesis is that
because the Kanji nonwords are unfamiliar (indeed, illegitimate) combinations of characters, the only way in which
participants can pronounce the nonwords is to process them at
the level of the individual component characters. If this
character-by-character reading for the nonword stimuli were
to induce a similar strategy for the words, then-because both
word and character levels would yield only one possible
pronunciation for the consistent words but could yield conflicting pronunciations for the inconsistent words-wnsistency
effects in Kanji reading might emerge. The parallel to this
predicted phenomenon in English has been demonstrated by
Monsell, Patterson, Graham, Hughes, and Milroy (1992), who
found a significantly higher error rate (particularly regularization errors) to exceptional inconsistent English words when
nonwords were mixed in with the words.
T. WYDELL, B. BUTTERWORTH, AND K. PATTERSON
inconsistent words was not significant. Item analyses were not
carried out because the number of stimulus items differed
between conditions, consistent and inconsistent words (40
each), and nonwords (80).
Errors. Error scores (square root transformed after a
constant of 0.1 was added) were submitted to a one-way
ANOVA As with the RT analyses, the main effect of stimulus
type was significant, Fl(2, 9) = 44.50, MSE = 0.105, p < .001.
A priori mean comparisons between different stimulus types
revealed that there were significant differences between wnsistent words and nonwords, F,<l, 18) = 54.06, p < .001, and
between inconsistent words and nonwords, F2(l, 18) = 77.55,
p < .001. However, the difference between consistent and
inconsistent words was m t significant (F < 1).
Method
Participants
Ten Japanese male and female native speakers (aged between 18
and 38 years), who were brought up in Japan until at least age 18 and
had lived in the United Kingdom for less than 18 months, served as
participants in this experiment. All had normal or corrected-to-normal
vision. Each participant was paid a small fee for participating.
Apparatus
The apparatus was the same as in Experiment 1.
Stimuli
The stimulus list consisted of 80 Kanji words and 80 nonwords. The
word stimuli were the same as those used in Experiment 3 (40
consistent and 40 inconsistent words, further divided into 20 each of
high- and low-familiarity words). The nonwords were constructed from
two-character Kanji words in which each constituent character has
only a single ON-reading. One character from such a word was
combined with another character from a different word, resulting in
two-character Kanji nonwords that were unambiguously pronounceable but meaningless. The words used to construct the nonwords items
were different from those used as word stimuli, thus preventing any
intralist repetition of characters.
Analyses Without Nonwur&
Further analyses were catricd out on word stimuli only, with
word type (consistent or inconsistent) end familiarity (either
high o r low) as variables.
RTs. Mean RTs for correct ~KSOCS~SKSwere submitted to a
two-way ANOVA. There WB a significant main effect of
familiarity, with shorter RTs to high- than to low-familiarity
words, Fi(l, 9) = 85.W,MSE = 95&,p < ,001. There was also a
significant main effect of wnsatQiKf,f+(l, 9) 7.80, MSE =
555,p < .02, but in the oppoule d i h to (bat predicted,
with faster RTs to inconsbeat h to mz!&ent d s . [tern
analyses, however, revealed thaf only fartillarity was significant, Fz(l, 76) = 26, MSE = 6,531,~< .001, and neither the
main effect of consistency nor the interaction between familiarity and consistency were reliable (Fs < 1).
As with Experiment 1, we carried out a n ANCOVA to
estimate the differences between consistent and inconsistent
words, had the variable of character frequency been properly
matched between conditions. Character frequencies might be
especially important in t h p experiment because the Kanji word
stimuli were mixed with K@ mnwords, which can only b e
named character-by-charader. The adjusted means were 740
ms for the consistent words and 735 ms for the inconsistent
words, a nonsignificant difference (F < 1).
Errors. Error scores (transformed in the usual way) were
submitted to a two-way ANOVA with word type and familiarity as variables. The results showed only a significant main
effect of familiarity: F i ( l , 9) = 14.65, MSE = 0.041, p < .004;
Fz(l, 76) = 4.73, MSE =Â 0.177.p < .03 (all other F s < 1).
More errors were made with low-familiarity words than with
high-familiarity words.
Procedure
All stimulus words and nonwords were randomized by the computer
so that each participant saw a different randomization order of the
stimuli. The rest of the procedure was the same as in Experiments 1-3
by using a straightforward naming technique.
Results
Mean RTs for correct responses (in milliseconds) and
percentage of error rates per condition are shown in Table 5.
Analyses With Norwards
RTs. Mean RTs for correct responses were submitted to a
one-way ANOVA. The main effect of stimulus type was
significant, Fi(2, 9) = 102.10, MSE = 1 , 0 5 5 , ~< .001. A priori
mean comparisons between three different stimulus types
revealed that there were significant differences between consistent words and nonwords, F i ( l , 18) = 134.30, p < .001, and
between inconsistent words and nonwords, Fz(l, 18) = 170.16,
p < .001. However, the difference between consistent and
Table 5
Mean Reaction Times (RTs; in Milliseconds) and Error Rates (ERs; in %) in Kanji Naming of
Words With Nonwords in Experiment 4
High familiarity
Condition
M
SD
ER
Words
Consistent
698
81.15
2
Inconsistent
685
75.12
2
Nonwords
Note. Adj. = adjusted.
Low familiarity
Total mean
M
SD
ER
RT
%
795
767
102.50
88.69
7
4
745
725
914
5
3
10
Adj.
M
740
735
1
READING JAPANESE KANJI
-tabulated
according to type and are
Table 4.Asin most of the preceding experiments#there were few errprs in no discernible pattern and
essentially qdtem+ivc ON- or KUN-reading errors even on
low-frequency inconsistent words.
'*-f
J-^
b-
Â¥':
and doubk-memy words. If alternative pronunciations of a
diameter axe computed, and the correct pronunciation must
be chosen amongst these alternatives, in the course of naming
a Kanji word,then the double-enemy words should perhaps
yield longer RTs or higher error rates than the single-enemy
words.
Discussion
experiments on word-nonword naming in English,
therewas an advantage in both speed and accuracy for naming
of Kanji real words compared with Kanji nonwords. In fact,the
RT difference between words and nonwords (an average of
about 180ins)was extremely large compared with the lexicality
advantage typically observed in Englishnaming studies, a point
that will be taken up in the General Discussion section.
Relative to previous experiments in this series (especially
Experiment 3, in which we used exactly the samewoxdft), the
presence of nonwords also seems to have slowed the latencies
for words: The mean RT for the words in Experiment 4 was
735 ins, compared with 676 ms in Experiment 3 (though of
course it must be recalled that the two exoeriments had
different individualsas participants). This apparent RT difference might be an indication that participants shifted their
attention more toward character-by-characterreading lather
than whole-word processing; however, as this slewing was not
accompanied by an advantage for consistent over inconsistent
words, nor by a ~amiliarityx Consistency interaction for
either accuracy or RT, it certainty does not provide any
convincing indication of character-level influences in Kanji
word naming. The manipulation of mixing nowords intoa list
containing inconsistent Kanji words has failed to produce as
effect parallel to that seen in English, in which the addition of
nonwords yields a significantly augmented rate of regularization errors to exception-inconsistentwords (Monsell et al.,
1992), an effectinterpreted in terms of processing at subword
levels.
In all four experiments presented thus far, the critical
stimulus materials havebeen ON-reading Kanji words, composed either of characters with a single unambiguous pronunciation or of charactersthat+ other mtextsÑca take one
or more alternative pronunciations.As explained toward the
end of the introduction, there
reasons for concentrating on ON-reading words; however, as explained there
also, it is just possible that the ON-readingof an inconsistent
character in a multicharacter wordis in some sense die more
dominant pronunciation, making the impactof alternative
but less dominant pronunciation hard todbemq.
Experiments 5 and 6, therefore, we turn our at$&h
to words with
KUN-ieadings.
I&@
In
Experiment 5
In this experiment, single-character KUN-reading words
were used as stimuli. Our rationale was to see whether consistency effects might be revealed by a manipulation of the
number of pronunciation enemies possessed by a given character. The KUN-reading one-character words selected for this
experiment had either a single alternative ON-reading or two
ON-readings:these two word classeswere labeled singlememy
Method.
Participants
Twenty Japanese male and female native speakers(aged between 19
and 43 years), who were brought up in Japan until at least age 18 and
had lived in the United Kingdom for less than 18 months, served as
participants in this experiment All had normal or conectcd-to-nonnal
vision. Each participant was paid a small fee for participating.
The apparatus was the same as in Experiments 1-4.
Stimuli
There were 42 KUN-reading single Kanji nouns: half with single
alternative ON-readings (the single-enemy condition) and half with
two O N - r d b g s (the double-enemy condition). Words from the two
conditions were matched as closely as possible on on item-by-item
basis for initial phoneme, number of morae (ranging between two and
four), rated word frequency, familiarity,and imageability (meanword
frequency, familiarity, and imageability are 3.76, 4.64, and 6.25 for
single-enemyand 3.76,4.98, and 6.27 for double-enemy, respectively).
It was not possible to match the words for character frequency
because, as explained earlier, there is an inevitable confounding in
which characters with multiple pronunciations occur more commonly
than those with few or only one pronunciation.
Procedure
The procedure was straightforward word naming as in Experiment 1.
Results
Table 6 shows mean RTs (in milliseconds) for correct
responses and percentage of error rates for the two conditions.
A one-way ANOVA on mean RTs for correct responses
showed that the difference between conditionswas approach-
Table 6
Mean Reaction Times (in Milliseconds) and Error Rates (ERs; in
9%) in K i i Naming WithSingle Versus Doubk Enemies
inExpemMut5
Correct response
Condition
Inconsistent
Single enemy
Double enemy
Note. Adj. = adjusted.
M
SD
Adj. M
ER
649
56.34
59.93
632
634
5
3
634
tter is almost always
ing significance over subjects, Fi(l,19) = 4.15, MSE = 571,p =
.055, but in the opposite direction to that predicted and was
not significant over items, F i ( l , 40) = 1.59, MSE = 1 , 9 9 1 , ~=
21.
Because the character frequency match between the two
conditions was far from perfect, an ANCOVA was carried out
with character frequency as covariate; however, this also failed
to reveal any difference between conditions, F(1, 36) = 2.63,
MSE = 1,628, p = .11. The ANCOVA-adjusted means for the
two conditions are shown in Table 6, as are the error rates that
were again in the opposite direction to that predicted by a
hypothesis that number of enemies should have an adverse
effect on naming.
Errors
Neither subject nor item analyses on errors were significant
(Fs < 1). The number of errors was too small to warrant
classification into different types.
Discussion
The result of this experiment was quite clear: There was no
difference in RT or error rate between KUN-reading onecharacter Kanji words with a single ON-reading enemy and
those with two ON-reading enemies. Once again, it appears
that character-to-pronunciation ambiguity in Kanji word naming has little effect on naming. There is, however, one final line
of investigation to be explored, in a comparison of ON- and
KUN-reading two-character words, which formed the content
of Experiment 6.
Experiment 6
Experiments 1-4 compared, under various conditions, the
naming of consistent and inconsistent two-character ONreading words. Experiment 5 compared single-enemy and
double-enemy one-character KUN-reading words. None of
these experiments has involved a contrast between ON- and
KUN-reading words. If both readings of an inconsistent
character are activated at some stage in the process of
computing the pronunciation of a word containing it, then a
contrast between words with ON- and KUN-readings might at
long last provide some evidence of consistency effects in Kanji.
For this final experiment, the stimulus words selected were
pairs of two-character words sharing the identical first character, in which one word of each pair takes an ON-reading and
the other a KUN-reading. W e call this a final hunt for
consistency effects because, of course, any word composed of
characters with both ON- and KUN-readings must by definition be an inconsistent word. That is, w e did not compare
consistent and inconsistent words but rather two groups of
inconsistent words; the contrast is therefore something more
like that between regular inconsistent and exception inconsistent words in English or perhaps like exception words characterized as having different degrees of irregularity in English
(Shallice et a]., 1983). As expressed in the general introduction, it is not even entirely clear whether it is the ON- or the
KUN-reading of a character with both readings that should be
not possible to do the
token frequency of frie
of the rnulticharacter worth c
however, one can calculate
applied to the present set of stimu
mean number of two-character wor
nent character that has ON- versus
substantially more ON- than KU
these characters, then despite the
token frequencies, it seems
reading words of the matched pairs woaid had.,<fV9WfS^,.k
higher summed-frequency friends than enentte,."')tte'e&'tte
reverse would be true for the KUN-reading stiinlflil^tfaOte.
This is indeed the =for type frequency: With respUr-to#e
first (matched) c h e w e r t i the 26 pairs of words d i n tkb
experiment, there arc.to average 35.8 two-charactaf: TEdr-ds
containing each of tbeae'charktersthat have ON-r~a&qp.bnt
only 6.8 words with.m-i-dtdipgs. There is a similar hbdance favoring ON-*
m<hesecond characten of our
stimuli.
Participants
Twenty male and female Japanese speakers (aged between 19 and
25 years), who were brought up in Japan until at least age 18 and had
lived in the United Kiagdorn f
a k s &kn 18 months, served as
participants in this experiment. All hadnodortiotrected-to-normal
vision. Each participant was paid a small f& for+mticipating.
Apparatus
T h e apparatus was the same as ia Experiments 1-5-
Stimuli
There were 52 two-character Kanji nouns: half of which were
ON-reading words and the other half were KUN-reading words. These
52 words consisted of 26 word pairs with the identical first character in
common; an example is given below:
t
READING JAPANESE KANJI
ON-reading stimulus word: f91i souba (meaning stock market)
KUN-reading stimulus word: ffl+ aite (meaning opponent).
Both stimulus words have the first character fl in commn. Each pair
k given
of words had the same number of morae (three in the m
above, so-u-ba and a-i-te).The words in each pair are, by definition,
perfectly matched for first-character frequency. Also by doflnlticm, the
words in each pair ace unmatched and unmatchabk for initial phoneme
because thesamefirst character has two differentpronuiKiatiorB(Hthetwo
wordsof each pair. Items in the two conditions were matched as closely
as possible on whole-word rated familiarity (4.5vs. 4.4).
Eqh participant named both ON- and KUN-teadingwords,but the
52 words were divided into two stimulus lists soch that any given
participant named either (he ON- or the KUN-reading word of each
pair but not both. For example, if a participant receivedsouba (from
the example indicated above) as a target word, then this participant
did not see aite. We intended for thts design feature to amid any
possible effect, either positive or negative, of naming two wordswith
the same first character but two different pronunciations. Therefore,
each individual word was named by 10 participant*,and each-paitiupant saw 26 items.
Discussion
This experiment produced essentially identical patterns of
performance for ON- and KUN-reading inconsistent nouns
matched for initial character. If there were a major role for
character-level computation of phonology in Kanji, one would
expect the inconsistent words with more typical component
pronunciations to have an advantage in speed, accuracy, o r
both, of naming. The ON-readings of the constituent characters used in this experiment have substantially higher type
frequencies (so much greater that they certainly also have higher
summed frequencies) and so must be considered more typical; yet
there was absolutely no advantage for the ON- over the KUNreadingwords.Thispattern givesno support to a role for characterlevel translationfrom orthography to phonology in Kanji.
General Discussion
Procedure
The procedure was the same as in Experiment 1, exospt U t h e
participantswere given 30 practice trials before the wrimcntaltriab.
m
Results
Table 7 shows correct mean RTs (in ntiUisecondsJ and
percentage of error rates for the two conditions.
R Ts
A one-way ANOVA on mean RTs for correct responses
showed no reliable differences between the two conditions:
Fi(1, 19) = 2.20, MSE = 880,p = .15; F-, < 1.
A multiple regression analysis revealed thai thà nariable of
rated word familiarity was at borderline sigiuficaace, ((44) =
-1.99, p = .053, and that no other variable approached
significance. An ANCOVA with second-character frequency,
word frequency, familiarity, and imageability as covariates
produced adjusted means for the two conditions that were
virtually identical: 646 ms for the KUN-readingcondition and
642 BIS for the ON-readingcondition.
Errors
A one-way ANOVA on error scores (transformed in the
usual way)also showed no reliable differences between the two
SO) =
conditions: Fi(l, 19) = 2.51, MSE = 0 . 2 2 4 , ~= .lm(l,
Table 7
Mean Reaction Times (in Milliseconds) and Error Rates (ERs; in
%) in ON-Reading Versus Kirn-Reading K
w
i Naming
inExperiment6
Condition
l.O,MSE = 0 . 2 6 , ~= .31. In any case, the nonsignificant numerical
difference in errors was in the direction opposite to our prediction.
There were too few errors for an informative classification.
M
ON-rettdmfc
636
KUN-readin8
650
Note. Ad). = adjusted.
Correct ratponse
Adj. M
81.64
642
SD
86.68
646
% ER
7
4
A simplified but reasonably accurate summary of the six
experiments reported in this article is that we observed no
significant character-to-pronunciation consistency effects in
Kanji word naming. TTlis was true (a) for both RT and accuracy
measures of naming performance; (b) for ON- and KUNreading two-character words and one-character KUN-reading
words; (c) for lower frequency words as well as common words;
and (d) even under several manipulations such as speeded
naming, and the inclusion of nonwords, that have been shown
to enhance consistency effects in English. There were a few
tantalizing hints of the consistency effects that we had expected to observe: In Experiment 3, for example, with blocked
presentation of consistent and inconsistent words and with a
dichotomized manipulation of target-word familiarity, there
was at least a numerical RT disadvantage for the low-familiarity
inconsistent words. However, this "effect" failed to reach significance either as a main effect of consistency or as a Familiarity X
Consistency interaction; the only statistically reliable effect on RT
in this as in other experimentswas familiarity.
These results in Kanji differ from those for English discussed in the introduction (e.g., Glushko, 1979; Jared et al.,
1990, and many others). In computation of phonology for an
English target word, the degree of consistency of pronunciation across words sharing orthographic segments with the
target influences the efficiency of the computation. Moreover,
this impact is appropriately "graded": Inconsistent words with
many high-frequency pronunciation enemies suffer more than
words with few or only uncommon enemies with many friends
to support their particular spelling-to-sound relationships. In
Kanji, however, whether each constituent character has one or
more other pronunciations, and also whether the alternative
pronunciations apply to other two-character Kanji words, had
no significant impact on naming latencies for the stimulus
words. As noteworthy as the absence of an RT effect was the
lack of word-naming errors that would, according to our
predictions, have been a clear signature of consistency effects.
In English word-naming experiments, the difference between
regular and 'irregular' words often reveals itself in error rates
T.WYDELL, B. BUTTERWORTH. AND K. PATTERSON
as well as in RTs. The most common error to an irregular
English word is the assignment of an alternative, more cornmon pronunciation of the inconsistent segment (typically the
body or rime); and the likelihood of such regularization errors
can be augmented by manipulations such as speeded naming
instructions (Strain et al., 1995) or inclusion of nonwords in the
stimulus set (Midgley-West, 1979; Monsell et al., 1992). In
contrast, our experiments yielded almost no Kanji naming
errors that would be the equivalent of assigning an alternative
pronunciation to an inconsistent segment, that is, alternative
ON-KUN reading errors. Across all of the stimulus materials
in all of the experiments that would permit ON-KUN substitutions, such errors accounted for fewer than 1% of responses,
whereas typical rates of regularization errors to low-frequency
inconsistent English words were in the range of 6%-15% (see
for example, Glushko, 1979; Paap & Noel, 1991).
The lack of significant consistency effects for Kanji is
different not only from English but from word-naming data in
other languages as well, for example, Italian (e.g., Colombo,
1992) and, more strikingly, Chinese (Hue, 1992; Yin, Butterworth, & Patterson, 1995). Most Chinese characters are
phonograms with two components: the radical, which represents a broad semantic field (i.e., a clue to the meaning of the
character), and the phonetic component, which typically provides a clue to the pronunciation of the character. Within these
phonograms, one can distinguish-somewhat as in Englishvariables of bath regularity (or congruency) and consistency.
Regular or congruent characters are those for which the
pronunciation of the character as a whole corresponds exactly
to the pronunciation of the phonetic component; for irregular
or incongruent characters, the pronunciation of the whole
character and of its phonetic component do not match.
Consistency, as in other writing systems, refers more to the
characteristics of pronunciation across an orthographic neighborhood. Consistent characters are those for which all characters containing a particular phonetic component have the same
pronunciation; for inconsistent characters, one or more exemplars sharing the same phonetic wmponent have discrepant
pronunciations. There is a further group of characters, sometimes called unique,that do not contain phonetic stems.
Hue (1992, Experiment 2) demonstrated that, within the set
of low-frequency experimental stimuli, regular (congruent)
characters were named more rapidly and more accurately than
either irregular or unique characters. Even more germane to
the present article, both Hue (1992, Experiment 3) and Yin et
al. (1994) obtained marked effects of consistency,with inoonsistent characters yielding slower RTs and more e m than
consistent characters. The errors were mainly pronunciations
appropriate to a different character containing the same
radical. Note that this is a true neighborhood effect-' hi both
experiments, the correct pronunciation of an inconsistent
character was always congruent with that of its phonetic
component, so there was no conflicting alternative pronunciation within the character itself; the conflict is. attribtitfibte
solely to the fact that other characters containing that same
phonetic component have different pronunciations. In other
words, this is an approximate equivalent of a disadvantage for
the naming of regular inconsistent words in English (o~i.,-bone.
which is regular but has the pronunciation enemies done and
eons). The fact that these consistency effects were not only
statistically reliable but of substantial magnitude in both
Chinese eiperiments m$ks the contrast with our nonresults
for Japanese KaG WEJXET noteworthy and surprising. It
must be remembered, howBaer* that characteristics of the
spelLinpmmd ~ l a t ~ I ib j m
i are different not only
from alphabetic writing system but also from Chinese. For
example, although therern1>9
pronunciation ambiguity in
Chinese characterse-tSKT wl6m a character (if the whole
character and its PhOttsrifrfqnpBBent are incongruent) or
across neighborhamla oCditfX^ereaiming a phonetic component, each whole charJ^ter,tenBty one ooxrect pronunciation
that is invariant acm* thb KÇ.Çesatxyw contexts in which
the character may occur.
It should be noted that, hoiH&ask.mtftc nnillim~taicKanji
words used here, the m a w m m c y effects
- ~.
taimli. possibly
in English have used
because consistency to^^iftfaefiçtt4E&iÈct-à respect to
"body" neighborhoods, w l w ^ G ÇDe ç^eraimanipulat
%.
'
'
al., 1992; Strain et al., 1^15)4Çd,tetfa -(
1992). We
acknowledge, however, thato~riaa3{isyUah(c
,.
words
susceptibility to co
to establish in this
theorists have spec
named by a different set
tal evidence including
of the same words (Sa
assert that the difference
but rather in the details of the
codes are computed (see Perfet
similar conclusion).
The obvious interpretation is that -readers
languages learn orthographic-to-
of other
computations
4
We are grateful to an anonywow
:A . .
IT
m
this point:
b
"
READING JAPANESE KANJI
that ficxolfttoe over commonly recurring subword patterns,
readers of Japanese Kanji learn that the only reliable level of
generalization is the whole word, As emphasized throughout
flnsntide, the correct pronunciation of a subword component
to Kanji (the individual character) is highly dependent on
intraword context.Of course, to some extent, the same 3 true
in IsDflliah;but two factors about English (and other alphabetic
scripts) might still encourage generalization at the level of
u~bwordanits. The first is that, for most of the incoasisttot
spelling patterns in English, there is one dominant s p c l b
sound correspondence. Setting weights on connections totah
account of the dominant pattern may hurt the occasional&wd
(pronunciation of caste will be slowed or even in-tei
because the participant computes the pronunciation '*kaBt" ai
well as "cast" as the result of the strength of the c o p p ~ f t :
dence between aste and "aist" across the family of wqrA
including haste, waste, etc.); however, this organ&&m
MI,
across the vocabulary as a whole, provtde more benefif'dfB
harm simply because the dominant s p e l l m g - s o u a d s e s a ~
dence does most often apply. Across single- and
character words, neither the ON nor the KUN pronnneiatfon
of a Kanji character can really be considered' dominant;
therefore, perhaps there would be no comparable beneflr to
subword generalization. The second factor is oneofteneiHphasized in discussions about subword levels of wmputa&ji &I
English (see, for example, Norris & Brown, 1985); The great
majority of irregular English words deviate from a more
typical, rule-governed pronunciation only in one à § two p b
nemes, typically the vowel (e.g., the regularized pomnciations of caste andpint are only one phoneme different f i ' t h c
correct pronunciations). If much, even though not all, ofwimt
the reader computes at the subword level is applica.Weto the
final pronunciation, then once again it may be more. helpful
than harmful. By contrast, the two (or more) <afternative
pronunciations of a Kanji character typically have nophonological elements in common (cf. the ON- and KUN-readingsi~fthe
character <S ,which are "oya" and "shin"). In this case, it may
be more harmful than helpful to compute both pronunciationsof a
component character on the path to a phondotipa^tep~esentation
for the whole word, which is the ultimate goal of thiscomputation.
The fact that several of the naming experiments reported in
this article produced reliable effects of individual-character
frequency, with faster naming latencies to wordt containing
more commonly encountered Kanji characters, might teem to
be at odds with our conclusion that the coinputatiDn of
phonology from orthography is dominated by the word level.
We do not, however, think that this result is in conflict with our
interpretation for the simple reason that there is another
plausible locus of the character frequency effect at the stage of
orthographic analysis. There is only a small number of alphabetic characters (26 in English, even fewer in Italian, for
example), all of which are relatively simple visual forms
and-despite differences in frequency of occurrence (compare
s and z, for example)-every one of which would be encountered in reading almost any text. By contrast, there are
approximately 3,000 commonly used Kanji characters, some
with rather complex visual forms. On the reasonable assumption that the efficiency of early orthographic analysis of a Kanji
word will be modulated by the familiarity and complexity of its
w-
characters as orthographic patterns, the character frequency
effect need have no direct implications for a model of phonological processingof Kanji words.
On& aspect of our results that seems to fit the proposal of a
mainly whole-word basis for computation of Kanji phonology is
the massive lexicality advantage for RTs observed in Experiment 4. It would of course be impossible to offer a single value
for a lexicality effect in experiments with English readers, as
the effectwill clearly vary as a function of the characteristics of
the words, the nonwords, the reading skill of the participants,
and so forth. Several studies in the literature, however, in
which English words and matched nonwords were named in
mixedlists (as in the design of Experiment 4 for Kanji) suggest
a typical R T difference in the region of 30-50 ms (see, for
example, Glushko, 1979; Monsell et al., 1992). In our Experimfint 4, the corresponding difference was about 180 a s . It is
also pertinent that latencies to name English nonwords are
predicted by the degree of pronunciation ambiguity: Seidenberg, PUut, Petersen, McClelland, and McRae (in press) have
demonstrated that nonwords like zak andpi& which are given
one consistent pronunciation by all participants, yield significantly faster RTs than even the most common pronunciation
of nowords given two o r three different readings by various
participants. The Kanji nonwords in Experiment 4, composed
of characters with only a single ON-reading, had unambiguous
pronunciations and yet yielded much slower naming RTs than
comparable words. This result can be interpreted as evidence
that computing the phonology of each component character
independently, as is required to name a Kanji nonword, plays
relatively little role in translating familiar Kanji words from
orthography to phonology.
The idea that processing in Kanji relies primarily on a
whole-word level finds some general support in a study by
Morton, Sasanuma, Patterson, and Sakuma (1992), who investigated recognition units for Kanji words by using a different
paradigm to the current naming experiments. In that study,
target words to be identified tachistoscopicaliy were preceded,
about half an hour earlier and in a different task, by words in
one of several critical conditions: (a) an identity prime (the
prime and target words were identical), (b) a single-character
prime (the target was a two-character word, and the prime was
one of its component characters), and (c) a two-character
prime (in which the target was either a single-character word
corresponding to one of the components of the prime or a
different two-character word sharing a character with the
prime). Relative to appropriate control conditions, Morton et
al. found a strong identity priming effect but no crossfacilitation between compound words sharing a single character or from a single character to a compound word that
contained it or from compound words to their component
characters. From this pattern of data, Morton et al. (1992)
concluded that Kanji words are recognized at the word rather than
the character level.
The transcoding between phonology and meaning must be
dominated by whole-word level representations in all languages because, except for morphological variations, there is
no useful level of subword generalization in the correspondences between phonology and meaning: Words that sound
similar do not have similar meanings. Although we had not
T. WYDELL, B. BUTTERWORTH, AND K'PA7TCRw
predicted these results, our findings for the computation
between orthography and phonology in Kanji suggest a similar
picture to that between phonology and meaning. In English
and other alphabetic writing systems, words that are orthographically similar almost always have similar pronunciations; this is not
true in Kanji, and the lack of consistency effects in Kanji word
naming may be a logical consequence of this difference.
References
Andrews, S. (1982). Phonological receding: Is the regularity effect
consistent? Memory & Cognition, 10,565-575.
Andrews, S. (1989). Frequency and neighborhood effects on lexical
access: Activation or search? Journal of Experimental Psychology:
Learning, Memory, and Cognition, 15, 802-814.
Andrews, S. (1992). Frequency and neighborhood effects on lexical
access: Lexical similarityor orthographic redundancy?Journal of Experimental Psychology: Learning Memory, and Cognition, 18,234-254.
Brown, G. D. A. (1987). Resolving inconsistency: A computational
model of word naming. Journal of Memory and Language, 26, 1-23.
Colombo, L. (1992). Lexical stress effects and its interaction with
frequency in word pronunciation. Journal of Experimental Psychology: Human Perception and Performance, 18, 987-1003.
Coltheart, M. (1978). Lexical access in simple reading tasks. In G.
Underwood (Ed.), Strategies of human information processing (pp.
151-215). London: Academic Press.
Glushko, R. J. (1979). The organization and activation of orthographic
knowledge in reading aloud. Journal of Experimental Psychology:
Human Perception and Performance, 5, 674-69 1.
Gum, T., & Bub, D. (1988). PsychLab software. Montreal, Quebec,
Canada: Montreal Neurological Institute.
Hue, C. W. (1992). Recognition processes in character naming. In
H.-C. Chen & Ovid J. L. Tzeng (Eds.), Language processing in
Chinese (pp. 93-107). Amsterdam: North-Holland.
Jared, D., McRae, K., & Seidenberg, M. S. (1990). The basis of
consistency: Effects in word naming. Journal of Memory and Language, 29, 687-715.
Midgley-West, L. (1979). Strategies in reading. Unpublished doctoral
dissertation, Birkbeck College, London, England.
Monsell, S., Doyle, M. C., & Haggard, P. N. (1989). Effects of
frequency on visual word recognition tasks: Where are they? Journal
of Experimental Psychology: General, 118, 43-71.
Monsell, S., Patterson, K.E., Graham, A., Hughes, C. H., & Milroy, R.
(1992). Lexical and sub-lexical translation of spelling to sound:
Strategic anticipation of lexical status. Journal of Experimental
Psychology: Learning, Memory, and Cognition, 18, 452-467.
Morton, J., Sasanuma, S., Patterson, K., & Sakuma, N. (1992). The
organisation of the lexicon in Japanese: Single and compound Kanji.
British Journal of Psychology, 83, 517-531.
National Language Institute. (1976). Gendai shinbun no Kanji [A
study of uses of Kanji characters in modern newspapers] (Report
No. 56). Tokyo: Shuei Shuppan.
Norris, D., & Brown, G. (1985). Race models and analogy theories: A
dead heat? Cognition, 20, 155-168.
Paap, K. R., & Noel, R. W. (1991). Dual-route models of print to
sound: Still a good horse race. Psychological Research, 53, 13-24.
Parkin, A. J. (1982). Phonological receding in lexical decision: Effects
of spelling-to-sound regularity depend on how regularity is defined.
Memory & Cognition, 10,43-53.
Patterson, K.,& Coltheart, V. (1987). Phonological processes in
reading: A tutorial review. In M. Coltheart (Ed.), Attention and
performance: XII. The psychology of reading (pp. 421-447). Hove,
England: Erlbaum.
Perfetti, C. A., Zhang, S., & Berent, I. (1992). Reading in English and
Chinese: Evidence for a "universal" phonological principle. In R.
Frost & L. Katz (Eds.), Orthography, phonology, morphology and
meaning (pp. 227-248). Amsterdam: N o r t h - H O W .
Plant, D. C., McdeUand, J. L., Seidenberg, M. $., & Patterson, K.
(1994). Understanding normal and impaired word reading: Computatlonal principles in quasi-regular domains (Tech. Rep. No.
PDP.CNS.94.5). Pittsburgh, PA; Carnegie Mellon University, Department of Psychology.
Sampson, G. (1985). Wrong systems. Stanford, CA; Stanford University Press.
Sasanuma, S. (1986). Universal and languageqed& symptomatology
and treatment of aphasia. F& Phoimoica.38, 121-175.
Sasanuma, S., Sakuma, N., & Kitano, K- (1992). Reading Kanji
without semantics: Evidence from a loogituduiakstudy of dementia.
Cognitive Neuropsychology, 9,465-486.
Seidenberg, M. S., & McCleUand, J. (1989). A distributed model of
word recognition and naming. PsychofoacalReview, 96, 523-568.
Seidenberg, M. S., Plant, D. C., Petersen, A. S., McClelland, J. L., &
McRae, K. (in press). Nonword pronunciation and models of word
recognition. Journal of Experimental Psychology: Human Perception
and Performance.
Seidenberg, M. S., Waters, G. S., Barnes, M. A, & Tanenhaus, M. K.
(1984). When does irregular spelling or pronunciation influence
word recognition? Journal of Verbal Learning and Verbal Behavior,
23, 393404.
Shallice, T., Warrington, E. K., & M c w t h y , R. (1983). Reading
without semantics. Quarterly Journal of Experimental Psychabgy,
35A, 111-138.
Stanhope, N., & Parkin, A. J. (1987). Further explorations of the
consistency effect in word and nonword pronunciation. Memory &
Cognition, 15(2), 169-179.
Strain, E., Patterson, K., & Seidenberg, M.S. (1995). Semantic effects
in single-word naming. Journal ofExptrimenta1 Psychology: Learning,
Memory, and Co&Wn, 21,1140-1154,
SweeUAM 4.5 [computer software]. (1990). Tokyo, Japan: A & A
Company, Ltd.
Taraban, R., & McCielland, J. L. (1987). Conspiracy effects in word
pronunciation. Joumalof MemoryandLanguage, 26,608-631.
Toyo Kanji jiten [Toyo Kanji dictionary]. Y. Abe (Ed.). (1962). Tokyo:
Obunsha.
Van Orden, G. C. (1987). A ROWS is a ROSE: SpeIling, sound and
reading. Memory & Cognition, 15, 181-198.
Vitkovitch, M., & Humphreys, G. W. (1991). Perseverant responding
in speeded naming of pictures: It's in the links. Joumalof -Eijmmmtal Psychology: Learning, Meifiaty, and Cognition, 17, 664-680.
Waters, G. S., & Seidenberg, M. S. (1985). Spelling-sound effects in
reading: Time-course and decision criteria. Memory & Cognition, 13,
557-572.
Wydell, T. (1991). Processes in the readme of Japanese: Cornpamtive
studies between English and Japanese odwpphies. Unpublished
doctoral dissertation, University of London, London, England.
Wydell, T. N., PaUerson, K E., & Humphreys, G . W. (1993).
Phonologically mediated access to meaning for Kanji: Is a ROWS
still a ROSE in Japanese Kanji? Journal of Qawrirnental Psychology:
Learning, Memory, and Cognition, 19 491-5-514.
Wydell, T. N., Quinlan, P., & Butterworth, B. (l995). Japanese lexical
database: 2,357 Japanese nouns rated .on frequency, familiar@ and
imageability and other language statistics an Kanji characters/words.
Manuscript submitted for publication.
Yin, W., Butterworth, B., & Patterson, K. (1995). Consistency and
congruency in naming Chinese characters. Manuscript submitted for
publication.
Received April 12,1994
Revision received November 16,1994
Accepted November 23,1994 rn