Language Learning
ISSN 0023-8333
EMPIRICAL STUDY
The Lexical Basis of Second Language
Reading Comprehension: From (Sub)Lexical
Knowledge to Processing Efficiency
Mona G. Alshehri
a
a,b
and Dongbo Zhangb
Taif University b University of Exeter
Abstract: This study compared how distinct lexical competences, including lexical knowledge as well as processing skills at both word/lexical and sublexical/
morphological levels, collectively and relatively predict reading comprehension in adult
learners of English as a foreign language (EFL). The participants were 220 Arabicspeaking EFL learners in a Saudi university. A battery of paper- and computer-based
tests was administered to measure the participants’ lexical competences, reading comprehension ability, and working memory. Hierarchical regression analyses revealed
that over and above working memory, both lexical and sublexical knowledge were
significant and unique predictors of reading comprehension, and sublexical processing efficiency, as opposed to lexical processing efficiency, predicted reading comprehension significantly. In addition, among the measured lexical competences, lexical
knowledge was the strongest predictor, and the two knowledge variables collectively
had a far greater influence on reading comprehension than did the two processing
efficiency variables. These findings are discussed in light of the lexical basis of text
comprehension.
Keywords reading comprehension; English as a foreign language; lexical quality;
lexical knowledge; lexical processing
This study was part of the project for the first author’s PhD dissertation completed at the University
of Exeter. The first author wishes to thank Taif University for sponsoring her study in the United
Kingdom. We thank Laura Ciaccio, Theres Grüter (Language Learning Associate Editor), Emma
Marsden (Language Learning Journal Editor), and anonymous reviewers for their comments and
suggestions that helped improve this paper. Any errors are our own.
Correspondence concerning this article should be addressed to Mona G. Alshehri, Department
of Education, Taif University, Al Hawiyah, Taif 26571, Kingdom of Saudi Arabia.
Email: mona.sh@tu.edu.sa
The handling editor for this article was Theres Grüter.
Language Learning 00:0, xxxx 2021, pp. 1–40
© 2021 Language Learning Research Club, University of Michigan
DOI: 10.1111/lang.12478
1
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Introduction
The lexical quality hypothesis (Perfetti, 2007) contends that high-quality representations of lexical and sublexical features are fundamental for efficient
word recognition and word-to-text integration and, consequently, for text comprehension. This hypothesis underscores the important role of diverse lexical
processes and, accordingly, readers’ lexical competences in reading comprehension. Essentially, efficient text comprehension necessitates not only rich
knowledge of word meanings but also an ability to process printed words
and access their meanings rapidly (i.e., lexical processing or word recognition
efficiency).
Previous studies involving diverse groups of second language (L2) readers have confirmed the importance of lexical knowledge, notably vocabulary
size/breadth, in reading comprehension (Choi & Zhang, 2021; Grabe, 2009;
Zhang, 2012). Yet, less is known about the role of knowledge of sublexical
features, notably morphological knowledge that, in light of the lexical quality hypothesis, should also play an important role in L2 reading comprehension. Limited research has concurrently considered both lexical and sublexical
knowledge in adult L2 readers of English (see Zhang & Koda, 2012, for an
exception; and see Ke, Miller, Zhang, & Koda, 2021, for a review and metaanalysis on the role of morphological awareness in biliteracy development in
children). More important, the literature on L2 reading comprehension has paid
little attention to lexical and sublexical processing efficiency that theoretically
should also be fundamentally important given that efficient comprehension is a
goal of reading (Grabe, 2009; Koda, 2005). Further research is thus warranted
on how diverse lexical competences, which have been defined in this study to
include not only lexical and sublexical knowledge but also lexical and sublexical processing efficiency, contribute to L2 reading comprehension. To this
end, this study measured distinct lexical competences using a battery of paperand computer-based tests in a large group of Arabic-speaking learners of English as a foreign language (EFL) in a Saudi university and compared how the
measured competences collectively and relatively predicted these learners’ L2
reading comprehension.
Background Literature
(Sub)Lexical Knowledge in Reading Comprehension
Reading comprehension can be understood as “the process of simultaneously extracting and constructing meaning through interaction and involvement with written language” (RAND Reading Study Group, 2002, p. 11).
The construction-integration model (Kintsch, 1988) contends that the process
Language Learning 00:0, xxxx 2021, pp. 1–40
2
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
of text comprehension starts with a reader’s accessing and integrating word
meanings for establishing a text model, and then the reader builds a situation
model through activation of background knowledge and various inferencing
processes. Reading comprehension thus arguably necessitates various linguistic processes including, notably, lexical processes that also underpin Perfetti’s
(1985) verbal efficiency theory and the lexical quality hypothesis that Perfetti
(2007) subsequently developed.
The lexical quality hypothesis places lexical representations and processes
at the center of a reading systems framework (Perfetti & Stafura, 2014) and
posits that high-quality representations of lexical and sublexical features are
fundamental to text comprehension (Perfetti, 2007). These representations involve the features of four constituents of word identity: orthography, phonology, semantics, and morphosyntax (Perfetti, 2007). Together, the quality of
these four features and the coherence among them facilitate the rapid, lowresource retrieval of lexical word identities and their integration into a mental
model of a text (Perfetti, 2007; Perfetti & Stafura, 2014).
The above theoretical outline of the lexical underpinnings of reading comprehension is largely situated in the first language (L1) context but should
pertain to L2 reading comprehension as well (Grabe, 2009). Words are the
building blocks of texts. To comprehend a text, L2 readers need to know the
meanings of the words that make up the text. The knowledge of word meanings, defined in this study as lexical knowledge, should thus play a critically
important role in text comprehension. This instrumentalist view (Anderson &
Freebody, 1981) of the importance of lexical knowledge in text comprehension can be well understood from a strand of L2 research that has focused
on lexical coverage and adequate comprehension of texts (e.g., Hu & Nation,
2000; Schmitt, Jiang, & Grabe, 2011). Other studies of L2 readers of English
have revealed strong positive correlations between vocabulary size/breadth of
knowledge (i.e., the number of words whose meanings are known) and reading
comprehension ability (e.g., Farran, Bingham, & Matthews, 2012; Qian, 1999;
Zhang, 2012). Grabe (2009) highlighted that correlation coefficients could be
greater than .90. Jeon and Yamashita’s (2014) meta-analysis showed that vocabulary knowledge is one of the strongest correlates of L2 reading comprehension (only next to grammatical knowledge; on average, r = .79).
Compared to the wide recognition of and strong empirical evidence for the
importance of lexical knowledge (i.e., knowledge of word meanings in the context of this study) in L2 reading comprehension, attention in the L2 literature
has been limited to the important role of knowledge of sublexical features encapsulated in the lexical quality hypothesis (Perfetti, 2007). As we mentioned
3
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
earlier, high-quality representations of lexical and sublexical features are fundamental to efficient word recognition and word-to-text integration (Perfetti,
2007). Additionally, the binding of constituent features also plays an essential
role. Morphology (both inflectional and derivational), in particular, has been
underscored by some scholars as an important constituent binding mechanism,
and morphological representations have a strong implication for reading acquisition (e.g., Ke et al., 2021; Kirby & Bowers, 2017). For example, in addition
to modifying the meaning and (sometimes) the part of speech of the base word
to which a suffix is added, English derivation is often characterized by phonological and/or orthographic change to the base word as well (e.g., apply →
applicable). Theoretically, morphological knowledge (and processing, which
is discussed in the next section) should also play an important role in the comprehension of English texts where multimorphemic words are prevalent (Nagy
& Anderson, 1984).
In fact, Jeon and Yamashita’s (2014) meta-analysis revealed that, on average, morphological knowledge had a correlation of .61 with L2 reading comprehension. It should be noted, however, that the number of effect
sizes/correlation coefficients meta-analyzed (k = 6) for morphological knowledge was notably smaller than the number for vocabulary knowledge (k = 31).
Thus, even though morphological knowledge, like orthographic knowledge
(r = .51) and phonological awareness (r = .48), was categorized in the metaanalysis as a low-evidence predictor of reading comprehension because of the
small number of correlations retrieved from the literature, the moderate average correlation did seem to lend clear support to the importance of morphology
in L2 reading comprehension. In a more recent meta-analysis, Ke et al. (2021)
reported a mean correlation of .52 between L2 morphological awareness and
reading comprehension (k = 17). The issues that wait to be further explored
in the L2 literature, however, are how morphological knowledge is important
for reading comprehension and whether it predicts L2 reading comprehension
over and above lexical knowledge.
Theoretically, morphological knowledge such as knowledge of roots and
affixes can contribute to text comprehension, independently of lexical knowledge, through at least two major mechanisms. On the one hand, the reader
can apply morphological knowledge for more accurate and rapid recognition of morphologically complex words in a text by, for example, dividing
those words into their morphemic constituents; on the other hand, morphological knowledge serves as a reliable strategy for the reader to unlock meanings of unknown words in textual reading, that is, instantaneous resolution of
Language Learning 00:0, xxxx 2021, pp. 1–40
4
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
vocabulary gaps during reading or “on the spot vocabulary learning” (Nagy,
2007, p. 64).
The empirical literature, however, has produced inconsistent findings.
Zhang and Koda (2013), for example, found young Chinese-speaking EFL
learners’ English morphological awareness that covered both derivation and
compounding predicted their reading comprehension, over and above vocabulary knowledge (or lexical knowledge as defined for the purpose of the present
study). Similar findings were also reported in some studies on young bilingual
readers (e.g., Kieffer & Lesaux, 2008; Zhang, 2017). Yet, a significant, unique
effect did not surface in Farran et al. (2012) study of Grades 3 and 5 Arabicspeaking bilingual readers of English in Canada. English morphological awareness barely explained any additional amount of variance in English reading
comprheneison after vocablary knoweldge was also included in the regression
model (vocabulary knowledge was actually the strongest predictor of reading
comprehension; see Table 7, p. 2175). Likewise, in a study of adult Chineseand Korean-speaking learners of English in Canada, Qian (1999) found that
morphological knowledge—knowledge of English affixes and stems that was
intended to be one of the measures for vocabulary depth—did not uniquely
and significantly predict reading comprehension. In Zhang and Koda’s (2012)
study of adult Chinese-speaking EFL learners, derivational knowledge did not
surface as a unique and significant predictor of reading comprehension when
they controlled for vocabulary knowledge.
(Sub)Lexical Processing Efficiency and Reading Comprehension
Although it is essential that readers possess diverse linguistic knowledge for
text comprehension, comprehension would be hampered if readers have not
automatized lower-level linguistic processes. Comprehension requires simultaneous orchestration or execution of a number of processes (Perfetti, 1999);
yet working memory capacity is limited (Baddeley, 2007). A lack of automatized lower-level processes would constrain the participation of higher-order
processes such as textual inferencing for effective construction of a mental
model. From a lexical perspective, because words are intended for use in the
real world, including for reading texts, knowing a word should not be simply
about an ability to “recognize it in connected speech or in print” and “to access
its meaning” but should entail the competence “to do these things within a fraction of a second” (Nagy & Scott, 2000, p. 273). The lexical quality hypothesis
(Perfetti, 2007), and its predecessor the verbal efficiency theory, embodies “a
capacity theory of comprehension” (Just & Carpenter, 1992, p. 122). It underscores high-quality representations of (sub)lexical features because they are
5
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
fundamental to rapid recognition of printed words and word-to-text integration processes. (Sub)lexical processing efficiency is an essential element of the
reading comprehension process (Perfetti & Stafura, 2014). In the L1 English
reading literature, particularly studies of school children or developing readers,
sight word recognition efficiency and word decoding fluency have been found
to be critical determinants of reading comprehension (Garcia & Cain, 2014).
Theoretically, the above emphasis on efficient lexical and sublexical processing should not pertain to L1 or monolingual readers only. In fact, word
recognition efficiency, that is, accurate and rapid recognition of printed words,
has been recognized as essential to L2 reading comprehension (Grabe, 2009;
Koda, 2005). Empirically, however, compared to the L1 reading literature, research that has considered fluency-related lexical competences has appeared
much less often in the literature on L2 English reading, and the existing body
of research has often approached the issue from diverse perspectives and generated mixed findings.
On the one hand, some studies of young ESL learners or bilingual children,
like those of monolingual children, considered the contribution of word decoding fluency to reading comprehension. Proctor, Carlo, August, and Snow
(2005), for example, found that after they had controlled for oral vocabulary,
English decoding fluency was not a unique and significant predictor of fourthgrade Spanish-speaking ESL learners’ reading comprehension in the United
States. Yet, in Pasquarella, Gottardo, and Grant’s (2012) study of adolescent
L2 readers of English in Canada, real and pseudoword decoding fluency significantly predicted reading comprehension after the researchers had controlled
for vocabulary knowledge.
On the other hand, there has been a small number of studies, mostly of
foreign language learners of English, that approached the issue of lexical processing efficiency in light of readers’ rapid lexical/semantic decision. As part
of the NELSON project, van Gelderen et al. (2004), for example, measured
adolescent Dutch-speaking EFL readers’ speed of word recognition with a lexical decision task, that is, a task that asked learners to decide as fast as they
could whether a letter string presented on a computer screen was an existing word (see also Harrington, 2018, where lexical decision tasks were intended to measure L2 lexical facility). Reaction times (RTs) and accuracy of
responses were both recorded. Among the five concurrent predictors of English
reading comprehension, only vocabulary knowledge, in addition to metacognitive knowledge, uniquely and significantly predicted reading comprehension.
A significant, unique effect did not surface for the RTs or for word recognition
speed. Yamashita’s (2013) study of Japanese-speaking university EFL learners,
Language Learning 00:0, xxxx 2021, pp. 1–40
6
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
however, found that reading comprehension was significantly predicted by
learners’ efficiency of decoding (judgement on whether a nonce word could
be read as an English word) and lexical meaning access (judgment on whether
words in a pair were antonyms) measured with a paper-based, timed yes/no decision task. It should be noted, however, that Yamashita, unlike van Gelderen
et al. (2004), did not concurrently consider the students’ lexical knowledge. It
thus remains unclear whether the significant effect identified for the processing efficiency measures would remain had a lexical knowledge measure been
included.
To date, very little research has aimed to test whether sublexical processing efficiency, particularly morphological processing efficiency, would be a
dimension of lexical competence that may uniquely predict L2 reading comprehension along with other dimensions (lexical vs. sublexical/morphological
knowledge, on the one hand, and lexical processing efficiency, on the other
hand). Overall, despite increasing interest in morphological knowledge and L2
reading comprehension (e.g., Ke et al., 2021; Kieffer & Lesaux, 2008; Zhang
& Koda, 2012) and L2 morphological processing and lexical representation
(Clahsen, Felser, Neubauer, Sato, & Silva, 2010; see also Ciaccio & Clahsen,
2020), little effort has been expended to combine the two lines of research and
to examine how morphological processing efficiency may have a unique role to
play during text reading. Logic suggests that if morphological knowledge is important for lexical inferencing and/or word decoding fluency during text comprehension, as some L2 studies have suggested (e.g., Zhang & Koda, 2012),
the use of or access to this knowledge must occur in a rapid manner for comprehension to be smooth and efficient. Zhang and Ke (2020) underscored the
importance of morphological decoding fluency in L2 reading comprehension.
If efficient morphological processing, which entails quick access to morphological features such as morphological structure and meanings of morphemic
constituents, is not in place, fluent morphological decoding would not be possible. In other words, morphological knowledge is necessary but insufficient for
efficient processing or recognition of multimorphemic words in print. Empirically, as in the case of lexical knowledge versus lexical processing efficiency,
studying morphological processing efficiency in conjunction with morphological knowledge would be warranted in order to explore their hypothetically
unique contribution to L2 reading comprehension.
The Present Study
This study set out to address the aforementioned gaps and explore the lexical
basis of L2 reading comprehension in light of the lexical quality hypothesis.
7
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
The overarching question to be answered was: How do distinct lexical competences collectively and relatively predict L2 reading comprehension? Three
sets of research questions were further posed to guide this study. The first set
examined the contribution of lexical versus sublexical predictors; the second
set examined the contribution of knowledge versus processing efficiency predictors; and the last set examined the collective and relative contributions of
the four lexical competences.
1. How does lexical versus sublexical knowledge, on the one hand, and
lexical versus sublexical processing efficiency, on the other hand, relatively predict L2 reading comprehension? How does lexical-level
competence (knowledge and processing efficiency) versus sublexical
competence (knowledge and processing efficiency) relatively predict
L2 reading comprehension?
2. How does lexical knowledge versus processing efficiency, on the one
hand, and sublexical knowledge versus processing efficiency, on the
other hand, relatively predict L2 reading comprehension? How does
knowledge (lexical and sublexical) versus processing efficiency (lexical and sublexical) relatively predict L2 reading comprehension?
3. How do the four lexical competences—lexical and sublexical, on
the one hand, and knowledge and processing efficiency, on the other
hand—collectively and relatively predict L2 reading comprehension?
Method
Participants
This study was conducted at a women’s university in Saudi Arabia. The participants were 268 Arabic-speaking first-year students in the university. For
various random reasons such as absence from class or schedule conflict,
48 students missed one or more of the testing sessions. The analyses for this
study were, therefore, based only on the data from those who attended all the
testing sessions described below (N = 220). The students’ age ranged between
17 and 22 years (Mage = 20 years). The background questionnaire showed that
a large majority of the participants started learning English when they were
about 12 years old.
The participants represented a range of undergraduate majors offered by
the Saudi university, including, for example, media, English, chemistry, nutrition, and computer science. English is generally the medium of instruction
in Saudi universities, particularly for science and engineering majors. Before
proceeding to their discipline studies in English, which typically starts with
Language Learning 00:0, xxxx 2021, pp. 1–40
8
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
the second year, Saudi university students need to go through a whole year
of intensive English learning to enhance their English proficiency, particularly
English for academic purposes. This was the case for the participants of this
study, who were first-year students. They all participated on a voluntary basis.
Measures
We administered a battery of paper- and computer-based tests that are described in detail below, in groups or individually, to measure the participants’
distinct lexical competences, reading comprehension, as well as their working
memory. We piloted all instruments with 30 other first-year students who were
studying at the same university but who did not participate in the study. We subsequently modified some of the instruments, and we also collected feedback
from some students from the pilot testing to help with the modification process. All tasks (Alshehri & Zhang, 2021a, 2021b, 2021c, 2021d, 2021e) except
the reading comprehension task are available on https://www.iris-database.org.
The reading comprehension task is not publicly available because it was
from the Gates-MacGinitie Reading Tests (MacGinitie, MacGinitie, Maria,
& Dreyer, 2000), which are copyright-protected and can be purchased from
https://riversideinsights.com.
All measures showed a fair to high level of reliability (see Table 1; Brown,
2014). In light of recent discussions on and recommendations for instrument
reliability based on internal consistency in the literature on psychoeducational
assessment and applied linguistics (e.g., McNeish, 2018; Plonsky & Derrick,
2016), we have reported McDonald’s omega, which does not assume Tauequivalence and which we calculated using the structural equation modeling
method following Hayes and Coutts’s (2020) recommendations. We have also
reported Cronbach’s alpha because of its wide, albeit increasingly contested,
use in the literature.
Reading Comprehension
We measured reading comprehension with a standardized reading test, Form
S from the Gates-MacGinitie Reading Tests (MacGinitie et al., 2000). We selected this test because it considers different types of texts and assesses literal
as well as inferential comprehension. Another consideration was that, as opposed to any retired standardized tests that target nonnative speakers of English
(e.g., IELTS), the participants were unlikely to have taken this test. Although
this test is more commonly used for L1 populations, it has been widely used
to measure L2 readers’ comprehension as well (e.g., Akamatsu, 2003; Li &
Kirby, 2015). We selected four short reading passages (mean length about 120
9
Language Learning 00:0, xxxx 2021, pp. 1–40
Measure
M
SD
95% CI
αb
ωc
21
8.15
3.47
[7.69, 8.61]
.630
.641
12
10
10
30
7.78
6.29
4.41
18.44
2.92
2.57
2.47
5.62
[7.39, 8.17]
[5.94, 6.63]
[4.09, 4.74]
[18.03, 19.47]
.755
.747
.661
.812
–
1,580.70
343.20
–
2,806.00
1,286.90
24
14.01
3.99
[1,538.60,
1,622.80]
[2,635.10,
2,977.00]
[13.55, 14.59]
–
1,966.50
423.70
–
3,631.80
1,298.60
72
40
38.10
30.27
14.58
7.73
[1,910.20,
2,023.40]
[3,459.20,
3,804.30]
[36.16, 40.04]
[29.67, 31.65]
Skewness
SE
Kurtosis
SE
0.479
0.154
−0.161
0.306
.757
.750
.670
.795
−0.370
−0.392
−0.696
−0.525
0.152
0.152
0.153
0.151
−0.603
−0.638
−0.102
−0.203
0.303
0.303
0.306
0.302
–d
–d
−0.593
0.152
1.126
0.302
–d
–d
1.994
0.164
5.357
0.327
.684
.689
0.087
0.153
−0.424
0.304
–d
–d
−0.829
0.151
1.038
0.302
–d
–d
1.413
0.164
3.720
0.327
.949
.904
.951
.911
0.320
−0.882
0.151
0.151
−0.490
0.182
0.300
0.301
(Continued)
10
Lexical Basis of L2 Reading Comprehension
Reading
comprehension
Affix form
Affix meaning
Affix function
Separability
(accuracy)
Separability (raw
RT)
Separability
(IESe )
Combinability
(accuracy)
Combinability
(raw RT)
Combinability
(IESe )
Vocabulary levels
Lexical decision
(accuracy)
Itemsa
Alshehri and Zhang
Language Learning 00:0, xxxx 2021, pp. 1–40
Table 1 Measures and descriptive statistics for all competency measures
Measure
M
SD
95% CI
αb
ωc
Skewness
SE
–
973.95
186.50
–d
–d
−0.298
0.151
0.050
0.301
–
1,358.30
537.10
–d
–d
1.720
0.165
4.071
0.328
20
14.02
3.73
[951.20,
996.70]
[1,302.10,
1,535.00]
[13.84, 14.76]
.754
.736
−1.110
0.151
1.740
0.300
–
1,988.30
320.70
–d
–d
−0.079
0.152
−0.046
0.302
–
3,039.80
1278.50
–d
–d
2.397
0.164
5.671
0.327
[1,949.00,
2,027.60]
[2,870.00,
3,209.70]
Kurtosis
SE
Note. N = 220. Affix form, affix meaning, and affix function = sublexical knowledge; separability and combinability = sublexical processing;
vocabulary levels = lexical knowledge; lexical decision = lexical processing; RT = reaction time; IES = inverse efficiency score.
a
Number of items in the measure.
b
Cronbach’s α estimate of reliability.
c
McDonald’s ω estimate of reliability, which does not assume Tau-equivalence, was calculated using the structural equation modeling method
in Mplus (Version 8.0; Muthén & Muthén, 1998–2017; see Hayes & Coutts, 2020).
d
Reliability could not be calculated for raw RTs for computer-based measures because participants showed diverse patterns of correct “yes”
reactions across real-word stimuli. Reliability also could not be calculated for IES RTs because there was only one holistic score for each
participant.
e
IES was calculated by dividing participants’ raw mean RT by the percentage of their correct responses.
Lexical Basis of L2 Reading Comprehension
Language Learning 00:0, xxxx 2021, pp. 1–40
Lexical decision
(raw RT)
Lexical decision
(IESe )
Working memory
(accuracy)
Working memory
(raw RT)
Working memory
(IESe )
Itemsa
Alshehri and Zhang
11
Table 1 (Continued)
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
words) from Level 5, based on our expert knowledge about Saudi university
students’ reading proficiency and on our pilot study. Of the four passages that
we selected, we deliberately chose two to be informational and the other two
to be narrative. Each passage was accompanied by five or six multiple-choice
questions (each with four response options), with a total of 21 questions across
the four passages. This test was paper-based and administered in two class sessions, two passages in each session and each session about 25 minutes. The
participants were instructed to read the passages silently and to circle an answer for each question.
Lexical Knowledge
We narrowly defined lexical knowledge as learners’ knowledge of meanings
of individual words. We intended lexical knowledge to represent the participants’ vocabulary breadth and measured it with a modified version of the Vocabulary Levels Test (Schmitt, Schmitt, & Clapham, 2001; Webb, Sasao, &
Ballance, 2017). The test for this study covered only four levels of word frequency: 1,000, 2,000, 3,000, and 5,000 words. From each frequency level, we
randomly sampled six clusters of words; each cluster consisted of a list of six
words and three meaning choices. Different from the original Vocabulary Levels Test, we translated the three meaning choices and presented them in Arabic,
the native language of the participants. The participants were asked to select a
word to match each meaning choice. This test was administered in one class
session; the participants were given 20 minutes to complete it.
Sublexical Knowledge
We were particularly interested in learners’ sublexical knowledge that pertains
to morphology or more specifically derivation. We measured the participants’
morphological knowledge with a task that we modeled on the Word Part Levels
Test (Sasao & Webb, 2017). Although we retained the format, the instructions,
and the scoring method of the original test, we redesigned some items, giving
consideration to Saudi university students’ English learning experience and
knowledge of English prefixes and suffixes. The test consisted of three sections
that assessed knowledge of form, meaning, and use/function of English affixes
(e.g., -less and super-).
The first section consisted of 12 items that measured knowledge of the correct written forms of common English prefixes and suffixes. The participants
were presented with four orthographically similar options, only one of which
was a correct affix and should thus have been circled (e.g., multi-, mul-, mlt-,
tui-). The second section consisted of 10 items that measured knowledge of
Language Learning 00:0, xxxx 2021, pp. 1–40
12
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
meanings of affixes. The participants were asked to select from four choices
a simple English word that conveyed the meaning of a target prefix or suffix.
For each affix (e.g., un-), two words (e.g., unhappy and unfair) were given as
examples to contextualize the use of the affix; the Arabic translation of the
four English word choices (e.g., for un-: again, no, back, and new) was also
provided. The last section included 10 items that measured knowledge of how
an affix indicates the part of speech of a derivational word (i.e., the syntactic
properties of affixes). For each item, a prefix or suffix (e.g., -ish) was presented
together with a derivational word (e.g., selfish) to show its use. The participants
were asked to select “noun,” “verb,” “adjective,” or “adverb” to demonstrate
their understanding about how the target affix indicated the part of speech of
a word to which it was attached. Arabic translations of the terms noun, verb,
adjective, and adverb were also provided. The test was administered to the participants in a separate class session of about 20 minutes.
Lexical Processing Efficiency
Although the lexical knowledge measure described earlier aimed to assess how
many words that the participants knew (specifically, their written receptive vocabulary size), lexical processing efficiency in the context of this study concerned visual word recognition efficiency, that is, how rapidly the participants
could recognize a printed word that they knew. To measure lexical processing
efficiency, we adopted a computer-based lexical decision task.
The lexical decision task consisted of 40 real words as well as 20 decodable
pseudowords (e.g., toag) as fillers. The real words were randomly selected from
the 1,000 word level of the most frequent words based on the Corpus of Contemporary American English (Davies, 2008; and see www.wordfrequency.info
for the relevant information) and should thus have been known to the participants. The order of these real words and pseudowords was randomized. The
participants were asked to indicate whether they knew a word (i.e., knew its
[partial] meaning) displayed on the screen by pressing as quickly as possible
the “yes” or “no” key marked on the keyboard (cf. the literature on measuring
efficiency of vocabulary recognition using computer-based, yes/no tests such
as Harrington, 2018, and Pellicer-Sánchez & Schmitt, 2012). Both RTs and
yes/no responses were recorded. Details on the testing procedure are provided
later in the Data Collection Procedure section.
Sublexical Processing Efficiency
The sublexical processing efficiency measure focused on morphological processing. Two computer-based tasks were included. In the separability task,
13
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
following Koda’s (2000) study, the participants were asked to decide, as quickly
as possible, whether a word presented in the center of the computer screen
could be separated into two or more meaningful components (i.e., stem and
affix). There were 30 stimulus words that we assumed that the participants
would know. Those words were selected from an initial list created by the first
author based on her many years of teaching in the university and familiarity
with the participants’ curriculum. They were also checked by the participants’
English teachers and later piloted with a separate group of students, as we
mentioned earlier. Fifteen of the stimuli were actual derivational words, such
as government and disappear, that can be segmented into govern and -ment
and dis- and appear, respectively. The other 15 words were monomorphemic
words that included a letter or a string of letters resembling an English affix,
for example, power and kitchen. Conversely, the combinability task asked the
participants to decide, as quickly as possible, whether the two word-parts presented on the computer screen could be combined to make a meaningful longer
English word. There were 24 items in this task, including 12 items that were
combinable, such as fear and less, and 12 items that were not, such as un and
home.
Working Memory
Text comprehension necessitates the execution of a number of processes, the
efficiency of which depends heavily on readers’ mental capacity (Just & Carpenter, 1992). Working memory capacity, in particular, is a significant correlate
of L2 reading comprehension (Harrington & Sawyer, 1992). (Sub)lexical processing itself also depends on working memory (Tokowicz, 2014). To obtain
a more accurate understanding of the effect of lexical competences, particularly that of processing efficiency, we also measured the participants’ working memory and later included working memory as a covariate when, in regression analysis, we used different lexical competences to predict reading
comprehension.
We measured working memory with a computerized digit span task, which
is one of the most widely used types of working memory tests (Richardson, 2007). The test for this study consisted of 20 numerical sequences—
10 for forward span and 10 backward span—assessing short-term storage of
the stimulus sequences. For the forward span items, the participants were
asked to decide, as quickly as possible, whether a digit sequence presented
on the computer screen was the one that they had just seen immediately
beforehand and in the same given order. Likewise, for the backward span
items, they were to decide whether a digit sequence was the one that they
Language Learning 00:0, xxxx 2021, pp. 1–40
14
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
had just seen immediately beforehand but that had the order reversed. For
both types of span, there were five sets of random numerical digits increasing in length of sequence (i.e., number of digits in a sequence; it started with
two-digit sequences and ended with six-digit sequences). Each set consisted
of two items: one with the order matched and the other with the order not
matched.
Data Collection Procedure
For the paper-based lexical and sublexical knowledge measures and the reading
comprehension test, the first author negotiated with the participants’ English
language teachers to administer them in groups in four to six class sessions
(each session about 20–30 minutes) as it was convenient for the classes. The
working memory and the lexical and sublexical processing efficiency measures were administered individually on a laptop computer and run on PsychoPy (Version 3.0; Peirce et al., 2019). The computer-based testing was conducted in a quiet space on the university campus in one session that lasted about
15 minutes. Data collection was completed over a period of four weeks. Task
instructions were given in Arabic (or Arabic and English) to ensure the participants’ full understanding of them.
The paper-based tests were carried out first, followed by the computerbased tests. For the computer-based testing, the working memory test was administered first, followed by the (sub)lexical processing efficiency measures.
For all measures, testing began with an on-screen instruction and some practice items. The participants were asked to give a response for an item presented
in the center of the computer screen by pressing as quickly as possible “yes”
(the left arrow key) or “no” (the right arrow key) marked with stickers on the
keyboard. Both RTs and yes/no responses were recorded. RT was calculated
as the interval between the onset of an item and the time of “yes” or “no”
being pressed. For the working memory test, the participants began the test
by seeing a digit sequence for a fixed length of 1,000 ms. Upon the offset of
the stimulus sequence, the question “Is this (a digit sequence) the number you
saw in the given order?” or “Is this (a digit sequence) the number you saw in
the reverse order?” appeared on the screen and was presented in Arabic. For
all the computer-based tests, the pressing of a key automatically activated the
next item. If no key was pressed for an item or no response was detected after
a certain period of time, the item would automatically disappear and the next
item would appear. The time assigned for an item to be answered before disappearing ranged from 1,000 to 2,000 milliseconds, with the baseline time for
each item estimated on the basis of the pilot study.
15
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Scoring and Missing Values
For the paper-based measures, one point was awarded for each correct answer; an incorrect answer or a missing response did not receive any points.
The maximum score possible was thus 21 for the reading comprehension test
and 72 for the lexical knowledge test. For the three sections of the sublexical
(or morphological) knowledge test, the maximum scores possible were 12, 10,
and 10, respectively.
The scoring and handling of missing data for the computer-based tasks
were less straightforward. There are no consistent methods for handling the
data of decision tasks like those in the present study (Jiang, 2012). In the literature on L2 reading comprehension, although some studies have considered
both accuracy and RTs of responses (e.g., Cremer & Schoonen, 2013), others
have incorporated only RTs (e.g., van Gelderen et al., 2004). In the present
study, for all computerized measures, we adopted RTs for our analysis. This
choice was also in line with our purpose of comparing (sub)lexical knowledge
against processing efficiency for which it was of greater interest to examine the
participants’ response or decision latency. We, however, accommodated accuracy rate in the final RT calculation for all computerized measures (see the
discussion on the inverse efficiency score or IES in the next paragraph; see
also the Limitations and Future Research section).
For the lexical processing efficiency task, we focused only on the 40 real
words; for those words, we relied on the RTs of correctly answered items. To
calculate the right RTs for analysis, for each participant, we recoded the RT of
a “no” decision for a real word as missing; and the RT of a missing decision
was also coded as missing. Then we calculated the mean RT for each item. A
RT that was above or below the item mean by two or more standard deviations
was subsequently considered to be an outlier and further recoded as missing.
This was followed by computing the mean RT of correctly answered items for
each participant. Finally, to accommodate the rate of correct responses, a raw
RT was replaced by an inversed value (Ratcliff, 1993). For each participant,
the IES was calculated by dividing the raw mean RT by the percentage of
correct responses (Townsend & Ashby, 1983). In this respect, the participants
with a low RT but a low accuracy rate as well would be penalized for the low
accuracy. The same procedure was largely followed for calculating the adjusted
RTs for the two sublexical/morphological processing efficiency tasks as well.
The only exception was that, unlike the filler items or pseudo words in the
lexical decision task, the monomorphemic items for the separability task and
the items not combinable for the combinability task were not excluded for RT
calculation. This was because a “no” decision on those items was considered
Language Learning 00:0, xxxx 2021, pp. 1–40
16
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
to also show the participants’ attention to morphological features and thus to
reflect their morphological processing efficiency.
Results
Descriptive Statistics, Reliabilities, and Normality
All statistical analyses, unless stated otherwise, were performed with the SPSS
(Version 26) software. Table 1 presents the means, standard deviations, reliabilities (McDonald’s omega and Cronbach’s alpha), and skewness and kurtosis
values of all measured competences (Appendix S1 in the online Supporting Information presents elaborated descriptive statistics that include the minimum
and maximum scores and the range for each variable). The accuracy rates,
raw RTs, as well as IESs/adjusted RTs are shown for the (sub)lexical processing efficiency measures although, for the reason that we mentioned earlier,
only IES RTs were used for the subsequent bivariate correlation and regression analyses. For most measures, the skewness and kurtosis estimates were
generally below the rule-of-thumb values for univariate normality (e.g., ±2 for
both skewness and kurtosis) as well as the critical values that may result in significant deviation from multivariate normality (e.g., ±2 and ±7 for skewness
and kurtosis, respectively; Curran, West, & Finch, 1996). The kurtosis of the
IES RTs was higher than that of the raw RTs and than that of the paper-based
measures. Nonetheless, the kurtosis was within the acceptable range for multivariate normality. Normality of residuals was checked and confirmed through
the examination of histograms and probability-probability plots (cf. Gelman &
Hill, 2007, who argued that normality is one of the least important assumptions
to check when running generalized linear models). Alpha was set at .05 for all
correlations and regression analyses.
Bivariate Correlations
Table 2 shows the bivariate correlations between all the variables. Reading
comprehension correlated positively and significantly with all knowledge variables, notably lexical knowledge (i.e., vocabulary knowledge; r = .643), which
also produced the highest correlation with reading comprehension. Reading comprehension also correlated negatively and significantly with working
memory (r = −.169) and the two sublexical processing efficiency tasks of the
separability and combinability (r = −.183 and −.193, respectively). The correlation between reading comprehension and lexical processing efficiency (i.e.,
lexical decision) was negative as well (r = −.084); however, it did not achieve
statistical significance.
17
Language Learning 00:0, xxxx 2021, pp. 1–40
Competency
1. Reading
comprehension
2. Affix form
3. Affix meaning
4. Affix function
5. Separability
7. Vocabulary levels
8. Lexical decision
9. Working memory
2
r (p)95% CI
.391 (< .001)
[.276, .497]
.369 (< .001)
[.261, .462]
.519 (< .001)
[.401, .613]
−.183 (.007)
[−.290, −.062]
−.193 (.004)
[−.317, −.061]
.643 (< .001)
[.549, .713]
−.084 (.213)
[−.330, .054]
−.169 (.012)
[−.260, −.075]
_
.463 (< .001)
[.365, .557]
.506 (< .001)
[.415, .585]
−.254 (< .001)
[.058, −.374]
−.220 (.001)
[−.333, −.101]
.405 (< .001)
[.286, .514]
−.137 (.043)
[−.328, −.044]
−.182 (.007)
[−.324, −.046]
3
r (p)95% CI
4
r (p)95% CI
5
r (p)95% CI
6
r (p)95% CI
7
r (p)95% CI
8
r (p)95% CI
_
.518 (< .001)
[.428, .592]
−.305 (< .001)
[−.417, −.195]
−.294 (< .001)
[−.402, −.190]
.447 (< .001)
[.333, .548]
−.228 (.001)
[−.398, −.152]
−.266 (< .001)
[−.393, −.130]
_
−.291 (< .001)
[−.385, −.197]
−.331 (< .001)
[−.424, −.234]
.547 (< .001)
[.422, .643]
−.218 (.001)
[−.367, −.149]
−.176 (.009)
[−.273, −.071]
_
.369 (< .001)
[.252, .490]
−193 (.004)
[−.289, −.084]
.247 (< .001)
[.144, .479]
.202 (.003)
[.053, .353]
_
−.206 (.002)
[−.322, −.084]
.273 (< .001)
[.144, .512]
.144 (.033)
[.021, .274]
_
−.227 (.001)
[−.365, −.146]
−.207 (.002)
[−.311, −.066]
_
.087 (.197)
[.013, .228]
Note. N = 220. Affix form, affix meaning, and affix function = sublexical knowledge; separability and combinability = sublexical processing;
vocabulary levels = lexical knowledge; lexical decision = lexical processing. The 95% confidence intervals are based on 1,000 bootstrap
samples. Bolding indicates correlation coefficients that are significant at p < .05.
18
Lexical Basis of L2 Reading Comprehension
6. Combinability
1
r (p)95% CI
Alshehri and Zhang
Language Learning 00:0, xxxx 2021, pp. 1–40
Table 2 Bivariate correlations for all measured competences
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
It is also important to note that the three measures of sublexical knowledge
were all significantly correlated with each other. Knowledge of affix forms significantly positively correlated with knowledge of affix meanings (r = .463)
and knowledge of affix function (r = .506); and knowledge of affix meaning
and knowledge of affix function also showed a significant positive correlation
(r = .518). All three sublexical knowledge measures also significantly positively correlated with lexical knowledge, r = .405, .447, and .557, respectively,
for the affix form, meaning, and function tasks.
Finally, all the (sub)lexical knowledge measures negatively and significantly correlated with all the (sub)lexical processing efficiency measures,
which made sense because the processing efficiency measures had a focus on
speed (i.e., the lower the value, the greater the speed). The two sublexical processing efficiency measures and working memory were also positively and significantly correlated. The correlation between lexical processing efficiency and
working memory was also positive but not statistically significant (r = .087).
Contribution of Lexical Competences to Reading Comprehension
A series of hierarchical regression analyses (Jeon, 2015) was performed to
examine how different dimensions of lexical competences—lexical versus sublexical and knowledge versus processing efficiency—collectively and relatively contributed to L2 reading comprehension over and above working memory. For all analyses, working memory was entered first into the regression
equation as a covariate (it explained about 2.9% of the variance in reading comprehension), followed by different lexical competences entered individually or
as a block. The three sublexical knowledge measures were always entered as a
block; likewise, the RTs for the morphological separability and combinability
tasks were also entered as a block to represent sublexical processing efficiency.
The order of entry was also switched for different predictors to test and compare their unique contribution to reading comprehension. Multicollinearity was
diagnosed for multiple regression analysis through variable inflation factors,
which ranged from 1.108 to 1.864 and were smaller than the lowest bound of
rule-of-thumb values (i.e., 2.5) for indicating the presence of multicollinearity
(Allison, 1999).
Comparing Lexical and Sublexical Predictors
The first set of research questions sought to compare lexical and sublexical
predictors of reading comprehension. We conducted three sets of regression
analyses for this purpose. We first examined how lexical knowledge and sublexical knowledge predictors relatively contributed to reading comprehension,
19
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Table 3 Comparing lexical and sublexical predictors of reading comprehension
Steps
Predictors
R2
95% CI
1
Working memory
.029 [.001, .084]
Lexical vs. Sublexical knowledge
2
Lexical knowledge
.414 [.315, .493]
3
Sublexical knowledge
.459 [.353, .527]
2
Sublexical knowledge
.300 [.193, .380]
3
Lexical knowledge
.459 [.353, .527]
Lexical vs. Sublexical processing
2
Lexical processing
.034 [.000, .087]
3
Sublexical processing
.068 [.008, .125]
2
Sublexical processing
.067 [.011, .130]
3
Lexical processing
.068 [.008, .125]
Lexical vs. Sublexical (knowledge & processing)
2
Lexical knowledge &
.419 [.316, .494]
processing
3
Sublexical knowledge &
.467 [.353, .529]
processing
2
Sublexical knowledge &
.300 [.185, .374]
processing
3
Lexical knowledge
.467 [.353, .529]
&processing
R2 adjusted
R2
p
.024
.029
.012
.409
.446
.287
.446
.386
.044
.271
.159
< .001
.001
< .001
< .001
.025
.050
.055
.050
.005
.034
.039
.000
.297
.021
.012
.876
.411
.390
< .001
.447
.049
.002
.280
.271
< .001
.447
.167
< .001
and then we analyzed how lexical processing efficiency and sublexical processing efficiency relatively contributed to reading comprehension. Finally, we
compared how the two lexical-level competences (i.e., knowledge and processing efficiency together) and the two sublexical competences (also knowledge
and processing efficiency together) relatively predicted reading comprehension. Estimates of the regression coefficients in the final regression model can
be seen in Appendix S2 in the online Supporting Information for each set of
analyses.
The upper panel of Table 3 shows that, when we controlled for working
memory, lexical knowledge additionally significantly explained 38.6% of the
variance in reading comprehension. Over and above working memory and
lexical knowledge, sublexical knowledge also significantly predicted reading
comprehension; it, however, only additionally explained 4.4% of the variance.
When sublexical knowledge was entered into the regression equation as the
second step, it significantly added 27.1% to the variance explained. The unique
Language Learning 00:0, xxxx 2021, pp. 1–40
20
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
effect of lexical knowledge remained significant; over and above working
memory and sublexical knowledge, it explained 15.9% of the variance in reading comprehension. Thus lexical knowledge explained a far greater amount of
unique variance than did sublexical knowledge although the unique effect of
both predictors was significant.
The middle panel of Table 3 shows the unique contribution of lexical versus sublexical processing efficiency. After controlling for working memory,
lexical processing efficiency did not predict reading comprehension significantly, whether it was entered before or after sublexical processing efficiency.
It barely explained any additional variance in reading comprehension when
sublexical processing efficiency was already in the model. However, sublexical processing efficiency uniquely explained a small yet significant proportion of variance in reading comprehension. Specifically, when working memory and lexical processing efficiency were in the regression model, sublexical
lexical processing efficiency additionally significantly explained 3.4% of the
variance.
Last, we compared the effects of the two lexical predictors with those of the
two sublexical predictors. As shown in the bottom panel of Table 3, the lexical predictors (knowledge and processing entered as a block) had a far greater
unique effect on reading comprehension than did the sublexical predictors, although the unique effect of both was statistically significant. Specifically, over
and above working memory and the lexical predictors, the sublexical predictors
additionally and significantly explained about 4.9% of the variance in reading
comprehension. However, the lexical predictors, when entered into the regression model at the last step, significantly explained about 16.7% of the variance
in reading comprehension.
Taken together, the findings suggested that lexical-level competences overall had a stronger effect on reading comprehension than did sublexical competences, and this advantage seemed to be attributed to the large effect of lexical
knowledge on reading comprehension. With respect to processing efficiency,
the effect at the sublexical level, though small, was greater.
Comparing Knowledge and Processing Efficiency Predictors
The second set of research questions aimed to compare the effects of knowledge and processing efficiency predictors. Three sets of regression analyses
again were conducted. We first compared these two types of competence at the
lexical level and then at the sublexical level. Last, we compared the effects of
lexical and sublexical knowledge (i.e., the two levels together) and those of lexical and sublexical processing efficiency (together). Estimates of the regression
21
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Table 4 Comparing knowledge and processing efficiency predictors of reading comprehension
Steps
Predictors
R2
95% CI
1
working memory
.029 [.001, .084]
Lexical knowledge vs. Lexical processing
2
Lexical knowledge
.414 [.315, .493]
3
Lexical processing
.419 [.316, .494]
2
Lexical processing
.034] [.000, .087
3
Lexical knowledge
.419 [.316, .494]
Sublexical knowledge vs. Sublexical processing
2
Sublexical knowledge
.300 [.193, .380]
3
Sublexical processing
.300 [.185, .374]
2
Sublexical processing
.067 [.011, .130]
3
Sublexical knowledge
.300 [.185, .374]
Knowledge vs. Processing (lexical & sublexical)
2
Lexical & sublexical
.459 [.353, .527]
knowledge
3
Lexical & sublexical
.467 [.353, .529]
processing
2
Lexical & sublexical
.068 [.008, .125]
processing
3
Lexical & sublexical
.467 [.353, .529]
knowledge
R2 adjusted
R2
p
.024
.029
.012
.409
.411
.025
.411
.386
.004
.005
.385
< .001
.214
.297
< .001
.287
.280
.055
.280
.271
.000
.039
.232
< .001
.997
.012
< .001
.446
.430
< .001
.447
.009
.341
.050
.039
.032
.447
.400
< .001
coefficients can be seen in Appendix S3 in the online Supporting Information
for each set of analyses.
The upper panel of Table 4 shows the results of the first comparison.
Controlling for working memory and lexical processing efficiency, lexical
knowledge significantly explained a unique proportion of variance in reading comprehension (about 38.5%). Conversely, however, a unique effect did
not surface for lexical processing efficiency when it was entered last into the
model, and minimal additional variance of reading comprehension was explained (0.4%).
The middle panel of Table 4 presents the results of the second comparison. Sublexical knowledge, whether entered in the model before or after sublexical processing efficiency, significantly predicted reading comprehension.
As the last predictor entered in the model, sublexical knowledge uniquely explained about 23.2% of the variance of reading comprehension. Conversely,
Language Learning 00:0, xxxx 2021, pp. 1–40
22
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
although working memory capacity was controlled for, sublexical processing
efficiency significantly predicted reading comprehension (3.9% of the variance
explained), but it failed to significantly predict reading comprehension when
sublexical knowledge was also in the model (0.0% of the variance explained).
Finally, as shown in the bottom panel of Table 4, the two knowledge measures (lexical and sublexical combined) collectively, uniquely, and significantly
explained about 40% of the variance in reading comprehension when working
memory and the two processing efficiency measures (lexical and sublexical)
were also in the model. Conversely, entered after working memory and the two
knowledge predictors, the two processing efficiency measures barely explained
any additional variance in reading comprehension (0.9%).
Taken together, the above findings seemed to suggest that knowledge was
a far stronger predictor of reading comprehension than processing efficiency,
which was true for both the lexical and the sublexical level or disregarding the
level of competence.
Unique Contribution of Each Predictor
Distinct from the first two questions, the last set of research questions focused
on the unique and relative contribution of each predictor. Regression coefficient estimates can be seen in the bottom panel of either Appendix S2 or
Appendix S3 in the online Supporting Information. Table 5 shows that the
four lexical competences collectively explained over 40% of the variance in
reading comprehension. The top section of the table shows the results of the
unique contribution of sublexical knowledge, and sublexical processing efficiency when all the other predictors (working memory included) were in
the model. The unique contribution was significant for sublexical knowledge
(4.0% of the variance explained), but not for sublexical processing efficiency
(0.1% of the variance explained). Likewise, the bottom section of Table 5
shows the unique contribution of lexical knowledge, and lexical processing
efficiency, when all the other predictors were in the model. The unique contribution of lexical knowledge was significant (16.5% of the variance explained);
yet a significant, unique effect did not surface for lexical processing efficiency
(0.8% of the variance explained).
Based on the unique proportion of variance explained for reading comprehension (i.e., R2 ), lexical knowledge appeared to be the strongest unique predictor, followed by sublexical knowledge. With the presence of the knowledge
predictors and working memory in the model, lexical and sublexical processing
efficiency barely contributed to reading comprehension.
23
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Table 5 The unique contribution of each predictor of reading comprehension
Steps
Predictors
R2
95% CI
1
Working memory
.029 [.001, .084]
Unique contribution of sublexical knowledge vs. processing
2
Lexical knowledge
.414 [.315, .493]
3
Lexical processing
.419 [.316, .494]
4
Sublexical processing
.427 [.318, .498]
5
Sublexical knowledge
.467 [.353, .529]
4
Sublexical knowledge
.467 [.358, .532]
5
Sublexical processing
.467 [.353, .529]
Unique contribution of lexical knowledge vs. processing
2
Sublexical knowledge
.300 [.193, .380]
3
Sublexical processing
.300 [.185, .374]
4
Lexical processing
.302 [.184, .373]
5
Lexical knowledge
.467 [.353, .529]
4
Lexical knowledge
.459 [.346, .523]
5
Lexical processing
.467 [.353, .529]
R2 adjusted
R2
p
.024
.029
.012
.409
.411
.413
.447
.452
.447
.386
.004
.008
.040
.048
.001
< .001
.214
.219
.001
< .001
.866
.287
.280
.279
.447
.441
.447
.271
.000
.002
.165
.159
.008
< .001
.997
.413
< .001
< .001
.068
Discussion
The present study set out to investigate how four distinct dimensions of lexical
competence—lexical versus sublexical, on the one hand, and knowledge versus
processing efficiency, on the other hand—collectively and relatively contribute
to reading comprehension in adult learners of English so as to shed light on
the lexical basis of L2 reading comprehension. The four lexical competences
collectively explained over 40% of the variance in the participants’ reading
comprehension. Compared to the processing efficiency predictors, the knowledge predictors had a predominant association with reading comprehension. In
fact, when the effects of the knowledge predictors were taken into consideration, those of the processing efficiency predictors were no longer significant.
Additionally, the lexical predictors, when examined collectively, had a greater
effect on reading comprehension than did the sublexical predictors; yet this
overall effect did not seem to hold specifically for processing efficiency in that
sublexical processing efficiency seemed to have a larger effect on reading comprehension than lexical processing efficiency (nonetheless, the effect of both
processing efficiency predictors was very small). Finally, among the four lexical competences, lexical knowledge was the strongest predictor, followed by
sublexical knowledge and then the processing efficiency predictors.
Language Learning 00:0, xxxx 2021, pp. 1–40
24
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Lexical Versus Sublexical Knowledge in Reading Comprehension
The lexical quality hypothesis (Perfetti, 2007) contends that high-quality representations of lexical and sublexical features are fundamentally important for
text comprehension. The lexical basis of reading comprehension that the hypothesis underscores (Perfetti & Hart, 2001) has been largely supported in the
L2 (as well as L1) reading comprehension literature. Notably, a strong association has consistently been found between vocabulary knowledge and reading
comprehension (Choi & Zhang, 2021; Grabe, 2009; Jeon & Yamashita, 2014).
This relationship was confirmed in the present study. The lexical knowledge
measure that targeted vocabulary size explained nearly 40% of the variance
in reading comprehension (when the effects of working memory and sublexical knowledge were concurrently considered; see Table 4). This finding corroborates previous findings that were derived largely from speakers of first
languages other than Arabic (e.g., Japanese, Chinese, Spanish) and that supported a strong association between vocabulary knowledge and reading comprehension. Taken together, these findings suggest that regardless of learners’
L1 background, lexical knowledge or knowledge of word meanings should not
be neglected when trying to understanding the components that contribute to
L2 reading comprehension.
An issue understudied in the literature pertains to the (unique) importance
of knowledge of sublexical features encapsulated in the lexical quality hypothesis. In the present study, we focused on (derivational) morphological
features because morphology could serve to bind other sublexical features,
including orthography, phonology, semantics, and grammar (see Kirby &
Bowers, 2017). In fact, this study attended to several aspects of morphological
knowledge that touched on orthography (the affix form measure), semantics
(the affix meaning measure), and grammar (the affix function measure). In
the L2 literature, despite an increasing interest in the role of morphology
in reading comprehension, attention to morphology has overall been limited
and most existing studies have focused on young EFL learners or bilingual
children (e.g., Ke et al., 2021; Kieffer & Lesaux, 2008; Zhang & Koda,
2013). Few studies have attended to this issue in adult learners of English
(see Zhang & Koda, 2012, for an exception). In the present study that focused
on adult Arabic-speaking EFL learners, all three measures of morphological
knowledge significantly and positively correlated with reading comprehension, and, collectively, they significantly predicted reading comprehension
over and above lexical knowledge (i.e., vocabulary size), even though the
unique effect was much smaller than that of lexical knowledge (see Table 3).
This finding thus lends support to the emphasis that the lexical quality
25
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
hypothesis places on the importance of sublexical representations for text
comprehension.
The finding also suggests that, for adult learners of English, morphological knowledge is uniquely important for reading comprehension independent
of lexical knowledge. Yet, it seems to differ from the findings of two previous studies that also focused on adult learners. Zhang and Koda (2012) found
morphological knowledge only indirectly contributed to reading comprehension and only through vocabulary knowledge; when they controlled for vocabulary knowledge, the effect of morphological knowledge was not significant.
Likewise, Qian (1999) did not report a unique and significant effect of the
morphological knowledge predictor that was intended to measure an aspect of
vocabulary depth knowledge.
One reason for the discrepancy of findings might be that the lexical/vocabulary measures in both Zhang and Koda (2012) and Qian (1999)
considered aspects of knowledge beyond that of individual word meanings.
Specifically, both studies, in addition to vocabulary size (measured with a Vocabulary Levels Test), concurrently considered word association ability as a
vocabulary depth measure, which was not the case in our study. Another reason might be that, in contrast to the two previous studies, our study had a more
comprehensive consideration of aspects of morphological knowledge, including form, meaning, as well as function. Notably, the affix function task that
targeted the participants’ knowledge of the syntactic properties (that is, partof-speech information of derivational affixes), had the highest, positive correlation with reading comprehension in this study (r = .519; see Table 2). This
aspect of knowledge, which was not specifically considered in the two previous
studies, is particularly underscored by Nagy (2007) as contributing to sentence
parsing and reading comprehension.
Whichever the reason might be, the above discussion suggests that morphological knowledge overall should be an important underpinning of reading
comprehension (see also the size of correlations reported in Jeon & Yamashita,
2014). Yet, whether a unique effect can emerge, over and above lexical knowledge, may depend on what aspects of morphological knowledge are the focus,
on the one hand, and what aspects of knowledge at the lexical level are the
concurrent focus, on the other hand. This issue warrants further research.
(Sub)Lexical Processing Efficiency in Reading Comprehension
The processing efficiency measures generated a few very intriguing findings.
Overall, when working memory and the two (lexical and sublexical) knowledge predictors were concurrently in the model, neither lexical nor sublexical
Language Learning 00:0, xxxx 2021, pp. 1–40
26
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
processing efficiency predicted reading comprehension significantly. This was
a surprising finding because, theoretically, for smooth text comprehension to
happen, efficient word recognition and word-to-text integration are essential
(Perfetti, 2007). In other words, text comprehension necessitates not only rich
knowledge of word meanings and sublexical morphological features, which
we discussed earlier, but also an ability to efficiently process printed words,
including multimorphemic words, and access their meanings during text comprehension. The ability to quickly recognize a word and word parts (that is, the
ease of accessing word knowledge), should, in theory, have an added value for
reading comprehension (Nagy & Scott, 2000; Perfetti & Hart, 2001). Automatized lower-level processing skills are essential for enabling effective participation of higher-order processes for constructing mental models during text
reading. This is in line with a capacity view of discourse comprehension (Just
& Carpenter, 1992) and should pertain to any readers of English, whether English is their L1 or L2 (Grabe, 2009; Koda, 2005).
We speculate that one interpretation for the lack of a unique and significant effect of the processing efficiency measures is that this finding may reflect
what characterizes lexical involvement at the particular developmental stage
of our participants. Although the participants had been learning English for at
least six years (in a foreign language context), their English proficiency tended
to be low. This can be partly seen from their low performance on the reading
comprehension measure (the average score for the participants was about 8 out
of 21 items; see Table 1). Level 5 of the Gates-MacGinitie Reading Tests, from
which the passages and questions were sampled, actually targets fifth graders
in an English-speaking context. In other words, for the participants to comprehend the passages, knowledge of word meaning (and knowledge of morphemic meanings for morphologically complex words) should reasonably be
the dominant influence. But in the L1 reading literature at least, less skilled
comprehenders compared to skilled comprehenders have tended to have problems with word processing or to show less immediate use of word meanings in
the integration process (Nation & Snowling, 2004; Perfetti & Stafura, 2014).
However, as we discuss in the Limitations section, this finding could also be
affected by how RT scores may not adequately represent individual differences
in (sub)lexical processing.
Another aspect for attention might be that the comprehension test was not
administered in a timed condition. Although the participants were asked to
complete a test session within a specified period of time, that is, within 10 to
15 minutes per passage, this time restriction might be too relaxed (considering that each passage was only about 120 words long and followed by only
27
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
five questions) for processing efficiency to make a noticeable additional difference (above and beyond word knowledge), particularly when individual differences in working memory were also considered. The present finding, however,
seems to corroborate findings from the NELSON project that studied adolescent learners of English in the Netherlands (e.g., Fukkink, Hulstijn, & Simis,
2005; van Gelderen et al., 2004). In these studies, word recognition speed was
not found to uniquely and significantly predict reading comprehension. In addition, although word recognition training did improve word recognition speed,
the effect did not transfer so as to benefit reading comprehension.
Despite the weak, unique effects of the two processing efficiency predictors, their relative contribution shown in Table 3 deserves some attention.
Specifically, when lexical processing efficiency was controlled for, sublexical
processing efficiency had a significant, albeit small, effect on reading comprehension. Conversely, however, this significant effect did not surface for lexical
processing. We speculate that this gap might be attributed to the psycholinguistic processes that could be differentially involved in the lexical decision
task and the morphological processing tasks. Specifically, when learners make
a decision about a highly frequent word such as sweet and visit (in a decontextualized task such as the lexical decision task in this study), they may rely only
on orthographic processing with little meaning activation, which would be very
different from processing those words in an actual text-reading situation where
access to meanings is essential. In contrast, for the two morphological processing tasks, though also decontextualized, rapid semantic activation or attention
to stem and affix meanings (e.g., inform and -ation for the stimulus word information) seemed unavoidable. Consequently, the required meaning activation
process that seemed to favor the morphological processing tasks might have
resulted in the relatively larger effect of sublexical processing efficiency than
lexical processing in this study. Such an account also seems to be in line with
that of Fukkink et al.’s (2005) result that was mentioned above, in that the improvement in the speed for recognizing decontextualized words as a result of
the word recognition training might only represent enhanced orthographic (and
phonological) processing and not capture the full lexical access that is required
of reading comprehension.
Limitations and Future Research
We focused only on four major types of lexical competence to explore the
lexical basis of reading comprehension. Although we considered both lexical
and sublexical levels and both knowledge and processing efficiency dimensions and although these predictors explained over 40% of the variance in L2
Language Learning 00:0, xxxx 2021, pp. 1–40
28
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
reading comprehension, efficient reading comprehension does not depend
solely on these dimensions. There are arguably other lexical knowledge and
skills that underpin (L2) reading comprehension. In the L2 literature, there is,
for example, an interest in the role of word or semantic association knowledge,
which has often been studied as a type of vocabulary depth knowledge (Qian,
1999; Zhang, 2012; see Schmitt, 2014, and Zhang & Koda, 2017, for reviews).
Cremer and Schoonen (2013) also distinguished between the availability and
accessibility of semantic association knowledge. In both L1 and L2 reading
literature, there has recently also been some attention given to knowledge of
connectives (e.g., Crosson & Lesaux, 2013; Fraser, Pasquarella, Geva, Gottardo, & Biemiller, 2021) and knowledge of formulaic language or multiword
lexical units (e.g., Kremmel, Brunfaut, & Alderson, 2017; Martinez & Murphy, 2011; Öksüz, Brezina, & Rebuschat, 2021). Collectively, these studies and
ours contribute to a more comprehensive understanding about the lexical basis
of (L2) reading comprehension. Nonetheless, it would seem too ambitious to
accommodate all these dimensions in a single study.
The relative contributions of different dimensions of lexical competence
to reading comprehension may depend on learners’ L2 proficiency. Some researchers have split their sample of readers into proficient and less proficient
subgroups and have aimed to examine whether the subgroups differed for any
relational patterns (e.g., Cremer & Schoonen, 2013; Shiotsu & Weir, 2007).
The present study did not perform this kind of ad hoc grouping because the
participants were from the same learner population. Future research, however,
might consider recruiting and comparing learners with distinct levels of language proficiency or at distinct developmental stages, or measuring and reporting proficiency rigorously (Park, Solon, Dehghan-Chaleshtori, & Ghanbar,
2021) so as to be able to use proficiency as a continuous variable in models,
for example.
Another limitation pertains to the relatively low reliability that we found
for the portion of the Gates-MacGinitie Reading Tests (see Table 1) that we
used with our participants (see the meta-analysis of reliability coefficients in
L2 research by Plonsky & Derrick, 2016, where the median reliability for
reading was .86). Although the Gates-MacGinitie Reading Tests are primarily intended for native English-speaking readers, they have been popular in
research with L2 populations as well, such as Li and Kirby (2015) with adolescent Chinese-speaking EFL readers or Akamatsu (2003) with adult ESL
readers. We thus have not speculated that the relatively low reliability could
be attributed to the inappropriateness of the test for adult EFL learners. One
reason for low reliability might be the relatively low number of passages and
29
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
questions included in this study. We sampled and administered only four passages because the tasks for the purpose of this study already required a commitment of over two hours, not to mention several other L2 reading tasks that
we administered for other studies. Another reason might be the extreme difficulty of a couple of the questions. For example, only 13% of the participants
correctly answered one question. To keep the test intact, we choose not to remove those items so as to augment the reliability for the present sample. Future research should consider adopting more passages with a larger number of
questions.
Finally, to accommodate the trade-off between speed and accuracy for
computer-based tasks, we followed Townsend and Ashby’s (1983) method to
calculate IESs and used those adjusted RTs as predictors of reading comprehension. Nevertheless, we recognize that IES is just one way to adjust
RT; despite its wide use in the literature, including the language learning
literature (e.g., Ke & Koda, 2017), efforts have been taken to explore other
methods for better accommodation of the speed-accuracy interaction and debates on this topic are not uncommon (e.g., Davidson & Martin, 2013; Liesefeld & Janczyk, 2019; Vandierendonck, 2017). In addition, there have been
explorations of and debates on the reliability of measuring and representing
individual differences in nonnative lexical processing. Schmalz (2020), for
example, explored a number of psycholinguistic marker effects. Word frequency, an important psycholinguistic marker, notably could modulate RT performance and its representation for individual differences in lexical processing (see also Brysbaert, Mandera, & Keuleers, 2017). In the present study,
the lexical decision task’s use of (correct responses to) words from the 1,000
most frequent words (unlike in the Vocabulary Levels Test or the lexical
knowledge task, where a wider range of less frequent words was used) could
have reduced the test’s ability to represent the participants’ individual differences in lexical processing efficiency. This is because the words may have
been very efficiently recognized, thus creating a kind of ceiling effect and
consequently limiting the test’s predictive power for reading comprehension.
It is beyond the purpose of the present study to directly investigate these
methodological issues, and as a result, we cannot rule out the possibility that
the approach to handling the interaction between speed and accuracy and
the reliability of the (adjusted) RT-based (sub)lexical processing measures
might have influenced the research findings (e.g., the lack of a significant,
unique effect for the lexical decision task). This can be a direction for future
research.
Language Learning 00:0, xxxx 2021, pp. 1–40
30
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Conclusions
In light of the lexical quality hypothesis, this study explored the lexical basis of L2 reading comprehension in a group of adult Arabic-speaking EFL
readers by studying the collective and relative contributions of four distinct
lexical competences: lexical versus sublexical and knowledge versus processing efficiency. Hierarchical regression analyses revealed that the four lexical
predictors collectively explained over 40% of the variance in the participants’
reading comprehension. Compared to the processing efficiency predictors, the
knowledge predictors had the predominant association with reading comprehension. When the knowledge predictors were not considered, sublexical (i.e.,
derivational morphological) processing efficiency, as opposed to lexical processing efficiency, significantly predicted reading comprehension, over and
above working memory. Overall, among the four lexical competences, lexical knowledge was the strongest predictor, followed by sublexical knowledge
and then the processing efficiency predictors.
This study confirmed strong lexical involvement in L2 reading comprehension. It underscored the critical importance of knowledge of word meanings that had been found in many previous studies. Yet, it also showed that
knowledge of sublexical morphological features can be important. Although
the lexical processing efficiency measures did not significantly predict reading
comprehension when lexical and sublexical knowledge were concurrently in
the model, evidence emerged that the type of processing where meaning activation is mandated (e.g., judging whether word parts can combine) can also
be important when accounting for reading comprehension. To our knowledge,
the present study was the first of its kind that has concurrently considered both
lexical and sublexical knowledge and processing efficiency to study reading
comprehension in L2 learners. The findings enrich the current understanding
about the fundamental role of lexical processes in L2 reading comprehension.
They particularly shed light on how derivational morphological knowledge as
well as processing skills may have a unique role to play for adult L2 learners
of English.
Final revised version accepted 9 September 2021
Open Research Badges
This article has earned an Open Materials badge for making publicly available
the components of the research methods needed to reproduce the reported procedure. All materials that the authors have used and have the right to share are
31
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
available at http://www.iris-database.org. All proprietary materials have been
precisely identified in the manuscript.
References
Alshehri, M., & Zhang, D. (2021a). Digit span test. Materials from “The lexical basis
of second language reading comprehension: From (sub)lexical knowledge to
processing efficiency.” [Working memory test]. IRIS Database, University of York,
UK. https://doi.org/10.48316/q7cd-mg79
Alshehri, M., & Zhang, D. (2021b). Lexical knowledge measure. Materials from “The
lexical basis of second language reading comprehension: From (sub)lexical
knowledge to processing efficiency” [Language test]. IRIS Database, University of
York, UK. https://doi.org/10.48316/qk1w-7b42
Alshehri, M., & Zhang, D. (2021c). Lexical processing efficiency task. Materials from
“The lexical basis of second language reading comprehension: From (sub)lexical
knowledge to processing efficiency” [Language test]. IRIS Database, University of
York, UK. https://doi.org/10.48316/xqtk-mh93
Alshehri, M., & Zhang, D. (2021d). Sub-lexical knowledge measure. Materials from
“The lexical basis of second language reading comprehension: From (sub)lexical
knowledge to processing efficiency” [Language test]. IRIS Database, University of
York, UK. https://doi.org/10.48316/k4cp-jw68
Alshehri, M., & Zhang, D. (2021e). Sub-lexical processing efficiency task. Materials
from “The lexical basis of second language reading comprehension: From
(sub)lexical knowledge to processing efficiency” [Language test]. IRIS Database,
University of York, UK. https://doi.org/10.48316/nnth-9h06
Allison, P. D. (1999). Multiple regression. Thousand Oaks, CA: Pine Forge Press.
Anderson, R. C., & Freebody, P. (1981). Vocabulary knowledge. In J. T. Guthrie (Ed.),
Comprehension and teaching: Research reviews (pp. 77–117). Newark, DE:
International Reading Association.
Akamatsu, N. (2003). The effects of first language orthographic features on second
language in text. Language Learning, 53, 207–231.
https://doi.org/10.1111/1467-9922.00216
Baddeley, A. (2007). Working memory, thought, and action. Oxford, UK: Oxford
University Press.
Brown, J. D. (2014). Classical theory reliability. In A. J. Kunnan (Ed.), The companion
to language assessment (pp. 1165–1181). Oxford, UK: Wiley-Blackwell.
https://doi.org/10.1002/9781118411360.wbcla054
Brysbaert, M., Mandera, P., & Keuleers, E. (2017). The word frequency effect in word
processing: An updated review. Current Directions in Psychological Science, 27,
45–50. https://doi.org/10.1177/0963721417727521
Ciaccio, L. A., & Clahsen, H. (2020). Variability and consistency in first and
second language processing: A masked morphological priming study on prefixation
Language Learning 00:0, xxxx 2021, pp. 1–40
32
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
and suffixation. Language Learning, 70, 103–136.
https://doi.org/10.1111/lang.12370
Choi, Y., & Zhang, D. (2021). The relative role of vocabulary and grammatical
knowledge in L2 reading comprehension: A systematic review of literature.
International Review of Applied Linguistics in Language Teaching, 59, 1–30.
https://doi.org/10.1515/iral-2017-0033
Clahsen, H., Felser, C., Neubauer, K., Sato, M., & Silva, R. (2010). Morphological
structure in native and nonnative language processing. Language Learning, 60,
21–43. https://doi.org/10.1111/j.1467-9922.2009.00550.x
Cremer, M., & Schoonen, R. (2013). The role of accessibility of semantic word
knowledge in monolingual and bilingual fifth-grade reading. Applied
Psycholinguistics, 34, 1195–1217. https://doi.org/10.1017/S0142716412000203
Crosson, A. C., & Lesaux, N. K. (2013). Does knowledge of connectives play a unique
role in the reading comprehension of English learners and English-only students?
Journal of Research in Reading, 36, 241–260.
https://doi.org/10.1111/j.1467-9817.2011.01501.x
Curran, P. J., West, S. G., & Finch, J. F. (1996). The robustness of test statistics to
nonnormality and specification error in confirmatory factor analysis. Psychological
Methods, 1, 16–29. https://doi.org/10.1037/1082-989X.1.1.16
Davidson, D. J., & Martin, A. E. (2013). Modeling accuracy as a function of response
time with the generalized linear mixed effects model. Acta Psychologia, 144,
83–96. https://doi.org/10.1016/j.actpsy.2013.04.016
Davies, M. (2008). The corpus of contemporary American English (COCA). Retrieved
from https://www.english-corpora.org/coca/
Farran, L. K., Bingham, G. E., & Matthews, M. W. (2012). The relationship between
language and reading in bilingual English-Arabic children. Reading and Writing,
25, 2153–2181. https://doi.org/10.1007/s11145-011-9352-5
Fraser, C., Pasquarella, A., Geva, E., Gottardo, A., & Biemiller, A. (2021). English
language learners’ comprehension of logical relationships in expository texts:
Evidence for the confluence of general vocabulary and text-connecting functions.
Language Learning, 71, 872–906. https://doi.org/10.1111/lang.12453
Fukkink, R. G., Hulstijn, J., & Simis, A. (2005). Does training in second-language
word recognition skills affect reading comprehension? An experimental study.
Modern Language Journal, 89, 54–75.
https://doi.org/10.1111/j.0026-7902.2005.00265.x
Gelman, A., & Hill, J. (2007). Data analysis using regression and
multilevel/hierarchical models. Cambridge, UK: Cambridge University Press.
Grabe, W. (2009). Reading in a second language: Moving from theory to practice.
New York, NY: Cambridge University Press.
Harrington, M. (2018). Lexical facility: Size, recognition speed and consistency as
dimensions of second language vocabulary knowledge. London, UK: Palgrave
MacMillan.
33
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Harrington, M., & Sawyer, M. (1992). L2 working memory capacity and L2 reading
skill. Studies in Second Language Acquisition, 14, 25–38.
https://doi.org/10.1017/S0272263100010457
Hayes, A. H., & Coutts, J. J. (2020). Use omega rather than Cronbach’s alpha for
estimating reliability. But…. Communication Methods and Measures, 14, 1–24.
https://doi.org/10.1080/19312458.2020.1718629
Hu, H. M., & Nation, P. (2000). What vocabulary size is needed to read unsimplified
texts. Reading in a Foreign Language, 8, 689–696.
Jeon, E. H. (2015). Multiple regression. In L. Plonsky (Ed.), Advancing quantitative
methods in second language research (pp. 131–158). New York, NY: Routledge.
Jeon, E. H., & Yamashita, J. (2014). L2 reading comprehension and its correlates: A
meta-analysis. Language Learning, 64, 160–212.
https://doi.org/10.1111/lang.12034
Jiang, N. (2012). Conducting reaction time research in second language studies. New
York, NY: Routledge.
Just, M. A., & Carpenter, P. A. (1992). A capacity theory of comprehension:
Individual differences in working memory. Psychological Review, 99, 122–149.
https://doi.org/10.1037/0033-295x.99.1.122
Ke, S., & Koda, K. (2017). Contribution of morphological awareness to adult L2
Chinese word meaning inferencing. The Modern Language Journal, 101, 742–755.
https://doi.org/10.1111/modl.12428
Ke, S., Miller, R. T., Zhang, D., & Koda, K. (2021). Crosslinguistic sharing of
morphological awareness in biliteracy development: A systematic review and
meta-analysis of correlation coefficients. Language Learning, 71, 8–54.
https://doi.org/10.1111/lang.12429
Kieffer, M. J., & Lesaux, N. K. (2008). The role of derivational morphology in the
reading comprehension of Spanish-speaking English language learners. Reading
and Writing, 21, 783–804. https://doi.org/10.1007/s11145-007-9092-8
Kintsch, W. (1988). The construction-integration model of text comprehension.
Psychological Review, 95, 163–182. https://doi.org/10.1037/0033-295X.95.2.163
Kirby, J. R., & Bowers, P. N. (2017). Morphological instruction and literacy: Binding
phonological, orthographic, and semantic features of words. In K. Cain, D. L.
Compton, & R. K. Parrila (Eds.), Theories of reading development (pp. 437–462).
Amsterdam, The Netherlands: John Benjamins.
https://doi.org/10.1075/swll.15.24kir
Koda, K. (2000). Cross-linguistic variations in L2 morphological awareness. Applied
Psycholinguistics, 21, 297–320. https://doi.org/10.1017/S0142716400003015
Koda, K. (2005). Insights into second language reading: A cross-linguistic approach.
New York, NY: Cambridge University Press.
Kremmel, B., Brunfaut, T., & Alderson, J. C. (2017). Exploring the role of
phraseological knowledge in foreign language reading. Applied Linguistics, 38,
848–870. https://doi.org/10.1093/applin/amv070
Language Learning 00:0, xxxx 2021, pp. 1–40
34
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Liesefeld, H. R., & Janczyk, M. (2019). Combining speed and accuracy to control for
speed-accuracy trade-offs(?). Behavior Research Methods, 51, 40–60.
https://doi.org/10.3758/s13428-018-1076-x
Li, M., & Kirby, J. R. (2015). The effects of vocabulary breadth and depth on English
reading. Applied Linguistics, 36, 611–634. https://doi.org/10.1093/applin/amu007
MacGinitie, W. H., MacGinitie, R. K., Maria, K., & Dreyer, L. G. (2000).
Gates-MacGinitie Reading Tests–Directions for Administration (levels 7/9 & 10/12)
(4th Ed.). Rolling Meadows, IL: Riverside.
Martinez, R., & Murphy, V. A. (2011). Effect of frequency and idiomaticity on second
language reading comprehension. TESOL Quarterly, 45, 267–290.
https://doi.org/10.5054/tq.2011.247708
McNeish, D. (2018). Thanks coefficient alpha, we’ll take it from here. Psychological
Methods, 23, 412–433. https://doi.org/10.1037/met0000144
Muthén, L. K., & Muthén, B. O. (1998–2017). Mplus user’s guide (8th ed.). Los
Angeles, CA: Author.
Nagy, W. (2007). Metalinguistic awareness and the vocabulary-comprehension
connection. In R. K. Wagner, A. E. Muse, & K. R. Tannenbaum (Eds.), Vocabulary
acquisition: Implications for reading comprehension (pp. 52–77). New York, NY:
Guilford Press.
Nagy, W. E., & Anderson, R. C. (1984). How many words are there in printed school
English? Reading Research Quarterly, 19, 304–330. https://doi.org/10.2307/747823
Nagy, W. E., & Scott, J. A. (2000). Vocabulary processes. In M. L. Kamil, P. B.
Mosenthal, P. D. Pearson, & R. Barr (Eds.), Handbook of reading research (Vol. 3,
pp. 269–284). Mahwah, NJ: Lawrence Erlbaum.
Nation, K., & Snowling, M. J. (2004). Beyond phonological skills: Broader language
skills contribute to the development of reading. Journal of Research in Reading, 27,
342–356. https://doi.org/10.1111/j.1467-9817.2004.00238.x
Öksüz, D., Brezina, V., & Rebuschat, P. (2021). Collocational processing in L1 and L2:
The effects of word frequency, collocational frequency, and association. Language
Learning, 71, 55–98. https://doi.org/10.1111/lang.12427
Park, H. I., Solon, M., Dehghan-Chaleshtori, M., & Ghanbar, H. (2021). Proficiency
reporting practices in research on second language acquisition: Have we made any
progress? Language Learning, Retrieved from https://onlinelibrary.wiley.com/
action/showCitFormats?doi=10.1111%2Flang.12475
Pasquarella, A., Gottardo, A., & Grant, A. (2012). Comparing factors related to
reading comprehension in adolescents who speak English as a first (L1) or second
(L2) language. Scientific Studies of Reading, 16, 474–503.
https://doi.org/10.1080/10888438.2011.593066
Peirce, J. W., Gray, J. R., Simpson, S., MacAskill, M. R., Höchenberger, R., Sogo, H.,
… Lindeløv, J. (2019). PsychoPy2: Experiments in behavior made easy. Behavior
Research Methods, 51, 195–203. https://doi.org/10.3758/s13428-018-01193-y
35
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Pellicer-Sánchez, A., & Schmitt, N. (2012). Scoring yes-no vocabulary tests: Reaction
time vs. nonword approaches. Language Testing, 29, 489–509.
https://doi.org/10.1177/0265532212438053
Perfetti, C. A. (1985). Reading ability. Oxford, UK: Oxford University Press.
Perfetti, C. A. (1999). Comprehending written language: A blueprint for the reader. In
C. Brown & P. Hagoort (Eds.), Neurocognition of language (pp. 167–208). Oxford,
UK: Oxford University Press.
Perfetti, C. A. (2007). Reading ability: Lexical quality to comprehension. Scientific
Studies of Reading, 11, 357–383. https://doi.org/10.1080/10888430701530730
Perfetti, C. A., & Hart, L. (2001). The lexical basis of comprehension skill. In D. S.
Gorfein (Ed.), On the consequences of meaning selection: Perspectives on resolving
lexical ambiguity (pp. 67–86). Washington, DC: American Psychological
Association. https://doi.org/10.1037/10459-004
Perfetti, C. A., & Stafura, J. (2014). Word knowledge in a theory of reading
comprehension. Scientific Studies of Reading, 18, 22–37.
https://doi.org/10.1080/10888438.2013.827687
Plonsky, L., & Derrick, D. (2016). A meta-analysis of reliability coefficients in second
language research. The Modern Language Journal, 100, 538–553.
https://doi.org/10.1111/modl.12335
Proctor, C. P., Carlo, M., August, D., & Snow, C. (2005). Native Spanish-speaking
children reading in English: Toward a model of comprehension. Journal
of Educational Psychology, 97, 246–256.
https://doi.org/10.1037/0022-0663.97.2.246
Richardson, J. T. E. (2007). Measures of short-term memory: A historical review.
Cortex, 43, 635–650. https://doi.org/10.1016/S0010-9452(08)70493-3
Qian, D. (1999). Assessing the roles of depth and breadth of vocabulary knowledge in
reading comprehension. Canadian Modern Language Review, 56, 282–308.
https://doi.org/10.3138/cmlr.56.2.282
RAND Reading Study Group. (2002). Reading for understanding: Toward an R&D
program in reading comprehension. Santa Monica, CA: RAND.
Ratcliff, R. (1993). Methods for dealing with reaction time outliers. Psychological
Bulletin, 114, 510–532. https://doi.org/10.1037/0033-2909.114.3.510
Sasao, Y., & Webb, S. A. (2017). The Words Parts Levels Test. Language Teaching
Research, 21, 12–30. https://doi.org/10.1177/1362168815586083
Schmalz, X. (2020). How (not) to measure individual differences: The case of
statistical learning and lexical processing efficiency. Presentation given at the Clinic
and Polyclinic for Child and Adolescent Psychiatry, Psychosomatics and
Psychotherapy, University of Munich. Retrieved from
https://osf.io/ctyqn/?show=revision
Schmitt, N. (2014). Size and depth of vocabulary knowledge: What the research
shows. Language Learning, 64, 913–951. https://doi.org/10.1111/lang.12077
Language Learning 00:0, xxxx 2021, pp. 1–40
36
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Schmitt, N., Jiang, X., & Grabe, W. (2011). The percentage of words known in a text
and reading comprehension. The Modern Language Journal, 95, 26–43.
https://doi.org/10.1111/j.1540-4781.2011.01146.x
Schmitt, N., Schmitt, D., & Clapham, C. (2001). Developing and exploring the
behaviour of two new versions of the Vocabulary Levels Test. Language Testing, 18,
55–88. https://doi.org/10.1177/026553220101800103
Shiotsu, T., & Weir, C. J. (2007). The relative significance of syntactic knowledge and
vocabulary breadth in the prediction of reading comprehension test performance.
Language Testing, 24, 99–128. https://doi.org/10.1177/0265532207071513
Tokowicz, N. (2014). Lexical processing and second language acquisition. New York,
NY: Routledge.
Townsend, J. T., & Ashby, F. G. (1983). Stochastic modeling of elementary
psychological processes. Cambridge, UK: Cambridge University Press.
Vandierendonck, A. (2017). A comparison of methods to combine speed and accuracy
measures of performance: A rejoinder on the binning procedure. Behavior Research
Methods, 49, 653–673. https://doi.org/10.3758/s13428-016-0721-5
van Gelderen, A., Schoonen, R., De Glopper, K., Hulstijn, J., Simis, A., Snellings, P.,
& Stevenson, M. (2004). Linguistic knowledge, processing speed, and
metacognitive knowledge in first-and second-language reading comprehension: A
componential analysis. Journal of Educational Psychology, 96, 19–30.
https://doi.org/10.1037/0022-0663.96.1.19
Webb, S., Sasao, Y., & Ballance, O. (2017). The updated Vocabulary Levels Test:
Developing and validating two new forms of the VLT. ITL - International Journal
of Applied Linguistics, 168, 34–70. https://doi.org/10.1075/itl.168.1.02web
Yamashita, J. (2013). Word recognition subcomponents and passage level reading in a
foreign language, Reading in a Foreign Language, 25, 52–71.
Zhang, D. (2012). Vocabulary and grammar knowledge in second language reading
comprehension: A structural equation modeling study. The Modern Language
Journal, 96(4), 558–575. https://doi.org/10.1111/j.1540-4781.2012.01398.x
Zhang, D. (2017). Derivational morphology in reading comprehension of
Chinese-speaking learners of English. Applied Linguistics, 38, 871–895.
https://doi.org/10.1093/applin/amv072
Zhang, D., & Ke, S. (2020). The simple view of reading made complex by
morphological decoding fluency in bilingual fourth-grade readers of English.
Reading Research Quarterly, 55, 311–329. https://doi.org/10.1002/rrq.287
Zhang, D., & Koda, K. (2012). Contribution of morphological awareness and lexical
inferencing ability to L2 vocabulary knowledge and reading comprehension among
advanced EFL learners: Testing direct and indirect effects. Reading and Writing,
25, 1195–1216. https://doi.org/10.1007/s11145-011-9313-z
Zhang, D., & Koda, K. (2013). Morphological awareness and reading comprehension
in a foreign language: A study of young Chinese EFL learners. System, 41,
901–913. http://doi.org/10.1016/j.system.2013.09.009
37
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
Zhang, D., & Koda, K. (2017). Assessing L2 learners’ vocabulary depth with word
associates format tests: Issues, findings, and suggestions. Asian-Pacific Journal of
Second and Foreign Language Education, 2, 1–30.
https://doi.org/10.1186/s40862-017-0024-0
Supporting Information
Additional Supporting Information may be found in the online version of this
article at the publisher’s website:
Appendix S1. Elaborated Descriptive Statistics and Reliability Estimates for
Competency Measures.
Appendix S2. Parameter Estimates of Regression Models for Lexical Versus
Sublexical Predictors of Reading Comprehension.
Appendix S3. Parameter Estimates of Regression Models for Knowledge Versus Processing Predictors of Reading Comprehension.
Appendix: Accessible Summary (also publicly available at
https://oasis-database.org)
The importance of lexical competence for reading comprehension in a
second language
What this research was about and why it is important
Words are the building blocks of texts. For good reading comprehension, second language (L2) learners need to know the meaning of a large number of
words. Yet, this knowledge, often known as vocabulary breadth (or size), is insufficient. Written English is replete with affixed words where suffixes such as
-ness and -ive are added to a word to modify its meaning (and often its grammatical status as well). This means that learners also need to have good affix
or word part knowledge for recognizing those words during text reading. In
addition, readers need to be able to access these types of knowledge efficiently
(at speed). This study measured four distinct aspects of lexical competence in
university Arabic-speaking learners of English as a foreign language in Saudi
Arabia and tested the relative importance of these aspects of lexical competence for L2 reading comprehension. Vocabulary breadth had the strongest
association with reading comprehension, followed by affix knowledge and
then the speed-related lexical skills. Understanding the relative importance of
different aspects of lexical competence is essential to appropriate instructional
emphasis in L2 reading classrooms.
Language Learning 00:0, xxxx 2021, pp. 1–40
38
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
What the researchers did
r Participants were tested (in a group setting) on their vocabulary breadth,
affix knowledge, and reading comprehension.
r Participants were also tested (on an individual basis) on their speed of recognizing whole words and on their speed of segmenting words into meaningful
parts or combining parts to make a word.
r The reading comprehension test involved answering a total of 21 multiple
choice questions about four short passages.
r Statistical relations were calculated for the associations of the four distinct
aspects of lexical competence with reading comprehension, and the strength
of the associations was compared.
What the researchers found
r Vocabulary breadth was the most important of the four aspects of lexical
competence in explaining the reading comprehension scores.
r Affix knowledge was additionally important for accounting for reading comprehension scores.
r Knowledge of words (vocabulary breath) and of affixes played a stronger
role than did the two speed measures.
r The speed of segmenting words and speed of combining word parts was also
found to explain reading comprehension scores, though only to a relatively
small extent.
r The speed of word recognition was not found to play a notable role in comprehension scores.
Things to consider
r Diverse aspects of lexical competence can have distinct roles during text
reading.
r Knowing the meaning of a large number of words is important for reading
comprehension, but knowledge of word parts such as affixes is also important.
r It may help for L2 vocabulary instruction to go beyond simple meaning
definitions, though the current study did not test this.
r Further research should check whether the speed measures used in the current study reliably captured individual differences in learners’ access to
words.
Materials, data, open access article: Materials are publicly available at https:
//www.iris-database.org.
39
Language Learning 00:0, xxxx 2021, pp. 1–40
Alshehri and Zhang
Lexical Basis of L2 Reading Comprehension
How to cite this summary: Alshehri, M. G., & Zhang, D. (2022). The importance of lexical competence for reading comprehension in a second language. OASIS Summary of Alshehri and Zhang (2022) in Language Learning.
https://oasis-database.org
This summary has a CC BY-NC-SA license.
Language Learning 00:0, xxxx 2021, pp. 1–40
40