Introduction

Substantial evidence from research with alphabetic scripts indicates flexibility in the spatial encoding of orthographic information (Grainger, Dufau, & Ziegler, 2016). Much of this research comes from studies assessing the influence of brief, masked presentations of transposed-letter primes on lexical decisions (i.e. word/nonword judgements) for subsequent target words (e.g. Perea & Lupker, 2003). These show that, for a target word like ‘judge’, primes containing letter transpositions (e.g. ‘jugde’) produce responses that are faster than following primes with letter substitutions (e.g. ‘jupbe’), and often not much slower than responses following correctly spelled primes (e.g. ‘judge’). As masked primes are assumed to influence an early stage of word recognition (Forster, Mohan, & Hector, 2003), the findings suggest that letter position, but not letter identity, is processed flexibly early in word recognition. Moreover, such findings have been central to the development of models of visual word recognition (see Norris, 2013).

Other research shows that transposing letters in words is associated with longer reading times for text (Rayner et al., 2006; White et al., 2008). However, eye-movement studies using the boundary paradigm show transposed-letter effects during parafoveal processing similar to those in masking priming (e.g. Johnson, Perea, & Rayner, 2007). In this paradigm, an invisible boundary is placed in front of a target word in a sentence. Prior to the reader’s gaze crossing the boundary, the target word is shown normally (i.e. a valid preview, e.g. ‘clerk’), or with letters transposed (e.g. ‘clrek’) or replaced by other letters (e.g. ‘cohrk’) to create an invalid preview. Once the reader’s gaze crosses the boundary, the display reverts to the target word, within the time it takes the reader to make an eye movement, so that readers are unaware a change has been made. In this way, preview effects on subsequent target word processing can be examined. Studies using this paradigm show larger benefits for transposed- compared to substituted-letter previews, suggesting letter-position information is processed flexibly during parafoveal processing. This research has primarily been conducted in alphabetic scripts like English. However, recent studies have examined whether similar effects are observed in Chinese reading. Chinese is created from box-like logograms called characters, with most words comprising two (or more) characters (Hoosain, 1992), although word boundaries are not visually demarcated in text (Li et al., 2015). Studies have therefore investigated whether a word’s character order is processed strictly in Chinese reading.

An early study by Taft, Zhu and Peng (1999) used a paradigm in which participants made lexical decisions for two-character words (and nonwords) that could form a word, or not, when their characters were transposed. Lexical decisions were slower for transposable words and nonwords relative to non-transposable controls, which Taft et al. took as evidence for flexible character-order processing. They argued that presentation of a transposable word activates lexical entries for both the correct word and its transposed-character counterpart (i.e. 领带 activates both 领带, meaning ‘tie’, and 带领, meaning ‘lead’), with competition between these slowing lexical decisions. Similarly, they argued that, as a transposable nonword presentation can activate the lexical entry for its base word, this will interfere with the decision to reject the nonword presentation, slowing lexical decisions for transposable nonwords relative to non-transposable controls.

Other evidence from studies using conventional masked priming methods (Gu, Li, & Liversedge, 2015: Experiment 1) supports this view, by showing faster responses to target words (e.g. 吝啬) following a transposed-character nonword prime (e.g. 啬吝) than unrelated nonword prime (e.g. 菠汞). The explanation here is that the brief, masked presentation of a transposable nonword activates the lexical entry for its base word, facilitating the recognition of that word when subsequently it is presented as a target. Similarly, eye-movement studies using the boundary paradigm show that target word processing benefits more from parafoveal preview of a transposed- than substituted-character nonword (Gu et al., 2015: Experiment 2; Gu & Li, 2015). In this paradigm, an invisible boundary is placed immediately prior to a target word and, before the reader’s gaze crosses this boundary, the target word is displayed normally (i.e. a valid preview) or replaced by a nonword created by either transposing or replacing its two characters (invalid transposed-character or substituted-character previews). As soon as the reader’s gaze crosses the boundary, the invalid displays revert to the target word and the experimenter can examine the effects of the different previews on the subsequent processing of the target word. Finding that a transposed-character preview facilitates processing relative to a substituted-character preview suggests that preview of a transposed-character nonword can activate the lexical entry for its base word, helping lexical identification of that word following the display change. Yang (2013) additionally showed that transposed-character previews that formed words with either a similar or a different meaning to the target produced similar benefits relative to a control preview, indicating that the effect is due to flexible character order encoding rather than semantic-relatedness. Studies to date therefore provide good evidence for flexible parafoveal processing of character order in Chinese reading.

An outstanding question concerns whether contextual expectations can influence parafoveal processing of character-order information during Chinese reading. Such effects are theoretically important, as they may provide novel evidence that context influences an early stage of word identification associated with parafoveal processing of linguistic information. Numerous boundary paradigm studies in alphabetic languages show that predictability effects when previews are identical to the target word (e.g. cake) are reduced for orthographically similar, substituted-letter previews (e.g. cahc), and eliminated for orthographically dissimilar previews (e.g. pies; Balota, Pollatsek, & Rayner, 1985; see Staub, 2015), with similar effects for Chinese (Chang et al., 2020). These findings suggest that, in high-predictability contexts, readers assess the fit of parafoveal information with their expectations. As parafoveal processing is degraded due to limitations in retinal acuity (Hilz & Cavonius, 1974), coarse perceptual information that fits those expectations (i.e. a visually similar preview) may facilitate subsequent processing of target word information, wherease parafoveal information that does not fit with predictions (i.e. a visually dissimilar preview) will not. An important question, therefore, is whether word predictability effects also are observed for transposed- relative to substituted-character previews, as this would reveal that contextual predictability and flexible character order processing can conspire to influence parafoveal processing. One possibility is that predictability might facilitate the processing of transposed-character relative to substituted-character previews. However, a study by Luke and Christianson (2012), using a novel paradigm combining self-paced word-by-word alphabetic reading and masked priming, found an advantage for transposed-letter over substituted-letter primes in non-constraining contexts was eliminated when the target word was strongly predicted. This was taken to show that contextual prediction can be highly lexically specific, including predicting the specific identity and order of letters in words (Laszlo & Federmeier, 2009). However, this requires further investigation as the study used a non-standard method and manipulated context across experiments (low- and high-constraint materials were used in different experiments).

Accordingly, the present experiment assessed the influence of word predictability on the transposed-character effect in Chinese reading. Participants read sentences where an invisible boundary was placed immediately before a two-character target word that was either highly predictable or less predictable from the prior sentence context (Table 1). Before the reader’s gaze crossed this boundary, the target was shown normally, as a valid preview, or with its characters either transposed or substituted by orthographically dissimilar characters to create invalid nonword previews, following the same procedure as previous research (Gu et al., 2015). We expected highly predictable words to yield shorter reading times than less predictable words, in line with research on word predictability effects in Chinese reading (e.g. Rayner et al., 2005; Zhao et al., 2019). We also expected to replicate a larger preview benefit for transposed- relative to substituted-character nonwords, consistent with flexible parafoveal processing of character order in previous research (Gu et al., 2015). The crucial question was whether predictability would modulate this effect. Following Luke and Christianson (2012), we might expect a transposed-character effect for less predictable words to be reduced or eliminated for highly predictable words. Alternatively, if lexical prediction is supported by flexible parafoveal encoding of character order, predictability effects may be larger following transposed- than substituted-character previews.

Table 1 An example stimulus. Preview stimuli and target words are underlined and vertical lines indicate the invisible boundary

Method

Eethics statement

The study was approved by the research ethics committee in the Academy of Psychology and Behavior at Tianjin Normal University and conducted in accordance with the principles of the Declaration of Helsinki.

Participants

Sixty young adults aged 18–24 years (M = 20) from Tianjin Normal University participated. An additional three participants were replaced as they noticed a large proportion of display changes. All were native Mandarin readers, screened for normal acuity (greater than 20/40 in Snellen values) using a Tumbling E eye chart (Taylor, 1978), and naïve to the purpose of the experiment. A power analysis based on the gaze-duration effects in Experiment 2 of Gu et al. (2015), using the SIMR package (Green & McLeod, 2016) in R (R Development Core Team, 2016), suggested the experiment was appropriately powered to detect a transposed-character effect (power > 80%). Note that previous research has not reported effect sizes for word-skipping, precluding power analysis based on this measure.

Stimuli and design

Stimuli were 90 Chinese sentence frames, each containing a two-character target word. Following procedures similar to those used in other studies of predictability effects in alphabetic languages and Chinese (Rayner et al., 2005; Rayner & Well, 1996; Zhao et al., 2019), words were selected to have high or low predictability from the prior sentence context using a cloze procedure with sentences truncated immediately before the target word. Fifteen young adults who did not take part in the experiment provided the next word in the sentence. A word was considered highly predictable if more than 75% of participants guessed it to be the next word, and as less predictable if fewer than 20% guessed it to be the next word. See Table 2 for selected words characteristics. High- and low-predictability words were matched for word frequency and first- and second-character frequency using the SUBTLEX-CH database (Cai & Brysbaert, 2010), and for visual complexity in terms of number of character strokes. Ten participants evaluated sentence naturalness (using a 7-point scale, ranging from 1 = entirely unnatural to 7 = entirely natural). Ratings were high with no significant difference between sentences with high- and low-predictability target words. Sentences averaged 22 characters long (range = 18–31 characters) and target words always appeared near their center.

Table 2 Mean target word properties

Sentences were presented using the boundary paradigm (Rayner, 1975), with an invisible boundary placed immediately before the target word. Valid previews were the high- and low- predictability words. Following the same procedure as previous research (Gu et al., 2015), tranposed-character and substituted-character nonword previews were created by either transposing a word’s characters or replacing both characters with orthographically dissimilar characters. Characters in substituted-character previews could not form a word with characters to the left or right and were matched with the target words for first- and second-character frequency (ts < 1.4, ps > .18), and number of strokes (t(89) = 1.00, p = .32).

The experiment used a within-subjects design with the factors word predictability (high, low) preview type (valid, invalid transposed-character, invalid substituted-character) as independent variables and eye-movement measures as dependent variables. We adopted a counterbalanced design in which each participant read one version of each sentence frame and equal numbers of sentences in each condition.

Apparatus and procedure

An SR Eyelink 1000 plus eye tracker tracked right-eye movements during binocular viewing at 1,000 Hz. Stimuli were displayed in Song 34-point font as black-on-white text on a high-resolution (1,920 × 1,080 pixels) monitor with a fast refresh rate (144 Hz). At 58-cm viewing distance, each character subtended 1.2° and so was of normal size for reading.

Participants took part individually and were instructed to read normally and for comprehension. At the start of the experiment, a three-point horizontal calibration procedure was performed across the same line as each sentence presentation (ensuring .30° or better spatial accuracy for all participants). Calibration accuracy was checked before each trial and the eye tracker recalibrated as required to maintain high spatial accuracy. At the start of each trial, a fixation square equal in size to one character was presented on the left side of the screen. Once the participant fixated this location, a sentence was presented with the first character replacing the square. Participants pressed a response key once they finished reading each sentence. This was replaced on 30% of trials by a comprehension question requiring a yes/no button-press response. At the end of the experiment, participants were asked if they noticed display changes. Three reported seeing more than 10%. Data for these participants were excluded from analyses and the participants replaced. All other participants reported seeing at most 5% display changes. The experiment lasted approximately 30 min for each participant.

Results

Accuracy answering comprehension questions was high for all participants (> 90%). Following standard procedures, short (< 80ms) and long (> 1,200 ms) fixations were removed (affecting 4.4% of fixations). Trials with track-loss or error were also excluded (affecting 21 (<1%) trials), as were trials for sentences receiving fewer than than six fixations (affecting 355 (6.6%) trials; following Zhang et al., 2019, although the same pattern of effects was observed when this exclusion criterion was not applied), trials in which an eye-blink was made within the boundary region or when fixating the target word (affecting 94 (1.7%) trials), and trials where display changes were delayed (i.e. implemented > 12 ms into the next fixation, affecting 656 (12%) fixations). We also excluded trials in which a saccade triggered a display change but terminated to the left of the larget word (i.e. a ‘j-hook’, affecting 145 (2.7%) trials). In total, 1,271 (23.5%) trials were removed. The remaining data were analysed by linear mixed-effects models (LMEs, Baayen, Davidson, & Bates, 2008) for continuous variables and generalized mixed-effects models for binomial variables, using the lme4 package (Version 1.1-21; Bates, Mächler, Bolker, & Walker, 2015) in R (R Development Core Team, 2016). For all measures, models with maximum random-effects structure were used (Barr, Levy, Scheepers, & Tily, 2013), with predictability, preview and their interaction as fixed factors, and participants and stimuli as crossed random effects. If models did not converge, the random-effects structure was reduced by first trimming this for stimuli, starting with removal of random-effect correlations, then random slopes. Contrasts of main effects and contrasts to examine interactions were defined using sliding contrasts (the contr.sdif function) in the MASS package (Version 7.3-47; Venables & Ripley, 2002). Log-transformed fixation-time effects are reported alongside untransformed means. Following convention, t/z values > 1.96 were considered significant.

We report word-level analyses for the target words, including measures informative about first-pass reading, i.e. the initial processing of a target word prior to a fixation to its right or a regression. These comprised word-skipping (SKIP, probability of not fixating a word during first-pass reading), first-fixation duration (FFD, duration of the first fixation on a word during first-pass reading), single-fixation duration (SFD, duration of the first fixation on a word receiving only one first-pass fixation), gaze duration (GD, sum of all first-pass fixations on a word) and regressions-out (RO, probability of a first-pass regression from a word). We report total reading time (TRT, sum of all fixations on a word) and regressions-in (RI, probability of a regression back to a word) as measures of later processing. These analyses were based on 2,438 observations for FFD, 2,254 observations for SFD (wth 42% of target words receiving a single first-pass fixation), 2,438 observations for GD, and 3,074 observations for TRT.

Target word means are shown in Table 3 and statistical effects summarized in Table 4. Predictability effects, marginal in FFD and significant in GD and TRT, were due to longer reading times for low- than high-predictability words. Marginal effects in regressions suggested readers made more regressions to the less predictable words. Preview effects were significant in target-word reading times (FFD, SFD, GD, TRT; see Fig. 1). Sliding contrasts showed these were longer for substituted-character compared to transposed-character previews, and for transposed-character compared to valid previews, revealing a transposed-character effect similar to previous research (Gu et al., 2015). A preview effect in word-skipping was due to reduced skipping following substituted-character compared to transposed-character previews, with no difference between transposed-character and valid previews, indicating less skipping when the preview was uninformative about target-word identity. Finally, readers made more regressions back to target words following transposed-character compared to valid previews, with no difference between transposed-character and substituted-character previews, indicating that invalid previews caused readers to re-inspect words more often.

Table 3 Means for target word measures
Table 4 Summary of statistical effects (continuous variables were log-transformed)
Fig. 1
figure 1

Pirate plots creating using the Yarrr package in R (Version 0.1.5, Phillips, 2017) depicting effects of preview type on fixation times for high- and low-predictability target in (A) first-fixation durations, (B) single-fixation durations, (C) gaze durations and (D) total reading times

Interactions between word predictability and the contrast between transposed-character and substituted-character previews in FFD and SFD were due to significant word predictability effects in these early fixation-time measures following transposed-character but not substituted-character previews (FFD, transposed-character preview = 12-ms effect, substituted-character preview = −13-ms effect; SFD, transposed-character preview = 12-ms effect, substituted-character preview = −13-ms effect; see Fig. 1). The same pattern of effect was observed in GD, although not significantly. A similar effect in word-skipping was due to lower skipping following substituted-character than transposed-character previews in high- than low-predictability contexts, most likely because substituted-character previews mismatched with contextual expectations in the high predictability condition. By comparison with these effects, no interactions between word predictability and the contrast between the transposed-character previews and valid previews in FFD and SFD were observed (FFD, valid preview = 10-ms effect; SFD, valid preview = 9-ms effect). The findings therefore indicate that transposed-character previews, but not substituted-character previews, supported word predictability effects during early processing of target words. No other interaction effects were significant.

Bayes factors (Kass & Raftery, 1995) were calculated to assess interaction effects in fixation time measures (note this technique is not applicable to discontinuous variables like word-skipping). These were computed separately for models examining contrasts between identical and transposed-character previews, and transposed-character and substituted-character previews. The analyses were conducted using the lmBF function within the BayesFactor package (Version 0.9.12-2; Morey, Roudner, & Jamil, 2015) in R (R Core Team, 2016), assessing support for H1 over H0 (i.e. BF10; see, e.g., Morey & Rouder, 2011) by comparing models incorporating both main effects and an interaction between preview and predictability against models containing only main effects of these variables. Analyses were conducted with scaling factor for g-priors set to 0.5, using 100,000 Monte Carlo iterations, and participants and items specified as random factors. Following Vandekerckhove, Matzke, and Wagenmakers (2015; derived from Jeffreys, 1961), Bayes factors (BFs) > 3 were taken to provide weak to moderate support for a model including an interaction effect, and BFs > 10 to provide strong support for this model, while BFs < 1 provided evidence against a model including an interaction effect and in favor of a model with only main effects.

FFD and SFD are early measures of target word processing that in LME analyses produced an interaction effect for the transposed-character versus substituted-character contrast but not the valid versus transposed-character contrast. This was consistent with our prediction that transposed-character, but not substituted-character, previews can support predictability effects during early processing. BFs for these measures provided strong evidence for a model without an interaction (i.e. main effects only) for the valid versus transposed-character contrast (FFD, BF10 = .08; SFD, BF10 = .11), and weak to moderate evidence for a model with an interaction effect for the transposed-character versus substituted-character contrast (FFD, BF10 = 3.64; SFD, BF10 = 1.62). For effects in later measures (GD, TRT), we observed no interaction affects in LME analyses for valid versus transposed-character or transposed-character versus substituted-character contrasts. BFs confirmed these effects by favoring models with main effects and no interaction (all BF10 < 1). Consequently, BFs provide partial, weak-to-moderate support for interaction effects in early processing measures, and confirmatory evidence for the absence of interaction effects in later processing.

Discussion

With the present experiment, we used the boundary paradigm to investigate whether transposed-character nonword previews can support word predictability effects in Chinese reading. The results showed a standard word predictability effect in reading times for target words, with shorter reading times (in GD and TRT) for more predictable words, in line with previous Chinese findings (Rayner et al., 2005; Zhao et al., 2019). However, unlike in these previous studies, we did not observe a predictability effect in word-skipping. This may be explained by comparatively high skipping rates in our experiment compared to these previous studies. The presence of effects in reading times nevertheless shows that our manipulation of word predictability was effective.

We also replicated the transposed-character effect reported previously (Gu et al., 2015), by showing that target-word identification was facilitated by transposed-character relative to substituted-character previews. Our findings therefore provide further evidence that character order is encoded flexibly early during Chinese reading. However, as readers made more regressions to invalid than valid previews, it seems a cost for invalid previews is not entirely eliminated by this flexibility in character order processing.

Crucially, we also observed interactions between word predictability and parafoveal preview, such that effects of word predictability during early target word processing (i.e. in FFD and SFD) were obtained for transposed- but not substituted-character previews. This is theoretically important as it provides a novel demonstration that contextual information can guide parafoveal processing of character order, implicating contextual knowledge in this very early aspect of the spatial encoding of orthographic information during Chinese reading. This finding contrasts with those reported by Luke and Christianson (2012) in experiments using a novel paradigm combining self-paced alphabetic reading with masked priming. In these experiments, transposed-nonword priming effects in low-constraining contexts were eliminated in high-constraining contexts, which Luke and Christianson attributed to prediction of the specific identity and order of letters in high-constraining contexts conflicting with prime information. Such findings are consistent with the claim that contextual prediction can be highly lexically-specific (e.g. Laszlo & Federmeier, 2009). One possibility is that this difference in findings represents a cross-script difference in the specificity of contextual prediction, perhaps because context does not strongly predict specific character identity or order in Chinese reading due to the large variety of characters and synonyms in this language (Campbell, 1997). However, as already noted, Luke and Christianson used non-standard methods and compared context effect across experiments. Consequently, further research that investigates contextual influences on the parafoveal processing of spatial information using standard eye-movement methods is required to confirm whether effects differ for alphabetic compared to character-based scripts.