Hostname: page-component-857557d7f7-zntvd Total loading time: 0 Render date: 2025-12-04T19:36:32.306Z Has data issue: false hasContentIssue false

Unpacking word segmentation processes by L2 Chinese readers: Evidence from eye movements

Published online by Cambridge University Press:  27 November 2025

Lin Li
Affiliation:
Academy of Psychology and Behavior, Faculty of Psychology, Tianjin Normal University, Tianjin, China
Yaning Ji
Affiliation:
Academy of Psychology and Behavior, Faculty of Psychology, Tianjin Normal University, Tianjin, China
Sha Li
Affiliation:
School of Psychology, Fujian Normal University, Fujian, China
Jingyi Liu
Affiliation:
Academy of Psychology and Behavior, Faculty of Psychology, Tianjin Normal University, Tianjin, China
Shan Wang
Affiliation:
School of Psychology, Fujian Normal University, Fujian, China
Sarah Gunn
Affiliation:
School of Psychology and Vision Sciences, University of Leicester, Leicester, UK
Kevin B. Paterson*
Affiliation:
School of Psychology and Vision Sciences, University of Leicester, Leicester, UK
*
Corresponding author: Kevin B. Paterson; Email: kbp3@leicester.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Sentences written in Chinese are composed of continuous sequences of characters, without spaces or other visual cues to mark word boundaries. While skilled L1 readers can efficiently segment this naturally unspaced text into words, little is known about the word segmentation capabilities of L2 readers, including whether they employ the same strategies to process temporary segmental ambiguities. Accordingly, we report two eye movement experiments that investigated the processing of sentences containing temporarily ambiguous “incremental” three-character words (e.g., “体育馆,” meaning “stadium”) whose first two characters could also form a word (“体育,” meaning “sport”), comparing the performance of 48 skilled L1 Chinese readers and 48 high-proficiency L2 Chinese readers in each experiment. Our findings reveal that both groups can process this ambiguity efficiently, employing similar word segmentations strategies. We discuss our findings in relation to models of eye movement control and word recognition in Chinese reading.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open data
Copyright
© The Author(s), 2025. Published by Cambridge University Press

1. Introduction

Most sentence processing research focuses on monolingual reading, yet there is growing interest in how second-language (L2) users interpret sentences in real time. A central question is whether L2 readers rely on the same or different cognitive mechanisms for sentence processing and ambiguity resolution (Hopp, 2022).

Current L2 reading research spans lexical access (Kroll et al., Reference Kroll, Bice, Botezatu, Zirnstein, Papafragou, Trueswell and Gleitman2022), syntactic processing and ambiguity resolution (Brothers et al., Reference Brothers, Hoversten and Traxler2021; Cunnings, Reference Cunnings2017; Roberts & Felser, Reference Roberts and Felser2011) and predictive processing (Kaan & Gruter, Reference Kaan, Grüter, Kaan and Grüter2021; Schlenter, Reference Schlenter2023). Evidence suggests L2 readers, like native (L1) readers, process sentences incrementally, integrating each new word as part of their current interpretation of text. Moreover, both groups experience difficulty when this new information conflicts with prior context (Lee & Witzel, Reference Lee and Witzel2023; Williams, Reference Williams2006). L2 readers, however, are typically slower and more error-prone in processing syntactic ambiguities, possibly reflecting weaker lexical–syntactic representations and higher cognitive load effects (Cop et al., Reference Cop, Drieghe and Duyck2015; Conklin et al., Reference Conklin, Alotaibi, Pellicer-Sánchez and Vilkaitė-Lozdienė2020).

L2 readers’ ability to process ambiguities appears to depend strongly on their L2 proficiency, while L2 processing also appears to be shaped by first-language (L1) knowledge. For example, studies on interlingual homographs, words with similar spelling but different meanings across languages (e.g., coin, meaning “corner” in French, and “metal currency” in English), shows that L1–L2 orthographic overlap affects meaning activation (Cop et al., Reference Cop, Dirix, Van Assche, Drieghe and Duyck2017; Fernandez et al., Reference Fernandez, Scheepers and Allen2021; Libben & Titone, Reference Libben and Titone2009; Schwartz & Kroll, Reference Schwartz and Kroll2006; Titone et al., Reference Titone, Libben, Mercier, Whitford and Pivneva2011; Tiffin-Richards, Reference Tiffin-Richards2024a,Reference Tiffin-Richardsb; Van Assche et al., Reference Van Assche, Drieghe, Duyck, Welvaert and Hartsuiker2011). Similarly, L1 syntax can influence L2 sentence processing, with facilitative or inhibitory effects depending on the cross-language structural similarity (Bardovi-Harlig & Sprouse, Reference Bardovi-Harlig and Sprouse2018; Choi & Ionin, Reference Choi and Ionin2021; Schwartz & Sprouse, 1996). When L1 and L2 share syntactic patterns, learners process sentences more efficiently (Kidd et al., Reference Kidd, Chan and Chiu2015), whereas structural differences can cause misinterpretations (Hatzidaki et al., Reference Hatzidaki, Branigan and Pickering2011; Roberts & Felser, Reference Roberts and Felser2011; Sasson et al., Reference Sasson, Schiff and Zluf2024).

This study extends this research into L2 sentence processing by examining how L2 readers segment and identify words during Chinese reading, focusing on processing of segmental ambiguities that occurs when a sentence segment supports competing lexical interpretations. Resolving such ambiguities is essential for fluent reading. We therefore investigated whether native (L1) and proficient L2 Chinese readers use similar or different segmental processing strategies to resolve ambiguities. To test this, we manipulated the contextual plausibility of alternative interpretations of a segmental ambiguity. Therefore, as well as providing insights into L2 segmental processing, this approach was likely to be informative about how L2 readers handle conflicts between new and prior contextual information (Lee & Witzel, Reference Lee and Witzel2023; Williams, Reference Williams2006).

1.1. Word segmentation in Chinese reading

Chinese uses a logographic script composed of equally spaced monomorphemic (and monosyllabic) characters (see Li et al., Reference Li, Zang, Liversedge, Pollatsek, Pollatsek and Treiman2015; Shen, Reference Shen and Ke2018; Zang et al., Reference Zang, Liversedge, Bai, Yan, Liversedge, Gilchrist and Everling2011). While some words are single characters, most are multicharacter compounds, with around 70% of common words being two-character compounds (Beijing Language Institute, 1986). As Chinese lacks spaces between words, readers must infer word boundaries. However, skilled L1 readers are not hindered by this (Bai et al., Reference Bai, Yan, Liversedge, Zang and Rayner2008; Zang et al., Reference Zang, Liang, Bai, Yan and Liversedge2013), and read at about 250 words per minute, comparable to reading rates for spaced alphabetic scripts like English (Liversedge et al., Reference Liversedge, Drieghe, Li, Yan, Bai and Hyönä2016). This suggests that L1 readers possess efficient strategies for grouping characters into words and establishing word boundaries without relying on spaces or other visual cues.

Little is known about the segmentation capabilities of L2 Chinese readers, however; including whether they employ the same segmental processing strategies as L1 readers. Beginning L2 learners typically use textbooks that employ intercharacter spaces to mark word boundaries in sentences (Cui et al., Reference Cui, Drieghe, Yan, Bai, Chi and Liversedge2012), with studies showing that these visual cues help L2 readers process text more efficiently (Bai et al., Reference Bai, Liang, Blythe, Zang, Yan and Liversedge2013; Bassetti & Lu, Reference Bassetti and Lu2016; Cui, Reference Cui2023; Ma et al., Reference Ma, Li and Zhuang2019; Shen et al., Reference Shen, Liversedge, Tian, Zang, Cui, Bai, Yan and Rayner2012; Yu, Reference Yu2022; Zhou, Ma, et al., Reference Zhou, Ma, Li and Taft2018; Zhou, Wang, et al., Reference Zhou, Ma, Li and Taft2018, Reference Zhou, Ye and Yan2020; but see Bassetti, Reference Bassetti2009; Bassetti & Masterson, Reference Bassetti and Masterson2012). However, less is known about how L2 readers process words in unspaced text. Offline studies suggest that L2 readers make frequent segmentation errors that can impair comprehension when text is unspaced (Shen & Jiang, Reference Shen and Jiang2013; Shen & Dai, Reference Shen and Dai2024; Yang, Reference Yang2021). However, little work to date has examined how L2 readers segment words in naturally unspaced text during online sentence processing.

Accordingly, we used eye-movement measures reading to compare the online segmental processing strategies of L1 and L2 Chinese readers. Eye movements are highly informative about word identification processes during reading (Liversedge & Findlay, Reference Liversedge and Findlay2000; Rayner, Reference Rayner1998, Reference Rayner2009) and widely used in L2 reading research (see Frenck-Mestre et al., Reference Frenck-Mestre, Kroll and De Groot2005; Roberts & Siyanova-Chanturia, Reference Roberts and Siyanova-Chanturia2013). They also are informative about L1 Chinese readers’ processing of segmental ambiguities (Huang & Li, Reference Huang and Li2020; Li et al., Reference Li, Bao, Li, Li, Liu, Wang, Warrington, Gunn and Paterson2024, Reference Li, Rayner and Cave2009; Reference Li, Huang, Yao and Hyönä2022; Ma et al., Reference Ma, Li and Rayner2014; Wang et al., Reference Wang, Angele, Ma and Li2021; Zhou & Li, Reference Zhou and Li2021; Zhou et al., Reference Zhou, Ma, Li and Taft2018; Yao et al., Reference Yao, Jiang, Chen and Li2025). This includes research comparing serial and parallel accounts of how readers process incremental segmental ambiguities in multicharacter sequences such as 体育馆, where all three characters form a word meaning “stadium,” but the first two characters also constitute a standalone embedded word, 体育, meaning “sport.”

1.2. Theoretical accounts of Chinese word segmentation

Serial accounts propose that Chinese readers process characters in strict linear order from left to right, making incremental decisions about how they combine to form words. A version of this account by Perfetti and Tan (Reference Perfetti, Tan, Wang, Inhoff and Chen1999) suggests that skilled readers rely on their knowledge that most words are two characters long to guide segmentation decisions, by first grouping adjacent characters into two-character units. Their account therefore predicts that an incremental three-character word such as 体育馆 (“stadium”) will initially be processed as the two-character standalone word 体育 (“sport”) followed by an additional character, leading readers to first identify the embedded word and then backtrack to reanalyze the sequence as a three-character compound, resulting in a processing cost.

Parallel accounts, implemented by the Chinese Reading Model (CRM; Li & Pollatsek, Reference Li and Pollatsek2020), assume that alternative segmental analyses are considered simultaneously (Inhoff & Wu, Reference Inhoff and Wu2005; Li et al., Reference Li, Rayner and Cave2009). Other eye movements models such as E-Z Reader (Reichle et al., Reference Reichle, Pollatsek, Fisher and Rayner1998, Reference Reichle, Rayner and Pollatsek2003), SWIFT (Engbert et al., Reference Engbert, Nuthmann, Richter and Kliegl2005) and OB1-reader (Snell & Grainger, Reference Snell and Grainger2019) depend on interword spaces for eye-movement guidance and word identification and are difficult to adapt for Chinese (Zhou & Li, Reference Zhou and Li2021; Reichle & Yu, Reference Reichle and Yu2018; Yu et al., Reference Yu, Liu and Reichle2021), whereas the CRM was designed specifically to account for word segmentation and identification in unspaced Chinese. The CRM assumes that, on each fixation, readers process a small group of characters within their perceptual span (the region of visual information that a reader can effectively process during a single fixation, McConkie & Rayner, Reference McConkie and Rayner1975, Reference McConkie and Rayner1976; Rayner et al., Reference Rayner, McConkie and Zola1980). This comprises about one character to the left and two or three characters to the right of fixation for skilled L1 Chinese readers (Inhoff & Liu, Reference Inhoff and Liu1998), but is likely to be smaller for L2 readers (see Fernandez et al., Reference Fernandez, Bothe and Allen2023; Jordan et al., Reference Jordan, Almabruk, Gadalla, McGowan, White, Abedipour and Paterson2014; Leung et al., Reference Leung, Sugiura, Abe and Yoshikawa2014; Paterson et al., Reference Paterson, McGowan, White, Malik, Abedipour and Jordan2014; Whitford & Titone, Reference Whitford and Titone2015). Within the CRM, all possible character groupings within this region are assumed to be processed in parallel, with alternative segmental analyses competing for selection via mechanisms inspired by models of visual word recognition (e.g., McClelland & Rumelhart, Reference McClelland and Rumelhart1981).

When reading a multicharacter sequence like 体育馆, it is assumed that lexical entries for both whole-word (“stadium”) and embedded word (“sport”) analyses are activated simultaneously and compete for selection. However, the whole-word form usually wins because it receives stronger bottom-up input from more characters. Once selected, this word is identified and segmented from the text, while activation of its embedded word declines rapidly. The CRM therefore predicts holistic processing of the ambiguity without reanalysis costs. The model further assumes that, though multiple candidates are activated in parallel, only one word ultimately is recognized at a time, preserving serial word recognition (Reichle et al., Reference Reichle, Warren and McConnell2009; White et al., Reference White, Palmer and Boynton2018, Reference White, Palmer and Boynton2020). The model also incorporates eye-guidance mechanisms which assume readers subconsciously estimate how many characters they can identify on each fixation and program their next eye movement to land beyond these characters (see Li et al., Reference Li, Liu and Rayner2011; Wei et al., Reference Wei, Li and Pollastsek2013).

Zhou and Li (Reference Zhou and Li2021) compared these accounts in two eye-movement experiments with L1 readers. In the first, participants read sentences containing either an incremental word (e.g., 体育馆 “stadium”) or a control target identical to its embedded word (体育 “sport”). While the incremental word was always plausible, the control target word (and therefore the embedded word) could be plausible or implausible depending on the characteristics of a preceding verb. The control word should therefore elicit longer fixations when implausible (the word plausibility effect, see Staub, Reference Staub2015), and a similar effect may also be observed for the embedded word if it is segmented first during processing. As Zhou and Li observed plausibility effects for control but not embedded words, this suggested that readers did not first identify the embedded word. A second experiment independently manipulated the plausibility of incremental and embedded word analyses of an ambiguity. Plausibility effects were observed only for the incremental words, further indicating that embedded words are not initially identified during sentence processing.

1.3. This study

Segmental ambiguities are frequent in Chinese, making it vital to understand how L1 and L2 readers resolve them. This study therefore extends Zhou and Li (Reference Zhou and Li2021) by investigating whether skilled L1 and high-proficiency L2 Chinese readers use the same or different segmental processing strategies. If they use the same strategies, both groups should replicate Zhou and Li’s findings. However, if they use different strategies, L1 readers may replicate Zhou and Li’s findings, while L2 readers exhibit processing differences.

One possibility is that higher cognitive load in L2 reading (Hopp, 2010) reduces the perceptual span (Fernandez et al., Reference Fernandez, Bothe and Allen2023; Jordan et al., Reference Jordan, Almabruk, Gadalla, McGowan, White, Abedipour and Paterson2014; Leung et al., Reference Leung, Sugiura, Abe and Yoshikawa2014; Paterson et al., Reference Paterson, McGowan, White, Malik, Abedipour and Jordan2014; Whitford & Titone, Reference Whitford and Titone2015), causing fewer characters to be processed per fixation. L2 readers may also have reduced familiarity with three-character words, and these factors together may lead L2 readers to adopt a more serial segmentation strategy in which they initially identify embedded words. In this case, embedded word plausibility effects may be observed for L2 but not L1 readers. It is also possible that L2 readers evaluate plausibility less efficiently (Lee & Witzel, Reference Lee and Witzel2023; Williams, Reference Williams2006), leading to later or slower detection of implausible segmentations and greater difficulty recovering from errors.

Following Zhou and Li (Reference Zhou and Li2021), we examined fixation times for target regions containing the incremental or control target words to test sensitivity to embedded word plausibility. If both groups process incrementally and holistically, plausibility effects should be observed for both groups for control word but not for embedded words. However, if L2 readers process more serially, fixation times may reflect both embedded and incremental word plausibility effects, while differences in the timing and magnitude of these effects may offer insights into differences in L1–L2 plausibility processing.

We also analyzed fixation times for the pretarget verb used to manipulate plausibility. One possibility is that, while fixating this verb, readers begin acquiring semantic information from the forthcoming target word, allowing them to start evaluating this word’s contextual plausibility. This would represent an instance of parafoveal processing, where linguistic information is obtained from a word in the reader’s peripheral vision before it is fixated (see Cutter et al., Reference Cutter, Drieghe, Liversedge, Pollatsek and Treiman2015; Schotter et al., Reference Schotter, Angele and Rayner2012). Zhou and Li (Reference Zhou and Li2021; also Yao et al., Reference Yao, Jiang, Chen and Li2025) observed no such effects, suggesting their readers did not process upcoming word meaning in advance. However, other work suggests readers can extract parafoveal meaning information, including about a word’s contextual plausibility (Yan et al., Reference Yan, Richter, Shu and Kliegl2009; Yang et al., Reference Yang, Staub, Li, Wang and Rayner2012; Pan et al., Reference Pan, Laubrock and Yan2016). Accordingly, examining this issue further could clarify whether parafoveal processing influences segmentation. Crucially, L1 and L2 readers may differ in parafoveal processing ability. While some studies show proficient L2 readers use parafoveal information efficiently (Tiffin-Richards, Reference Tiffin-Richards2024a,Reference Tiffin-Richardsb), others report limited use by L2 Chinese readers (Cong & Chen, Reference Cong and Chen2022; Xiao et al., Reference Xiao, Jia and Wang2021). Accordingly, we report analyses comparing the parafoveal processing of segmental ambiguity across L1 and L2 groups.

2. Experiment 1

Experiment 1 compared eye movements of skilled L1 and high-proficiency L2 Chinese participants during sentence reading. The sentences included an incremental three-character target word that was always contextually plausible, which included a two-character embedded word that was either contextually plausible or implausible. These sentences were compared with control sentences where the embedded word replaced the incremental word as the target word.

Using a similar design, Zhou and Li (Reference Zhou and Li2021) found that skilled L1 readers showed plausibility effects for control but not embedded words, as reflected in total reading times at the target region. We therefore examined whether a similar effect would be observed for L1 and L2 readers in out experiment, while also running exploratory analyses using additional eye-movement measures to identify possible group processing differences.

Von der Malsburg and Angele (Reference von der Malsburg and Angele2017) highlight that using multiple dependent variables in eye-movement research substantially increases the risk of Type I errors. To mitigate this, they recommend specifying hypotheses in advance, interpreting effects from a single measure with caution unless predicted a priori, and applying Bonferroni or similar corrections. However, they also caution that Bonferroni adjustments may be overly conservative for highly correlated fixation time variables. As an alternative, they suggest considering an effect reliable if it reaches significance in two or more related measures.

This study addressed this increased risk of Type I error by clearly distinguishing between hypothesis-driven and exploratory analyses. This enabled testing of specific theoretical predictions while exploiting the richness of eye-movement data. For hypothesis-driven tests, we applied Von der Malsburg and Angele’s (Reference von der Malsburg and Angele2017) criteria, using Bonferroni-corrected thresholds for multiple comparisons and a convergence heuristic deeming effects reliable if significant in two or more dependent measures. These criteria guided interpretation without serving as absolute cut-offs.

Ethics statement. The study was approved by the ethics committee of the Academy of Psychology and Behavior, Tianjin Normal University, and conducted in accordance with the Declaration of Helsinki (World Medical Association, 2013).

2.1. Method

Participants. Participants were 48 L1 Chinese readers aged 20–29 years (M = 22; 30 females), all undergraduates at Tianjin Normal University, and 48 L2 Chinese readers aged 18–32 years (M = 24; 27 females) enrolled in language courses at Tianjin Normal University or Fujian Normal University. All L1 readers were native Mandarin speakers with some English knowledge, as English is taught from primary school in mainland China. L2 readers were native users of diverse mainly alphabetic languages differing in word-spacing conventions (see Supplementary Table S5). All L2 participants had at least 2 years of Chinese learning experience (M = 7.6 years, SD = 4.7) and had achieved at least HSK level 4 (the standard six-level Chinese proficiency test administered by the Ministry of Education of China). Their average proficiency was HSK-5 (SD = 1), mean total score 226/300 (SD = 32) and reading score 75/100 (SD = 13), indicating moderate–high proficiency.

L2 participants completed the LHQ 3.0 (Language History Questionnaire; Li et al., Reference Li, Zhang, Yu and Zhao2019), which assesses second-language proficiency (Zhang et al., Reference Zhang, Huang, Jiang, Xu, Rao and Xu2023) and collects self-report data on learning experience, usage and skill levels via its web interface (LHQ 2.0; Li et al., Reference Li, Zhang, Tsai and Puls2014). L2 readers self-rated moderate to high overall and reading proficiency (M = 0.59, SD = 0.13; reading M = 4, SD = 0.94).

Prior eye-movement studies provide limited effect size data for L1 Chinese segmental processing and none for L2 readers. To ensure adequate power, we used larger samples and stimuli than previous work, increasing observations per condition from 400 (Zhou & Li, Reference Zhou and Li2021) to 960. We employed two strategies to ensure our study was adequately powered effects. First, we increased the participant sample size and stimulus set relative to previous research, raising the number of observations per condition from 400 in Zhou and Li (Reference Zhou and Li2021) to 960 in this study. We also ran a priori power analyses (powerSim, powerCurve; Green & MacLeod, Reference Green and MacLeod2016, in R: R Core Team, 2022) using data from a 30-participant pilot later included in the final sample (Huang & Li, Reference Huang and Li2023). Full details are in the Supplemental File (Section S1). Analyses confirmed sufficient power to detect key effects reported previously.

Stimuli and design. Stimuli consisted of 80 sentence sets, each containing a region of interest with either a two-character control word or a three-character incremental target. Twenty sets were adapted from Zhou and Li (Reference Zhou and Li2021) for L2 suitability and 60 new sets were created using the same procedures. For each set, the verb preceding the target was manipulated to yield four experimental conditions (see Table 1). We ensured all three-character incremental words appeared in a current standard dictionary, confirming their formal recognition in Chinese. A complete list of incremental and embedded words with English translations is provided in the Supplemental File (Section S6).

Table 1. Example sentence stimuli for Experiment 1

Note. In Experiment 1, characters in pretarget verbs are underlined and bold ( 关注/修建 ), characters in target words are italicized and bold ( 体育/体育馆 ).

a This refers to the plausibility of the embedded words in incremental target word contexts and the plausibility of two-character target words in the control sentences. Note that the control words were identical to the corresponding embedded word. In Experiment 2, characters in pretarget verbs are underlined and bold ( 关注/修建/促进/查阅 ), characters in target words are italicized and bold( 体育馆 ).

In one condition (plausible embedded), the verb (e.g., 关注, “pay attention to”) was followed by a three-character incremental target (e.g., 体育馆, “stadium”) that was always plausible in the sentence context, and where its first two characters formed an embedded two-character word (e.g., 体育, “sport”) that also was plausible. In a second condition (implausible embedded), the verb was changed (e.g., to 建设, “build”) so that the incremental word stayed plausible, but the embedded word became implausible. Two further control conditions used the two-character embedded word itself as a control target word, producing plausible (e.g., 关注体育, “pay attention to sport”) and implausible (e.g., 建设体育, “build sport”) variants. Details of stimulus characteristics appear in the Supplemental File (Section S7).

Stimuli were placed in four lists. Each began with four practice sentences and included 80 experimental and 20 plausible filler sentences. Each list contained 20 items per condition, assigned via a Latin square so each sentence appeared once per list with balanced condition representation. The experimental and filler sentences were presented in random order. Equal numbers of L1 and L2 participants were pseudo-randomly assigned to each list.

Apparatus and procedure. An EyeLink 1000 Plus eye-tracker (SR Research, Canada) recorded each participant’s right-eye gaze during binocular viewing, with <.01° RMS spatial and 1000 Hz temporal resolution. A headrest and chinrest minimized movement. Sentences appeared on a 24-inch high-resolution LCD monitor (1920 × 1080 pixels, 150 Hz) in Song font as black text (RGB 0,0,0) on a light gray background (RGB 220,220,220). At a 75 cm viewing distance, each character subtended ≈1° horizontally, typical for reading (Xu & Jordan, Reference Xu and Jordan2009). Participants read individually and were instructed to read normally for comprehension. At the start, a 3-point horizontal calibration was performed along the same line as each sentence presentation, ensuring ≤.35° spatial accuracy. Calibration was checked before each trial and repeated as necessary to ensure the same high spatial accuracy throughout the experiment. Each trial began with a fixation square equal to one character on the left side of the screen. Once fixated, the sentence appeared, with its first character replacing the square. Participants pressed a key when they finished reading. On 25% of trials, a yes/no comprehension question followed, with responses recorded. The experiment lasted about 40 minutes per participant.

2.2. Results

Accuracy answering comprehension questions was high for all participants >80%, native, M = 95.5%; L2, M = 88.1%, p < .001), indicating that though the L1 group had higher accuracy, both groups understood the sentences well.

Data analysis procedure. Following standard procedures, fixations less than 80 ms and longer than 1200 ms were removed (affecting 4.9% of fixations for L1 participants and 3.1% of fixations for L2 participants). Trials in which participants had less than 5 fixations were removed, resulting in 0.3% loss of trials for L1 readers and 0.1% for L2 readers, attributable to track loss or failure to read sentences sufficiently carefully. The remaining data were analyzed by linear mixed effects models (LMEMs; Baayen et al., Reference Baayen, Davidson and Bates2008) using R (R Core Team, 2022) and the lme4 package (Bates et al., Reference Bates, Maechler and Bolker2011). The p values were estimated using the lmerTest package (Kuznetsova et al., Reference Kuznetsova, Brockhoff and Christensen2014). For binomial variables, generalized LMEMs were conducted with the Laplace approximation. A maximal random effects structure was used (Barr et al., Reference Barr, Levy, Scheepers and Tily2013), with participants and stimuli as crossed random effects. If models did not converge, the random effects structure was trimmed, first by removing correlations then slopes for stimuli, then correlations and slopes for participants, until the model converged. The models are listed in the Supplemental File (Section S2).

Sentence-level analyses. In addition to word-level analyses reported next, we examined sentence-level eye movements for the L1 and L2 groups. Full details are provided in the Supplemental File (Section S3). The results show that L2 readers exhibited longer sentence reading times, made more and longer fixations, shorter forward saccades and a higher number of regressions, consistent with greater reading difficulty in the L2 group.

Word-level analyses. We conducted word-level analyses for a target region comprising either the incremental word or control word and pretarget region comprising the preceding verb. Target word analyses were informative about fixational processing of the target words, while pretarget analyses were informative about both parafoveal processing of these words and subsequent re-reading effects.

Following Zhou and Li (Reference Zhou and Li2021), we used linear mixed effects models to test theoretically motivated hypotheses, using a single variable encompassing all the experimental conditions as a fixed effect. Customized contrasts (Schad et al., Reference Schad, Vasishth, Hohenstein and Kliegl2020) were used to test effects of (1) target word type (incremental word vs. control word); (2) control word plausibility (plausible vs. implausible) and (3) embedded word plausibility (plausible vs. implausible). These are reported separately for L1 and L2 groups, following which we report further customized contrasts that include the variable group to compare L1 versus L2 effects. Continuous variables (fixation times) were log-transformed (Wagenmakers et al., Reference Wagenmakers, Krypotos, Criss and Iverson2012). Contrasts were defined using the contr.sdif function in the MASS package (Venables & Ripley, Reference Venables and Ripley2002). As each variable had two levels, these produced main effect coding equivalent to other methods (e.g., contr.sum function). We coded contrasts using deviation coding (−1/1), though sum-to-zero coding (−0.5/0.5) produced the same patterns of effects.

Word-level analyses used standard eye movement measures informative about first-pass reading; that is, the initial processing of a word prior to a fixation to its right or a regression (e.g., Rayner, Reference Rayner2009). These comprised word skipping (probability of not fixating a word during first pass), first-fixation duration (FFD; duration of the first progressive fixation on a word) and gaze duration (GD; sum of all first-pass fixations on a word). Note that single-fixation durations (duration of the first progressive fixation for a word receiving only one first-pass fixation) are not reported as L2 readers rarely made only one first-pass fixation on words. We additionally report word-level measures informative about later processing; namely, total reading time (TRT; sum of all fixations on a word) and regressions-in (RI; probability of a regression back to a word). Tables 2 and 3 present means for the target and pretarget regions and summarize their statistical effects, respectively.

Table 2. Mean eye movement measures for target and pretarget regions in Experiment 1

Note. The standard error of the mean is given in parentheses. Fixation time measures are shown in ms.

a These labels indicate whether the embedded words were plausible or implausible.

Table 3. Summary of statistical effects for Experiment 1

Note. Asterisks indicate p < .05. FFD = First-fixation duration, GD = gaze duration, TRT = total reading time, RI = probability of a regression-in.

Target region analyses. Target word type (incremental vs. control) influenced gaze durations and total reading times for both groups, with longer reading times for the incremental words (gaze duration: L1, 329 vs. 279 ms; L2, 892 vs. 592 ms; total reading time: L1, 495 vs. 440 ms; L2, 1418 vs. 947 ms). L1 first fixations also were longer for incremental words (238 vs. 227 ms), though no L2 effect was observed (298 vs. 302 ms). L2 regressions-in probabilities were higher for the incremental than control words (.31 vs. .26), with no L1 effect (.25 vs. .26). Significant Group × Word Type interactions reflected larger L2 differences, likely due to their reduced familiarity with three-character words. Assuming three main dependent measures (gaze duration, total reading time, regressions-in), Bonferroni correction set the threshold at p < .017. Gaze-duration and total-time effects exceeded this (p < .001) and appeared in two measures, confirming robustness.

We next examined control word plausibility effects to test whether L1 and L2 groups showed total reading time effects, as in Zhou and Li (Reference Zhou and Li2021). L1 readers showed longer total reading times for implausible targets (463 vs. 415 ms); while L2 readers showed a similar but nonsignificant trend (978 vs. 916 ms). An L2 plausibility effect appeared in regressions-in (.29 vs. .22), with no L1 counterpart (.28 vs. .24). As plausibility effects were absent in first-pass measures, these total reading time and regression effects likely reflect later, postinitial processing. Bonferroni correction (p < .017) confirmed that the L1 total reading time and L2 regressions-in effects were reliable. Overall, the findings replicate Zhou and Li (Reference Zhou and Li2021) for L1 readers and show that L2 readers show have greater difficulty computing word plausibility during online sentence processing. Finally, we examined embedded word plausibility effects in sentences with plausible incremental targets. No effects were found for either group, consistent with Zhou and Li (Reference Zhou and Li2021). This suggests that neither group processed the embedded word interpretation during fixations on incremental target words.

Pretarget region analyses. The interchangeable verbs used to manipulate plausibility were matched for word and character frequency and visual complexity (stroke count). To account for any residual variation, pretarget region models included log-transformed and centered word frequency plus centered frequencies and stroke counts for both characters. Models with and without these covariates yielded the same fixed effects, with no group differences, confirming that verb characteristics did not influence pretarget eye movements.

We first examined target word type effects (incremental vs. control), which appeared only in late measures. Total reading times were longer for verbs followed by a control than an incremental word (L1: 467 vs. 428 ms; L2: 1025 vs. 977 ms). L1 regressions-in were also higher for verbs preceding a control than incremental target (.34 vs. .26), with no L2 effect. As these occurred only in late measures, they reflect re-reading effects. Bonferroni correction (p < .017) confirmed that the L1 effects were robust (p < .001), though the L2 total reading time effect was not. These re-reading effects therefore appear stronger in L1 than L2.

Next, we assessed control word plausibility effects. Consistent with Zhou and Li (Reference Zhou and Li2021), both groups showed total reading time and regressions-in effects. Total reading times were longer (L1: 498 vs. 435 ms; L2: 1077 vs. 972 ms) and regressions-in were higher (L1: .38 vs. .29; L2: .32 vs. .26) for verbs followed by an implausible than plausible control word. All effects exceeded p < .01 and Bonferroni p < .017, indicating robustness. No gaze-duration effects emerged, suggesting control word plausibility was not processed parafoveally. The total-time and regression effects therefore reflect re-reading effects following both L1 and L2 readers experiencing difficulty integrating an implausible control target word.

We next considered embedded word plausibility for sentences with incremental target words (which were always plausible). Both groups showed longer total reading times for verbs when the embedded word was implausible (L1: 439 vs. 417 ms; L2: 1011 vs. 942 ms). These exceeded p < .01 and Bonferroni p < .017, indicating robust effects. L1 gaze durations also were longer for verb followed by implausible embedded words (303 vs. 285 ms), with a marginal L2 trend (643 vs. 604 ms). The L1 effect did not reach the Bonferroni-corrected threshold, but appeared in both gaze duration and total time, satisfying the convergence heuristic, and therefore, was considered reliable. It is noteworthy that Zhou and Li (Reference Zhou and Li2021) did not report a comparable parafoveal processing effect. Figure 1 illustrates these effects.

Figure 1. Embedded word plausibility effects for the pretarget region in Experiment 1, in (A) gaze durations and (B) total reading times.

Further exploratory analyses examining effects of L2 native language characteristics and L2 Chinese reading proficiency are reported in the Supplemental File (Section S5).

2.3. Discussion

Experiment 1 replicated and extended findings reported by Zhou and Li (Reference Zhou and Li2021), while providing new evidence about how L1 and L2 readers process segmental ambiguity. Participants read sentences containing a three-character incremental target word whose first two characters could form a standalone embedded word, which was either plausible or implausible in the sentence context. Eye movements for these sentences were compared with control sentences in which the embedded word served as a control target word, replacing the incremental word. We examined effects for both a target word region and a pretarget region containing a two-character verb.

At the target region, both groups showed increased gaze durations and total reading times for incremental versus control target words, with larger effects for L2 readers. These likely reflect greater difficulty in processing three-character incremental words, especially among L2 readers, who may be less familiar with such lexical forms. The robustness of these findings, confirmed across multiple measures, underscores the cognitive cost of processing segmentally ambiguous words during reading. In line with Zhou and Li (Reference Zhou and Li2021), we also observed an L1 plausibility effect for control target words in both total reading times and regressions-in, indicating the effectiveness of our plausibility manipulation, and suggesting that L1 readers re-evaluate contextual fit with earlier sentence content when encountering an implausible continuation. The L2 group showed a plausibility effect in regressions-in only, suggesting more limited contextual sensitivity. Crucially, neither group exhibited an embedded word plausibility effect at the target region for sentences with incremental target words, suggesting neither group gained lexical access to the embedded word’s lexical representation during fixational ambiguity processing. These findings mirror those of Zhou and Li for L1 readers. Zhou and Li interpreted their results as supporting a parallel activation account of segmental ambiguity resolution, as implemented by the CRM (Li & Pollatsek, Reference Li and Pollatsek2020). According to this account, embedded and whole-word analyses are activated in parallel, with the whole-word interpretation usually winning the competition for selection, without providing full lexical access for the embedded word. Our target region findings align with this explanation, while similarity between L1 and L2 patterns suggests the two groups exhibit a similar whole-word preference when resolving segmental ambiguity.

Pretarget region analyses also revealed word plausibility effects. Implausible control target words led to increased regressions and longer total reading times for both groups, suggesting re-reading due to integration difficulty. Crucially, we observed similar pretarget region effects of embedded word plausibility in total reading time for sentences containing incremental target words. This was consistent with both groups re-reading the verb when the embedded word analysis was contextually implausible. The L1 group showed a similar effect in gaze durations, indicating that the effect also occurred during first-pass processing for the pretarget verb. This suggests that early, parafoveal access to embedded word meaning additionally produced contextual mismatch effects for L1 readers.

The re-reading effects for both groups at the pretarget region indicate lexical access to embedded word representations during sentence processing by both L1 and L2 readers, with this causing integration difficulty, similarly to effects observed for control words. L1 gaze-duration effects additionally suggest these readers lexically access the embedded word representation during an early stage of reading associated with the parafoveal processing of target words. Crucially, this pattern challenges the interpretation of segmental ambiguity processing offered by Zhou and Li (Reference Zhou and Li2021; also Li & Pollatsek, Reference Li and Pollatsek2020), who propose that embedded words are activated but not fully lexically accessed during ambiguity resolution. We consider the implications of these findings in the General Discussion.

3. Experiment 2

A potential limitation of Experiment 1 was that the incremental target words were always plausible. Readers likely take account of plausibility when segmenting words and may be more likely to segment an embedded word when the incremental word does not fit plausibly with the prior context. To take account of this, Experiment 2 manipulated the contextual plausibility of incremental target words and their embedded words orthogonally.

As in Experiment 1, we sought to minimize the likelihood of false positive effects by focusing hypothesis-driven tests on measures that previously showed L1 plausibility effects in Zhou and Li’s (Reference Zhou and Li2021) study. For the target word region, hypothesis testing assessed whether: (1) whole-word plausibility influenced total reading times and (2) embedded word plausibility affected total reading times when the whole word was plausible, but not when it was implausible. We also conducted exploratory analyses across a broader range of eye movement measures to examine L1 and L2 differences in ambiguity processing. For the pretarget verb region, we conducted hypothesis-driven tests to assess parafoveal effects of embedded or target word ambiguity, as reflected in gaze durations. As with the target region, these exploratory analyses were performed to gain a fuller understanding of processing differences between L1 and L2 groups.

3.1. Method

Participants. Participants were 48 L1 Chinese readers aged 19–25 years (M = 22; 29 females) and 48 L2 Chinese readers aged 18–30 years (M = 24; 29 females) from Tianjin Normal University and Fujian Normal University. The L1 group were native Mandarin speakers with some English knowledge from schooling. No participants took part in Experiment 1. L2 readers were native users of diverse alphabetic languages differing in word-spacing conventions (Supplementary Table S5). They had studied Chinese for 2 years minimum (M = 6.4, SD = 3.7), having passed HSK-4 on average at level 5 (SD = 1), with total M = 233/300 (SD = 31) and reading M = 76/100 (SD = 14). The L2 group also completed the LHQ, confirming moderate to high proficiency (overall M = 0.58, SD = 0.10; reading M = 4, SD = 0.8).

As in Experiment 1, we employed two strategies to ensure adequate power. First, participant and stimulus numbers were increased relative to previous work, raising observations per condition from 400 (Zhou & Li, Reference Zhou and Li2021) to 960. Second, a priori power analyses were conducted using data from a 30-participant pilot later included in the final sample. Full details are provided in the Supplemental File (Section S1). Results confirmed sufficient power to detect key effects observed previously.

Stimuli and design. Eighty sentence sets were used in Experiment 2, each containing an incremental word target (see Table 2). Following Zhou and Li (Reference Zhou and Li2021), the preceding verb was manipulated to yield four experimental conditions. In two, the target word was plausible, and its embedded word was either plausible or implausible depending. In the remaining two, the target was implausible while its embedded word varied in plausibility. Detailed stimulus characteristics are reported in the Supplemental File (Section S7).

Sentences were distributed across four presentation lists. Each began with four practice sentences and included 80 experimental and 20 plausible filler sentences. Each list contained 20 items per condition, assigned via a Latin square so every sentence appeared once per list with balanced condition numbers Sentences were presented in random order, with equal numbers of L1 and L2 participants pseudo-randomly assigned to each list.

Apparatus and procedure. This was the same as Experiment 1.

3.2. Results

Accuracy on comprehension questions was high for all participants (>80%; L1: M = 94.6%, L2: M = 86.9%, p < .001), though significantly higher for L1 readers, confirming that both groups understood the sentences.

Data analysis procedures. Following standard procedures, fixations shorter than 80 ms or longer than 1200 ms were removed (3.8% for L1 and 2.6% for L2). Trials with fewer than five fixations or track loss were excluded (0.7% L1; 0.3% L2). The remaining data were analyzed with LMEMs in R using the lme4 package, with p values from lmerTest (Kuznetsova et al., Reference Kuznetsova, Brockhoff and Christensen2014).

Sentence-level analyses. We conducted sentence-level analyses of eye movements for sentences containing plausible or implausible incremental targets to assess reading of globally plausible versus implausible sentences. L2 readers had longer reading times, and made more and longer fixations, shorter saccades and more regressions. Consistent with Experiment 1, these findings indicate greater reading difficulty for L2 readers. Reading times were also longer for globally implausible than plausible sentences, with this effect larger for L2 readers. Full details appear in the Supplemental File (Section S3).

Word-level analyses. As in Experiment 1, we report word-level eye-movement measures for two regions of interest: the target region (three-character incremental word) and the pretarget verb region. Effects at the target region indicate fixational processing of incremental words, whereas pretarget effects reflect parafoveal processing or re-reading.

We used linear mixed effects models (LMEMs) to test theoretically motivated hypotheses, with a single condition variable as a fixed effect and customized contrasts (Schad et al., Reference Schad, Vasishth, Hohenstein and Kliegl2020) assessing: (1) main effects of incremental word plausibility; (2) embedded word plausibility for plausible incremental words and (3) embedded word plausibility for implausible incremental words. These were reported separately for L1 and L2 groups, followed by contrasts including group to compare L1 versus L2.

We distinguished between hypothesis-driven and exploratory analyses to limit Type I error risk from multiple measures. For hypothesis-driven tests, we evaluated robustness under conservative criteria, applying Bonferroni correction and the convergence-based heuristic of von der Malsburg and Angele (Reference von der Malsburg and Angele2017). As in Experiment 1, these criteria assessed reliability but were not treated as strict cutoffs.

Continuous variables (fixation times) were log-transformed (Wagenmakers et al., Reference Wagenmakers, Krypotos, Criss and Iverson2012). Contrasts were defined using the contr.sdif function in MASS (Venables & Ripley, Reference Venables and Ripley2002), yielding effect coding equivalent to contr.sum. We used deviation coding (−1/1), though sum-to-zero coding (−0.5/0.5) produced identical results. We employed maximal random-effects structures where possible (Barr et al., Reference Barr, Levy, Scheepers and Tily2013), including participants and stimuli as crossed effects. If models failed to converge, random structures were simplified stepwise, first removing correlations, then slopes for stimuli and participants, until convergence was achieved. The Supplemental File (Section S2) lists the final models used. Tables 4 and 5 report mean eye movements and statistical results; Figures 2 and 3 illustrate key effects at target and pretarget regions.

Table 4. Mean eye movements for pretarget and target regions in Experiment 2

Note. The standard error of the mean is shown in parentheses. Fixation time measures are shown in ms.

Table 5. Summary of statistical effects for Experiment 2

Note. Asterisks indicate p < .05. FFD = first-fixation duration, GD = gaze duration, TRT = total reading time, RI = probability of a regression-in.

Figure 2. Embedded word plausibility effects for plausible incremental target words in Experiment 2, in (A) gaze durations for the pretarget region and (B) total reading times for the target region.

Figure 3. Embedded word plausibility effects for implausible incremental target words in Experiment 2, in (A) gaze durations for the pretarget region and (b) total reading times for the target region.

Target region analyses. Hypothesis-driven contrasts testing main effects of incremental word plausibility showed longer total reading times for implausible than plausible incremental words for both groups (L1: 679 vs. 558 ms; L2: 1616 vs. 1533 ms), and a higher probability of regressions-in (L1: .40 vs. .33; L2: .34 vs. .30). Using three measures (gaze duration, total reading time, regressions-in) and Bonferroni correction (p < .017), all effects were significant (p < .01) and appeared in at least two measures, confirming robustness. These results replicate Zhou and Li (Reference Zhou and Li2021), confirming the effectiveness of the incremental word plausibility manipulation. However, as effects emerged only in late measures, readers likely detected implausibility during re-reading rather than first-pass processing. We also found a significant Group × Plausibility interaction in total reading time, driven by a larger L1 effect, indicating greater L1 sensitivity to word plausibility.

Contrasts of embedded word plausibility for plausible incremental words showed no significant effects. However, when incremental words were implausible, both groups showed embedded word plausibility effects in total reading time (L1: 710 vs. 648 ms; L2: 1649 vs. 1584 ms). These findings indicate that both groups considered the alternative embedded word analysis when the preferred whole-word interpretation was implausible. Effects exceeded the Bonferroni-corrected threshold (p < .017) and appeared in multiple measures, suggesting robustness.

In sum, both L1 and L2 readers not only showed plausibility effects consistent with whole-word processing but also activated embedded word meanings when the whole-word interpretation failed. These effects appeared late during processing, in total reading times, with larger L1 effects suggesting higher L1 sensitivity to contextual plausibility.

Pretarget region analyses. As in Experiment 1, statistical models for the pretarget region included each verb’s log-transformed and centered lexical frequency, first- and second-character frequencies and stroke counts as covariates. Including or excluding these did not change fixed effect patterns. Moreover, no group differences appeared for these covariates, unsurprising given that verbs were closely matched across groups.

Analyses revealed main effects of incremental word plausibility in total reading times and regressions-in for both groups. Verbs preceding implausible incremental words elicited longer total reading times and more regressions-in (L1: 663 vs. 487 ms; L2: 1249 vs. 1087 ms; L1 regressions-in: .50 vs. .31; L2: .33 vs. .28). These effects suggest both groups reinspected the pretarget verb when encountering an implausible incremental word.

An L2-only gaze-duration effect also was observed (704 vs. 661 ms) but was absent for L1 (331 vs. 318 ms), indicating L2 parafoveal detection of upcoming incremental word plausibility. L2 gaze durations were generally longer, allowing more parafoveal processing time. These effects exceeded the Bonferroni-corrected threshold (p < .017) and appeared in multiple measures, confirming robustness.

When incremental words were plausible, embedded word plausibility produced no L1 effects but yielded L2 effects in gaze duration (680 vs. 642 ms) and total time (1130 vs. 1045 ms). This indicated L2 sensitivity to embedded word plausibility even when the whole-word was appropriate, with the gaze-duration effect implying that this effect occurred during parafoveal processing. The L2 total-time effect exceeded p < .017; the gaze-duration effect met it exactly, indicating strong but partly ambiguous reliability.

When incremental words were implausible, L1 readers showed embedded word plausibility effects in gaze duration (350 vs. 312 ms), but L2 readers did not (703 vs. 704 ms). Both groups, however, showed effects in total reading time (L1: 726 vs. 559 ms; L2: 1303 vs. 1196 ms) and regressions-in (L1: .58 vs. .42; L2: .35 vs. .30). These results imply increased re-reading for implausible embedded and whole-word interpretations, with only L1 readers showing parafoveal sensitivity. Additionally, a significant Group × Embedded Word Plausibility interaction in total time and regressions-in reflected greater L1 reanalysis effort. The L1 effects exceeded p < .017 and appeared across all three measures. For L2 readers, only total-time effects met this criterion. Thus, L1 readers showed both parafoveal and re-reading effects, whereas L2 readers showed only later re-reading effects.

In summary, L1 gaze-duration effects of embedded word plausibility appeared only when the whole-word interpretation was implausible, whereas L2 readers showed little comparable parafoveal processing. Both groups showed embedded word effects in re-reading, but L1 effects occurred only when the whole-word was implausible, whereas L2 effects appeared regardless of plausibility. Overall, these findings suggest more limited L2 parafoveal processing and greater difficulty using context to resolve segmental ambiguity.

3.3. Discussion

Experiment 2 compared L1 and L2 readers’ processing of sentences containing a three-character incremental word, manipulating the plausibility of both whole-word and embedded word interpretations. L2 readers showed slower and more effortful reading, with longer fixations, shorter saccades and more regressions.

At the target region, both groups showed incremental word plausibility effects. For L1 readers, the effect emerged in early (gaze duration) and late (regressions-in, total-time) measures, indicating sensitivity to word plausibility during lexical processing and integration. For L2 readers, effects appeared only later, suggesting delayed or less efficient contextual integration. No embedded word plausibility effects appeared in first-pass reading. However, L1 readers showed late effects when the incremental word was implausible, implying reanalysis when the preferred interpretation conflicted with context. This pattern indicates lexical access to the embedded word during reanalysis, matching Experiment 1 evidence that embedded words influence processing when context or preview supports activation.

Findings from the pretarget region also showed sensitivity to word plausibility. Both groups had longer total times and more regressions-in for implausible parafoveal words, reflecting integration difficulty. Notably, L2 readers additionally showed a gaze-duration effect, suggesting parafoveal plausibility processing, which may have been enabled by these readers making generally longer fixations. Group differences were clearer for embedded word plausibility. L2 readers showed reliable effects even when the incremental word was plausible, including a gaze-duration effect implying parafoveal access to embedded meaning (again potentially supported by their generally longer fixations). In contrast, L1 readers showed such effects only when the whole word was implausible, consistent with context-driven reanalysis. Under these conditions, L1 readers displayed both early (gaze duration) and late (total-time, regressions-in) effects, indicating rapid detection of ambiguity and flexible access to the embedded word when context disfavored the whole-word meaning. This parallels Experiment 1, where L1 readers showed parafoveal sensitivity to embedded word plausibility before direct fixation. Together, these findings support dynamic L1 engagement with embedded meanings, which may be activated early, following parafoveal preview, and later when integration fails.

Overall, both L1 and L2 readers were sensitive to contextual plausibility and re-read when expectations were violated. However, L1 readers showed greater flexibility and efficiency in accessing embedded meanings, especially when context contradicted whole-word interpretations. In contrast, L2 readers appeared to maintain multiple candidate segmentations longer, suggesting weaker suppression of implausible alternatives.

4. General discussion

Two eye-movement experiments investigated L1 and L2 processing of segmental ambiguities in Chinese reading involving incremental three-character words (e.g., 体育馆, “stadium”) whose first two characters form an embedded word (e.g., 体育, “sport”). In Experiment 1, experimental sentences contained a plausible incremental word, while the plausibility of its embedded word was manipulated; with these sentences compared with control sentences where the incremental word was replaced by its embedded word. This allowed us to investigate whether L1 and L2 readers show similar plausibility effects, by making longer fixations for implausible control target words, and perhaps also implausible embedded words, across conditions. Experiment 2 further investigated these effects by manipulating the plausibility of whole and embedded words independently. Prior work (Zhou & Li, Reference Zhou and Li2021) showed holistic L1 processing of segmental ambiguity, observing control word plausibility effects in their Experiment 1 and whole-word plausibility effects in their Experiment 2, but an absence of embedded word plausibility effects in either experiment. They took these findings to support the parallel activation account of segmental processing implemented in the Chinese Reading Model (CRM; Li & Pollatsek, Reference Li and Pollatsek2020).

Our goal was to establish whether these effects could be replicated for L1 readers and whether high-proficiency L2 readers adopt comparable segmentation strategies. L2 reading was slower in both experiments, consistent with prior evidence of slower L2 reading without word boundaries (Bai et al., Reference Bai, Guo, Cao, Gu, Yan and Zang2012; Bassetti & Lu, Reference Bassetti and Lu2016; Cui, Reference Cui2023; Ma et al., Reference Ma, Li and Zhuang2019; Shen et al., Reference Shen, Liversedge, Tian, Zang, Cui, Bai, Yan and Rayner2012; Yu, Reference Yu2022; Zhou et al., Reference Zhou, Ma, Li and Taft2018, Reference Zhou, Ye and Yan2020). Following Zhou and Li, we reported effects for both a region containing the target word and a pretarget region comprising the verb used to manipulate contextual plausibility.

4.1. Target region effects

Results from both experiments indicate that L1 and L2 readers favor holistic interpretations of segmentally ambiguous words. In Experiment 1, plausibility effects were observed for the control target words, especially among L1 readers, but not for the embedded words in incremental targets, as would be expected if these were segmented first during sentence processing. These findings replicated Zhou and Li (Reference Zhou and Li2021) and so supported the CRM’s assumption that whole-word interpretations dominate when context allows. In Experiment 2, embedded word plausibility effects emerged only when the whole-word interpretation was implausible, particularly for L1 readers, suggesting context-driven reanalysis via embedded word access. Thus, while both groups appear to rely primarily on the holistic processing of segmental ambiguity, it also appears that they can lexically access the embedded words during reanalysis.

Both groups processed sentences incrementally, integrating input with context and showing difficulty when this input conflicted with contextual expectations (Lee & Witzel, Reference Lee and Witzel2023; Williams, Reference Williams2006). Word plausibility effects were weaker for L2 readers, suggesting less efficient integration despite these readers’ high language proficiency. Nonetheless, the L2 readers used segmentation strategies broadly similar to L1 readers, demonstrating native-like resolution of segmental ambiguities.

This research is among the first to use eye movements to examine L2 processing of segmental ambiguity (see also Yao et al., Reference Yao, Jiang, Chen and Li2025). Crucially, the evidence suggests that both L1 and L2 readers employ holistic segmentation strategies but can reanalyze ambiguities flexibly when this is required by context, supporting the view that high-proficiency L2 readers can achieve near-native word segmentation. Future work should explore how these strategies develop across proficiency levels.

One possibility is that L2 readers draw on their L1 experience. Previous studies show cross-linguistic influences in sentence processing, such as orthographic overlap (Libben & Titone, Reference Libben and Titone2009) and syntactic transfer (Bardovi-Harlig & Sprouse, Reference Bardovi-Harlig and Sprouse2018; Schwartz & Sprouse, Reference Schwartz and Sprouse1996). In the Supplemental File, we reported analyses comparing L2 readers whose L1 employs interword spaces with those whose L1 does not, but found no subgroup differences. Similarly, analyses comparing effects in terms of HSK or self-rated language proficiency revealed no clear proficiency effects. These null results suggest that once high proficiency is achieved, segmentation strategies converge. However, further work is required as early-stage learners may still exhibit L1-driven segmentation differences.

4.2. Pretarget verb region effects

Analyses for the pretarget verb region enabled assessment of both parafoveal processing effects and re-reading effects. Parafoveal processing is critical for fluent reading, allowing extraction of information from upcoming words before fixating them (Cutter et al., Reference Cutter, Drieghe, Liversedge, Pollatsek and Treiman2015; Schotter et al., Reference Schotter, Angele and Rayner2012). Prior research shows orthographic and phonological information can be accessed parafoveally. However, whether higher level semantic information, including contextual plausibility, can be parafoveally processed remains debated (for discussion, see Pan et al., Reference Pan, Frisson, Federmeier and Jensen2024; Rayner et al., Reference Rayner, Schotter and Drieghe2014; Schotter, Reference Schotter2013; Schotter et al., Reference Schotter, Milligan and Estevez2023). Some studies suggest that parafoveal processing of semantic information is possible in compact logographic scripts like Chinese (Yan et al., Reference Yan, Richter, Shu and Kliegl2009; Yan & Sommer, Reference Yan and Sommer2015; Pan et al., Reference Pan, Laubrock and Yan2016). Moreover, the studies additionally suggest that this semantic preview information can influence processing of the currently fixated word, that is, a parafoveal-on-foveal effect. However, such findings are controversial (see Li et al., Reference Li, Zhang, Tsai and Puls2014; Drieghe, Reference Drieghe, Liversedge, Gilchrist and Everling2011).

In Experiment 1, an L1 gaze-duration effect of embedded word plausibility suggested parafoveal semantic access during first-pass processing of the verb. A similar L2 trend did not reach significance. Both groups also showed total reading time effects, while L1 readers showed regressions-in effects. These late effects reflect re-reading of the verb after fixating the incremental word, indicating later lexical access to the embedded word interpretation during reanalysis. The regressions-in effect confirms that this lexical access triggered reanalysis. The findings therefore indicate that embedded words were lexically and semantically accessed during segmental ambiguity processing and that this influenced subsequent processing, despite the holistic analysis ultimately being strongly preferred.

In Experiment 2, both groups again showed embedded word plausibility effects at the pretarget region, but with distinct profiles. L2 readers showed early (gaze duration) and late (total-time) effects even when the incremental word was plausible, reflecting parafoveal access, which may have been facilitated by long L2 fixation times on the pretarget verb. In contrast, L1 readers showed such effects only when the incremental word was implausible, with both early and late measures affected. This pattern suggests that L1 readers selectively accessed embedded word candidates when the whole-word analysis conflicted with context, whereas L2 readers maintained multiple interpretations even without conflict. Together, these results confirm parafoveal access to embedded word meanings in both groups, which may differ with how efficiently context guides segmentation and reanalysis.

These group-level differences raise broader questions about L2 parafoveal processing. Both L1 and L2 readers accessed embedded word meanings, including parafoveally. High-proficiency L2 readers therefore appear capable of semantic parafoveal processing, possibly supported by their longer first-pass fixations at the verb. These results nevertheless align with findings showing proficiency in L2 readers parafoveal processing (Tiffin-Richards, Reference Tiffin-Richards2024a,Reference Tiffin-Richardsb), and contrast with other reports of more limited L2 parafoveal processing in Chinese reading (Cong & Chen, Reference Cong and Chen2022; Xiao et al., Reference Xiao, Jia and Wang2021). This may reflect variation in L2 proficiency across studies, and future research should assess how parafoveal processing develops with L2 proficiency.

It is also possible that some parafoveal effects reflect mislocated fixations, which occur when eye movements either undershoot or overshoot the intended target word (Drieghe, Reference Drieghe, Liversedge, Gilchrist and Everling2011; Drieghe et al., Reference Drieghe, Rayner and Pollatsek2008). In our data, some fixations on the pretarget verb may have occurred while readers were processing initial characters of the incremental word. This could mimic parafoveal preview while actually reflecting foveal processing. Crucially, as Chinese lacks interword spaces, readers are less likely to use word boundaries to guide eye movements, so that readers are less likely to fixate optimally on words and likely to fixate a broad range of fixation locations (see Li et al., Reference Li, Rayner and Cave2009; Yan et al., Reference Yan, Kliegl, Richter, Nuthmann and Shu2010). As a result, characters from upcoming words sometimes may fall within foveal vision, with the potential to create apparent parafoveal-on-foveal effects (Yin et al., Reference Yin, Chen, Wen, Zhao, He and Liu2025). Such effects may be especially common among L2 readers, who exhibit less precise saccade-targeting and greater fixation variability (Gnetov & Kuperman, Reference Gnetov and Kuperman2024). While this does not undermine the present evidence for embedded word activation, it suggests such activation may sometimes arise from foveal rather than parafoveal processing. Nevertheless, whether through parafoveal preview or mislocated fixation, our results clearly show embedded word meanings are accessed during segmental processing.

4.3. Implications for the Chinese reading model

The present findings have implications for the CRM. As embedded word meanings appear to be accessed during both early and late processing, we propose that the model should be refined to include a mechanism for transient, context-sensitive activation of embedded constituents. This may arise from parafoveal preview, mislocated fixation or variability in fixation location. As noted earlier, fixation position during word recognition is likely more variable in Chinese than in alphabetic scripts, due to the absence or reduction of word-based eye-guidance effects (Li et al., Reference Li, Liu and Rayner2011; Yan et al., Reference Yan, Kliegl, Richter, Nuthmann and Shu2010; see also Li et al., Reference Li, Li, Wang, McGowan and Paterson2018). As a result, readers sometimes may fixate near the end of the preceding word or within different parts of an incremental word, bringing its embedded portion into foveal or parafoveal vision depending on fixation location. These spatial factors likely influence when and whether embedded word meanings are activated during early processing and this influence of fixation location should be investigated further.

However, early activation alone cannot explain the late embedded word plausibility effects we observed. These suggest that embedded meanings can also be reactivated during postlexical reanalysis, particularly when a whole-word interpretation is implausible. We therefore propose a dual-route CRM: one route supporting early, fixation dependent activation of embedded words and another supporting later, context-driven reactivation. This dual-route approach may also clarify some L1–L2 reading differences, as between-group variability in eye-movement control, parafoveal efficiency and contextual use could modulate both early and late access to embedded words meanings.

4.3. Conclusions

In conclusion, our findings show that both L1 and high-proficiency L2 readers resolve segmental ambiguity in Chinese using broadly similar processing strategies. We also provide novel evidence that embedded word meanings are accessed both parafoveally and during reanalysis by both groups. While the underlying mechanisms appear similar, the timing and strength of effects suggest that L2 readers engage in less efficient or less selective word segmentation processing. Future research should examine how word segmentation mechanisms and parafoveal processing develop during L2 Chinese learning and are shaped by proficiency and reading experience.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/S136672892510076X.

Data availability statement

Datasets, analytic code, stimuli and a Supplemental File are available via the University of Leicester Figshare repository: https://figshare.com/s/50fc0db1beea674be8b0 (Li et al., Reference Li, Ji, Li, Liu, Wang, Gunn and Paterson2025).

Acknowledgements

Lin Li and Kevin Paterson are joint corresponding authors. Lin Li and Kevin Paterson designed the experiments with help from the other authors. Lin Li designed the stimuli and analyzed the data. Jingyi Liu, Shan Wang and Yaning Ji collected the data. Yaning Ji and Lin Li revised the statistical analysis and data reporting. Kevin Paterson and Lin Li wrote the manuscript with critical comments from Sha Li and Sarah Gunn. The ideas and data appearing in the manuscript have not been disseminated before (e.g., at a conference or meeting, posted on a listserv, shared on a website).

Funding statement

This work was supported by a grant from the Annual Project of Philosophy and Social Sciences Planning in Tianjin to Lin Li (TJXL24-002) and a Haihe Invited Professorship awarded to Kevin Paterson.

Competing interests

The authors declare no competing interests related to this research.

Footnotes

This research article was awarded Open Data badge for transparent practices. See the Data Availability Statement for details.

References

Baayen, R. H., Davidson, D. J., & Bates, D. M. (2008). Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language, 59(4), 390412. http://doi.org/10.1016/j.jml.2007.12.005.CrossRefGoogle Scholar
Bai, X., Yan, G., Liversedge, S. P., Zang, C., & Rayner, K. (2008). Reading spaced and unspaced Chinese text: Evidence from eye movements. Journal of Experimental Psychology. Human Perception and Performance, 34(5), 12771287. https://doi.org/10.1037/0096-1523.34.5.1277.CrossRefGoogle ScholarPubMed
Bai, X., Guo, Z., Cao, Y., Gu, J., Yan, G., & Zang, C. (2012). Effect of word segmentation on Hindi-English bilinguals’ reading: Evidence from eye movements. Journal of Psychological Science, 35(3), 544549.Google Scholar
Bai, X. J., Liang, F. F., Blythe, H. I., Zang, C. L., Yan, G. L., & Liversedge, S. P. (2013). Interword spacing effects on the acquisition of new vocabulary for readers of Chinese as a second language. Journal of Research in Reading, 36, 417.10.1111/j.1467-9817.2013.01554.xCrossRefGoogle Scholar
Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J. (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68(3), 255278. https://doi.org/10.1016/j.jml.2012.11.001.CrossRefGoogle ScholarPubMed
Bardovi-Harlig, K., & Sprouse, R. A. (2018). Negative versus positive transfer. The TESOL Encyclopedia of English Language Teaching, 16. https://doi.org/10.1002/9781118784235.eelt0084.Google Scholar
Bassetti, B. (2009). Effects of adding interword spacing on Chinese reading: A comparison of Chinese native readers and English readers of Chinese as a second language. Applied PsychoLinguistics, 30(4), 757775. https://doi.org/10.1017/S0142716409990105.CrossRefGoogle Scholar
Bassetti, B., & Lu, M. H. (2016). Effects of interword spacing on native English readers of Chinese as a second language. International Review of Applied Linguistics in Language Teaching, 54(1), 122. https://doi.org/10.1515/iral-2016-0014.CrossRefGoogle Scholar
Bassetti, B., & Masterson, J. (2012). Effects of removing morphemic information and adding interword spacing on reading in Chinese experienced and inexperienced readers. Reading and Writing, 25(9), 22912314. https://doi.org/10.1007/s11145-012-9358-7.CrossRefGoogle Scholar
Bates, D., Maechler, M., & Bolker, B. (2011). LME4: Linear mixed-effects models using S4 classes. R package version 0.999375-42. http://CRAN.R-project.org/package=lme4Google Scholar
Beijing Language Institute. (1986). Modern Chinese frequency dictionary (Xian Dai Han Yu ci pin ci Dian). Beijing Language Institute Press.Google Scholar
Brothers, T., Hoversten, L. J., & Traxler, M. J. (2021). Bilinguals on the garden-path: Individual differences in syntactic ambiguity resolution. Bilingualism: Language and Cognition, 24(4), 612627. https://doi.org/10.1017/S1366728920000711.CrossRefGoogle ScholarPubMed
Choi, S. H., & Ionin, T. (2021). Plural marking in the second language: Atomicity, definiteness, and transfer. Applied PsychoLinguistics, 42(3), 549578. https://doi.org/10.1017/S0142716420000569.CrossRefGoogle Scholar
Conklin, K., Alotaibi, S., Pellicer-Sánchez, A., & Vilkaitė-Lozdienė, L. (2020). What eye-tracking tells us about reading-only and reading-while-listening in a first and second language. Second Language Research, 36(3), 257276. https://doi.org/10.1177/0267658320921496.CrossRefGoogle Scholar
Cong, F., & Chen, B. (2022). Parafoveal orthographic processing in bilingual reading. International Journal of Bilingual Education and Bilingualism, 25(10), 36983710. https://doi.org/10.1080/13670050.2022.2073786.CrossRefGoogle Scholar
Cop, U., Dirix, N., Van Assche, E., Drieghe, D., & Duyck, W. (2017). Reading a book in one or two languages? An eye movement study of cognate facilitation in L1 and L2 reading. Bilingualism: Language and Cognition, 20(4), 747769. https://doi.org/10.1017/S1366728916000213.CrossRefGoogle Scholar
Cop, U., Drieghe, D., & Duyck, W. (2015). Eye movement patterns in natural reading: A comparison of monolingual and bilingual reading of a novel. PLoS One, 10(8), e0135896.10.1371/journal.pone.0134008CrossRefGoogle Scholar
Cui, L., Drieghe, D., Yan, G. L., Bai, X. J., Chi, H., & Liversedge, S. P. (2012). Parafoveal processing across different lexical constituents in Chinese reading. The Quarterly Journal of Experimental Psychology, 66, 403416.10.1080/17470218.2012.720265CrossRefGoogle ScholarPubMed
Cui, Y. (2023). Eye movements of second language learners when reading spaced and unspaced Chinese texts. Frontiers in Psychology, 13(14), 783960. https://doi.org/10.3389/fpsyg.2023.783960.CrossRefGoogle Scholar
Cunnings, I. (2017). Parsing and working memory in bilingual sentence processing. Bilingualism: Language and Cognition, 20(4), 659678. https://doi.org/10.1017/S1366728916000675.CrossRefGoogle Scholar
Cutter, M., Drieghe, D., & Liversedge, S. P. (2015). How is information integrated across fixations in reading? In Pollatsek, A., & Treiman, R. (Eds.), The Oxford handbook of Reading (pp. 245260). Oxford University Press. https://doi.org/10.1093/oxfordhb/9780199324576.013.11Google Scholar
Drieghe, D. (2011). Parafoveal-on-foveal effects on eye movements during reading. In Liversedge, S. P., Gilchrist, I. D., & Everling, S. (Eds.), The Oxford handbook of eye movements, Oxford University Press, (pp. 839856).Google Scholar
Drieghe, D., Rayner, K., & Pollatsek, A. (2008). Mislocated fixations can account for parafoveal-on-foveal effects in eye movements during reading. Quarterly Journal of Experimental Psychology, 61(8), 12391249. https://doi.org/10.1080/17470210701467953.CrossRefGoogle ScholarPubMed
Engbert, R., Nuthmann, A., Richter, E. M., & Kliegl, R. (2005). SWIFT: A dynamical model of saccade generation during reading. Psychological Review, 112, 777813.10.1037/0033-295X.112.4.777CrossRefGoogle Scholar
Fernandez, L. B., Bothe, R., & Allen, S. E. M. (2023). The role of L1 reading direction on L2 perceptual span: An eye-tracking study investigating Hindi and Urdu speakers. Second Language Research, 39(2), 447469. https://doi.org/10.1177/02676583211049742&nbsp.CrossRefGoogle Scholar
Fernandez, L. B., Scheepers, C., & Allen, S. E. M. (2021). Cross-linguistic differences in parafoveal semantic and orthographic processing. Attention, Perception, & Psychophysics, 83, 31833200. https://doi.org/10.3758/s13414-021-02329-7.CrossRefGoogle ScholarPubMed
Frenck-Mestre, C., Kroll, J. F., & De Groot, A. M. B. (2005). Ambiguities and anomalies: What can eye movements and event-related potentials reveal about second language sentence processing. In Handbook of bilingualism: Psycholinguistic approaches (pp. 268281). Oxford University Press.Google Scholar
Gnetov, D., & Kuperman, V. (2024). Reading proficiency predicts spatial eye-movement control in the first and second language. Journal of Experimental Psychology: Learning, Memory, and Cognition, 50(8), 13151328. https://doi.org/10.1037/xlm0001325.Google ScholarPubMed
Green, P., & MacLeod, C. J. (2016). SIMR: an R package for power analysis of generalized linear mixed models by simulation. Methods in Ecology and Evolution, 7(4), 493498. Portico. https://doi.org/10.1111/2041-210x.12504.CrossRefGoogle Scholar
Hatzidaki, A., Branigan, H. P., & Pickering, M. J. (2011). Co-activation of syntax in bilingual language production. Cognitive Psychology, 62(2), 123150. https://doi.org/10.1016/j.cogpsych.2010.10.002.CrossRefGoogle ScholarPubMed
Huang, L., & Li, X. (2020). Early, but not overwhelming: The effect of prior context on segmenting overlapping ambiguous strings when reading Chinese. Quarterly Journal of Experimental Psychology: Human Experimental Psychology, 73(9), 13821395. https://doi.org/10.1177/1747021820926012.CrossRefGoogle Scholar
Huang, L., & Li, X. (2023). The effects of lexical- and sentence-level contextual cues on Chinese word segmentation. Psychonomic Bulletin & Review. https://doi.org/10.3758/s13423-023-02336-9.Google ScholarPubMed
Inhoff, A. W., & Liu, W. (1998). The perceptual span and oculomotor activity during the reading of Chinese sentences. Journal of Experimental Psychology: Human Perception and Performance, 24(1), 2034. https://doi.org/10.1037/0096-1523.24.1.20.Google ScholarPubMed
Inhoff, A. W., & Wu, C. (2005). Eye movements and the identification of spatially ambiguous words during Chinese sentence reading. Memory & Cognition, 33(8), 13451356. https://doi.org/10.3758/BF03193367.CrossRefGoogle ScholarPubMed
Institute of Linguistics, Chinese Academy of Social Sciences. (2024). Xiandai Hanyu Cidian [Modern Chinese dictionary] (7th ed.). The Commercial Press.Google Scholar
Jordan, T. R., Almabruk, A. A., Gadalla, E. A., McGowan, V. A., White, S. J., Abedipour, L., & Paterson, K. B. (2014). Reading direction and the central perceptual span. Evidence from Arabic and English. Psychonomic Bulletin & Review, 21, 505511.10.3758/s13423-013-0510-4CrossRefGoogle ScholarPubMed
Kaan, E., & Grüter, T. (2021). Prediction in second language processing and learning: Advances and directions. In Kaan, E. and Grüter, T. (Eds.), Prediction and second language processing (pp. 124). John Benjamins. https://doi.org/10.1075/bpa.12.01kaa.CrossRefGoogle Scholar
Kidd, E., Chan, A., & Chiu, J. (2015). Cross-linguistic influence in simultaneous Cantonese–English bilingual children’s comprehension of relative clauses. Bilingualism: Language and Cognition, 18(3), 438452. https://doi.org/10.1017/S1366728914000649.CrossRefGoogle Scholar
Kroll, J. F., Bice, K., Botezatu, M. R., & Zirnstein, M. (2022). On the dynamics of lexical access in two or more languages. In Papafragou, A., Trueswell, J. C., & Gleitman, L. R. (Eds.), The Oxford handbook of the mental lexicon,Oxford handbooks. Academic. https://doi.org/10.1093/oxfordhb/9780198845003.013.26.Google Scholar
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2014). lmerTest: Tests for random and fixed effects for linear mixed effect models (lmer objects of lme4 package; R package Version 2.0–6) [Computer software]. http://CRAN.Rproject.org/package=lmerTestGoogle Scholar
Lee, J., & Witzel, J. (2023). Plausibility and structural reanalysis in L1 and L2 sentence comprehension. Quarterly Journal of Experimental Psychology, 76(2), 319337. https://doi.org/10.1177/17470218221092400.CrossRefGoogle ScholarPubMed
Leung, C. Y., Sugiura, M., Abe, D., & Yoshikawa, L. (2014). The perceptual span in second language reading: An eye-tracking study using a gaze-contingent moving window paradigm. Open Journal of Modern Linguistics, 4(05), 585.10.4236/ojml.2014.45051CrossRefGoogle Scholar
Libben, M. R., & Titone, D. A. (2009). Bilingual lexical access in context: Evidence from eye movements during reading. Journal of Experimental Psychology. Learning, Memory, and Cognition, 35(2), 381390. https://doi.org/10.1037/a0014875.CrossRefGoogle ScholarPubMed
Li, L., Ji, Y., Li, S., Liu, J., Wang, S., Gunn, S., & Paterson, K. B. (2025). Unpacking word segmentation processes by L2 Chinese readers: Evidence from eye movements [data]. University of Leicester. https://figshare.le.ac.uk/articles//_/27139821Google Scholar
Li, L., Bao, L., Li, Z., Li, S., Liu, J., Wang, P., Warrington, K. L., Gunn, S., & Paterson, K. B. (2024). Efficient word segmentation is preserved in older adult readers: Evidence from eye movements during Chinese reading. Psychology and Aging, 39(3), 215230. https://doi.org/10.1037/pag0000813.CrossRefGoogle ScholarPubMed
Li, P., Zhang, F., Tsai, E., & Puls, B. (2014). Language history questionnaire (LHQ 2.0): A new dynamic web-based research tool. Bilingualism: Language and Cognition, 17(3), 673680. http://doi.org/10.1017/S1366728913000606.CrossRefGoogle Scholar
Li, P., Zhang, F., Yu, A., & Zhao, X. (2019). Language history questionnaire (LHQ3): An enhanced tool for assessing multilingual experience. Bilingualism: Language and Cognition, 23(5), 938944. http://doi.org/10.1017/S1366728918001153.CrossRefGoogle Scholar
Li, S., Li, L., Wang, J., McGowan, V. A., & Paterson, K. B. (2018). Effects of word length on eye guidance differ for young and older Chinese readers. Psychology and Aging, 33(4), 685692. https://doi.org/10.1037/pag0000258.CrossRefGoogle Scholar
Li, X., Huang, L., Yao, P., & Hyönä, J. (2022). Universal and specific reading mechanisms across different writing systems. Nature Reviews Psychology, 1, 133144. https://doi.org/10.1038/s44159-022-00022-6.CrossRefGoogle Scholar
Li, X., Liu, P., & Rayner, K. (2011). Eye movement guidance in Chinese reading: Is there a preferred viewing location? Vision Research, 51(10), 11461156. https://doi.org/10.1016/j.visres.2011.03.004.CrossRefGoogle Scholar
Li, X., & Pollatsek, A. (2020). An integrated model of word processing and eye-movement control during Chinese reading. Psychological Review, 127(6), 11391162. http://doi.org/10.1037/rev0000248.CrossRefGoogle ScholarPubMed
Li, X., Rayner, K., & Cave, K. R. (2009). On the segmentation of Chinese words during reading. Cognitive Psychology, 58(4), 525552. http://doi.org/10.1016/j.cogpsych.2009.02.003.CrossRefGoogle ScholarPubMed
Li, X., Zang, C., Liversedge, S. P., Pollatsek, A. (2015). The role of words in Chinese reading. In Pollatsek, A. & Treiman, R. (Eds.), The Oxford handbook of Reading (pp. 232244). Oxford University Press.Google Scholar
Liversedge, S. P., Drieghe, D., Li, X., Yan, G., Bai, X., & Hyönä, J. (2016). Universality in eye movements and reading: A trilingual investigation. Cognition, 147, 120. https://doi.org/10.1016/j.cognition.2015.10.013.CrossRefGoogle Scholar
Liversedge, S. P., & Findlay, M. (2000). Saccadic eye movements and cognition. Trends in Cognitive Sciences, (1), 614.10.1016/S1364-6613(99)01418-7CrossRefGoogle ScholarPubMed
Ma, G., Li, X., & Rayner, K. (2014). Word segmentation of overlapping ambiguous strings during Chinese reading. Journal of Experimental Psychology: Human Perception & Performance, 40, 10461059.Google ScholarPubMed
Ma, G., Li, D., & Zhuang, X. (2019). Do visual word segmentation cues improve reading performance in Chinese reading? Ergonomics, 62(8), 10861097. https://doi.org/10.1080/00140139.2019.1608315.CrossRefGoogle ScholarPubMed
McClelland, J. L., & Rumelhart, D. E. (1981). An interactive activation model of context effects in letter perception: I. An account of basic findings. Psychological Review, 88(5), 375407. https://doi.org/10.1037/0033-295X.88.5.375.CrossRefGoogle Scholar
McConkie, G. W., & Rayner, K. (1975). The span of the effective stimulus during a fixation in reading. Perception & Psychophysics, 17(6), 578586.10.3758/BF03203972CrossRefGoogle Scholar
McConkie, G. W., & Rayner, K. (1976). Asymmetry of the perceptual span in reading. Bulletin of the Psychonomic Society, 8(5), 365368. https://doi.org/10.3758/bf03335168.CrossRefGoogle Scholar
Pan, J., Laubrock, J., & Yan, M. (2016). Parafoveal processing in silent and oral Reading: Reading mode influences the relative weighting of phonological and semantic information in Chinese. Journal of Experimental Psychology: Learning, Memory, and Cognition, 42(8), 12571273. https://doi.org/10.1037/xlm0000242.Google ScholarPubMed
Pan, Y., Frisson, S., Federmeier, K. D., & Jensen, O. (2024). Early parafoveal semantic integration in natural reading, eLife, 12, RP91327. https://doi.org/10.7554/eLife.91327.4.CrossRefGoogle ScholarPubMed
Paterson, K. B., McGowan, V. A., White, S. J., Malik, S., Abedipour, L., & Jordan, T. R. (2014). Reading direction and the central perceptual span in Urdu and English. PLoS One, 9(2), e88358.10.1371/journal.pone.0088358CrossRefGoogle ScholarPubMed
Perfetti, C. A., & Tan, L. H. (1999). The constituency model of Chinese word identification. In Wang, J., Inhoff, A. W., & Chen, H.-C. (Eds.), Reading Chinese script: A cognitive analysis (pp. 115134). Lawrence Erlbaum Associates Publishers.Google Scholar
R Core Team (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.Google Scholar
Rayner, K. (1998). Eye movements in reading and information processing: 20 years of research. Psychological Bulletin, 124(3), 372422. https://doi.org/10.1037/0033-2909.124.3.372.CrossRefGoogle ScholarPubMed
Rayner, K., McConkie, G. W., & Zola, D. (1980). Integrating information across eye movements. Cognitive Psychology, 12(2), 206226. https://doi.org/10.1016/0010-0285(80)90009-2.CrossRefGoogle ScholarPubMed
Rayner, K. (2009). The thirty-fifth sir Frederick Bartlett lecture: Eye movements and attention in reading, scene perception, and visual search. Quarterly Journal of Experimental Psychology, 62(8), 14571506. https://doi.org/10.1080/17470210902816461.CrossRefGoogle Scholar
Rayner, K., Schotter, E. R., & Drieghe, D. (2014). Lack of semantic parafoveal preview benefit in reading revisited. Psychonomic Bulletin & Review, 21(4), 10671072. https://doi.org/10.3758/s13423-014-0582-9.CrossRefGoogle ScholarPubMed
Reichle, E. D., Pollatsek, A., Fisher, D. L., & Rayner, K. (1998). Toward a model of eye movement control in reading. Psychological Review, 105(1), 125157. https://doi.org/10.1037/0033-295X.105.1.125.CrossRefGoogle Scholar
Reichle, E. D., Rayner, K., & Pollatsek, A. (2003). The E-Z reader model of eye-movement control in reading: Comparisons to other models. Behavioral and Brain Sciences, 26(4), 455476. https://doi.org/10.1017/S0140525X03000104.CrossRefGoogle Scholar
Reichle, E. D., & Yu, L. (2018). Models of Chinese reading: Review and analysis. Cognitive Science, 42(S4), 11541165. http://doi.org/10.1111/cogs.12564.CrossRefGoogle Scholar
Reichle, E. D., Warren, T., & McConnell, K. (2009). Using EZ reader to model the effects of higher level language processing on eye movements during reading. Psychonomic Bulletin & Review, 16(1), 121. http://doi.org/10.3758/PBR.16.1.1.CrossRefGoogle ScholarPubMed
Roberts, L., & Felser, C. (2011). Plausibility and recovery from garden paths in second language sentence processing. Applied PsychoLinguistics, 32(2), 299331. https://doi.org/10.1017/S0142716410000421.CrossRefGoogle Scholar
Roberts, L., & Siyanova-Chanturia, A. (2013). Using eye-tracking to investigate topics in L2 acquisition and L2 processing. Studies in Second Language Acquisition, 35(2), 213235. https://doi.org/10.1017/S0272263112000861.CrossRefGoogle Scholar
Sasson, A., Schiff, R., & Zluf, B. (2024). Syntactic knowledge in a foreign language: Examining cross-language transfer effects in L2 noun phrase comprehension. Reading and Writing, 27, 126. https://doi.org/10.1007/s11145-024-10569-w.Google Scholar
Schad, D. J., Vasishth, S., Hohenstein, S., & Kliegl, R. (2020). How to capitalize on a priori contrasts in linear (mixed) models: A tutorial. Journal of Memory and Language, 110, 104038. https://doi.org/10.1016/j.jml.2019.104038.CrossRefGoogle Scholar
Schlenter, J. (2023). Prediction in bilingual sentence processing: How prediction differs in a later learned language from a first language. Bilingualism: Language and Cognition, 26(2), 253267. https://doi.org/10.1017/S1366728922000736.CrossRefGoogle Scholar
Schotter, E. R. (2013). Synonyms provide semantic preview benefit in English. Journal of Memory and Language, 69, 619633. https://doi.org/10.1016/j.jml.2013.09.002.CrossRefGoogle ScholarPubMed
Schotter, E. R., Angele, B., & Rayner, K. (2012). Parafoveal processing in reading. Attention, Perception & Psychophysics, 74(1), 535. https://doi.org/10.3758/s13414-011-0219-2.CrossRefGoogle ScholarPubMed
Schotter, E. R., Milligan, S., & Estevez, V. M. (2023). Event-related potentials show that parafoveal vision is insufficient for semantic integration. Psychophysiology, 60(7), e14246. https://doi.org/10.1111/psyp.14246.CrossRefGoogle ScholarPubMed
Schwartz, A. I., & Kroll, J. F. (2006). Bilingual lexical activation in sentence context. Journal of Memory and Language, 55(2), 197212. https://doi.org/10.1016/j.jml.2006.03.004.CrossRefGoogle Scholar
Schwartz, B. D., & Sprouse, R. A. (1996). L2 cognitive states and the Full Transfer/Full Access model. Second Language Research, 12(1), 4072. https://doi.org/10.1177/026765839601200103CrossRefGoogle Scholar
Shen, H. H., & Dai, D. (2024). Word segmentation and reading comprehension among advanced learners of Chinese. Reading in a Foreign Language, 36(1), 127. https://doi.org/10.64152/10125/67467CrossRefGoogle Scholar
Shen, D. L., Liversedge, S. P., Tian, J., Zang, C. L., Cui, L., Bai, X. J., Yan, G. L., & Rayner, K. (2012). Eye movements of second language learners when reading spaced and unspaced Chinese text. Journal of Experimental Psychology: Applied, 18(2), 192202.Google ScholarPubMed
Shen, H. H. (2018). Chinese as a second language reading: Lexical access and text comprehension. In Ke, C. (Ed.), The Routledge handbook of Chinese second language acquisition. (1st ed.). Routledge.Google Scholar
Shen, H. H., & Jiang, X. (2013). Character reading fluency, word segmentation accuracy, and reading comprehension in L2 Chinese. Reading in a Foreign Language, 25(1), 125.10.64152/10125/66679CrossRefGoogle Scholar
Snell, J., & Grainger, J. (2019). Readers are parallel processors. Trends in Cognitive Sciences, 23, 537546.10.1016/j.tics.2019.04.006CrossRefGoogle ScholarPubMed
Staub, A. (2015). The effect of lexical predictability on eye movements in reading: Critical review and theoretical interpretation. Language and Linguistics Compass, 9(8), 311327. https://doi.org/10.1111/lnc3.12151.CrossRefGoogle Scholar
Tiffin-Richards, S. P. (2024a). Bilingual parafoveal processing: Children and adults preprocess orthographic information of the upcoming word during sentence reading in their first and second language. Journal of Experimental Psychology. Learning, Memory, and Cognition, 50(11), 18441861. https://doi.org/10.1037/xlm0001346.CrossRefGoogle Scholar
Tiffin-Richards, S. P. (2024b). Cognate facilitation in bilingual reading: The influence of orthographic and phonological similarity on lexical decisions and eye-movements. Bilingualism: Language and Cognition, 118. https://doi.org/10.1017/S1366728923000949.Google Scholar
Titone, D., Libben, M., Mercier, J., Whitford, V., & Pivneva, I. (2011). Bilingual lexical access during L1 sentence reading: The effects of L2 knowledge, semantic constraint, and L1-L2 intermixing. Journal of Experimental Psychology: Learning, Memory, and Cognition, 37(6), 14121431. https://doi.org/10.1037/a0024492.Google ScholarPubMed
Van Assche, E., Drieghe, D., Duyck, W., Welvaert, M., & Hartsuiker, R. J. (2011). The influence of semantic constraints on bilingual word recognition during sentence reading. Journal of Memory and Language, 64(1), 88107. https://doi.org/10.1016/j.jml.2010.08.006.CrossRefGoogle Scholar
Venables, W. N., & Ripley, B. D. (2002). Modern applied statistics with S (4th ed.). Springer. https://doi.org/10.1007/978-0-387-21706-2CrossRefGoogle Scholar
von der Malsburg, T., & Angele, B. (2017). False positives and other statistical errors in standard analyses of eye movements in reading. Journal of Memory and Language, 94, 119133. https://doi.org/10.1016/j.jml.2016.10.003.CrossRefGoogle ScholarPubMed
Wagenmakers, E. J., Krypotos, A. M., Criss, A. H., & Iverson, G. (2012). On the interpretation of removable interactions: A survey of the field 33 years after Loftus. Memory & Cognition, 40(2), 145160. https://doi.org/10.3758/s13421-011-0158-0.CrossRefGoogle ScholarPubMed
Wang, J., Angele, B., Ma, G., & Li, X. (2021). Repetition causes confusion: Insights to word segmentation during Chinese reading. Journal of Experimental Psychology. Learning, Memory, and Cognition, 47(1), 147156. https://doi.org/10.1037/xlm0000817.CrossRefGoogle ScholarPubMed
Wang, A., Yeon, J., Zhou, W., Shu, H., & Yan, M. (2016). Cross-language parafoveal semantic processing: Evidence from Korean–Chinese bilinguals. Psychonomic Bulletin & Review, 23, 285290. https://doi.org/10.3758/s13423-015-0876-6.CrossRefGoogle ScholarPubMed
Wei, W., Li, X., & Pollastsek, A. (2013). Word properties of a fixated region affect outgoing saccade length in Chinese reading. Vision Research, 80, 16. http://doi.org/10.1016/j.visres.2012.11.015.CrossRefGoogle ScholarPubMed
White, A. L., Palmer, J., & Boynton, G. M. (2018). Evidence of serial processing in visual word recognition. Psychological Science, 29, 10621071.10.1177/0956797617751898CrossRefGoogle ScholarPubMed
White, A. L., Palmer, J., & Boynton, G. M. (2020). Visual word recognition: Evidence for a serial bottleneck in lexical access. Attention, Perception & Psychophysics, 82, 20002017. https://doi.org/10.3758/s13414-019-01916-z.CrossRefGoogle ScholarPubMed
Whitford, V., & Titone, D. (2012). Second-language experience modulates first- and second-language word frequency effects: Evidence from eye movement measures of natural paragraph reading. Psychonomic Bulletin & Review, 19(1), 7380. https://doi.org/10.3758/s13423-011-0179-5.CrossRefGoogle ScholarPubMed
Whitford, V., & Titone, D. (2015). Second-language experience modulates eye movements during first-and second-language sentence reading: Evidence from a gaze-contingent moving window paradigm. Journal of Experimental Psychology: Learning, Memory, and Cognition, 41(4), 11181129. http://doi.org/10.1037/xlm0000093.Google ScholarPubMed
Williams, J. N. (2006). Incremental interpretation in second language sentence processing. Bilingualism: Language and Cognition, 9(1), 7188. https://doi.org/10.1017/S1366728905002385.CrossRefGoogle Scholar
World Medical Association. (2013). WMA Declaration of Helsinki: Ethical principles for medical research involving human subjects. http://www.wma.net/en/30publications/10policies/b3/Google Scholar
Xiao, X. Z., Jia, G., & Wang, A. (2021). Semantic preview benefit of Tibetan-Chinese bilinguals during Chinese reading. Language Learning and Development, 19(1), 115. https://doi.org/10.1080/15475441.2021.2003198.CrossRefGoogle Scholar
Xu, M., & Jordan, T. R. (2009). Assessing effects of viewing distance on normal Chinese reading: Some methodological and theoretical implications. Behavioral Research Methods, 41(4), 971976. https://doi.org/10.3758/BRM.41.4.971.CrossRefGoogle ScholarPubMed
Yan, M., Kliegl, R., Richter, E. M., Nuthmann, A., & Shu, H. (2010). Flexible saccade-target selection in Chinese reading. Quarterly Journal of Experimental Psychology, 63(4), 705725. https://doi.org/10.1080/17470210903114858.CrossRefGoogle ScholarPubMed
Yan, M., & Sommer, W. (2015). Parafoveal-on-foveal effects of emotional word semantics in reading Chinese sentences: Evidence from eye movements. Journal of Experimental Psychology: Learning, Memory, and Cognition, 41(4), 1237. https://doi.org/10.1037/xlm0000095.Google ScholarPubMed
Yan, M., Richter, E. M., Shu, H., & Kliegl, R. (2009). Readers of Chinese extract semantic information from parafoveal words. Psychonomic Bulletin & Review, 16, 561566. https://doi.org/10.3758/PBR.16.3.561.CrossRefGoogle ScholarPubMed
Yang, S. (2021). Investigating word segmentation of Chinese second language learners. Reading and Writing, 34, 12731293. https://doi.org/10.1007/s11145-020-10113-6.CrossRefGoogle Scholar
Yang, J., Staub, A., Li, N., Wang, S., & Rayner, K. (2012). Plausibility effects when reading one- and 2-character words in Chinese: Evidence from eye movements. Journal of Experimental Psychology: Learning, Memory, and Cognition, 38(6), 18011809. http://doi.org/10.1037/a0028478.Google ScholarPubMed
Yao, P., Jiang, X., Chen, X., & Li, X. (2025). Exploring the processing unit of L2 Chinese learners in on-line Chinese reading. Second Language Research, 41(1), 319. https://doi.org/10.1177/02676583231202606.CrossRefGoogle Scholar
Yin, Z., Chen, S., Wen, N., Zhao, B., He, Z., & Liu, Y. (2025). Repeated previews elicit an inhibitory parafoveal-on-foveal effect in Chinese reading: Implications for attention allocation. Journal of Experimental Psychology: Learning, Memory, and Cognition. Advance online publication. https://doi.org/10.1037/xlm0001471Google ScholarPubMed
Yu, Q. (2022). Chinese text presentations and reading efficiency. Reading in a Foreign Language, 34(1), 142164. http://hdl.handle.net/10125/67416 10.64152/10125/67416CrossRefGoogle Scholar
Yu, L., Liu, Y., & Reichle, E. D. (2021). A corpus-based versus experimental examination of word-and character-frequency effects in Chinese reading: Theoretical implications for models of reading. Journal of experimental psychology: General, 150(8), 16121641. https://doi.org/10.1037/xge0001014CrossRefGoogle Scholar
Zang, C., Liang, F., Bai, X., Yan, G., & Liversedge, S. P. (2013). Interword spacing and landing position effects during Chinese reading in children and adults. Journal of Experimental Psychology: Human Perception and Performance, 39(3), 720734. http://doi.org/10.1037/a0030097.Google ScholarPubMed
Zang, C., Liversedge, S. P., Bai, X., Yan, G. (2011). Eye movements during Chinese reading. In Liversedge, S. P., Gilchrist, I. D., & Everling, S. (Eds.), The Oxford handbook of eye movements (pp. 961978). Oxford University Press. https://doi.org/10.1093/oxfordhb/9780199539789.001.0001.Google Scholar
Zhang, J., Huang, Y., Jiang, C., Xu, Y., Rao, H., & Xu, H. (2023). Dynamic brain responses to Russian word acquisition among Chinese adult learners: An event-related potential study. Human Brain Mapping, 44(9), 37173729. https://doi.org/10.1002/hbm.26307.CrossRefGoogle ScholarPubMed
Zhou, J., & Li, X. (2021). On the segmentation of Chinese incremental words. Journal of Experimental Psychology: Learning, Memory, and Cognition, 47(8), 13531368. https://doi.org/10.1037/xlm0000984.Google ScholarPubMed
Zhou, J., Ma, G., Li, X., & Taft, M. (2018). The time course of incremental word processing during Chinese reading. Reading and Writing, 31(3), 607625. https://doi.org/10.1007/s11145-017-9800-y.CrossRefGoogle Scholar
Zhou, W., Ye, W., & Yan, M. (2020). Alternating-color words facilitate reading and eye movements among second-language learners of Chinese. Applied PsychoLinguistics, 41(3), 685699. https://doi.org/10.1017/S014271642000021.CrossRefGoogle Scholar
Figure 0

Table 1. Example sentence stimuli for Experiment 1

Figure 1

Table 2. Mean eye movement measures for target and pretarget regions in Experiment 1

Figure 2

Table 3. Summary of statistical effects for Experiment 1

Figure 3

Figure 1. Embedded word plausibility effects for the pretarget region in Experiment 1, in (A) gaze durations and (B) total reading times.

Figure 4

Table 4. Mean eye movements for pretarget and target regions in Experiment 2

Figure 5

Table 5. Summary of statistical effects for Experiment 2

Figure 6

Figure 2. Embedded word plausibility effects for plausible incremental target words in Experiment 2, in (A) gaze durations for the pretarget region and (B) total reading times for the target region.

Figure 7

Figure 3. Embedded word plausibility effects for implausible incremental target words in Experiment 2, in (A) gaze durations for the pretarget region and (b) total reading times for the target region.

Supplementary material: File

Li et al. supplementary material

Li et al. supplementary material
Download Li et al. supplementary material(File)
File 133 KB