The neural processing of the interaction between accentuation and lexical prediction during spoken sentence comprehension

Yan Yuan; Zhiren Zheng; Yu-Fu Chien; Chunhai Gao; Weijun Li

doi:10.1017/langcog.2025.12

The neural processing of the interaction between accentuation and lexical prediction during spoken sentence comprehension

Published online by Cambridge University Press: 22 May 2025

Yan Yuan ,

Chunhai Gao and

Yan Yuan: Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, China
Zhiren Zheng: Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, China
Yu-Fu Chien: Affiliation:
Department of Chinese Language and Literature, Fudan University, Shanghai, China
Chunhai Gao*: Affiliation:
Faculty of Education, Shenzhen University, Shenzhen, China
Weijun Li*: Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, China
*: Corresponding authors: Weijun Li and Chunhai Gao; Emails: liwj@lnnu.edu.cn; chunhaigao@hotmail.com
Corresponding authors: Weijun Li and Chunhai Gao; Emails: liwj@lnnu.edu.cn; chunhaigao@hotmail.com

Article contents

Abstract
Introduction
Methods
Discussion
Conclusion
Data availability statement
Competing interests
References

Rights & Permissions

Abstract

Language comprehension requires integration of multiple cues, but the underlying mechanisms of how accentuation, as a significant prosodic feature, influences the processing of words with different levels of cloze probability remains unclear. This study exploits event-related potentials (ERPs) to examine the processing of accented and unaccented words with high-, medium-, and low-cloze probabilities embedded in the final position of highly constrained contexts during spoken sentence comprehension. Our results indicate that accentuation and cloze probability interact across the N400 and post-N400 positivity (PNP) time windows. Under the accented condition, N400 amplitudes gradually increased as cloze probability decreased. Conversely, under the unaccented condition, PNP amplitudes gradually increased as cloze probability decreased with a frontal distribution. These results suggest that the effect of predictability is influenced by accentuation, which is likely due to the processing speed and depth of the critical words, modulated by the amount of attentional resources allocated to them.

Keywords

Accentuation Cloze probability ERPs Spoken comprehension

Information

Type: Article
Information: Language and Cognition , Volume 17 , 2025 , e48

DOI: https://doi.org/10.1017/langcog.2025.12 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

1. Introduction

During spoken language comprehension, one of the challenging tasks that listeners need to tackle is to extract information from rapidly unfolding acoustic signals. To do so, listeners utilize linguistic and non-linguistic contextual cues to predict forthcoming information. Speech processing would be facilitated if the incoming information aligns with these predictions. Word predictability is commonly operationalized as ‘cloze probability’, meaning the probability of words being used in a non-speeded, offline sentence completion test (DeLong et al., Reference DeLong, Urbach and Kutas2005; Kutas & Hillyard, Reference Kutas and Hillyard1984; Wlotko & Federmeier, Reference Wlotko and Federmeier2012). In addition to contextual cues, prosody has also been shown to influence spoken sentence comprehension (Allbritton et al., Reference Allbritton, McKoon and Ratcliff1996; Kjelgaard & Speer, Reference Kjelgaard and Speer1999; Lehiste, Reference Lehiste1973; Price et al., Reference Price, Ostendorf, Shattuck-Hufnagel and Fong1991). It is therefore crucial to investigate how the brain copes with the situation in which various levels of predictive violations are encountered, and it is also important to understand how and to what extent prosody interacts with cloze probability during language comprehension.

1.1. Semantic prediction in language comprehension

In natural conversation, interlocutors can successfully ‘take over’ and complete each other’s sentences immediately (Pickering & Garrod, Reference Pickering and Garrod2004). This implies that language comprehension is not a passive process but an active anticipation that progresses with the context heard/read. Prediction is a core and ubiquitous mechanism of the brain function (Friston, Reference Friston2010). During the process of language comprehension, probabilistic predictions across multiple levels of representations enable rapid understanding of the content we read or hear, leading to more efficient comprehension (Kuperberg & Jaeger, Reference Kuperberg and Jaeger2016). The strength and precision of prediction may be influenced by various factors such as memory capacity (Ding et al., Reference Ding, Zhang, Liang and Li2023), world knowledge (Hagoort et al., Reference Hagoort, Hald, Bastiaansen and Petersson2004), age (Federmeier et al., Reference Federmeier, McLennan, De Ochoa and Kutas2002; Wlotko et al., Reference Wlotko, Federmeier and Kutas2012), etc.

There is clear evidence indicating that at least within highly constrained sentence contexts, comprehenders are able to predict the semantic features of upcoming words. Eye tracking studies consistently show that when a word has higher predictability within a given context, readers tend to spend less time fixating on that word, and these words are also more likely to be skipped (Clifton et al., Reference Clifton, Ferreira, Henderson, Inhoff, Liversedge, Reichle and Schotter2016; Kliegl et al., Reference Kliegl, Dambacher, Dimigen, Jacobs and Sommer2012). Furthermore, previous studies have shown that the N400, an event-related potential (ERP) component reflecting semantic processing, is reduced in response to words that match the semantic predictions generated by highly predictable (relative to less predictable) contexts (DeLong & Kutas, Reference DeLong and Kutas2016; Kutas & Federmeier, Reference Kutas and Federmeier2011). For instance, when reading/hearing ‘The terrorists planted a bomb in the airport and four people were killed in the…’, comprehenders can easily predict that the last word is ‘explosion’ (Thornhill & Van Petten, Reference Thornhill and Van Petten2012). In other words, comprehenders are able to access a unique lexical-semantic representation (e.g., <explosion>) distinct from any other word (e.g., <terminal>) ahead of its availability from the bottom-up input. Therefore, compared to ‘explosion’, ‘terminal’ would elicit a larger N400 amplitude. Some researchers argue that the N400 reflects the magnitude of prediction error (DeLong et al., Reference DeLong, Urbach and Kutas2005; Kutas & Federmeier, Reference Kutas and Federmeier2011; Nieuwland & Van Berkum, Reference Nieuwland and Van Berkum2006). Indeed, the correlation between cloze probability and N400 amplitude has been consistently observed (Kutas & Federmeier, Reference Kutas and Federmeier2011), with some studies reporting correlations of 0.8 or higher, indicating a strong association.

In addition, several studies have reported differential modulation of brain activities preceding the predicted occurrence of words in highly predictable versus less predictable sentence contexts. These include larger negative ERP effects (Freunberger & Roehm, Reference Freunberger and Roehm2017; Grisoni et al., Reference Grisoni, Miller and Pulvermüller2017), increased θ power (Dikker & Pylkkänen, Reference Dikker and Pylkkänen2013; Piai et al., Reference Piai, Anderson, Lin, Dewar, Parvizi, Dronkers and Knight2016), and suppression of α/β power (Piai et al., Reference Piai, Roelofs and Maris2014; Piai et al., Reference Piai, Roelofs, Rommers and Maris2015; Rommers et al., Reference Rommers, Dickson, Norton, Wlotko and Federmeier2017; Wang et al., Reference Wang, Hagoort and Jensen2018). These predictive effects are neuroanatomically localized to the neocortex and subcortical regions (Dikker & Pylkkänen, Reference Dikker and Pylkkänen2013; Piai et al., Reference Piai, Roelofs, Rommers and Maris2015; Wang et al., Reference Wang, Hagoort and Jensen2018). They are attributed to the processes of generating predictions and/or accessing lexical semantic representations corresponding to the predicted words themselves.

Furthermore, in the past three decades, a substantial body of research on prediction-related phenomena has not only revealed N400 but also shown isolated late positivities or biphasic effects, where larger N400s are followed by larger late positive waves, also known as the post-N400 positivity (PNP) (e.g., DeLong et al., Reference DeLong, Quante and Kutas2014; DeLong & Kutas, Reference DeLong and Kutas2016; Van Petten & Luka, Reference Van Petten and Luka2006; Van Petten & Luka, Reference Van Petten and Luka2012). Van Petten and Luka (Reference Van Petten and Luka2012) noted in their review that there are two distinct topographical distributions of the PNP. One exhibits a parietal distribution, more prominent in studies comparing semantic congruent versus incongruent sentence completions (e.g., Daltrozzo et al., Reference Daltrozzo, Wioland and Kotchoubey2007; Diaz & Swaab, Reference Diaz and Swaab2007; Pijnacker et al., Reference Pijnacker, Geurts, Van Lambalgen, Buitelaar and Hagoort2010), while the other demonstrates a frontal distribution, more prevalent in comparisons of high- versus low-cloze probability (e.g., Federmeier & Kutas, Reference Federmeier and Kutas2005; Kutas, Reference Kutas1993; Moreno et al., Reference Moreno, Federmeier and Kutas2002). Kuperberg (Reference Kuperberg, Miller, Cutting and McCardle2013) presented a slightly different contrast, suggesting that they are errors in event or structural predictions that trigger posterior PNPs (P600s), while errors in lexical predictions trigger more anterior PNPs. However, a common thread is that anterior/frontal PNPs reflect some form of cost associated with prediction violations. Researchers interpret these two PNP distributions based on different functions corresponding to different brain regions. The parietal distribution of PNP bears high similarity to the topographical distribution of syntactic/semantic P600, hence attributed to reprocessing, repair, and retrieval of problematic sentences (Friederici et al., Reference Friederici, Hahne and Mecklinger1996; Hahne & Friederici, Reference Hahne and Friederici1999; O’Rourke & Van Petten, Reference O’Rourke and Van Petten2011). In contrast, although several functional interpretations of the frontal PNP have been proposed, there is currently no consensus. Thornhill and Van Petten (Reference Thornhill and Van Petten2012) as well as Kuperberg (Reference Kuperberg, Miller, Cutting and McCardle2013) posited that it indexes sensitivity to specific lexical forms rather than conceptual expectancies. Other proposals included inhibiting expected but unencountered words (Kutas, Reference Kutas1993) and arguments linking it to learning/adaptation mechanisms (Davenport & Coulson, Reference Davenport and Coulson2013; Kuperberg & Jaeger, Reference Kuperberg and Jaeger2016), where mental models are updated to reflect probabilities in the current environment. Kuperberg and Jaeger (Reference Kuperberg and Jaeger2016) further suggested that the PNP may index a form of ‘model-switching’, reflecting resource reallocation to models corresponding more directly to statistical patterns.

In recent years, the hierarchical predictive coding framework has been employed to explain predictive processing in language comprehension (Heilbron et al., Reference Heilbron, Armeni, Schoffelen, Hagoort and De Lange2022; Ryskin & Nieuwland, Reference Ryskin and Nieuwland2023). In this framework, individuals continuously generate top-down expectations based on world knowledge and long-term memory (Eddine et al., Reference Eddine, Brothers, Wang, Spratling and Kuperberg2024; Huettig, Reference Huettig2015; Ryskin & Nieuwland, Reference Ryskin and Nieuwland2023; Spratling, Reference Spratling2017). When the bottom-up input fails to meet these expectations, prediction errors arise, reflected in the amplitude of the N400. The detection of prediction errors triggers further cognitive processes to resolve this mismatch. These cognitive processes may include attention adjustment, working memory updating, semantic re-evaluation, and inhibitory control. In the predictive framework, the resolution of prediction errors is reflected in late-stage brain activity. For instance, Wang et al. (Reference Wang, Schoot, Brothers, Alexander, Warnke, Kim and Kuperberg2023) utilized MEG and ERPs to track the temporal dynamics and localization of brain activity elicited by expected, unexpected plausible, and implausible words during incremental language comprehension. The results demonstrated that, within the 300- to 500-ms time window, the three conditions produced progressively larger responses within left temporal cortex (prediction error). In the 600- to 1000-ms time window, unexpected plausible words elicited significant neural activity in the left inferior frontal and middle temporal cortices, which may indicate the resolution process of prediction errors, including the retrieval of new patterns and the generation of new predictions.

1.2. Prosodic facilitation in spoken language comprehension

Speech comprehension requires the integration of multiple cues, such as syntax, semantics, prosody, and others. Accentuation is a kind of prosodic feature in the speech signal that reflects the relative prominence of specific syllables, words, or phrases within a rhythmic structure through modulation of pitch or syllable duration (Shattuck-Hufnagel & Turk, Reference Shattuck-Hufnagel and Turk1996).

A lot of psycholinguistic research on accentuation primarily focuses on its correspondence with information structure. Previous behavioral studies have found that speech processing is facilitated when new information (or focused information) is accented, and old information is unaccented (Bock & Mazzella, Reference Bock and Mazzella1983; Dahan et al., Reference Dahan, Tanenhaus and Chambers2002; Terken & Nooteboom, Reference Terken and Nooteboom1987; Yang & Li, Reference Yang and Li2004). ERP studies have found that unaccented new information or accented old information can lead to processing difficulties and increased neural activities, such as the N400 or P600 components (Bögels et al., Reference Bögels, Schriefers, Vonk and Chwilla2011; Hruska et al., Reference Hruska, Alter, Steinhauer, Steube, Cave, Guaitella and Santi2001; Ito & Garnsey, Reference Ito and Garnsey2004; Wang et al., Reference Wang, Bastiaansen, Yang and Hagoort2011). These studies prove that accentuation plays a crucial role in spoken language comprehension.

Furthermore, accentuation can modulate selective attention, which in turn influences speech processing. Some researchers have found that accentuation can regulate listeners’ selective attention during speech processing (Astheimer & Sanders, Reference Astheimer and Sanders2009; Cutler et al., Reference Cutler, Dahan and Van Donselaar1997; Ito & Speer, Reference Ito and Speer2008). For instance, Cutler (Reference Cutler1976), using a phoneme monitoring task, observed an accelerated phoneme monitoring speed at accented positions, and speculated that the result likely stemmed from the listener’s focused attention. Sanford et al. (Reference Sanford, Sanford, Molle and Emmott2006) employed a change detection task in which participants were auditorily presented with discourse twice and were asked to determine whether there was anything changed between the two presentations. Critical words were produced with either a noncontrastive or a contrastive accent. Their results showed that participants exhibited superior detection to word changes in the contrastive accent condition compared to the noncontrastive accent condition, suggesting that accentuation can modulate listeners’ selective attention during language processing.

In addition, accentuation can modulate general cognitive processes, which in turn influence speech processing. For example, relative to unaccented counterparts, accented counterparts elicited a positive deflection between 200 and 500 ms (Dimitrova et al., Reference Dimitrova, Stowe, Redeker and Hoeks2012), and accented words within discourse increased the N400 amplitude (Li et al., Reference Li, Hagoort and Yang2008; Li & Ren, Reference Li and Ren2012; Wang et al., Reference Wang, Bastiaansen, Yang and Hagoort2011, Reference Wang, Bastiaansen, Yang and Hagoort2012). Studies employing single-sentence paradigms have reported broadly distributed N400 effects, with central maxima observed for words with unpredictable accentuation, but fronto-lateral expectancy negativity observed for words with predictable accentuation (Heim & Alter, Reference Heim and Alter2006). These findings suggest that in online spoken language comprehension (Li & Yang, Reference Li and Yang2013), accentuation interacts with long-term memory and directs listeners’ attention to salient constituents of discourse, leading to more detailed and comprehensive processing. In contrast, unaccented information undergoes relatively shallow analysis (Baumann & Schumacher, Reference Baumann and Schumacher2012; Wang et al., Reference Wang, Bastiaansen, Yang and Hagoort2011). This is consistent with the neuroimaging findings in Kristensen et al. (Reference Kristensen, Wang, Petersson and Hagoort2013), indicating that accentuated language activates a general attention network.

Taken together, some previous studies have shown that accented information attracts more attentional resources, facilitating faster and deeper processing (e.g. Li et al., Reference Li, Deng, Yang and Wang2018; Sun et al., Reference Sun, Sommer and Li2022; Wang et al., Reference Wang, Hagoort and Yang2009; Wang et al., Reference Wang, Bastiaansen, Yang and Hagoort2011), and that as the cloze probability of critical words decreases during sentence comprehension, the level of prediction error increases, which requires additional cognitive resources to process these novel details that deviate from the context of the sentence (Federmeier & Kutas, Reference Federmeier and Kutas1999; Van Berkum et al., Reference Van Berkum, Brown, Zwitserlood, Kooijman and Hagoort2005). However, it is still unclear how accentuation and predictability of words interact during language comprehension.

1.3. The current study

We aimed to investigate how accentuation influences the processing of lexical items with varying levels of cloze probability. Specifically, we auditorily presented a high-constrain sentence context with disyllabic words varying in three cloze probability levels embedded at the sentence-final position. Additionally, we manipulated the accentuation of the critical words to examine the neural activities involved in processing these words at different cloze probability levels.

Thereby, we address the following two research questions: (1) whether the modulation of attention resources introduced by accentuation interacts with the predictive error and (2) to what extent the processing of highly predicted and less predicted words influenced by accentuation.

We expect that accentuation modulates selective attention, thereby influencing the speed and depth of sentence processing. That is, we predict that accented critical words capture more attentional resources, enabling the rapid detection of predictive errors and providing additional cognitive resources for deep semantic processing (Li et al., Reference Li, Ren, Zheng and Chen2020; Wang et al., Reference Wang, Bastiaansen, Yang and Hagoort2011). In other words, accentuation should interact with cloze probability, with accented critical words of low cloze probabilities yielding the greatest N400 and PNP amplitudes. The data and methods used in the study are presented in the following sections.

2. Methods

2.1. Participants

The current study used a 2 (accentuation: accented, unaccented) × 3 (cloze probability: high, medium, low) within-participants design. An a priori power analysis conducted via G*Power 3.1.9.7 (Faul et al., Reference Faul, Erdfelder, Lang and Buchner2007) showed that 19 participants were required to observe a significant (α = 0.05) interaction at .80 power. To be on the safe side, twenty-six college students were recruited as participants for the study, with ages ranging from 19 to 25 years (male = 7; M±SD = 22.5±1.5). All participants were native Chinese speakers. They all had normal or corrected-to-normal vision and had no hearing impairments, reading difficulties, or neurological disorders. The research was approved by the Ethics Committee of Liaoning Normal University. Prior to the experiment, participants provided informed consent. After the experiment, they received monetary compensation for their participation.

2.2. Materials

A total of 240 sets of sentence contexts were created, with 6 kinds of sentence continuation in each set, varying in predictability and accentuation of the disyllabic critical words embedded in the sentence-final position. The predictability of the critical words was determined by their cloze probabilities obtained in a rating experiment. In this experiment, twenty volunteers (8 males, aged between 19 and 25) who did not participate in the main experiment provided cloze probability ratings for the critical words on a seven-point scale. Ratings between 6 and 7 were regarded as high-cloze words, between 3 and 5 as medium-cloze words, and between 1 and 2 as low-cloze words. Critical words with high-cloze probabilities were highly predictable and semantically congruent with the preceding context. Critical words with medium-cloze probabilities were less predictable but semantically congruent with the preceding context. Critical words with low-cloze probabilities were impossible to predict by the preceding context and semantically incongruent with the preceding context. A one-way ANOVA (analysis of variance) was conducted on the ratings of the critical words. Results showed that the ratings of the three level of cloze probability differed significantly, F (1,719) =1356, p<.001, η_p² = 0.89 (see Table 2 Column 1 for details).

Additionally, we recruited 20 participants (4 males, aged between 19 and 28) to provide 7-point Likert ratings for lexical frequency, concreteness, and imaginability for each word in the three-level cloze probability. The results of the one-way repeated measures ANOVA conducted on the ratings across the three dimensions indicated that there were no significant differences between the three cloze probabilities: Lexical frequency: F (2,38) = 1.25, p = 0.29, η_p ²= 0.06; Concreteness: F (2,38) = 0.23, p = 0.69, η_p ²= 0.01; and Imaginability: F (2,38) = 2.35, p = 0.13, η_p ²= 0.11 (see Table 2 Columns 2–4 for details).

In addition to the manipulation of cloze probabilities, all sentences were produced with two types of accentuation. For the accented condition, the disyllabic critical nouns in the sentence-final position were accented; for the unaccented condition, a disyllabic noncritical noun in the sentence fragments preceding the final critical words was accented (see Table 1 for material examples). In total, 1440 test sentences (240 constraint contexts × 3 cloze conditions × 2 types of accentuation) were recorded by a phonetically trained male native Chinese speaker in a soundproof booth, at a sampling rate of 44.1 kHz and 16-bit resolution.

Table 1. Examples of stimuli

Table 2. Ratings for critical words’ cloze probability, lexical frequency, concreteness, and imageability (M±SD)

Using Praat software (Boersma & Weenink, Reference Boersma and Weenink2022) with publicly available scripts (Feinberg, Reference Feinberg2018; Puts & Cardenas, Reference Puts and Cardenas2018), the average sound pressure level (SPL) of each sentence was normalized to a uniform level of 70 dB based on previous studies (Li et al., Reference Li, Deng, Yang and Wang2018; Sun et al., Reference Sun, Sommer and Li2022), to avoid specific responses to general loudness differences. Each sentence was divided into two parts: the sentence fragment preceding the critical word and the critical word itself. To ensure that the speaker successfully and correctly accented the critical words, paired-sample t-tests were performed on the mean syllable duration, maximal pitch, and SPL between the critical words and the sentence fragment preceding the critical words in the accented and unaccented conditions (see Table 3 for details). On average, relative to unaccented critical words, accented critical words showed significantly higher F0 maxima, SPL, and longer duration. Overall, the acoustic features of the current accentuation pattern align with previous studies (Chen & Gussenhoven, Reference Chen and Gussenhoven2008; Li et al., Reference Li, Deng, Yang and Wang2018). For the sentence fragments preceding the critical words, the mean duration was longer and the F0 maxima were higher under the unaccented condition compared to the accented condition. The SPL values were not significantly different between the two conditions.

Table 3. Acoustic parameters of critical words (CWs) and the preceding sentence fragments under the two accent conditions

Note: *p<.05; **p<.01; ***p<.001; Acc. = accented, Un-acc. = unaccented.

Additionally, we statistically analyzed the acoustic parameters (duration, pitch, and intensity) of critical words with different cloze probabilities under accented and unaccented conditions using a two-way repeated measures ANOVA (see Table 4 for details). Duration results indicated a significant main effect of accent, F (1, 239) = 1873.80, p < .001, η_p² = 0.89; accented words had a longer duration than unaccented words. Pitch results revealed a significant main effect of accent, F (1, 239) = 327.65, p < .001, η_p² = 0.58; accented words had a higher pitch compared to unaccented words. A significant main effect of cloze probability was also found, F (2, 478) = 11.85, p < .001, η_p² = 0.05; high-cloze words had the highest pitch, followed by low-cloze words, with medium-cloze words having the lowest pitch. Intensity results showed a significant main effect of accent, F (1, 239) = 5844.78, p < .001, η_p² = 0.58; accented words were louder than unaccented words. A significant main effect of cloze probability was again found here, F (2, 478) = 22.71, p < .001, η_p² = 0.09; high-cloze words had the highest intensity, followed by medium-cloze words, and low-cloze words had the lowest intensity. There was a significant interaction between accent and cloze probability, F (2, 478) = 15.66, p = .001, η_p² = 0.06. Further simple effects analysis indicated that under the accented condition, only high- and low-cloze words showed a significant difference, F (2, 238) = 7.42, p < .001, while under the unaccented condition, all three cloze levels differed significantly from one another, F (2, 238) = 25.99, p < .001.

Table 4. Acoustic parameters of critical words (CWs) in the two accent conditions

2.3. Procedure

The overall experimental materials comprised 1440 test sentences (240 constraint contexts × 3 cloze conditions × 2 types of accentuation) and 90 filler sentences. The filler sentences differed from the critical sentences in length and structure to prevent participants from predicting the sentence-final words (e.g., 小明最近读了一篇论文。Xiao Ming read a paper recently.).

To ensure that participants would not hear the same sentence context under different conditions more than once, a Latin Square design was employed to generate six lists of stimuli, such that each participant heard only one of the lists. Each list contained an equal number of items (40 sentences) for each condition, resulting in a total of 240 sentences per list. Sentences of each list were separated into three blocks, with each block consisting of 135 sentences (120 experimental sentences and 15 filler sentences) and lasting approximately 20 minutes. There were brief intervals between blocks. Prior to the formal experiment, participants conducted an initial practice session of 20 trials to acquaint themselves with the experimental procedures. The list order was counterbalanced across participants. The sentences within each list were presented in a random order.

The experiment took place in a softly lit, quiet, and comfortable room. Participants sat in front of a 23-inch LCD monitor and wore headphones with volume adjusted to their preference. In a given trial, participants first saw a fixation cross and simultaneously heard a beep sound for 500 ms. Then, participants heard a sentence while the fixation cross stayed on the screen. They were asked to keep looking at the fixation cross when hearing the sentence. As soon as the auditory signal ended, the fixation cross disappeared, and a 400-ms blank interval followed. After the interval, a probe word appeared on the screen, and participants had to determine whether the probe word appeared in the sentence they had just heard by pressing either ‘J’ or ‘F’ within 3000 ms. Half of the participants pressed the ‘J’ key for ‘yes’ and the ‘F’ key for ‘no’. The other half pressed the ‘J’ key for ‘no’ and the ‘F’ key for ‘yes’. Probe words were two-character nouns (content words) that can either be literal repetitions of the two-character nouns from the preceding sentence, regardless of their position, or any two-character nouns that are semantically unrelated to the preceding sentence. The whole experiment consisted of 50% yes responses and 50% no responses. Finally, participants saw a 400-ms blank screen before the commencement of the next trial. The experimental procedure is shown in Figure 1.

Figure 1. A single trial of the experimental procedure.

2.4. EEG acquisition and analysis

The EEG data were recorded from 64 cap-mounted Ag/AgCl electrodes (ANT Neuro EEGO Inc., Germany), placed according to the extended international 10–20 system. During recordings, a 100 Hz low-pass filter was applied; the sampling rate was 500 Hz. CPz was used as the online reference, and offline analysis involved re-referencing by subtracting the average from bilateral mastoids (M1, M2) from the EEG data in each channel. Impedances were kept below 5 KΩ for all electrodes. The collected EEG data were preprocessed using the EEGLAB toolbox (version 2023.0) in MATLAB software (R2018b). The preprocessing steps included bandpass filtering (0.1 to 30 Hz), segmenting the EEG data into epochs from 200 ms before to 800 ms after the onset of the critical words, corrected with a 200-ms prestimulus baseline. Eye movements were corrected using the ‘Independent Ocular Component Correction’ model in EEGLAB. Epochs with signals exceeding ±80 μV in any given channel were excluded. After artifact rejection, there was an average of 36 valid trials per condition (40 trials under each condition originally).

Combined with visual inspection of the data, and previous relevant research (DeLong et al., Reference DeLong, Quante and Kutas2014; Li & Ren, Reference Li and Ren2012; Nieuwland et al., Reference Nieuwland, Barr, Bartolozzi, Busch-Moreno, Darley, Donaldson, Ferguson, Fu, Heyselaar, Huettig, Husband, Ito, Kazanina, Kogan, Kohút, Kulakova, Mézière, Politzer-Ahles, Rousselet and Zu Wolfsthurn2020; Thornhill & Van Petten, Reference Thornhill and Van Petten2012; Van Berkum et al., Reference Van Berkum, Brown, Zwitserlood, Kooijman and Hagoort2005), the average ERP amplitudes obtained from various regions of interest (ROIs) were used as dependent variables. Two two-way repeated measures ANOVAs were conducted on the average amplitudes of the evoked brain potentials for the 300- to 450- and 500- to 700-ms time windows, respectively. The factors examined were the type of accentuation (accented, unaccented) and cloze probability level (high, medium, low). For the N400, we selected the electrodes CP1, CPz, CP2, P1, Pz, and P2 as ROIs. For the PNP, although previous research has reported two distinct topographical distributions, considering the current experimental design involving different cloze probabilities in conjunction with Van Petten and Luka (Reference Van Petten and Luka2012)’s review and the observed topographical differences (Figure 3B), we selected F1, Fz, F2, FC1, FCz, and FC2 electrodes as the ROIs. The p-values in all ANOVAs were adjusted using the Greenhouse–Geisser correction for nonsphericity. The results are given in the following sections.

2.5. Results

2.5.1. Behavioral results

To ensure that accentuation and cloze probability processing were not influenced by keypresses, participants were required to make behavioral judgments 400 ms after stimulus presentation. Therefore, reaction times were not analyzed; only accuracy rates were considered.

A two-way repeated measures ANOVA was conducted on the accuracy rates (ARs) of the behavioral data, with accentuation type (accented, unaccented) and cloze probability level (high, medium, low) treated as independent variables. The results did not show any significant effects, all ps > .05 (see Table 5).

Table 5. Accuracy rates under different conditions (M±SD)

2.5.2. ERP results

For the results of the N400 analysis, there was a significant main effect of cloze probability level, F (2,21) = 3.39, p = 0.043, η_p² = 0.13. Pairwise comparisons showed a trend of the medium-cloze condition inducing larger N400 amplitudes compared to the high-cloze condition, p = 0.054. There was also a significant interaction between accentuation type and cloze probability level, F (1,22) = 3.55, p = 0.037, η_p² = 0.14. Further simple effects analysis revealed that under the accented condition, the medium-cloze condition induced larger N400 amplitudes compared to the high-cloze condition, F (1,22) = 4.75, p = 0.02, and the low-cloze condition produced larger N400 amplitudes relative to the high-cloze condition, F (1,22) = 4.75, p = 0.023. However, under unaccented conditions, there were no significant differences observed among medium-cloze, low-cloze, and high-cloze conditions, all ps > .62. (see Figures 2A and 3A).

Figure 2 A. Average waveform of the N400 component at Pz with different cloze probability levels under the accented condition (left) and unaccented condition (right). B. Average waveform of the PNP component at Fz with different cloze probability levels under the accented condition (left) and unaccented condition (right).

Figure 3 A. Topographical maps of low-cloze minus high-cloze, medium-cloze minus high-cloze, low-cloze minus medium-cloze under both accented and unaccented conditions within the 300-450 ms time window. B. The topographical maps of low-cloze minus high-cloze, medium-cloze minus high-cloze, low-cloze minus medium-cloze under both accented and unaccented conditions within the 500-700 ms time window.H: high-cloze; M: medium-cloze; L: low-cloze.

For the PNP analysis, results showed a significant main effect of accentuation type, F (1,22) = 6.47, p = 0.018, η_p² = 0.23, suggesting that the unaccented condition induced larger PNP amplitudes compared to the accented condition. Moreover, although no main effect of cloze probability level was found, F (2,21) = 0.70, p = 0.490, η_p² = 0.03, a significant interaction between accentuation type and cloze probability level was observed, F (2,21) = 27.92, p = 0.003, η_p² = 0.24. Simple effects analysis revealed that under the unaccented condition, the medium-cloze condition induced larger PNP amplitudes than the high-cloze condition, F (2,21) = 4.83, p = 0.039, and the low-cloze condition yielded larger PNP amplitudes than the high-cloze condition, F (2,21) = 10.21, p = 0.004. Under the accented condition, there were no significant differences observed among medium-, low-, and high-cloze conditions, all ps > .22 (see Figures 2B and 3B).

3. Discussion

This study employed the ERP technique to investigate the neural processing of disyllabic words varying in levels of cloze probability and types of accentuation during spoken sentence comprehension. Our ERP results showed that the neural processing of disyllabic words is not only influenced by their cloze probabilities but also by their degrees of accentuation during spoken sentence comprehension. These findings will be discussed in more below.

Regarding the ERP results in the 300- to 450-ms time window, the main effect of cloze probability was observed, with larger N400 amplitudes yielded by lower cloze words, consistent with the findings from numerous previous ERP studies (DeLong et al., Reference DeLong, Quante and Kutas2014; Federmeier, Reference Federmeier2007; Kutas, Reference Kutas1993; Thornhill & Van Petten, Reference Thornhill and Van Petten2012). The current N400 effect can be explained within the predictive framework. Compared to highly predictable words, unexpected words involved greater processing demands, leading to larger prediction errors, which caused increased N400 amplitudes.

More importantly, a significant interaction between cloze probability and accentuation was also found, with the accented low- and medium-cloze words eliciting larger N400 amplitudes compared to the accented high-cloze words. This result revealed that an influence of cloze probability on the N400 amplitudes only emerged in the accented condition. During online speech processing, accentuation can modulate listeners’ time-selective attention, influencing the speed and depth of semantic processing (Li & Ren, Reference Li and Ren2012). According to the “good enough” language comprehension strategy (Ferreira et al., Reference Ferreira, Bailey and Ferraro2002), readers and listeners do not process all information carried by a sentence to the same extent for reasons of processing efficiency. Specifically, the depth of information processing (i.e., the level of fine-grained processing) typically depends on the importance and salience of the information (Cooper et al., Reference Cooper, Eady and Mueller1985; Eady et al., Reference Eady, Cooper, Klouda, Mueller and Lotts1986; Pierrehumbert, Reference Pierrehumbert1980), and the allocation of attention resources plays a crucial role. Accentuation, a form of focal information, has been shown to induce attentional bias (Cutler & Fodor, Reference Cutler and Fodor1979; Dahan & Tanenhaus, Reference Dahan and Tanenhaus2005). In the current study, listeners may have selectively allocated more attention to the accented critical words and engaged themselves in more in-depth processing, thus rapidly detecting different levels of semantic incongruence. Conversely, when the critical words were unaccented, listeners may have paid relatively less attention to them, and adopted the shallow processing approach, leading to an inability to immediately detect the semantic incongruences caused by the critical words of medium- and low-cloze probabilities. Therefore, no N400 effect was obtained for the unaccented words in the medium- and low-cloze conditions.

An alternative explanation for the N400 interaction is also possible. In high-constrain contexts, participants can generate specific semantic and phonological predictions for the final critical words, with semantic access to the high-cloze words being rapid and efficient without excessive semantic processing (Wang et al., Reference Wang, Bastiaansen, Yang and Hagoort2011). When medium-cloze and low-cloze words were accented, since they were not highly predicted, participants not only needed to adopt additional attentional resources but they also had to put more cognitive efforts into fine-grained semantic processing to integrate the medium- and low-cloze words into the given contexts, leading to larger N400 amplitudes. Furthermore, the additional recruitment of attentional resources may have been so great that they overwhelmed the differences of cognitive efforts spent on the medium- and low-cloze words. Thus, the N400 difference between the medium- and low-cloze words under the accented condition disappeared.

For the ERP results in the 500- to 700-ms time window, the main effect of accentuation was observed, with larger PNP amplitudes elicited over the frontal regions under the unaccented condition compared to the accented condition, which is consistent with previous studies (Baumann & Schumacher, Reference Baumann and Schumacher2012; Li et al., Reference Li, Zhang and Yang2017; Li et al., Reference Li, Deng, Yang and Wang2018). This result may be due to the fact that when the sentence-final words are highly predictable, listeners tend to expect them to be accented. Therefore, when the words are unaccented, greater PNP amplitudes are yielded, reflecting the difficulty of integrating unaccented information into the given context.

Moreover, a significant interaction between cloze probability and accentuation was also observed, with the unaccented low- and medium-cloze words eliciting larger PNP amplitudes compared to the unaccented high-cloze words, mirroring the N400 interaction effect. In addition, the PNP effects were more anteriorly distributed, which is dissimilar to the centro-parietally distributed N400 effects in topographical distribution. The absence of an N400 effect for the unaccented critical words suggests that participants did not immediately detect their semantic incongruence. Therefore, the PNP effects elicited by the unaccented words can be considered, to some extent, a result of shallow processing due to the absence of accentuation on the focal information (critical words), leading to delayed detection of semantic incongruences, which manifests in the PNP time window. In fact, Wang et al. (Reference Wang, Hagoort and Yang2009) investigated the impact of information structure on the depth of semantic processing, revealing that focus position affects semantic processing. In their study, participants quickly detected whether semantic violations occurred at the focus position, as reflected by the N400 component, while no N400 difference was observed for semantic violations in the nonfocus position. This result also supports the view that shallow processing may occur at times during language comprehension.

As expected, the PNP effect showed a frontal distribution, different from the typical late positive component observed in the central-parietal region. Previous studies investigating words with varying levels of cloze probability interpreted the frontally distributed PNP effect as reflecting uncertainty in semantic predictions within the sentence or discourse context (Delong et al., Reference Delong, Urbach, Groppe and Kutas2011; Federmeier, Reference Federmeier, Wlotko, De Ochoa-Dewald and Kutas2007; Otten & Van Berkum, Reference Otten and Van Berkum2008). Additionally, Kutas (Reference Kutas1993) proposed that the fronto-central PNP reflects inhibition of predicted but unfulfilled words. Based on Kutas (Reference Kutas1993), Federmeier et al. (Reference Federmeier, McLennan, De Ochoa and Kutas2002) speculated about the inhibitory interactions between frontal and temporal regions during language comprehension. They proposed that the successful generation of the predicted words is modulated by inhibitory regulation from the left frontal cortex over the activated, stored word-form networks in the temporal regions. In the current study, participants formed predictions about the upcoming sentence-final critical words based on the sentence context. Although the appearing medium- and low-cloze critical words evidently contradicted these predictions, the participants still needed to integrate the current words into the existing sentence context. Therefore, the frontal cortex had to inhibit the activated representations initiated by the sentence context to support the semantic processing of the medium- and low-cloze critical words. This inhibitory process was reflected by the late positive responses with a fronto-central distribution. Furthermore, from Figures 2B and 3B (even if statistical significance was not reached), it can be seen that although both low- and medium-cloze conditions elicited larger PNPs than the high-cloze condition, the former elicited a greater effect. This may indicate that listeners expended more cognitive resources in inhibiting the appropriate words predicted by the semantic context under the low-cloze condition. Given that there is no unified explanation for the frontally distributed late positive component, future studies should investigate the neurocognitive functions of the late positive component.

As a caution, we cannot completely rule out the possibility that the acoustic properties of the sentence fragments preceding the critical words may signal the position of accented (versus unaccented) words. Accordingly, consistent with previous research (Cutler & Fodor, Reference Cutler and Fodor1979), the prosody of the preceding context, such as the duration of the sentence fragments (Table 3), may predict the position and nature of the upcoming critical words, facilitating their rapid processing. Furthermore, the critical words in this study appear after continuous spoken sentences. Consequently, there is significant overlap between the ERP and the preceding words, eliciting auditory evoked responses to each new phoneme, as well as ERPs to the critical words, which may contribute to the noisy EEG signals (Figure 2).

4. Conclusion

The present study extends previous research by testing the neural processing of accented and unaccented words varying with cloze probabilities. The current design enables us to systematically investigate the processing of accentuation on lexical predictability during spoken sentence comprehension. Our data revealed that under highly constrained sentence contexts, accented and unaccented words of different cloze probabilities produced different patterns of N400 and PNP amplitudes, reflecting a gradation of prediction violation modulated by accentuation of the critical words. The pattern of the ERP results is likely due to differences of attention allocation in the processing of the accented and unaccented words with different degrees of predictability.

Data availability statement

Data will be made available on request.

Acknowledgements

This research was supported by the Shenzhen Science and Technology Innovation Bureau Project (20220811005233001). Additional funding was provided by the Scientific Research and Innovation Team of Liaoning Normal University. We thank Weijing Xing for her assistance with data collection. Special thanks to Prof. Werner Sommer for offering his suggestions and providing support.

Competing interests

No potential conflict of interest was reported by the authors.

References

Allbritton, D. W., McKoon, G., & Ratcliff, R. (1996). Reliability of prosodic cues for resolving syntactic ambiguity. Journal of Experimental Psychology: Learning, Memory, and Cognition, 22(3), 714.Google Scholar PubMed

Astheimer, L. B., & Sanders, L. D. (2009). Listeners modulate temporally selective attention during natural speech processing. Biological Psychology, 80(1), 23–34.10.1016/j.biopsycho.2008.01.015CrossRef Google Scholar PubMed

Baumann, S., & Schumacher, P. B. (2012). (De-) accentuation and the processing of information status: Evidence from event-related brain potentials. Language and Speech, 55(3), 361–381.10.1177/0023830911422184CrossRef Google Scholar PubMed

Bock, J. K., & Mazzella, J. R. (1983). Intonational marking of given and new information: Some consequences for comprehension. Memory & Cognition, 11, 64–76.10.3758/BF03197663CrossRef Google Scholar PubMed

Boersma, P., & Weenink, D. (2022). Praat: A system for doing phonetics by computer, 2000. Software available at www.praat.org, 4(2).Google Scholar

Bögels, S., Schriefers, H., Vonk, W., & Chwilla, D. J. (2011). Pitch accents in context: How listeners process accentuation in referential communication. Neuropsychologia, 49(7), 2022–2036.10.1016/j.neuropsychologia.2011.03.032CrossRef Google Scholar PubMed

Chen, Y., & Gussenhoven, C. (2008). Emphasis and tonal implementation in Standard Chinese. Journal of Phonetics, 36(4), 724–746.10.1016/j.wocn.2008.06.003CrossRef Google Scholar

Clifton, C. Jr, Ferreira, F., Henderson, J. M., Inhoff, A. W., Liversedge, S. P., Reichle, E. D., & Schotter, E. R. (2016). Eye movements in reading and information processing: Keith Rayner’s 40 year legacy. Journal of Memory and Language, 86, 1–19.10.1016/j.jml.2015.07.004CrossRef Google Scholar

Cooper, W. E., Eady, S. J., & Mueller, P. R. (1985). Acoustical aspects of contrastive stress in question–answer contexts. The Journal of the Acoustical Society of America, 77(6), 2142–2156.10.1121/1.392372CrossRef Google Scholar PubMed

Cutler, A. (1976). Phoneme-monitoring reaction time as a function of preceding intonation contour. Perception & Psychophysics, 20, 55–60.10.3758/BF03198706CrossRef Google Scholar

Cutler, A., Dahan, D., & Van Donselaar, W. (1997). Prosody in the comprehension of spoken language: A literature review. Language and Speech, 40(2), 141–201.10.1177/002383099704000203CrossRef Google Scholar PubMed

Cutler, A., & Fodor, J. A. (1979). Semantic focus and sentence comprehension. Cognition, 7(1), 49–59.10.1016/0010-0277(79)90010-6CrossRef Google Scholar PubMed

Dahan, D., & Tanenhaus, M. K. (2005). Looking at the rope when looking for the snake: Conceptually mediated eye movements during spoken-word recognition. Psychonomic Bulletin & Review, 12(3), 453–459.10.3758/BF03193787CrossRef Google Scholar PubMed

Dahan, D., Tanenhaus, M. K., & Chambers, C. G. (2002). Accent and reference resolution in spoken-language comprehension. Journal of Memory and Language, 47(2), 292–314.10.1016/S0749-596X(02)00001-3CrossRef Google Scholar

Daltrozzo, J., Wioland, N., & Kotchoubey, B. (2007). Sex differences in two event-related potentials components related to semantic priming. Archives of Sexual Behavior, 36, 555–568.10.1007/s10508-006-9161-0CrossRef Google Scholar PubMed

Davenport, T., & Coulson, S. (2013). Hemispheric asymmetry in interpreting novel literal language: An event-related potential study. Neuropsychologia, 51(5), 907–921.10.1016/j.neuropsychologia.2013.01.018CrossRef Google Scholar PubMed

DeLong, K. A., & Kutas, M. (2016). Hemispheric differences and similarities in comprehending more and less predictable sentences. Neuropsychologia, 91, 380–393.10.1016/j.neuropsychologia.2016.09.004CrossRef Google Scholar PubMed

DeLong, K. A., Quante, L., & Kutas, M. (2014). Predictability, plausibility, and two late ERP positivities during written sentence comprehension. Neuropsychologia, 61, 150–162.10.1016/j.neuropsychologia.2014.06.016CrossRef Google Scholar PubMed

Delong, K. A., Urbach, T. P., Groppe, D. M., & Kutas, M. (2011). Overlapping dual ERP responses to low cloze probability sentence continuations. Psychophysiology, 48(9), 1203–1207.10.1111/j.1469-8986.2011.01199.xCrossRef Google Scholar PubMed

DeLong, K. A., Urbach, T. P., & Kutas, M. (2005). Probabilistic word pre-activation during language comprehension inferred from electrical brain activity. Nature Neuroscience, 8(8), 1117–1121.10.1038/nn1504CrossRef Google Scholar PubMed

Diaz, M. T., & Swaab, T. Y. (2007). Electrophysiological differentiation of phonological and semantic integration in word and sentence contexts. Brain Research, 1146, 85–100.10.1016/j.brainres.2006.07.034CrossRef Google Scholar PubMed

Dikker, S., & Pylkkänen, L. (2013). Predicting language: MEG evidence for lexical preactivation. Brain and Language, 127(1), 55–64.10.1016/j.bandl.2012.08.004CrossRef Google Scholar PubMed

Dimitrova, D. V., Stowe, L. A., Redeker, G., & Hoeks, J. C. (2012). Less is not more: Neural responses to missing and superfluous accents in context. Journal of Cognitive Neuroscience, 24(12), 2400–2418.10.1162/jocn_a_00302CrossRef Google Scholar

Ding, J., Zhang, Y., Liang, P., & Li, X. (2023). Modulation of working memory capacity on predictive processing during language comprehension. Language, Cognition and Neuroscience, 38(8), 1133–1152.10.1080/23273798.2023.2212819CrossRef Google Scholar

Eady, S. J., Cooper, W. E., Klouda, G. V., Mueller, P. R., & Lotts, D. W. (1986). Acoustical characteristics of sentential focus: Narrow vs. broad and single vs. dual focus environments. Language and Speech, 29(3), 233–251.10.1177/002383098602900304CrossRef Google Scholar PubMed

Eddine, S. N., Brothers, T., Wang, L., Spratling, M., & Kuperberg, G. R. (2024). A predictive coding model of the N400. Cognition, 246, 105755.10.1016/j.cognition.2024.105755CrossRef Google Scholar

Faul, F., Erdfelder, E., Lang, A. G., & Buchner, A. (2007). G* Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods, 39(2), 175–191.10.3758/BF03193146CrossRef Google Scholar

Federmeier, K. D. (2007). Thinking ahead: The role and roots of prediction in language comprehension. Psychophysiology, 44(4), 491–505.10.1111/j.1469-8986.2007.00531.xCrossRef Google Scholar PubMed

Federmeier, K. D., & Kutas, M. (1999). A rose by any other name: Long-term memory structure and sentence processing. Journal of Memory and Language, 41(4), 469–495.10.1006/jmla.1999.2660CrossRef Google Scholar

Federmeier, K. D., & Kutas, M. (2005). Aging in context: Age-related changes in context use during language comprehension. Psychophysiology, 42(2), 133–141.10.1111/j.1469-8986.2005.00274.xCrossRef Google Scholar PubMed

Federmeier, K. D., McLennan, D. B., De Ochoa, E., & Kutas, M. (2002). The impact of semantic memory organization and sentence context information on spoken language processing by younger and older adults: An ERP study. Psychophysiology, 39(2), 133–146.10.1111/1469-8986.3920133CrossRef Google Scholar PubMed

Federmeier, K. D., Wlotko, E. W., De Ochoa-Dewald, E., & Kutas, M. (2007). Multiple effects of sentential constraint on word processing. Brain Research, 1146, 75–84.10.1016/j.brainres.2006.06.101CrossRef Google Scholar PubMed

Feinberg, D. (2018). Praat scripts.10.4324/9780429476686-9CrossRef Google Scholar

Ferreira, F., Bailey, K. G., & Ferraro, V. (2002). Good-enough representations in language comprehension. Current Directions in Psychological Science, 11(1), 11–15.10.1111/1467-8721.00158CrossRef Google Scholar

Freunberger, D., & Roehm, D. (2017). The costs of being certain: Brain potential evidence for linguistic preactivation in sentence processing. Psychophysiology, 54(6), 824–832.10.1111/psyp.12848CrossRef Google Scholar PubMed

Friederici, A. D., Hahne, A., & Mecklinger, A. (1996). Temporal structure of syntactic parsing: Early and late event-related brain potential effects. Journal of Experimental Psychology: Learning, Memory, and Cognition, 22(5), 1219.Google Scholar PubMed

Friston, K. (2010). The free-energy principle: A unified brain theory? Nature Reviews Neuroscience, 11(2), 127–138.10.1038/nrn2787CrossRef Google Scholar PubMed

Grisoni, L., Miller, T. M., & Pulvermüller, F. (2017). Neural correlates of semantic prediction and resolution in sentence processing. Journal of Neuroscience, 37(18), 4848–4858.10.1523/JNEUROSCI.2800-16.2017CrossRef Google Scholar PubMed

Hagoort, P., Hald, L., Bastiaansen, M., & Petersson, K. M. (2004). Integration of word meaning and world knowledge in language comprehension. Science, 304(5669), 438–441.10.1126/science.1095455CrossRef Google Scholar PubMed

Hahne, A., & Friederici, A. D. (1999). Electrophysiological evidence for two steps in syntactic analysis: Early automatic and late controlled processes. Journal of Cognitive Neuroscience, 11(2), 194–205.10.1162/089892999563328CrossRef Google Scholar PubMed

Heilbron, M., Armeni, K., Schoffelen, J.-M., Hagoort, P., & De Lange, F. P. (2022). A hierarchy of linguistic predictions during natural language comprehension. Proceedings of the National Academy of Sciences, 119(32), e2201968119.10.1073/pnas.2201968119CrossRef Google Scholar PubMed

Heim, S., & Alter, K. (2006). Prosodic pitch accents in language comprehension and production: ERP data and acoustic analyses. Acta neurobiologiae experimentalis, 66(1), 55–68.10.55782/ane-2006-1587CrossRef Google Scholar PubMed

Hruska, C., Alter, K., Steinhauer, K., & Steube, A. (2001). Misleading dialogues: Human’s brain reaction to prosodic information. In Cave, C., Guaitella, I., & Santi, S. (Eds.), Oralite et Gestualite: Interactions et Comportements Multimodaux Dans la Communication, Aix-en-Provence (pp. 425–430). Paris: L’Harmattan.Google Scholar

Huettig, F. (2015). Four central questions about prediction in language processing. Brain Research, 1626, 118–135.10.1016/j.brainres.2015.02.014CrossRef Google Scholar PubMed

Ito, K., & Garnsey, S. M. (2004). Brain responses to focus-related prosodic mismatch in Japanese. In Speech prosody 2004, international conference. 10.21437/SpeechProsody.2004-140CrossRef Google Scholar

Ito, K., & Speer, S. R. (2008). Anticipatory effects of intonation: Eye movements during instructed visual search. Journal of Memory and Language, 58(2), 541–573.10.1016/j.jml.2007.06.013CrossRef Google Scholar PubMed

Kjelgaard, M. M., & Speer, S. R. (1999). Prosodic facilitation and interference in the resolution of temporary syntactic closure ambiguity. Journal of Memory and Language, 40(2), 153–194.10.1006/jmla.1998.2620CrossRef Google Scholar

Kliegl, R., Dambacher, M., Dimigen, O., Jacobs, A. M., & Sommer, W. (2012). Eye movements and brain electric potentials during reading. Psychological Research, 76, 145–158.10.1007/s00426-011-0376-xCrossRef Google Scholar PubMed

Kristensen, L. B., Wang, L., Petersson, K. M., & Hagoort, P. (2013). The interface between language and attention: Prosodic focus marking recruits a general attention network in spoken language comprehension. Cerebral Cortex, 23(8), 1836–1848.10.1093/cercor/bhs164CrossRef Google Scholar PubMed

Kuperberg, G. R. (2013). The pro-active comprehender: What eventrelated potentials tell us about the dynamics of reading comprehension. In Miller, B., Cutting, L., & McCardle, P. (Eds.), Unraveling theBehavioral, Neurobiological, and Genetic Components of Reading Comprehension (pp. 176–192). Baltimore: Paul Brookes Publishing.Google Scholar

Kuperberg, G. R., & Jaeger, T. F. (2016). What do we mean by prediction in language comprehension? Language, Cognition and Neuroscience, 31(1), 32–59.10.1080/23273798.2015.1102299CrossRef Google Scholar PubMed

Kutas, M. (1993). In the company of other words: Electrophysiological evidence for single-word and sentence context effects. Language and Cognitive Processes, 8(4), 533–572.10.1080/01690969308407587CrossRef Google Scholar

Kutas, M., & Federmeier, K. D. (2011). Thirty years and counting: Finding meaning in the N400 component of the event-related brain potential (ERP). Annual Review of Psychology, 62, 621–647.10.1146/annurev.psych.093008.131123CrossRef Google Scholar PubMed

Kutas, M., & Hillyard, S. A. (1984). Brain potentials during reading reflect word expectancy and semantic association. Nature, 307(5947), 161–163.10.1038/307161a0CrossRef Google Scholar PubMed

Lehiste, I. (1973). Phonetic disambiguation of syntactic ambiguity. The Journal of the Acoustical Society of America, 53(1_Supplement), 380–380.10.1121/1.1982702CrossRef Google Scholar

Li, W., Deng, N., Yang, Y., & Wang, L. (2018). Process focus and accentuation at different positions in dialogues: An ERP study. Language, Cognition and Neuroscience, 33(2), 255–274.10.1080/23273798.2017.1387278CrossRef Google Scholar

Li, W., Zhang, J., & Yang, Y. (2017). The cognitive processing of contrastive focus and its relationship with pitch accent. Acta Psychologica Sinica, 49(9), 1137.10.3724/SP.J.1041.2017.01137CrossRef Google Scholar

Li, X., Hagoort, P., & Yang, Y. (2008). Event-related potential evidence on the influence of accentuation in spoken discourse comprehension in Chinese. Journal of Cognitive Neuroscience, 20(5), 906–915.10.1162/jocn.2008.20512CrossRef Google Scholar PubMed

Li, X., Ren, G., Zheng, Y., & Chen, Y. (2020). How does dialectal experience modulate anticipatory speech processing? Journal of Memory and Language, 115, 104169.10.1016/j.jml.2020.104169CrossRef Google Scholar

Li, X., & Yang, Y. (2013). How long-term memory and accentuation interact during spoken language comprehension. Neuropsychologia, 51(5), 967–978.10.1016/j.neuropsychologia.2012.12.016CrossRef Google Scholar PubMed

Li, X.-q., & Ren, G.-q. (2012). How and when accentuation influences temporally selective attention and subsequent semantic processing during on-line spoken language comprehension: An ERP study. Neuropsychologia, 50(8), 1882–1894.10.1016/j.neuropsychologia.2012.04.013CrossRef Google Scholar PubMed

Moreno, E. M., Federmeier, K. D., & Kutas, M. (2002). Switching languages, switching palabras (words): An electrophysiological study of code switching. Brain and Language, 80(2), 188–207.10.1006/brln.2001.2588CrossRef Google Scholar PubMed

Nieuwland, M. S., Barr, D. J., Bartolozzi, F., Busch-Moreno, S., Darley, E., Donaldson, D. I., Ferguson, H. J., Fu, X., Heyselaar, E., Huettig, F., Husband, E. M., Ito, A., Kazanina, N., Kogan, V., Kohút, Z., Kulakova, E., Mézière, D., Politzer-Ahles, S., Rousselet, G., …, Zu Wolfsthurn, S. V. G. (2020). Dissociable effects of prediction and integration during language comprehension: Evidence from a large-scale study using brain potentials. Philosophical Transactions of the Royal Society B, 375(1791), 20180522.10.1098/rstb.2018.0522CrossRef Google Scholar PubMed

Nieuwland, M. S., & Van Berkum, J. J. (2006). When peanuts fall in love: N400 evidence for the power of discourse. Journal of Cognitive Neuroscience, 18(7), 1098–1111.10.1162/jocn.2006.18.7.1098CrossRef Google Scholar

O’Rourke, P. L., & Van Petten, C. (2011). Morphological agreement at a distance: Dissociation between early and late components of the event-related brain potential. Brain Research, 1392, 62–79.10.1016/j.brainres.2011.03.071CrossRef Google Scholar

Otten, M., & Van Berkum, J. J. (2008). Discourse-based word anticipation during language processing: Prediction or priming? Discourse Processes, 45(6), 464–496.10.1080/01638530802356463CrossRef Google Scholar

Piai, V., Anderson, K. L., Lin, J. J., Dewar, C., Parvizi, J., Dronkers, N. F., & Knight, R. T. (2016). Direct brain recordings reveal hippocampal rhythm underpinnings of language processing. Proceedings of the National Academy of Sciences, 113(40), 11366–11371.10.1073/pnas.1603312113CrossRef Google Scholar PubMed

Piai, V., Roelofs, A., & Maris, E. (2014). Oscillatory brain responses in spoken word production reflect lexical frequency and sentential constraint. Neuropsychologia, 53, 146–156.10.1016/j.neuropsychologia.2013.11.014CrossRef Google Scholar PubMed

Piai, V., Roelofs, A., Rommers, J., & Maris, E. (2015). Beta oscillations reflect memory and motor aspects of spoken word production. Human Brain Mapping, 36(7), 2767–2780.10.1002/hbm.22806CrossRef Google Scholar PubMed

Pickering, M. J., & Garrod, S. (2004). Toward a mechanistic psychology of dialogue. Behavioral and Brain Sciences, 27(2), 169–190.10.1017/S0140525X04000056CrossRef Google Scholar

Pierrehumbert, J. B. (1980). The phonology and phonetics of English intonation Massachusetts Institute of Technology.Google Scholar

Pijnacker, J., Geurts, B., Van Lambalgen, M., Buitelaar, J., & Hagoort, P. (2010). Exceptions and anomalies: An ERP study on context sensitivity in autism. Neuropsychologia, 48(10), 2940–2951.10.1016/j.neuropsychologia.2010.06.003CrossRef Google Scholar

Price, P. J., Ostendorf, M., Shattuck-Hufnagel, S., & Fong, C. (1991). The use of prosody in syntactic disambiguation. The Journal of the Acoustical Society of America, 90(6), 2956–2970.10.1121/1.401770CrossRef Google Scholar PubMed

Puts, D., & Cardenas, R. (2018). Voice scripts. December 3 https://doi.org/10.17605/OSF.IO/K2BHS.CrossRef Google Scholar

Rommers, J., Dickson, D. S., Norton, J. J., Wlotko, E. W., & Federmeier, K. D. (2017). Alpha and theta band dynamics related to sentential constraint and word expectancy. Language, Cognition and Neuroscience, 32(5), 576–589.10.1080/23273798.2016.1183799CrossRef Google Scholar PubMed

Ryskin, R., & Nieuwland, M. S. (2023). Prediction during language comprehension: What is next? Trends in Cognitive Sciences.10.1016/j.tics.2023.08.003CrossRef Google Scholar PubMed

Sanford, A. J., Sanford, A. J., Molle, J., & Emmott, C. (2006). Shallow processing and attention capture in written and spoken discourse. Discourse Processes, 42(2), 109–130.10.1207/s15326950dp4202_2CrossRef Google Scholar

Shattuck-Hufnagel, S., & Turk, A. E. (1996). A prosody tutorial for investigators of auditory sentence processing. Journal of Psycholinguistic Research, 25, 193–247.10.1007/BF01708572CrossRef Google Scholar

Spratling, M. W. (2017). A review of predictive coding algorithms. Brain and Cognition, 112, 92–97.10.1016/j.bandc.2015.11.003CrossRef Google Scholar PubMed

Sun, Y., Sommer, W., & Li, W. (2022). How accentuation influences the processing of emotional words in spoken language: An ERP study. Neuropsychologia, 166, 108144.10.1016/j.neuropsychologia.2022.108144CrossRef Google Scholar PubMed

Terken, J., & Nooteboom, S. G. (1987). Opposite effects of accentuation and deaccentuation on verification latencies for given and new information. Language and Cognitive Processes, 2(3–4), 145–163.10.1080/01690968708406928CrossRef Google Scholar

Thornhill, D. E., & Van Petten, C. (2012). Lexical versus conceptual anticipation during sentence processing: Frontal positivity and N400 ERP components. International Journal of Psychophysiology, 83(3), 382–392.10.1016/j.ijpsycho.2011.12.007CrossRef Google Scholar PubMed

Van Berkum, J. J., Brown, C. M., Zwitserlood, P., Kooijman, V., & Hagoort, P. (2005). Anticipating upcoming words in discourse: Evidence from ERPs and reading times. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31(3), 443.Google Scholar PubMed

Van Petten, C., & Luka, B. J. (2006). Neural localization of semantic context effects in electromagnetic and hemodynamic studies. Brain and Language, 97(3), 279–293.10.1016/j.bandl.2005.11.003CrossRef Google Scholar PubMed

Van Petten, C., & Luka, B. J. (2012). Prediction during language comprehension: Benefits, costs, and ERP components. International Journal of Psychophysiology, 83(2), 176–190.10.1016/j.ijpsycho.2011.09.015CrossRef Google Scholar PubMed

Wang, L., Bastiaansen, M., Yang, Y., & Hagoort, P. (2011). The influence of information structure on the depth of semantic processing: How focus and pitch accent determine the size of the N400 effect. Neuropsychologia, 49(5), 813–820.10.1016/j.neuropsychologia.2010.12.035CrossRef Google Scholar PubMed

Wang, L., Bastiaansen, M., Yang, Y., & Hagoort, P. (2012). Information structure influences depth of syntactic processing: Event-related potential evidence for the Chomsky illusion. PLoS One, 7(10), e47917.10.1371/journal.pone.0047917CrossRef Google Scholar PubMed

Wang, L., Hagoort, P., & Jensen, O. (2018). Language prediction is reflected by coupling between frontal gamma and posterior alpha oscillations. Journal of Cognitive Neuroscience, 30(3), 432–447.10.1162/jocn_a_01190CrossRef Google Scholar PubMed

Wang, L., Hagoort, P., & Yang, Y. (2009). Semantic illusion depends on information structure: ERP evidence. Brain Research, 1282, 50–56.10.1016/j.brainres.2009.05.069CrossRef Google Scholar PubMed

Wang, L., Schoot, L., Brothers, T., Alexander, E., Warnke, L., Kim, M., … & Kuperberg, G. R. (2023). Predictive coding across the left fronto-temporal hierarchy during language comprehension. Cerebral Cortex, 33(8), 4478–4497.10.1093/cercor/bhac356CrossRef Google Scholar PubMed

Wlotko, E. W., & Federmeier, K. D. (2012). So that’s what you meant! Event-related potentials reveal multiple aspects of context use during construction of message-level meaning. Neuroimage, 62(1), 356–366.10.1016/j.neuroimage.2012.04.054CrossRef Google Scholar PubMed

Wlotko, E. W., Federmeier, K. D., & Kutas, M. (2012). To predict or not to predict: Age-related differences in the use of sentential context. Psychology and Aging, 27(4), 975.10.1037/a0029206CrossRef Google Scholar PubMed

Yang, Y., & Li, X. (2004). The role of accentuation in spoken discourse comprehension. Acta Psychologica Sinica, 36(04), 393.Google Scholar

Table 1. Examples of stimuli

Table 2. Ratings for critical words’ cloze probability, lexical frequency, concreteness, and imageability (M±SD)

Table 3. Acoustic parameters of critical words (CWs) and the preceding sentence fragments under the two accent conditions

Table 4. Acoustic parameters of critical words (CWs) in the two accent conditions

Figure 1. A single trial of the experimental procedure.

Table 5. Accuracy rates under different conditions (M±SD)

Article contents

The neural processing of the interaction between accentuation and lexical prediction during spoken sentence comprehension

Abstract

Keywords

Information

1. Introduction

1.1. Semantic prediction in language comprehension

1.2. Prosodic facilitation in spoken language comprehension

1.3. The current study

2. Methods

2.1. Participants

2.2. Materials

2.3. Procedure

2.4. EEG acquisition and analysis

2.5. Results

2.5.1. Behavioral results

2.5.2. ERP results

3. Discussion

4. Conclusion

Data availability statement

Acknowledgements

Competing interests

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests