1. Introduction
There is a vast amount of literature showing that, in languages with postnominal relative clauses, such as English, Hebrew, Spanish, and Italian, object relative (OR) clauses are acquired late by children (e.g., Arosio et al., Reference Arosio, Adani, Guasti, Brucart, Gavarró and Solà2009; Cilibrasi et al., Reference Cilibrasi, Adani and Tsimpli2019; Kidd & Arciuli, Reference Kidd and Arciuli2016) and are relatively difficult to process even for adults (e.g., del Río et al., Reference del Río, López-Higes and Martín-Aragoneses2012; Holmes & O’Regan, Reference Holmes and O’Regan1981). In the present study, we examine whether providing young school-aged Spanish children who have not yet mastered this construction with limited albeit enhanced additional experience will result in an improvement in comprehension. In addition, we investigate the potential relationship between grammatical development and general cognitive ability by assessing whether the comprehension of ORs before and after training is related to the children’s vocabulary size and non-verbal intelligence quotient (IQ).
1.1. Why are ORs difficult?
ORs are relative clauses in which the head is the object of the verb in the subordinate clause (e.g., the man that my friend saw). In subject relatives (SRs), in contrast, the head is the subject of the verb (e.g., the man that saw my friend). The relative difficulty of ORs has been attributed to their structural properties. ORs are thought to be more marked than SRs because objects are lower on the Noun Phrase Accessibility Hierarchy (cf. Keenan & Comrie, Reference Keenan and Comrie1977) than subjects. They also involve longer dependencies between the filler and the gap,Footnote 1 and the subject structurally intervenes between the head noun and the gap. These properties place relatively high demands on working memory, which results in processing difficulties even for adults. They also make the construction more difficult for children to acquire, since children are known to have limited working memory/computational capacity (Adani, Reference Adani2011; Arosio et al., Reference Arosio, Panzeri, Molteni, Magazù and Guasti2017; Cilibrasi et al., Reference Cilibrasi, Adani and Tsimpli2019; Contemori & Belletti, Reference Contemori and Belletti2014; Friedmann et al., Reference Friedmann, Belletti and Rizzi2009).
The late acquisition of ORs could also be attributed to the properties of the input. According to experience-based accounts, including usage-based approaches (e.g., Kidd et al., Reference Kidd, Brandt, Lieven and Tomasello2007; Reali, Reference Reali2014; Reali & Christiansen, Reference Reali and Christiansen2007), ORs are more difficult to acquire and process because their schematic syntactic structure is less prototypical and frequent than that of SRs, especially when the head is a full Noun Phrase (NP) with an animate referent (Betancort et al., Reference Betancort, Carreiras and Sturt2009; Reali, Reference Reali2014; Reali & Christiansen, Reference Reali and Christiansen2007). Although SRs are similar in frequency to ORs and have in fact been found to be slightly less frequent in spoken texts (Reali & Christiansen, Reference Reali and Christiansen2007; Roland et al., Reference Roland, Dick and Elman2007), they have the same word order (SVO) as ordinary transitive sentences in both English and Spanish. Sentences with the canonical SVO structure can be interpreted using a simple processing heuristic, namely, that the first NP corresponds to the agent. This same heuristic works against comprehending ORs and could account for the fact that children sometimes interpret them as if they were SRs (Arosio et al., Reference Arosio, Adani, Guasti, Brucart, Gavarró and Solà2009; Diessel, Reference Diessel, Givón and Shibatani2009; Labelle, Reference Labelle1990).
The accounts discussed above make different predictions about the acquisition of ORs. For those based on syntactic structure and processing demands, the late acquisition of ORs is related to maturational and cognitive constraints that are in place until learners reach a particular point in development (Cilibrasi et al., Reference Cilibrasi, Adani and Tsimpli2019; Contemori & Belletti, Reference Contemori and Belletti2014; Friedmann et al., Reference Friedmann, Belletti and Rizzi2009). Crucially, according to this position, such constraints stem ultimately from the maturation of the brain and are largely independent of linguistic experience (Borer & Wexler, Reference Borer, Wexler, Roeper and Williams1987) – although, of course, proponents of these approaches do acknowledge that some relevant experience is required in order to trigger development. For experience-based accounts (Reali, Reference Reali2014; Reali & Christiansen, Reference Reali and Christiansen2007), in contrast, the acquisition of ORs depends on learners’ experience with the construction. Therefore, according to the latter view, the acquisition of ORs could be accelerated by providing learners with additional experience. In the present study, we tested this prediction by exposing Spanish children who had not yet acquired ORs to instances of this construction while explicitly contrasting ORs with their corresponding SRs as well as providing explicit feedback on accuracy. If children’s difficulties were due to development-related maturational and/or processing constraints, we would not expect their comprehension of the critical structure to improve after a brief training session. By contrast, if what was hindering comprehension was insufficient amount or quality of experience with the construction, training should result in observable improvements.
1.2. Grammatical comprehension, vocabulary, and non-verbal IQ
One fundamental question in the study of language acquisition concerns the mechanisms underlying the acquisition of the grammatical system of the native language. Usage-based models of language acquisition (Bybee, Reference Bybee2010; Goldberg, Reference Goldberg2006; Tomasello, Reference Tomasello2000) assume that grammar, in the same way as vocabulary, is represented as a set of form–meaning pairings. Partial support for this claim can be found in the existence of robust correlations between grammatical and lexical development in first language acquisition (Bates et al., Reference Bates, Bretherton and Snyder1988; Marchman et al., Reference Marchman, Martínez‐Sussmann and Dale2004) and in ultimate attainment of grammar and vocabulary in adult L1 speakers (Dąbrowska, Reference Dąbrowska2018; Llompart & Dąbrowska, Reference Llompart and Dąbrowska2020). Furthermore, usage-based models also argue that the acquisition of both grammar and vocabulary relies on the same domain-general mechanisms (albeit not necessarily to the same extent). Critically, this contrasts with modular models (e.g., Pinker, Reference Pinker1999; Ullman, Reference Ullman2001; Ullman et al., Reference Ullman, Corkin, Coppola, Hickok, Growdon, Koroshetz and Pinker1997), which assume that grammatical knowledge is stored separately from lexical knowledge and depends on procedural/implicit memory, unlike lexical acquisition, which is thought to rely on declarative/explicit memory.
Still, research to date has not yet been able to provide a clear picture of the learning mechanisms underlying the acquisition of grammar. There is indeed some evidence outlining a close relationship between grammar and implicit learning and memory. Several studies have shown correlations between grammatical development in typically developing children and implicit learning abilities (Hamrick et al., Reference Hamrick, Lum and Ullman2018; Kidd, Reference Kidd2012; Kidd & Arciuli, Reference Kidd and Arciuli2016; Lum et al., Reference Lum, Conti-Ramsden, Page and Ullman2012). Furthermore, a number of studies of children with specific language impairment (SLI; now mostly referred to as developmental language disorder [DLD]) have shown persistent deficits in implicit learning in this population (Gabriel et al., Reference Gabriel, Maillart, Stefaniak, Lejeune, Desmotte and Meulemans2013; Lum et al., Reference Lum, Conti-Ramsden, Page and Ullman2012; Ullman & Pierpont, Reference Ullman and Pierpont2005).
However, there are also grounds to argue that explicit learning abilities and problem-solving abilities may play a role in grammatical development. For example, within the usage-based framework, Dąbrowska (Reference Dąbrowska2010, Reference Dąbrowska2018) proposes that acquiring a constructional schema effectively requires solving a proportional analogy of the form: MEANING1: form 1:: MEANING2: form 2. In order to do this, learners must be able to encode features, induce abstract relations, and maintain information in working memory. Abilities of this kind are believed to be involved in explicit reasoning, in that they are, for example, routinely targeted by intelligence tests.
In fact, there is also empirical evidence supporting a relationship between IQ and explicit learning abilities, on the one hand, and grammar learning, on the other. First, children with learning impairments consistently perform well below their chronological age on virtually all aspects of language (e.g., van der Schuit et al., Reference van der Schuit, Segers, van Balkom and Verhoeven2011). This is also true for individuals with Williams’ syndrome: while some researchers claimed that morphosyntactic abilities are selectively preserved in this group (see, e.g., Clahsen & Almazan, Reference Clahsen and Almazan1998), later studies have shown that this is not the case; that is to say, the performance of children and adolescents with Williams’ syndrome on language measures is fully in line with their mental age (Stojanovik et al., Reference Stojanovik, Perkins and Howard2004). Furthermore, in children with cognitive impairments, robust correlations have been reported between non-verbal IQ and grammatical abilities (Facon et al., Reference Facon, Facon-Bollengier and Grubar2002; van der Schuit et al., Reference van der Schuit, Segers, van Balkom and Verhoeven2011). Similar correlations have also been reported in typically developing children (Roth et al., Reference Roth, Speece and Cooper2002; West et al., Reference West, Vadillo, Shanks and Hulme2018), although some studies have failed to find such a relationship (Kidd & Arciuli, Reference Kidd and Arciuli2016; van der Schuit et al., Reference van der Schuit, Segers, van Balkom and Verhoeven2011). Finally, correlations between L1 grammatical abilities on the one hand and executive function, language aptitude, and need for cognition (i.e., the extent to which an individual enjoys effortful cognitive activities) on the other hand have been observed in typically developing children (Brooks & Kempe, Reference Brooks and Kempe2019) as well as in adults (Dąbrowska, Reference Dąbrowska2018; Llompart & Dąbrowska, Reference Llompart and Dąbrowska2023; Street & Dąbrowska, Reference Street and Dąbrowska2010). In the present study, we contribute to this ongoing discussion by examining the extent to which L1 children’s grammatical knowledge, assessed through their comprehension of ORs, is related to (i) their vocabulary size at the time of testing and (ii) their non-verbal IQ.
1.3. Spanish relatives
Both SRs and ORs in Spanish are normally introduced by the subordinating conjunction que, although less common variants with the pronouns cual ‘which’ and quien ‘who’ are also possible. SRs share the word order of simple transitive clauses, that is, SVO (1). In ORs, where the NP referring to the patient is the head, the subject of the relative clause can appear either before (2a) or after the verb (2b).
Spanish is a language with differential object marking (DOM): when the direct object is human and specific, it must be marked with the preposition a ‘to’, as in (1).Footnote 2 The preposition a can also be used to mark the subordinator in ORs, as in (3).
Cross-linguistically, the vast majority of ORs modify inanimate heads (see Mitsugi, Reference Mitsugi2016 and footnote 3). Since a only occurs in DOM contexts, it follows that the prepositional variant of the OR, as in (3), is less frequent than the ‘plain’ variant in (2). This is confirmed by corpus analyses: in a sample of educated spoken Spanish in the Madrid region analyzed by Butler (Reference Butler1992), ORs with a accounted for 5.8% of all ORs; in Reali’s (Reference Reali2014) corpus, also of spoken Peninsular Spanish, the proportion was even smaller (less than 5%). However, in DOM contexts, the variant a is strongly preferred: in fact, the proportion of ORs with the a marker is close to the overall frequency of ORs with animate heads attested across a number of corpus studies, suggesting a near-categorical preference for a relatives in such contexts.Footnote 3
Furthermore, the a variant is arguably easier to process, especially in spoken language, for two reasons. First, it is unambiguously marked as an OR from the onset of the relative clause. This is in opposition to the plain variant, especially when the subject appears in the postverbal position, as in (2b): the latter sentence is identical to the SR in (1) until the final Prepositional Phrase (PP)/NP. Given that there is evidence that sentences like (2b) are initially interpreted as SRs and need to be reanalyzed on reaching the second NP (Betancort et al., Reference Betancort, Carreiras and Sturt2009), the prepositional variant is expected to reduce processing difficulty. Secondly, in spoken Spanish, sentences like (1) and (2b), both with masculine subjects and objects, differ only in the vowel of an unstressed function word (i.e., [el] vs. [al]), which makes the comprehension of the sentence dependent on hearing this small acoustic difference. With feminine referents, SRs and ORs are often quasi-indistinguishable in perception because, if the preceding word ends in -a, the preposition a is merged with that vowel: thus, (4a) is almost identical to (4b). As a result, comprehension errors involving the ‘plain’ variant could be in many cases due to perceptual factors.
For these reasons, in the grammatical comprehension task reported here, we used the prepositional variant of the OR with the verb in the final position, that is, the variant corresponding to (3a).
It is important to observe here that ORs such as the ones used in this study and the vast majority of psycholinguistic research on ORs, which have animate heads and full NP subjects, are rare in discourse. As mentioned above, most attested ORs have inanimate heads, and they also tend to have pronominal or, in pro-drop languages, elided subjects (e.g., el libro que (él) escribió ‘the book that he wrote’). This is because the function of ORs is to anchor the head NP in ongoing discourse by linking it to a previously mentioned animate referent (Fox & Thompson, Reference Fox and Thompson1990). Note that such relative clauses can be processed by relying on semantic and/or pragmatic cues: a person can write a book, but a book cannot write a person. Thus, in order to be able to assess whether speakers are able to process ORs by relying on syntactic cues alone, it is necessary to use non-canonical clauses like those in examples (2) and (3).
1.4. The acquisition of relative clauses
Corpus studies of children acquiring Spanish (Barreña, Reference Barreña, Leow and Sanz2000; Hernández Pina, Reference Hernández Pina1984; both cited in Ezeizabarrena, Reference Ezeizabarrena2012) suggest that both SRs and ORs emerge early in development, at around age 2;6, and Pérez-Leroux (Reference Pérez-Leroux1993) reports that Spanish-speaking children as young as 3 occasionally produce ORs under experimental conditions. However, Pérez-Leroux also notes that young children tend to avoid such structures, and Ezeizabarrena (Reference Ezeizabarrena2012) observed very high error rates in ORs, which persisted even in the oldest children in this study (aged 7).
We are not aware of any experimental studies examining the comprehension of ORs by Spanish-speaking children. However, studies investigating the comprehension of ORs by preschool children acquiring a number of other languages, including English, Greek, Italian, Hebrew, Portuguese, and Catalan, typically report comprehension at around chance (Arosio et al., Reference Arosio, Adani, Guasti, Brucart, Gavarró and Solà2009, Reference Arosio, Panzeri, Molteni, Magazù and Guasti2017; Belletti et al., Reference Belletti, Friedmann, Brunato and Rizzi2012; Friedmann & Costa, Reference Friedmann and Costa2010; Gavarró et al., Reference Gavarró, Cunill, Muntané and Reguant2012; Guasti et al., Reference Guasti, Stavrakaki and Arosio2008; Kidd & Arciuli, Reference Kidd and Arciuli2016). One exception to this is Adani (Reference Adani2011), who found that 6-year-olds chose the target response on 85% of the trials. In Adani’s experiment, the subject and object NPs differed in number (one was singular and the other was plural), so it is possible that the children were able to achieve such high levels of performance because they were relying on agreement cues – although the presence of such cues did not help the children tested by Arosio et al. (Reference Arosio, Adani, Guasti, Brucart, Gavarró and Solà2009), Gavarró et al. (Reference Gavarró, Cunill, Muntané and Reguant2012), or Guasti et al. (Reference Guasti, Stavrakaki and Arosio2008).
Furthermore, other studies indicate that ORs continue to pose difficulties even for school-aged children. Kidd and Arciuli (Reference Kidd and Arciuli2016) report a mean comprehension rate of 55% in English-speaking children aged from 6 to 8, while Cilibrasi et al. (Reference Cilibrasi, Adani and Tsimpli2019), who tested slightly older children (7;5–11;7) found an overall comprehension rate of 68%. Two other studies examined the comprehension of ORs by Italian and Hebrew 7- and 9-year-olds. Arosio et al. (Reference Arosio, Panzeri, Molteni, Magazù and Guasti2017) observed accuracy rates of 73% for 7-year-olds and 77% for 9-year-olds for Italian-speaking children. For Hebrew-speaking children, the corresponding figures were 78% and 92%, respectively (Friedmann & Costa, Reference Friedmann and Costa2010, experiment 5). Importantly, in all of these studies, children performed much better – typically at or close to the ceiling – on SRs. Thus, while there are some differences between studies, it is clear that the acquisition of ORs continues well into the school years.
1.5. The present study
In the present study, we assess school-aged Spanish children’s comprehension of ORs using a picture selection task. The children completed the task three times: before they received training (pre-test), immediately after training (post-test 1), and a few days later (post-test 2). Performance on the pre-test was compared to performance on the post-tests in order to assess whether participants had improved after training. Performance on post-test 1 was further compared to performance on post-test 2 to assess whether the improvement (if any) was indeed lasting and to test potential consolidation effects. In addition, we also administered measures of non-verbal intelligence (Raven’s Colored Progressive Matrices (CPM); Raven et al., Reference Raven, Court and Raven1990) and receptive vocabulary (the Spanish version of the Peabody Picture Vocabulary Test (PPVT); Dunn et al., Reference Dunn, Dunn and Arribas2006) in order to determine the extent to which individual differences in these cognitive abilities modulate grammatical comprehension.
2. Methods
2.1. Participants
Forty children (17 males and 23 females) aged from 5;11 to 7;1 (mean 6;7) took part in this study. All children were native speakers of Spanish and were recruited from a school located in the Seville area (i.e., Colegio San Antonio María Claret). Informed consent was obtained from the parents, and the study was conducted in accordance with the Declaration of Helsinki. Ethics approval was obtained from the Ethics Committee of the Friedrich Alexander University Erlangen-Nuremberg.
2.2. Materials and procedure
Participants were tested individually in a quiet room at the school by an experimenter who was a native speaker of Spanish. There were two experimental sessions of approximately 50 and 30 minutes. The first session included the pre-test, training, and immediate post-test as well as the non-verbal intelligence measure; in the second session, the children completed the delayed post-test and the vocabulary measure. On average, the sessions were 7.3 days apart (SD = 2; range: 1–13). The tasks are described in the following subsections in the order in which they were conducted.
2.2.1. Grammatical comprehension: picture selection task
Materials for this task were 16 pairs of pictures (32 pictures in total) and 64 Spanish sentences. The pairs of pictures depicted transitive actions with two animate participants (e.g., a man and a boy). The pictures in each pair differed in who carried out the action (e.g., a man helping a boy versus a boy helping a man). Corresponding to each picture were two sentences, one containing a SR and the other containing an OR. Both the nouns (e.g., niño ‘boy’, hombre ‘man’, abuela ‘grandma’) and verbs (e.g., ayudar ‘to help’, besar ‘to kiss’, despertar ‘to wake up’) referring to the participants and actions depicted in the pictures were basic Spanish vocabulary and were expected to be known by our participant group. An example of a test item is provided in Figure 1.
The sentences were assigned to two versions of the test, each containing 16 SRs and 16 ORs. This means that, in our example, the two sentences requiring that participants point to the boy (i.e., one SR and one OR) would be included in version 1, while those where the correct response required pointing to the man would be included in version 2. Version 1 was used during pre-test and training, and version 2 was used during post-test 1 and post-test 2. Thus, in the post-tests, the children saw the same picture pairs as in the pre-test and training but heard a different prompt (with the roles reversed): for example, if the pre-test prompt was Señala el niño al que el hombre ayuda (‘Point to the boy that the man is helping’), the corresponding prompt in the post-tests would be Señala el hombre al que el niño ayuda (‘Point to the man that the boy is helping’). Note that this feature of the design means that, if participants simply learned to select a particular picture during training, this would result in decreased performance in the post-tests, where they were required to choose the other picture.
Each part of the picture selection task had a total of 32 trials, 16 with SRs and 16 with ORs. Sentences were presented in a pseudorandom order with the constraint that the SR and OR involving the same action and pair of participants were at least two trials apart.
The procedure was the same for the pre-test, post-test 1, and post-test 2. On each trial, participants were shown a pair of pictures and were orally instructed to point to one specific referent that appeared in one of the pictures (e.g., Señala el niño al que el hombre ayuda ‘Point to the boy that the man is helping’). As mentioned above, the pictures in each pair shared the same referents but with different roles. Once an answer had been given, the experimenter moved on to the next trial. No feedback was provided during the test stage.
The training part of the task was divided into two blocks. In the first block, which consisted of 16 trials, the child was shown a pair of pictures and the experimenter described each picture in turn using a SR and an OR with the same head noun (Éste es el niño que ayuda al hombre y éste es el niño al que el hombre ayuda ‘This is the boy that is helping the man, and this is the boy that the man is helping’) while pointing to the relevant referent. Immediately afterward, the child was asked to repeat the two sentences while pointing to the matching pictures. Thus, in this phase, the SR and ORs were explicitly contrasted. The experimenter always described the picture on the left first; since the position of the referent of the SR and OR was counterbalanced, this meant that in half of the trials the SR came first, and in the other half, the OR came first.
In the second training block, which also contained 16 trials, the procedure was the same as in the pre-test, except that children were given feedback on their responses. Thus, the child was shown the pictures and asked to point to the correct referent (e.g., Señala el niño al que el hombre ayuda ‘Point to the boy that the man is helping’). If they selected the matching picture, the experimenter confirmed their choice as correct (e.g., Sí, éste es el niño al que el hombre ayuda ‘Yes, this is the boy that the man is helping’). If they selected the wrong picture, the experimenter provided negative feedback (e.g., No, éste es el niño que ayuda al hombre ‘No, this is the boy that is helping the man’) and then pointed to the target picture and repeated the correct description (e.g., Éste de aquí es el niño al que el hombre ayuda ‘This one here is the boy that the man is helping’).
2.2.2. Non-verbal IQ test
Raven’s CPM test (Raven et al., Reference Raven, Court and Raven1990) was used as a measure of non-verbal IQ. The CPM is designed for children of ages 5 through 12. Each item in the test contains a large colored pattern with a missing part; the participant’s task is to choose the missing part from an array of six possible responses. There are 36 items in total.
2.2.3. Vocabulary test
The vocabulary test used in the present study was the Spanish PPVT-III (Dunn et al., Reference Dunn, Dunn and Arribas2006). The PPVT-III is a normed test of receptive vocabulary in which participants must identify the visual referents of words from an array of pictures. On each trial, participants were shown a grid containing four pictures and were asked to point toward the picture depicting a specific word (e.g., lápiz ‘pencil’).
3. Results
The dataset analyzed in this article and the code to reproduce the analyses reported are available at https://osf.io/83abx/ (Open Science Framework). Data from one participant for the delayed post-test and the vocabulary task were missing due to that participant not being able to attend the second testing session. Data from this participant for the IQ test and the pre-test and immediate post-test parts of picture selection were retained in all analyses. The age of one of the participants also failed to be recorded. Table 1 shows the descriptive statistics for the picture selection task, with accuracies provided separately for SRs and ORs for all parts. The mean score for PPVT-III was 87 (SD = 17.9); the mean for CPM was 22.4 (SD = 5.8). Note that the PPVT-III mean of this group is substantially higher than the 50th percentile for children aged 6;7 (a score of 71), which is the mean age of our participants. A total of 33 out of the 39 participants (85%) who completed the vocabulary test were at or above that percentile, and 10 (26%) were above the 75th percentile for that same age (a score of 99). Similar figures are also found for the CPM (Court, Raven et al., Reference Raven, Court and Raven1996). Out of the 40 participants who completed the task, 32 (80%) were above the 50th percentile (a score of 17), and 21 (53%) above the 75th percentile (a score of 22) for children aged 6 to 7 (Raven et al., Reference Raven, Court and Raven1996). The correlation between PPVT-III and CPM scores was not significant (r(37) = .04, p = .79).
Note: Standard deviations are in parentheses.
Regarding the picture selection task, as shown in Table 1, the children were extremely accurate with SRs in all three tests. Performance on ORs, in contrast, was very poor in the pre-test (well below chance) but improved substantially after training, with scores on the delayed post-test (i.e., post-test 2) being slightly higher than on the immediate post-test. Figure 2 showcases the distribution of individual scores in each part of the task. As observable in this figure, there were only four children (10% of the sample) who were clearly above chance already in the pre-test. This number then rose dramatically for the two test parts after the training.
To systematically examine differences in accuracy as well as the potential effects of participants’ non-verbal IQ and vocabulary scores on performance, we fitted a mixed-effects logistic regression model (lme4 package 1.1–31, Bates et al., Reference Bates, Mächler, Bolker and Walker2015) in R (Version 4. 2. 2, R Core Team, 2017). The dependent variable was Response in the OR trials (coded 1 for correct and 0 for incorrect). The SR trials were excluded because of the clear ceiling effects in this condition. The independent variables of interest were Task Part (Pre-test/Post-test 1/Post-test 2), IQ, and Vocabulary. Task Part was Helmert-coded as two contrasts to which we will henceforth separately refer as Pre-Post and Post1-Post2, which targeted two different comparisons of interest (see Llompart, Reference Llompart2021, for a similar approach). ‘Pre-Post’ was coded to capture differences in accuracy between the pre-test (i.e., before training) and the two post-tests (i.e., after the training). Hence, the pre-test was coded as −0.5, and post-test 1 and post-test 2 were both coded as 0.25 (see Brehm & Alday, Reference Brehm and Alday2022, for a discussion of the importance of providing the numeric values of contrast-coding procedures in psycholinguistic research). With this coding, a significantly positive regression coefficient would indicate that post-test accuracy was higher than pre-test accuracy. ‘Post1-Post2’ was coded to quantify differences in accuracy between post-test 1 and post-test 2 (i.e., approximately a week after post-test 1). Post-test 1 was coded as −0.5, post-test 2 was coded as 0.5, and pre-test was left at 0. Therefore, a significantly positive regression coefficient would in this case mean that accuracy in post-test 2 was higher than in post-test 1. IQ scores and Vocabulary scores were entered as centered and scaled numeric predictors. In addition, participants’ age was included as a covariate. Age was entered in months and was also centered and scaled.
The Task Part contrasts (i.e., Pre-Post and Post1-Post2), IQ, Vocabulary, and Age, as well as the interactions between the two Task Part contrasts and IQ and those between the two Task Part contrasts and Vocabulary, were included in the model as fixed effects. Regarding random effects, the model included random intercepts for Participants and Items. Given that the items in the pre-test differed from those in the two post-tests in the target sentences but had the same picture pairs, items were re-coded by picture pair before entering the analyses as random factor for Item. The random-effects structure was chosen by a model fitting procedure using log-likelihood ratio tests such that random slopes were only included if they improved the model’s fit. This was done because the maximal random-effects structure led to convergence issues.Footnote 4 The only random slope that was included was for Pre-Post over participants (χ 2(3) = 57.43, p < .001). Random slopes for Post1-Post2 over Participants and for Pre-Post, Post1-Post2, IQ, Vocabulary and Age over Items were not included because they did not improve the fit of the model (all p > .25). The model was run using the optimizer ‘bobyqa’ in the glmer control options to obtain model convergence. Odds ratios and 95% confidence intervals for each predictor, as well as the marginal and conditional R2 of the model, were estimated using the tab_model function from the sjPlot package (version 2.8.14; Lüdecke, Reference Lüdecke2018). The results are provided in Table 2.
The model revealed significant effects of Pre-Post and Post1-Post2, no effects of IQ, Vocabulary, or Age, and no significant interactions. These results indicate that children chose the correct picture more often after training (i.e., in post-test 1 and post-test 2) than before training (i.e., in the pre-test) and that they were more accurate in the delayed post-test than immediately after training. IQ and Vocabulary were not found to be significant predictors of performance in the comprehension task, and their effects did not significantly interact with Task Part. Coefficients for the correlations between IQ and Vocabulary, on the one hand, and accuracy with ORs in each of the three test parts (all non-significant), on the other hand, are provided in Supplementary Table S1.
Since participants experienced different delays between post-test1 and post-test2, we additionally assessed whether the number of days between sessions had an effect on participants’ accuracy with OR in post-test 2. We ran another mixed-effects logistic regression model on the data from post-test 2 only with Response as the dependent variable and Age, IQ, Vocabulary, and Delay (i.e., number of days between sessions), all centered and scaled, as predictors. This analysis showed that the effect of Delay was not significant (b = 0.33; z = 1.36; p = .17), thus indicating that the number of days between the two post-tests did not appear to have an impact on participants’ performance in post-test 2.
4. Discussion
In this study, we examined the comprehension of Spanish SRs and ORs by 6- to 7-year-old Spanish-speaking children. In particular, we aimed to assess whether children who had not yet mastered the grammatical structure of Spanish ORs would improve in their comprehension of this grammatical construction if they were provided with short but intensive training focusing on the semantic contrast between ORs and SRs. Furthermore, we investigated the extent to which individual differences in the comprehension of sentences containing ORs were related to vocabulary size and non-verbal IQ. In the following subsections, we discuss (i) the effect of training on the comprehension of ORs, (ii) the role of vocabulary size and non-verbal IQ in grammatical development, and (iii) the role of experience in grammatical development.
4.1. The effect of training on the comprehension of ORs
Our results show that a short training session involving the presentation of just 16 ORs resulted in substantial improvement in children’s comprehension of this structure. The 6- and 7-year-olds who participated in the study showed very high accuracies in picture selection when faced with SRs in all phases of the task, but their accuracy on ORs in the pre-test was very low and clearly below chance (27% correct). Accuracy improved to 60% correct immediately after training and further improved to 66% correct on the delayed post-test conducted several days afterward.Footnote 5
The fact that children improved so dramatically after training, that is, from pre-test to post-test 1, provides strong evidence against maturational and processing accounts of the late acquisition of ORs. If the difficulties observed during the pre-test were due to excessive syntactic complexity and/or limited processing resources at this specific point in development, then the relatively brief training could not have had such a sizable effect on their ability to comprehend ORs. By contrast, our results are more consistent with experience-based accounts.
In addition to the massive improvement in the comprehension of ORs which occurred between the pre-test and the immediate post-test, we also observed a smaller improvement in performance between the immediate post-test and the delayed post-test. There are two potential explanations for this. First, this improvement could be the result of sleep consolidation effects. A number of studies have shown that performance in learning tasks improves further when learning is reinforced by intervening sleep phases. This is the case for tasks targeting knowledge associated with procedural memory, such as motor sequences (e.g., Barakat et al., Reference Barakat, Carrier, Debas, Lungu, Fogel, Vandewalle, Hoge, Bellec, Karni, Ungerleider, Benali and Doyon2013), as well as with declarative memory, like, for example, novel word learning (Dumay & Gaskell, Reference Dumay and Gaskell2007). Therefore, in the present scenario, it could be the case that the intervening sleep periods between post-test 1 and post-test 2 reinforced the gains observed for the targeted grammatical structure. Alternatively, the additional improvement observed in post-test 2 could be due to children’s additional experience with relatives outside of the laboratory between post-test 1 and post-test 2: in other words, once children became sensitized to the difference between SRs and ORs during the training session, they were able to further entrench the structure through exposure to more ambiguous exemplars in naturalistic settings. Irrespective of the reason for this additional improvement, the fact that children were not only not worse in post-test 2 than in post-test 1 – or just as accurate, given that the materials were the same – but actually slightly better, suggests that the knowledge they acquired during training became integrated into their developing linguistic systems.
4.2. The role of vocabulary size and non-verbal IQ in grammatical development
The regression analysis did not reveal any significant effects of children’s vocabulary size, as measured by the Spanish PPVT, or non-verbal IQ, which was assessed by means of Raven’s CPM, neither on their own nor in interaction with the test part. While this outcome was unexpected to some extent given the findings of previous research (e.g., Bates et al., Reference Bates, Bretherton and Snyder1988; Roth et al., Reference Roth, Speece and Cooper2002; West et al., Reference West, Vadillo, Shanks and Hulme2018), we believe that it may have to do with the characteristics of our sample. As discussed in Section 3, the children that we tested were, as a group, well above the expected mean scores for their age for both the vocabulary and the IQ measures. Even though there is variation in the scores for both measures, this may not have been enough for relationships with grammatical proficiency with ORs to arise. Alternatively, it could be that the fact that there were so many participants with high scores and so few scoring below the expected mean obscured the relationship of interest.
In spite of this, it is worth noting that, for accuracy with ORs in the pre-test, the correlation with vocabulary approached significance (r(37) = .29, p = .07; see Supplementary Table S1) and then the correlation became smaller in the later parts, once participants had undergone the training phase. This suggests that vocabulary may still have played a role in influencing how well children could comprehend the OR sentences before receiving any instruction. This could be because vocabulary size can be considered a proxy for the quantity and quality of the input the learner has received (cf. Hurtado et al., Reference Hurtado, Marchman and Fernald2008) and children who had been exposed to more input and/or higher quality input may have had more opportunities to acquire both new words (i.e., a larger vocabulary) and new grammatical constructions, including ORs. This explanation is compatible with an experience-based account of children with ORs (e.g., Kidd et al., Reference Kidd, Brandt, Lieven and Tomasello2007; Reali, Reference Reali2014; Reali & Christiansen, Reference Reali and Christiansen2007) and aligns with the broader usage-driven perspective that additional experience should crucially facilitate the acquisition of the target construction.
4.3. The role of language experience and instruction in grammatical development
Above, we concluded that our results appear to support experience-based accounts of the difficulties that children have with ORs (e.g., Kidd et al., Reference Kidd, Brandt, Lieven and Tomasello2007; Reali, Reference Reali2014; Reali & Christiansen, Reference Reali and Christiansen2007). However, a key question for these accounts is how exactly one should operationalize language experience. Broadly speaking, relevant experience is usually quantified as the frequency of occurrence of the target construction in the input. Yet, in this case, can frequency alone account for the late acquisition of ORs (vs. SRs)?
Previous research on English (Reali & Christiansen, Reference Reali and Christiansen2007; Roland et al., Reference Roland, Dick and Elman2007) has shown that in spoken language, ORs are actually slightly more frequent than SRs. To obtain similar data for Spanish, we conducted a small-scale corpus study using data from the BecaCESNo corpus (Benedet et al., Reference Benedet, Cruz, Carrasco and Snow2004) in TalkBank. We excerpted 15-minute samples from transcripts of conversations with 20 Spanish children aged from 5;9 to 7;5 recorded in a home setting (i.e., 5 hours of speech in total) and automatically extracted all adult utterances containing the subordinating conjunction que from this subcorpus. These were then manually coded as SRs, ORs, or ‘other’. This procedure yielded 20 ORs and 16 SRs. Extrapolating from these figures, we estimate that Spanish children hear, on average, about 4 ORs and 3.2 SRs per hour. Thus, it is clear that differences in the frequency of these two constructions alone cannot explain the asymmetry between SRs and ORs.
Another apparent problem for an account based on sheer frequency is that, while relative clauses are rather infrequent in comparison with basic structures such as interrogatives or argument structure constructions, they are not that rare in absolute terms. If we assume that children are exposed to language for 8 hours per day, which is a rather conservative estimate, this means that between the ages of 2;7, when the development of complex syntax may be said to begin in earnest,Footnote 6 and 6;7 (the mean age of our participants), they will have heard about 47000 ORs (4 years × 365 days × 8 hours × 4 tokens) and about 37000 SRs. The number of exemplars that they heard during training, and even during the experiment as a whole, is minuscule compared to these figures. Why, then, had they not acquired ORs earlier? Or to phrase the question differently, what was it about the additional experience during training that triggered learning while preceding exposure did not?
Let us begin with the reasons for the later acquisition of ORs compared to SRs. As mentioned in the introduction, one important difference between SRs and ORs is that the former have the same word order as ordinary transitive sentences. Thus, on a more general level, the match between the SR and the canonical and well-entrenched SVO pattern automatically increases the frequency counts of structures like SRs and likely boosts performance on this construction. Conversely, this advantage for SRs may result in detrimental interference on performance with ORs (Arosio et al., Reference Arosio, Adani, Guasti, Brucart, Gavarró and Solà2009; del Río et al., Reference del Río, López-Higes and Martín-Aragoneses2012; Labelle, Reference Labelle1990). Secondly, as discussed in the Introduction, even if ORs were encountered as often, or even more often than SRs, ORs in spoken language tend to be fairly stereotypical in that they usually have inanimate heads and pronominal or null subjects (Reali, Reference Reali2014; Reali & Christiansen, Reference Reali and Christiansen2007; Roland et al., Reference Roland, Dick and Elman2007). This is likely to facilitate the acquisition of low-level schemas for this specific configuration (e.g., la INANIMATE (a la) que (PRON) TR.VB and el INANIMATE (al) que (PRON) TR.VB) but, at the same time, may slow down the acquisition of a more general construction that would also subsume sentences of the type we used in our task, which had full NPs designating animate participants in both the subject and the object position.
These observations lead us to an important conclusion for experience-based accounts of OR acquisition, as well as for usage-based models of language acquisition in a more general sense: to explain the order of acquisition, we need to consider not just the frequency of the constructions that we are interested in, but also the frequency of other constructions that may facilitate or interfere with the acquisition of the target construction and the lexical properties of the exemplars of the construction in the input (see Diessel, Reference Diessel, Givón and Shibatani2009, or similar observations).
Turning to the second question, namely, what was so special about the experience provided during the training session, three possible explanations come to mind. One possibility is that the experimental input and naturalistic input differ with regard to the distribution of exemplars in time. In the training phase of our experiment, the children heard a number of exemplars during a very short period (about 10 minutes), while in real life, the relevant exemplars are distributed more evenly over longer stretches of time. However, this is unlikely to be the reason for the different outcomes, as the available evidence suggests that, for linguistic and non-linguistic material alike, spaced exposure leads to more efficient learning than massed exposure (cf. Ambridge et al., Reference Ambridge, Theakston, Lieven and Tomasello2006; Miles, Reference Miles2014) – in other words, our training is likely to have been even more effective had it been spread out over a number of days.
A more plausible explanation is that, as discussed above, the knowledge that our participants had acquired from naturalistic input was relatively specific: in other words, it comprised a construction or set of constructions which would enable them to process ORs with inanimate heads and pronominal or null subjects, but not a fully general construction which would apply to non-canonical sentences of the kind that they encountered in the experiment. There is considerable evidence that children are conservative learners in the sense that they ‘hug the data’; that is to say, they stick closely to patterns that they have heard (Goldberg, Reference Goldberg2006; Tomasello, Reference Tomasello2000, Reference Tomasello, Gentner and Kuczaj2003) and consequently acquire a set of low-level patterns rather than a single construction that captures their commonality. In some cases, such low-level patterns appear to persist into adulthood (cf. Dąbrowska, Reference Dąbrowska2008a, Reference Dąbrowska2008b; Dąbrowska et al., Reference Dąbrowska, Rowland and Theakston2009).
A third possibility is that the experimental training was more effective because it explicitly contrasted SRs and ORs and invited children to attend to how the difference in form corresponds to a difference in meaning in its first phase (i.e., when they repeated after the experimenter) and provided feedback on accuracy in its second phase. Explicit instruction and attention to form are known to facilitate the acquisition of grammar in adult L2 learners (Goo et al., Reference Goo, Granena, Yilmaz, Novella and Rebuschat2015; Norris & Ortega, Reference Norris and Ortega2001; Spada & Tomita, Reference Spada and Tomita2010), but are generally assumed to be irrelevant for language learning in children, which is thought to rely on implicit learning mechanisms. However, there are reasons to consider that attention and explicit reasoning are also relevant for child language learning. As pointed out earlier, correlations have been observed between grammatical proficiency and IQ in both adults and children (Dąbrowska, Reference Dąbrowska2018; Houwen et al., Reference Houwen, Visser, van der Putten and Vlaskamp2016; Roth et al., Reference Roth, Speece and Cooper2002; West et al., Reference West, Vadillo, Shanks and Hulme2018), as well as between L1 grammatical proficiency and (foreign) language aptitude (Dąbrowska, Reference Dąbrowska2018; Llompart & Dąbrowska, Reference Llompart and Dąbrowska2023; Skehan & Ducroquet, Reference Skehan and Ducroquet1988). Furthermore, several artificial language learning studies have shown that the acquisition of some (but not all) grammatical patterns appears to be related to conscious awareness (e.g., Kenanidis et al., Reference Kenanidis, Dąbrowska, Llompart and Pili-Moss2023) and that children in the same age group as our participants benefit from explicit instruction just like adults (Ferman & Karni, Reference Ferman and Karni2014; Lichtman, Reference Lichtman2016).
Since our training involved increased experience with the more complex and less frequent structure as well as explicit attention to form by means of contrast and explicit feedback, we cannot discriminate between these two explanations on the basis of the data reported here. However, there is some indirect evidence that seems to argue in favor of the second account. First, although the vast majority of the ORs that children have heard in naturalistic contexts are most likely instances of the more specific construction, they are likely to have heard some tokens with less canonical structures, and yet they did not acquire a fully general OR construction. Furthermore, training studies with adult participants critically point toward the idea that explicit attention to the form–function mapping can result in rapid gains in comprehension (e.g., Street & Dąbrowska, Reference Street and Dąbrowska2010), while extensive exposure without such focus can speed up processing but does not necessarily improve comprehension accuracy (Wells et al., Reference Wells, Christiansen, Race, Acheson and MacDonald2009).
5. Conclusion
In sum, the present study showed that a brief training session with additional experience including explicit contrast and feedback resulted in a dramatic improvement in the comprehension of Spanish ORs by 6- to 7-year-old Spanish children. This suggests that the difficulties associated with this construction and the late acquisition thereof should be attributed to the quality and quantity of children’s experience with it rather than to maturational factors. Thus, our findings are compatible with experience-based accounts of OR acquisition (e.g., Kidd et al., Reference Kidd, Brandt, Lieven and Tomasello2007; Reali, Reference Reali2014; Reali & Christiansen, Reference Reali and Christiansen2007) and more broadly with usage-based models of language acquisition (Bybee, Reference Bybee2010; Goldberg, Reference Goldberg2006; Tomasello, Reference Tomasello2000).
Our findings also suggest that learning situations in which children are invited to explicitly attend to details of form and meaning may be particularly beneficial for acquisition, at least for relatively difficult structures such as ORs. The learning that occurs in this kind of setting involves conscious reflection about form and effortful reasoning – processes that one associates with instructed second language learning rather than first language acquisition, which is generally assumed to rely entirely on implicit learning, especially when it comes to grammar. While we do not wish to deny that implicit learning plays a central role in child language acquisition, our results, combined with earlier findings that linked individual differences in grammatical development to IQ, executive functioning, and language aptitude, suggest that explicit learning abilities also play a role. Further research will be necessary to determine exactly how and when such mechanisms are involved.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/langcog.2023.69.
Data availability statement
The dataset analyzed in this article and the code to reproduce the analyses reported are available at https://osf.io/83abx/ (Open Science Framework).
Acknowledgments
We thank Balma Laseo for her help with participant testing, Colegio San Antonio María Claret for their willingness to collaborate with us, and the participating children and their parents for their interest and for contributing to research in language acquisition.
Funding statement
This project was funded by an Alexander von Humboldt Professorship (ID-1195918) awarded to the second author.
Competing interest
The authors have no competing interests to declare.