INTRODUCTION
Narrative skills can be defined as the ability to tell a cohesive and structured story (either about something that has happened, or describing a book or plot) that gets the main points across so the listener can understand. In preschool years, this ability is considered to be an important predictor of later development of literacy, as documented by longitudinal studies (Snow, Tabors, Nicholson & Kurland, Reference Snow, Tabors, Nicholson and Kurland1995; Wellman, Lewis, Freebairn, Avrich, Hansen & Stein, Reference Wellman, Lewis, Freebairn, Avrich, Hansen and Stein2011). Specifically, narrative skills are thought to be related to reading comprehension (Cain & Oakhill, Reference Cain and Oakhill1996); there are common underlying processes involved in both tasks, such as understanding of a story grammar, but also indexes of broader oral language skills, including lexical and grammatical knowledge. The analysis of the development of narrative skills has recently been examined more thoroughly in studies that targeted bilingual populations (Roch & Florit, Reference Roch and Florit2013; Roch, Florit & Levorato, Reference Roch, Florit and Levorato2016; Squires, Lugo-Neris, Peña, Bedore, Bohman & Gillam, Reference Squires, Lugo-Neris, Peña, Bedore, Bohman and Gilliam2014). According to Hudson and Shapiro (Reference Hudson, Shapiro, McCabe and Peterson1991), narrative skills require the following: conceptual knowledge, i.e. a general knowledge of the world; linguistic knowledge, which allows one to produce a correct text from a lexical, syntactic, and semantic point of view; structural knowledge, that is, to know the structural components suitable for each narrative genre; and, finally, pragmatic knowledge, which refers to the point of view of the narrator and his/her knowledge about the story, in the specific context in which it takes place. The complex multidirectional interactions among various levels of linguistic competence underlying the narrative skills makes it hard to disentangle the contribution of each component in typically developing monolingual children. On this matter, bilingualism offers several advantages as an entry point to the study of narrative skills, because, although a number of studies have demonstrated that bilingualism is not a risk factor for the development of language delay (Paradis, Genesee & Crago, Reference Paradis, Genesee and Crago2011), low scores are often obtained for vocabulary measures in L2 by bilinguals, although, compared to monolinguals, they may develop better metalinguistic skills that represent an important resource for general language and cognitive development (Bialystok, Reference Bialystok2009).
Measures of narrative skills are highly ecological tasks that address how language is expressed in everyday life, and thus represent an intriguing research area for the comprehension of the relationship between different levels of linguistic processing. This becomes particularly interesting in children who speak more than one language. Moreover, the structure of children's narratives is relatively invariant across languages (Fiestas & Peña, Reference Fiestas and Peña2004; Gutiérrez-Clellen, Reference Gutiérrez-Clellen2002; Pearson, Reference Pearson, Oller and Eilers2002; Squires et al., Reference Squires, Lugo-Neris, Peña, Bedore, Bohman and Gilliam2014). Narratives allow for a parallel assessment of linguistic skills that are not susceptible to the disadvantages of standardized tests, namely, scoring based on typical monolingual performance (Simon-Cereijido & Gutiérrez-Clellen, Reference Simon-Cereijido and Gutiérrez-Clellen2009).
Narrative skills have been widely investigated in school-aged children from both monolingual (e.g. Berman & Slobin, Reference Berman and Slobin1994; Peterson & McCabe, Reference Peterson and McCabe1983) and bilingual (Gutiérrez-Clellen, Reference Gutiérrez-Clellen and Goldstein2004; Pearson, Reference Pearson, Oller and Eilers2002) populations, while fewer studies focussed on bilingual preschool children (Fiestas & Peña, Reference Fiestas and Peña2004; Uccelli & Páez, Reference Uccelli and Páez2007; Iluz-Cohen & Walters, Reference Iluz-Cohen and Walters2012; Roch & Florit, Reference Roch and Florit2013). Considering the multi-faceted nature of narratives, the present paper is aimed at comparing linguistic skills and the quality of narrative production in bilingual and monolingual children. In order to better understand the underlying idea of the current work, the existing literature on the development of narrative skills in monolingual, and, subsequently, in bilingual, children is reported.
Research on the development of narrative competence in monolingual children
Between four and ten years of age, children's ability to produce and comprehend complex texts increases progressively (e.g. Florit, Roch & Levorato, Reference Florit, Roch and Levorato2014; Lever & Sénéchal, Reference Lever and Sénéchal2011; Pearson, Reference Pearson, Oller and Eilers2002), and story-telling is a task that is often used by language specialists and researchers because it allows them to detect a variety of information regarding children's development in language, lexical knowledge, and level of acquisition of grammatical and morphosyntactic structures (Botting, Reference Botting2002; Cleave, Girolametto, Chen & Johnson, Reference Cleave, Girolametto, Chen and Johnson2010; Fiestas & Peña, Reference Fiestas and Peña2004; Iluz-Cohen & Walters, Reference Iluz-Cohen and Walters2012), thus providing valuable information about children's linguistic and metalinguistic knowledge (Miller, Heilmann, Nockerts, Iglesias, Fabiano & Francis, Reference Miller, Heilmann, Nockerts, Iglesias, Fabiano and Francis2006; Squires et al., Reference Squires, Lugo-Neris, Peña, Bedore, Bohman and Gilliam2014).
Studies that have assessed developmental trajectories of narrative skills have often used picture series such as, for example: Frog, where are you? (Mayer, Reference Mayer1969), the Bus Story Test (Renfrew, Reference Renfrew1997), and The Picnic Story (Tavano & Biancuzzi, Reference Tavano and Biancuzzi2008). Using picture elicitation tasks to investigate narrative skills encourages the child to talk, thus allowing the observer to analyze vocabulary and grammatical structures that are present in children's discourse from an early age. In addition, the cognitive load is reduced, thanks to the presence of the pictures: children are not required to recall past events, as in the case of personal narratives, but they can produce stories by simply connecting and describing different images, or they can make the story richer and more sophisticated by adding details and structural elements. With this method it is possible to obtain information as to the level of language acquisition and the use of linguistic and discursive forms that are typical of the child's own language (Ervin-Tripp, Reference Ervin-Tripp1989).
Monolingual children between the ages of three and four have been found to produce a higher number of simple than complex clauses, with the latter being primarily causal clauses (D'Amico, Albano, Marano & Devescovi, Reference D'Amico, Albano, Marano and Devescovi2008). This provides interesting insight, in the light of the importance of causality in narrative macro-structure. At the end of preschool age, typically developing children have acquired a basic knowledge or schema of stories (Trabasso, Stein & Johnston, Reference Trabasso, Stein, Johnson and Bower1981) and are able to produce narratives that include more than two sequentially connected elements. They tend to place too much emphasis on specific details depicted in pictures, however, lacking a general sense of discourse organization; furthermore, their stories often end without proper conclusions (Peterson & McCabe, Reference Peterson and McCabe1983). In other words, the narratives at this stage lack a general coherent representation. At about six years of age, children begin to produce narratives containing a sequence of events that are purpose oriented, and the story grammar of narratives improves by the inclusion of initiating events, goal-directed actions, and consequences (Berman, Reference Berman1988). During the school-age years, narrative skills continue to develop, and children include elements related to decontextualized language in their narrative productions (Stein & Glenn, Reference Stein, Glenn and Freedle1979), with older children being able to include more events, mental states, and temporal and causal relations between events (Curenton & Justice, Reference Curenton and Justice2004). According to D'Amico et al. (Reference D'Amico, Albano, Marano and Devescovi2008), at around nine to eleven years of age the number of simple and complex clauses used tend to become stable and, at this time, children attain mature narrative skills, which will be further refined during the next few years.
Some studies have investigated the relationship between linguistic knowledge and narrative skills, showing that lexical and grammatical skills influence children's acquisition of narrative proficiency (Berman & Slobin, Reference Berman and Slobin1994; Bishop & Donlan, Reference Bishop and Donlan2005). Hipfner-Boucher, Milburn, Weitzman, Greenberg, Pelletier, and Girolametto (Reference Hipfner-Boucher, Milburn, Weitzman, Greenberg, Pelletier and Girolametto2014) investigated the relationship between measures of oral narrative structure and measures of phonological awareness (blending and elision) in preschoolers, and they found an association between narrative structure and phonological awareness, and between vocabulary and phonological awareness. The authors explain these relationships by suggesting common structural (relating parts to a whole) and processing (representational analysis and control of attention) demands that underlie both narrative skills and phonological awareness. There is, however, a paucity of research that specifically addresses the relationship between linguistic knowledge and narrative skills in preschool monolingual children.
Research on linguistic and narrative competence in bilingual children
Bilingual children, here defined, for the need of synthesis, as those children who speak or are exposed to two languages in everyday life (De Lamo White & Jin, Reference De Lamo White and Jin2011), may differ in linguistic competence in their second language (L2), based on age and quality of exposure (Kovelman, Baker & Petitto, Reference Kovelman, Baker and Petitto2008; Paradis et al., Reference Paradis, Genesee and Crago2011). Children who are exposed at school to a language that is different from their home language may show delays in some aspects of their L2 linguistic knowledge compared to their monolingual peers (August & Shanahan, Reference August and Shanahan2006), particularly in vocabulary knowledge (Lindsey, Manis & Bailey, Reference Lindsey, Manis and Bailey2003; Manis, Lindsey & Bailey, Reference Manis, Lindsey and Bailey2004), but may be as competent as their monolingual peers in other cognitive and linguistic skills (Bonifacci, Giombini, Bellocchi & Contento, Reference Bonifacci, Giombini, Bellocchi and Contento2011; Bonifacci & Tobia, Reference Bonifacci and Tobia2016). International guidelines (ASHA, 1985, 2004) on clinical assessment in multilingual contexts strongly suggest that the assessment of bilingual performance in L2 should be accompanied by an evaluation of L1 linguistic skills, because highlighting weaknesses in L2 does not necessarily mean that the same pattern stands for L1 linguistic competencies. Considering the difficulties in assessing L1 competencies, particularly for language-minority children, a vast body of literature has been aimed at studying linguistic competencies in L2, in order to achieve information regarding a typical bilingual developmental trajectory. However, to date, the evidence collected is contextualized to L2 proficiency and cannot be extended to general (L1 and L2) linguistic development. Nevertheless, the analysis of the sole L2 developmental trajectory is useful in that it provides information about the instructional language; therefore, identifying patterns of strengths and weaknesses in L2 may help to identify potential risk factors for language delays. In the study run by Paradis, Schneider, and Sorenson Duncan (Reference Paradis, Schneider and Sorenson Duncan2013), a combination of measures in L2, excluding vocabulary, discriminated between bilingual children with and without Specific Language Impairment (SLI); nonword repetition and tense morphology were the strongest discriminators, together with the parental questionnaire that addressed first-language development. Furthermore, as specified by Paradis (Reference Paradis2016) when assessing competencies in L2, story-telling tasks and measures of cognitive processing such as nonword repetition might be more sensible tasks to use when identifying language delays in bilinguals than norm referenced linguistic assessments.
Based on the assumption that narrative skills are influenced, but not limited to, linguistic knowledge, a number of studies have specifically addressed the issue of quality of narratives and development of narrative skills, both in L1 and L2, in bilingual populations.
As in monolinguals, the ability to produce narratives in both L1 and L2 develops from preschool to school age (Gutiérrez-Clellen, Reference Gutiérrez-Clellen2002; Pearson, Reference Pearson, Oller and Eilers2002; Squires et al., Reference Squires, Lugo-Neris, Peña, Bedore, Bohman and Gilliam2014; Uccelli & Páez, Reference Uccelli and Páez2007). Gutiérrez-Clellen (Reference Gutiérrez-Clellen2002) highlights that even children with limited expertise in one of the two languages may be able to produce stories in their weaker language with an adequate narrative structure and a good global quality. Squires et al. (Reference Squires, Lugo-Neris, Peña, Bedore, Bohman and Gilliam2014), found that, at a macro-structural level, bilingual children with typical development become progressively more able to produce structured and complex narratives in their L2, with L1 macro-structure predicting macro-structure in L2. However, the authors suggest that children had to independently learn the nuances of each language in order to be able to use them, with a minor degree of language-transfer in micro-structure. An additional suggestion made by the authors is that only children who reached a good level of knowledge in narrative structure would be able to include more advanced literate language forms in their productions.
Uccelli and Páez (Reference Uccelli and Páez2007) analyzed vocabulary and narrative skills of low socioeconomic status (SES) Spanish–English bilingual children, who were asked to tell a story in Spanish (L1) and English (L2). They found that bilinguals showed, both in their L1 and L2, a poorer vocabulary compared to monolinguals (Spanish and English), although the bilinguals' vocabulary significantly improved between the last year of kindergarten and the first year of primary school. A similar effect of poorer vocabulary in both L1 and L2 was found in a study by Páez, Tabors, and López (Reference Páez, Tabors and López2007), and, as reviewed by Bialystok (Reference Bialystok2009), this pattern seems to be consistent across the lifespan, even for simultaneous bilinguals. The weaknesses in bilinguals' vocabulary scores have recently been challenged by some studies conducted on French–English bilinguals in Canada. There, the bilingual disadvantage in vocabulary scores was found neither in L1 nor in L2, suggesting that a favourable language-learning environment (Thordardottir, Reference Thordardottir2011) and socio-cultural context (Smithson, Paradis & Nicoladis, Reference Smithson, Paradis and Nicoladis2014) might significantly modulate vocabulary development in bilingual populations. Despite conflicting results in vocabulary skills assessed in either L1 or L2, it is acknowledged that bilinguals may equal or exceed monolinguals' knowledge when considering the ‘total conceptual vocabulary’, that is, the total number of words that a child knows in one and/or the other language (e.g. Pearson, Fernández & Oller, Reference Pearson, Fernández and Oller1995). Concerning narrative level, Uccelli and Páez (Reference Uccelli and Páez2007) showed that bilingual children progressively improved, from kindergarten to first grade in their L1 (story scores) and in their L2 (story scores and language scores). The story score measure was mainly intended as a macro-structural parameter, and it included three parameters: story elements, story sequence, and perspective. On the other hand, the language score was principally aimed at assessing micro-structural indexes related to linguistic aspects, and it included syntactic and lexical measures.
Fiestas and Peña (Reference Fiestas and Peña2004) analyzed the stories produced by preschool bilingual children who speak Spanish at home and English at school; they used a measure of story complexity by counting how many story elements (setting, initiating event, internal response, plan, attempt, consequence, and ending) were included in the children's narratives. The productivity score included number of clauses, number of words, and Mean Length of Utterances (MLU). The authors showed that children produced, in both English and Spanish, a similar pattern of story complexity and productivity. The differences concerned the inclusion of specific elements of the story grammar: in Spanish, the children included mainly the trigger events and attempts, while in English the consequences were also present. The authors interpreted these differences with reference to cultural influences on narrative contextualization and also to the different exposure to the vocabulary of story-telling in school and at home.
Iluz-Cohen and Walters (Reference Iluz-Cohen and Walters2012) collected narratives by asking children to tell stories (either in English or in Hebrew) after looking at picture stimuli from English and Hebrew children's books; the authors observed that the morphosyntactic and lexical measures were related to each other, but not to measures of the story grammar, demonstrating the independence of narrative structure and linguistic skills used for producing the narratives. In the study by Gagarina et al. (Reference Gagarina, Klop, Kunnari, Tantele, Välimaa, Balčiūnienė, Bohnacker and Walters2012), conducted for the development of the Multilingual Assessment Instrument for Narratives (MAIN), it was found that bilingual children had similar skills in the story structure of narratives in both languages. According to the authors, this result might be explained as an effect of a transfer process and sharing of competencies between L1 and L2, especially because story grammar skills can be considered as common underlying competencies with an interface to both cognition and language processing. Similar conclusions were proposed by Kupersmitt and Berman (Reference Kupersmitt, Berman, Verhoeven and Stromqvist2001), Schwartz and Shaul (Reference Schwartz and Shaul2013), Viberg (Reference Viberg, Verhoeven and Stromqvist2001), and Pearson (Reference Pearson, Oller and Eilers2002), who suggested that mental representations of story grammar could transfer from one language to the other, but that this transfer is primarily related to a conceptual level, which is available in the processing of L2, once a linguistic threshold has been reached.
In summary, previous studies showed that in both bilingual and monolingual children the macro-structure of the stories in narrative tasks improves as children progress from kindergarten to primary school, therefore highlighting an improvement in the overall quality of the narrative. On the other hand, lexical and morphological skills, if observed in bilinguals' L2, might show differential trajectories compared to those of monolingual children, with bilingual children often lagging behind their monolingual peers even after more than three years of exposure to the L2 (Paradis, Reference Paradis2016). There are, however, only a few studies that have analyzed the relationship between L2 linguistic competence and narrative productions in bilingual populations.
The present study
Rationale: The main contribution of the current work is to provide additional evidence on the assessment methods of bilingual children, particularly as far as narrative competence is concerned. Although best practice would suggest, when possible, assessing bilingual children in both languages, very often this is not possible because of the great variety of L1 spoken by bilinguals. Therefore, it is relevant to identify specific valid measures in which bilinguals might reach monolingual-like levels of linguistic proficiency, and more specifically, to bear in mind that narratives represent ecologically valid and rich linguistic data while avoiding monolingual test norms, which might also suffer from cultural biases. The main focus of the present paper is to investigate whether bilingual children, as compared to their monolingual peers, might show similar narratives as far as the macro-structure is concerned, even in the case of lower-level lexical and morphosyntactic skills. The main hypothesis is that bilinguals who have poorer linguistic abilities (in their weaker language) than monolinguals of the same age may show similar abilities to monolinguals in higher-order measures, such as the ability to produce a story using story grammar elements. The second aim was to investigate the predictive role of linguistic knowledge in the instructional language in macro-structural parameters of narrative skills in the same language, separately for the two populations considered, including both standardized measures and parameters obtained directly from narrative production.
The specific research questions addressed in the present study were the following:
- 
                        
                        1. At preschool age, are there differences between monolingual and bilingual children in the structure of the narratives produced, namely in the macro-structure and in the linguistic skills used to produce a narrative, that is, the micro-structure? Considering the literature reviewed, we hypothesized a difference between bilinguals and monolinguals in L2 lexical skills, and therefore in indexes of lexical variety. We did not expect bilingual and monolingual children to show any differences in macro-structural indexes of linguistic production. 
- 
                        
                        2. Which are the best linguistic predictors of narrative production in bilingual and monolingual populations? This question refers to an exploratory aim of the study, since few data are available in the literature in relation to this. The main hypothesis is that in bilinguals, thanks to transfer processes of mental representation and knowledge of story structure that are shared by the two languages, macro-structure parameters should be relatively independent from linguistic knowledge in L2, whereas this relationship should be more consistent for the group of monolinguals. 
METHOD
Participants
Screening phase
The study battery was initially administered, for screening purposes, to a group of 290 children whose parents signed the informed consent. The children were recruited from mainstream kindergartens where the Laboratory for the Assessment of Learning Disabilities (LADA) of Bologna University's Department of Psychology (Italy) was running the LOGOS project, which is aimed at early identification and intervention in indexes of learning difficulties.
A total of 128 children (mean age: 4;8, SD: 0;3) who fulfilled the selection criteria described below were selected from this larger group, and took part in the study.
Based on the inclusion criteria for the bilingual sample, 61 of the original 290 participants were excluded because they did not fulfil the requirements (i.e. they had one Italian parent). For both the bilingual and monolingual samples, a further inclusion criterion was adopted, that is, only the children who correctly rearranged the story picture used in the narrative task (see below for details) were included. A total of 101 children were excluded from the initial larger group based on this criterion, with no significant differences between the monolingual (n = 48) and bilingual group (n = 53) (χ 2(1) = 0·138, p = ·71).
In the final sample of 128 participants, children were divided into two groups:
- 
                           
                           1. A group of 64 bilingual children (Bs) (mean age: 4;8, SD: 0;3, Range: 4;2–5;3; 31 females, 33 males), speaking minority languages at home. Mean Age Of Exposure (AoE), defined as the age when a bilingual child first begins to receive intensive, systematic, and maintained exposure to his/her new language (Kovelman et al., Reference Kovelman, Baker and Petitto2008), was 1;6 (SD: 0;8). All of the children started kindergarten in Italy, and 72% of them also attended nursery school. All of the children could be considered as early bilinguals, because they were all exposed to the Italian language before age three (Kovelman et al., Reference Kovelman, Baker and Petitto2008; Bellocchi, Bonifacci & Burani, Reference Bellocchi, Bonifacci and Burani2016), as established by consulting their entry into the school system and by collecting information from class teachers. There were twelve main native language groups: Tagalog (10·9%), Urdu (10·9%), Arabic (15·6%), Romanian (28·1%), Bengali (9·4%), Spanish (9·4%), Chinese (4·7%), Russian and Hindi (3·1% each), Albanian, French and Polish (1·6% each). The selection criteria were: - 
                                    
                                    • Exposure to an L1 other than Italian (L2) within the family context; 
- 
                                    
                                    • Having both parents speaking an L1 other than Italian at home; 
- 
                                    
                                    • Being born in Italy or having arrived in the first year after birth; 
- 
                                    
                                    • Having been exposed to Italian before age three; 
- 
                                    
                                    • Not having been referred to neuropsychiatric units for any range of developmental disorder, or sensory or neurological impairment. 
 2. One group of 64 Italian Monolinguals (Ms) (mean age: 4;8, SD: 0;3, Range: 4;1–5;3; 35 females, 29 males) was matched with the bilinguals for chronological age (t(126) = –1·6, p = ·11), sex (χ 2(1) = 0·5, p = ·49), nursery programme attendance (χ 2(1) = 2·2, p = ·14). Where possible, children were selected from the same classes or, alternatively, from schools in the same district. The matching for nursery programme attendance (which in Italy is a service that is provided through payment of a fee) permitted moderate control over the socioeconomic status of the group, whereas the fact that participants were recruited within the same classes or school district provided a marginal control of educational exposure, considering, among other aspects, that all the teachers were enrolled in the LOGOS Project, which provides teacher training and sharing of didactic strategies. For the monolingual group inclusion criteria were: being born in Italy from Italian-speaking parents and not being exposed to any other foreign language at home. None of the children had been referred for any range of developmental disorder, or sensory or neurological impairment. 
- 
                                    
                                    
The Italian preschool programme is a three-year programme that involves children from three to six years. During these preschool years formal instruction regarding literacy skills is not provided, although children are involved in activities that are aimed at improving socialization and cognitive and linguistic development.
Materials
Learning Difficulties Indexes battery (IDA; Bonifacci, Pellizzari, Giuliano & Serra, Reference Bonifacci, Pellizzari, Giuliano and Serra2015). This test battery has been developed to assess a wide range of linguistic skills in preschoolers and is composed of six tasks that measure articulation and vocabulary, phonological awareness, morphosyntactic comprehension, phonological memory, story sequencing, and letter knowledge. The internal consistency reliability values reported refer to the normative sample (N = 1416). Normative data are reported for four 6-month age intervals (4;0 to 5;11) and, for each subscale, z-scores were calculated based on mean values reported by the test's manual. Reliability indexes, as reported in the test's manual, refer either to Cronbach's alpha or to Kuder–Richardson Formula 20 (KR-20) value. The battery is composed of the following subscales:
Vocabulary
Children were asked to name thirty-six images selected for decreasing frequency in the spoken language (Barca, Burani & Arduino, Reference Barca, Burani and Arduino2002). The accuracy score, ranging from 0 to 36, was considered. The scale's KR-20 was ·85. This subtest also allows for an evaluation of speech sound skills, testing the fifty-two main sounds of the Italian language (single phonemes or consonant groups).
Phonological awareness
The battery included four different subtests aimed at assessing phonological awareness: syllable blending (e.g. to-po → topo ‘mouse’) (6 items); syllable segmentation (carota → ca-ro-ta ‘carrot’) (6 items); first syllable recognition ( cane– casa ‘dog–house’) (4 items); and rhymes (porta–torta ‘door–cake’) (4 items). The stimuli for the first two tasks were presented orally and children were required to give a verbal answer. For the second two pairs of tasks children were presented with a target picture and were required to choose, from among four pictures, which one started or ended with the same sounds (i.e. syllable). Each item received a score of 1 for a correct response and a score of 0 for an incorrect answer, for a maximum total score of 20. The scale's KR-20 was ·84.
Morphosyntactic comprehension
Children were presented with three pictures representing three different scenarios. For each picture, they were asked to identify or manipulate elements of the scene by comprehending different types of sentences pronounced by the examiner (e.g. the child had to correctly place a card depicting a book after hearing a sentence such as “The book is under the pillow”). The morphosyntactic structures investigated were: singular/plurals (4 items), locatives (7 items), active/passive (4 items), and relative clauses (3 items). A total of eighteen sentences were presented, and for each of them a score of 2 (correct answer at first attempt), 1 (correct answer at second attempt), or 0 (wrong answer), was given. The total score, ranging from 0 to 36, was considered. The scale's Cronbach's alpha was ·70.
Phonological memory
This is a nonword repetition task in which children are presented with eight nonwords: two 2-syllable, two 3-syllable, two 4-syllable, and two 5-syllable words. Children are told that the researcher is going to say some ‘magic words’, and are asked to repeat them exactly. A score of 2 was given for perfectly repeated words, while a score of 0 was given for repetition errors in any of the sounds (vowels and consonants). Only in the case of mispronunciation of a sound due to a speech-sound articulation error was a score of 1 given. For example, if a child pronounced the word rana ‘frog’ as lana in the vocabulary task, showing a difficulty with the phoneme ‘r’, a repetition of the nonword fimedura as fimedula would be scored 1. If, however, a child said fimedula, but in the previous task he/she had shown the capacity to articulate the sound ‘r’ correctly, the score was 0. The total score ranged from 0 to 16, and the scale's Cronbach's alpha was ·72.
Letter knowledge
Children were presented with a picture of a train with one letter (from a to z) in each coach. The experimenter had to choose four letters within the child's name or from the first few letters of his/her surname, thus considered to be familiar letters, and four random letters that were not part of the name of the child, considered to be unfamiliar letters. Then the experimenter randomly indicated these letters in the train picture and the child was required to say the sound or the name of the letter. A score of 1 was given for each correct response, for a maximum score of 8. The scale's Cronbach's alpha was ·70.
Story sequencing
The task is composed of five pictures depicting a brief tale of a little dinosaur, named Dino, who is preparing a cake. The five cards contain the following main elements:
- 
                           
                           1. Image of Dino with an empty cart in front of a supermarket; 
- 
                           
                           2. Image of Dino inside the supermarket with cake ingredients in the cart; 
- 
                           
                           3. Image of Dino in the kitchen preparing the cake; 
- 
                           
                           4. Image of the cake near the oven; 
- 
                           
                           5. Image of Dino sitting at the table with the cake ready and another chair nearby. 
Each participant was presented with four pictures presented in the wrong order (fixed and predetermined: 4-2-5-3). Image number 1 was given as a prompt, explaining that Dino wanted to prepare a surprise for his friend Ida, and that he went to the supermarket. Then the child was asked to arrange the pictures in the correct order and tell the story aloud. In order to reduce the memory load when producing narratives, the pictures were left on the table during the story-telling.
A score of 1 was given for each picture that was in the correct place (maximum score: 4). The scale's Cronbach's alpha was ·82. Then, as specified in the next paragraph, narratives produced by the children were coded and scored to analyze linguistic expressive abilities. Some examples of narratives produced by monolingual and bilingual children are included in the ‘Discussion’ section. As previously mentioned, only the 128 children who rearranged the story correctly were included in the study, in order to consider only those narratives that were derived from the same story sequence; the number of children excluded (48 monolinguals and 53 bilinguals) was not statistically different (χ 2(1) = 0·138, p = ·71). The rationale that guided the exclusion of children who did not rearrange the story correctly was based on different considerations: the story production that followed an incorrect arrangement of the story would refer to a different story plot, and consequently would not be suitable to be encoded according to the macro-structural parameters used in the present study.
Narratives: coding and scoring
Based on literature analysis, a scheme for coding narrative productions was derived; most criteria were derived from the MAIN scoring rubric (Gagarina et al., Reference Gagarina, Klop, Kunnari, Tantele, Välimaa, Balčiūnienė, Bohnacker and Walters2012). The conceptual framework that guided the development of the MAIN was to include both macro-structural and micro-structural scoring parameters, the first based on story grammar, the second referring to linguistic features used in the construction of a coherent discourse in a specific language (Gagarina, Klop, Tsimpli & Walters, Reference Gagarina, Klop, Tsimpli and Walters2016).
MACRO-STRUCTURAL MEASURES
Number of macro-structural elements
Each story elicitation was coded based on six macro-structural elements. For each one, two independent experts identified a set of main points that received a score of 1 if included in the story-telling.
- 
                           
                           • Goal: celebrate (birthday), having the cake ready; 
- 
                           
                           • Attempt: shopping, preparing the cake, cooking; 
- 
                           
                           • Outcome: cake ready; 
- 
                           
                           • Mental states: waiting for Ida 
- 
                           
                           • Setting: supermarket, home, kitchen; 
Each macro-structural element produced by the child was scored 1, and this gave a maximum macro-structure element score of 10 for the story elicitation set. The maximum score consists of all possible elements that can potentially be elicited by the picture set.
Level of macro-structural complexity
Alongside the count of macro-structural elements, which represents a general and single quantitative measure of story production, the MAIN-scoring rubric allows researchers to score the complexity of story production. The complexity score ranges from low to high, based on the relationships between the macro-structural elements of Goal (G), Attempt (A), and Outcome (O): (i) low complexity implies the presence of two elements, namely one Attempt and one Outcome within the episodes of a story; (ii) medium complexity implies the presence of two elements, namely one Goal and one Attempt (GA) or one Goal and one Outcome (GO); this is considered more complex than low complexity because the characters' goals are mentioned by the children; and (iii) high complexity implies the presence of a complete sequence of three relevant elements, namely one Goal, one Attempt, and one Outcome (GAO). When the story production did not include at least one Attempt and one Outcome its complexity was classified as 0 (absence of complexity). For each participant, the highest level of complexity reached was identified. Absence, low, medium, and high complexity levels were respectively scored as 0, 1, 2, and 3.
MICRO-STRUCTURE MEASURES
Lexical level
To analyze the lexical complexity of narrative productions the following set of parameters was used:
- 
                           
                           • Total number of words included in the production; 
- 
                           
                           • Number of lexical errors: sum of paraphasias, nonsense words, unintelligible words, simplifications, repetitions. Lexical errors represent mistakes in vocabulary use (irrespective of the correct morphology of the words). Errors and omissions in morphemes were counted, irrespective of their vocabulary count. For example, attributing the determinate instead of indeterminate article is a morpheme error but not a lexical error. Saying ‘the bird’ (first mention) is a morpheme use error but not a lexical error. This measure is considered an index of lexical immaturity, that is, the difficulty in finding the appropriate word in the discourse, thus leading to more errors and hesitations; 
- 
                           
                           • Token: total number of correct words, thus excluding fillers (‘ehm’), false starts (‘she … she … ’), unintelligible words, paraphasia (talovo/tavolo ‘table/talbe’); 
- 
                           
                           • Type: number of different words within the production. This is calculated by subtracting all repeated words from the number of tokens; 
- 
                           
                           • Type/Token Ratio (TTR): This refers to the range of vocabulary for size of speech sample and is considered an index of lexical diversity, independently of sample size (Richards, Reference Richards1987). It has been used as a predictor of foreign language learning aptitude and attainment (Skehan, Reference Skehan1986). This measure was used because texts were of similar/the same length and therefore was considered to be sufficient for the evaluation of lexical diversity. It is calculated by the ratio between the number of correct words (Token) and the number of non-repeated words (Type). 
Morphosyntactic level
To analyze the quality of production at the morphosyntactic level the following indexes were considered:
- 
                           
                           • Proportion of errors and omission in free and bound morphemes on tokens; 
- 
                           
                           • Mean Length of Utterances (MLU). Calculated by dividing the total number of tokens by the number of sentences, defined, as suggested by Marini, Andreetta, del Tin, and Carlomagno (Reference Marini, Andreetta, del Tin and Carlomagno2011), on the basis of four hierarchical criteria: acoustic (the sentence is defined by the presence of breaks); semantic (the sentence is a semantic unit); grammatical (the sentence is a complete grammatical sentence in which its subordinate clauses are also included); and phonological (the sentence is delimited by the presence of a word that is interrupted, indicating a false start). 
Text cohesion level
Finally, three indexes were considered as referring to text cohesion:
- 
                           
                           • Number of principal clauses; 
- 
                           
                           • Number of coordinate clauses; 
- 
                           
                           • Number of subordinate clauses; 
- 
                           
                           • Proportion between the number of principal and complex (subordinate and coordinate) clauses. 
transcription and coding reliability. Two postgraduate psychologists, Italian monolinguals, were trained in the transcription and coding procedures by the first and last author. During task administration, the narrative productions were recorded manually by transcribing them accurately on the record form, and scored by one of the authors. All complete and intelligible utterances were included in the analyses. Uncertainties in coding were discussed and resolved by the research group. Nearly 30% of narratives (story-tellings), were randomly selected to be coded independently by another author.
Inter-rater agreement for transcriptions was expressed through Cronbach's alpha for each task and show an adequate level of reliability, as follows: macro-structural level, Cronbach's alpha = ·78; lexical level (Type, Token, Lexical errors, Total number of words), Cronbach's alpha = ·86; Morphosyntactic level (MLU), Cronbach's alpha = ·96; Text cohesion, Cronbach's alpha = ·99).
For the narrative task, mean raw scores were considered, as reported in Table 2.
Procedure
Children were familiarized with the examiners during ordinary school activities for approximately one hour. They were then tested individually in a quiet room in their school, in one session that lasted for approximately 30 minutes; breaks were allowed if the child showed signs of fatigue. The IDA battery was administered and scored by trained developmental psychology master's degree students and one PhD student. One author, who is a qualified clinical psychologist, supervised the scoring procedure. All parents gave informed consent prior to the beginning of the research. Ethical approval was obtained by the Ethics Committee of the University of Bologna for the LOGOS project, which involved the administration of the IDA Battery in a wide sample of children within the City Council of Bologna.
RESULTS
Issues related to chronological age and age of exposure to Italian language
Although the two groups did not differ significantly in age, we controlled for a possible effect of age because it is known that narratives undergo a significant period of development at the ages under scrutiny. Thus, all analyses were run including age as a covariate; age was generally non-significant as a covariate, thus we reported data from these analyses only when they differed from univariate analyses. Then, only for the bilingual group, further consideration was given to the Age of First Bilingual Exposure (AoE-L2), defined by Kovelman et al. (Reference Kovelman, Baker and Petitto2008) as the age when a bilingual child first began to receive intensive, systematic, and maintained exposure to his/her new language. Correlation analyses were run for the bilingual group, between AoE and IDA subscales (linguistic assessment), and between AoE and parameters from narratives. In the first set of analyses it emerged that AoE was significantly related to Morphosyntactic comprehension (R = 0·3; p = ·015) and, considering parameters from narratives, the only significant correlation was found between AoE and the number of utterances (R = –0·3, p =.012).
Linguistic assessment
Table 1 reports mean and SD values (z-scores) together with Cohen's d effect sizes for each linguistic variable. A Manova was run with Group (Bs, Ms) as main factor and z-scores of IDA subscales (vocabulary, phonological awareness, morphosyntactic comprehension, phonological memory, letter knowledge) as dependent variables. A main effect of Group emerged (F(5,122) = 16·7, p < ·001, η 2 = ·41), with bilinguals performing globally worse than monolingual peers. Univariate analyses highlighted the fact that monolinguals outperformed bilinguals in vocabulary, phonological awareness, and morphosyntactic comprehension (all ps < ·001), but the two groups did not differ in phonological memory (p = ·33) or in letter knowledge (p = ·052).
Table 1. Descriptive statistics (mean and SD of z-scores) and effect size (Cohen's d) for bilingual and monolingual groups in linguistic abilities (IDA Battery)

Narratives
macro-structural level. The different parameters referring to the quality of narratives were either investigated in the fixed factor Group (Ms vs. Bs) with a single t-test, or with multivariate analyses, depending on the number of variables considered within each factor. In the case of structural complexity, given that this was a categorical variable, a chi-square analysis was run.
In Table 2, descriptive statistics for mean raw scores are reported, together with Cohen's d, for each parameter considered in the analysis of narratives.
The first MANOVA was run on the number of macro-structural elements. There were no differences between groups (F(5,122) = 2·04, p = ·08, η 2 = ·08). Considering the Mancova with age as a covariate, a marginal effect of group emerged (F(5,121) = 2·32, p = ·047, η 2 = ·09), with bilinguals reporting a relatively smaller number of macro-structure events. Univariate analyses highlighted a main difference in Attempt (p < ·05) and Settings (p < ·05), but when adjusted for Bonferroni correction for multiple comparisons (p < ·01) none of the differences remained significant. The two groups did not differ for Goal (p = ·27), Outcome (p = ·73), or Mental State (p = ·68) subscales. A chi-square analysis revealed that Bilinguals and Monolinguals did not differ in their level of macro-structural complexity (χ 2(1, N = 128) = 1·67, p = ·64): bilinguals reported 54·7% of GAO in their narratives, and monolinguals 50% of GAO structures. This can be considered a high level of complexity since Goal, Attempt, and Outcome were all included in the story-telling.
Table 2. Descriptive statistics (mean and SD of raw scores) and effect size (Cohen's d) for bilingual and monolingual groups in the parameters of narratives

notes: * Max = 10; **MLU = tokens/number of sentences.
MICRO-STRUCTURAL LEVEL
Lexical level
t-tests were run on single parameters of the lexical level. It emerged that bilinguals produced a lower number of words (t(126) = 2·57, p < ·05, d = ·45), as well as a lower number of Tokens (t(126) = 3·36, p < ·01, d = ·59), and Types (t(126) = 4·42. p < ·001, d = ·78). They also showed a higher level of lexical immaturity, committing more errors than monolinguals (t(126) = 2·62, p < ·05, d = ·49). However, bilinguals did not differ from monolinguals in the Type/Token ratio (t(126) = 1·09, p = ·28, d = ·2). When adjusted for Bonferroni correction for multiple comparisons (p < ·01), only group differences in Tokens and Types remained significant. Nonetheless, the only difference between the two groups that shows a high effect size is indeed the difference in the number of Types, while the difference in the Tokens showed only a medium effect size. Age, included as a covariate, proved to be marginally significant only in lexical maturity (F(1,127) = 3·9, p = ·05, η 2 = ·03), with an overall effect of group (F(1,127) = 8·4, p < ·01, η 2 = ·06), which remained significant after Bonferroni correction.
Morphosyntactic level
t-tests on the proportion of free morphemes (out of the number of tokens) highlighted the fact that bilinguals made more morphosyntactic mistakes than monolinguals in free morphemes (t(126) = –3·32, p < ·01, d = ·71), with a medium effect size, but not in bound morphemes (t(126) = –1·55, p = ·12, d = ·29). The two groups did not differ either in the number of utterances (t(126) = 1·36, p = ·17, d = ·25) or in the Mean Length of Utterance (MLU) (t(126) = 0·97, p = ·33, d = ·18).
Text cohesion
t-tests run on parameters of text cohesion showed that the two groups did not differ in any measure: number of principal clauses (t(126) = 0·71, p = ·48, d = ·12), number of coordinate clauses (t(126) = 1·03, p = ·30, d = ·18), number of subordinate clauses (t(126) = 1·36, p = ·17, d = ·24), total number of complex clauses (t(126) = 1·47, p = ·14, d = ·26), percentage of complex clauses out of the total number of clauses (t(126) = 0·75, p = ·45, d = ·13).
Relationship between linguistic knowledge and narrative skills
The first multiple linear regression analysis (see Table 3) was run separately for the two groups (Bilinguals, Monolinguals), with z-scores of IDA battery subscales (vocabulary, phonological awareness, morphosyntactic comprehension, phonological memory, letter knowledge) as independent variables, and total macro-structural elements as a dependent variable. For monolinguals, the model was not significant (R2 adj = –0·001, p = ·43), whereas it was near to significance for the bilingual group (R2 adj = 0·09, p = ·05). In this model, none of the single IDA subscales was a significant predictor of the narratives' total macro-structural score.
Table 3. Summary of linear regression analyses, separately for bilingual and monolingual groups, for linguistic measures predicting Macro-structural Level (total number of elements)

The second linear regression analysis (see Table 4), run separately for the bilingual and monolingual groups, was carried out with measures of micro-structure, namely, MLU and Type/Tokens as independent variables, and total macro-structural elements as a dependent variable. It emerged that the model was significant for the monolingual group (R2 adj = 0·13, p < ·01), with MLU a significant (β= ·4, p < ·01) predictor of macro-structural richness. The model was not significant for the bilingual group (R2 adj = –0·03, p = ·99).
Table 4. Summary of linear regression analyses, separately for bilingual and monolingual groups, of micro-structural variables predicting Macro-structural Level (total number of elements)

DISCUSSION
In the present study, bilingual preschool children were compared to their monolingual peers in measures of linguistic knowledge (vocabulary, phonological awareness, morphosyntactic comprehension, phonological memory, and letter knowledge) and narrative skills (telling of a story sequence composed of five images), taking into account both macro-structural and micro-structural parameters of story production. The aims of the study were twofold: first, to assess differences in L2 linguistic competence and narrative skills between bilingual minority children and monolingual children. The second aim was to investigate the predictive role of linguistic knowledge in the instructional language in macro-structural parameters of narrative skills in the same language, separately for the two populations considered, including both standardized measures and parameters obtained directly from narrative production.
Based on the existing literature reviewed in the ‘Introduction’, we hypothesized a difference between bilinguals and monolinguals in L2 linguistic knowledge, with a particular weakness in vocabulary scores. We also expected poor L2 competence to possibly influence narrative indexes of lexical variety, but not macro-structural parameters. In line with this, we expected that linguistic competence in Italian (instructional language) would better predict narrative skills in monolinguals than in bilinguals. One important contribution of the current study is that linguistic competence, although collected only in L2, was obtained not only from standardized tests but also, directly, from narratives. Previous works showed that standardized tests for language assessment might not reflect the real linguistic skills of bilinguals since their performance is compared to expected performance by monolinguals at a certain age. Linguistic parameters obtained directly from narratives represent an additional, more ecological, measure of linguistic skills in which children may be stimulated to use everyday language. Therefore, a narrative task provides the possibility to investigate the child's ability to use the language. Integrating the information regarding L2 proficiency from standardized measures with that from a narrative task should provide a complete picture of the linguistic profile of bilingual children in L2, the language they use in the educational setting.
Considering the comparison between bilinguals and monolinguals in the linguistic measures battery (in L2 for bilinguals), there was a difference between groups in vocabulary, phonological awareness, and morphosyntactic comprehension, as expected. The two groups did not differ in letter knowledge and phonological memory (tested through a nonword repetition task). These results indicated a general difficulty experienced by bilinguals in the L2 language level expected for their age when compared to monolinguals, for all measures. Our results on vocabulary size are perfectly in line with a vast number of studies documenting lower L2 vocabulary scores in bilinguals compared to monolinguals (e.g. Bialystok. Reference Bialystok2009) and, together with data on morphosyntactic comprehension and phonological awareness, suggest that bilingual minority children at this stage of development (preschool years) show a delay in L2 compared to their monolingual peers, probably due to a still modest exposure to L2. However, a similar performance between the two groups was found in measures that were more specifically related to processing skills, such as phonological memory. Thus, bilinguals seem to show a circumscribed disadvantage in L2 linguistic knowledge but seem to have adequate processing and cognitive skills.
As far as narrative skills are concerned, it emerged that in L2 narratives bilinguals showed a tendency to underperform in the overall number of macro-structural elements. The detailed analysis of subscales showed that bilinguals and monolinguals did not differ in Goal, Outcome, and Mental States, whereas bilinguals reported slightly fewer elements as far as Settings and Attempts were concerned, possibly due to their limited vocabulary. However, none of these differences held for Bonferroni corrections and their effect sizes were generally low, thus suggesting an absence of disadvantage in the ability to report macro-structural elements in their narratives. The two groups did not differ in the level of macro-structural complexity; that is, both groups reported an equal number (54·7% Bs and 50% Ms) of Goal, Attempt, and Outcome elements within their narratives. This suggests that, although bilinguals reported fewer elements according to some parameters, they were equally able as monolinguals as far as the elements of story grammar are concerned. This finding suggests that, although bilinguals have lower linguistic skills in their L2, as highlighted in preceding analyses, when compared to their monolingual peers, they are able to tell stories of similar structural complexity, showing that they have the ability to use the language for communication purposes. This is in line with other studies that highlighted the fact that bilinguals did not differ from monolinguals at the macro-structural level of narratives (Iluz-Cohen & Walters, Reference Iluz-Cohen and Walters2012). Data from both groups also showed that about half the children reported stories that lacked the three main components, thus supporting the idea that, in preschool years, children's stories still lack a complete discourse organization (Peterson & McCabe, Reference Peterson and McCabe1983). Accordingly, it is also worth underlining that a high percentage of children included in the screening phase were excluded for not being able to rearrange the story correctly, and this could suggest that at this age children might not have a stable representation of story grammar.
Regarding the components of other narrative skills analyzed in the present study, it emerged that bilinguals' L2 narratives did not differ from those of monolinguals as far as text cohesion was concerned; that is, the number of principal, coordinate, and subordinate clauses, as well as the proportion between the number of principal and complex (subordinate and coordinate) clauses, did not differ. In the same vein, at a morphosyntactic level, the two groups did not differ in the number of utterances, in the MLU and in the proportion of errors in bound morphemes. The only significant difference was in the proportion of free morphemes, although mean values were generally very low for both groups. These trends were nevertheless coupled with deficiencies at a lexical level, where bilinguals' narratives remained characterized, compared to those of monolinguals, by more lexical errors, a lower number of total words, and fewer types and tokens. We will provide some examples here so as to better illustrate the nature of the task and the crucial differences between the two groups. Two examples of monolingual productions follow: (a) “Dino va a prendere delle cose al supermercato per fare una torta alla sua amica, dopo entra nel supermercato e quando ha finito di prendere le cose va a casa e prepara la torta, poi mette in forno la torta poi la tira fuori e la taglia” (47 words) – ‘Dino goes to get some things at the supermarket to make a cake for his friend. Then he goes into the supermarket and when he finishes getting the things he needs, he goes home and makes the cake, then puts the cake in the oven and then gets it out and cuts it’;(b) “Qui lui andava a fare la spesa, e lui ha preso gli ingredienti e poi li impastava e poi metteva dentro la torta nel forno, poi aspettava Ida” (28 words) – ‘Here he went shopping, and he has got the ingredients and then he kneaded them and then he put the cake in the oven, and waited for Ida’. The following, on the other hand, are examples of narratives produced by bilingual children: (a) “Sta andando a comprare gli ingredienti della torta per la festa della lucertola, sta preparando poi la mette in forno e la aspetta” (23 words) – ‘He is going to buy the ingredients of the cake for the lizard's party, he is preparing then he puts it in the oven and waits for it’; (b) “Va al supermercato doveva fare TOTTA,Footnote 1 presi tutti farina e OVE inizia a FALLA poi ha fatto TOTTA e la mangia tutta, taglia PENDE e mangia” (26 words) – ‘Goes to the supermarket he had to prepare CAKE, all flour and EGGS taken and starts to make it and then he made CAKE, and he eats it all, cuts, TAKE and eats'.
As these examples highlight, the pattern of results described above support the idea that bilinguals are characterized by an adequate competence in L2 language use and linguistic knowledge at a morphosyntactic level, although this is coupled with deficiencies at a lexical level. Nevertheless, the differences that emerged were characterized by low to medium effect sizes. Again, these results indicate that, in an elicited narrative production, bilinguals show similar levels to their monolingual peers.
The current findings also offer novel insight into linguistic parameters derived directly from narratives: as compared to results from standardized measures, it emerged that bilinguals and monolinguals differed in fewer measures when more ecological measures of language use were adopted. In order to obtain a more realistic picture of the language skills of bilinguals, standardized measures should be combined with ecological measures of language production.
Finally, to analyze the relationship between linguistic knowledge and narrative skills, two sets of regression analyses were run in order to investigate linguistic predictors of narrative skills at a macro-structural level. From the first analysis, where it was assessed whether standardized linguistic measures predicted the number of macro-structural elements reported in narratives, none of the models proved to be significant, although there was a tendency to statistical significance in the bilingual group. In the second analysis, MLU, but not tokens, emerged as a significant predictor of the number of macro-structural elements in the monolingual group but not in the bilingual group. The first pattern of results suggests a relative independence between general linguistic knowledge and narrative skills, particularly for monolinguals. This might be interpreted in the light of an important role played by non-linguistic factors in narrative quality. In other words, individual differences in linguistic knowledge within average levels of performance (i.e. in the absence of language impairment), do not necessarily predict the quality of narrative structure. Further investigation should be carried out on other potential predictors, such as general knowledge, pragmatic skills, print exposure, and so on. As far as bilingual children are concerned, the pattern of results is less clear. None of the subscales included in the regression model actually turned out to be a specific predictor of macro-structural parameters (number of reported elements). This might be interpreted in the light of a transversal but minor interdependence of language production and comprehension, as suggested by Pickering and Garrod (Reference Pickering and Garrod2013), and bilingualism may offer a particular perspective in highlighting connections that may not be manifest in monolingual populations.
A second set of regressions analyzed the relationship between micro- and macro-structural elements in the narrative production of bilinguals and monolinguals. It emerged that in monolinguals, but not in bilinguals, the macro-structural level was influenced by MLU, suggesting that the quality of narrative production, at a macro-structural level, is influenced by the syntactic complexity achieved in the same language. For bilinguals, no significant predictor emerged: one might speculate that the quality of their L2 narrative production is scarcely influenced by the linguistic structure of the narratives.
To sum up, the present study offers new insight into the relationship between linguistic abilities and narrative skills, calling attention to the fact that preschool bilingual children are able to structure stories in their L2 with similar cohesive and macro-structural complexity to monolingual peers, although ‘in few words’, that is, showing an L2 lexical delay compared to monolingual peers. This is in line with the hypothesis that language skills tend to develop in synchrony for monolingual speakers, but might develop at different rates in bilingual children (Pearson, Reference Pearson, Oller and Eilers2002).
The present study presents some limitations, one of which is the small sample of language production analyzed, due to the fact that children were asked to produce narratives that described a relatively simple, and brief, five-picture story. Further investigation should be aimed at analyzing longer samples of language production, possibly associated with both telling and retelling tasks. Moreover, in the present study, L1 competencies were not considered, and this was primarily due to the fact that the sample included language minority bilinguals who spoke a variety of different languages, with a lack of quantitative and qualitative tools to assess linguistic and narrative skills in their L1. Finally, it would be worthwhile, in future studies, to investigate, in a more thorough manner, the role of the SES and home literacy environment as mediating factors in the relationship between linguistic and narrative skills. For the purposes of the present study, we suggest that the selection criteria adopted, together with the minor predictive role of SES in Italy (OCSE-PISA, 2006), contribute to substantially minimize the role of SES in explaining the pattern of strengths and weaknesses in narrative skills discussed here. As a final remark, it has to be underlined that the narratives discussed in the present study were collected from the sample of children selected from a larger group for having correctly rearranged the story sequence. On the one hand, this might be considered a potential shortcoming of the study, because, being collected from a group of children with good sequential reasoning skills, the narratives may be partially biased. However, the advantage of this conservative selection procedure is that it provides a guarantee that all the participants had adequate reasoning skills.
There is clearly a particular need for unbiased assessment tools for language-general skills in bilingual populations, in order to bypass the frequent risk of over- and understatement of the profile of bilingual competence. The combined analysis of levels of language knowledge and narrative skills may allow us, therefore, to better detect strengths and weaknesses in bilingual development. This represents the main clinical and educational implication of the current work: when considering the ability to use the language for communication purposes alongside the performance obtained from standardized tests, the typical delays highlighted in bilinguals in their L2 become less evident. Therefore, the evaluation of narrative skills contributes to a better understanding of the typical bilingual profile, thus providing important suggestions as to the best educational practices for bilingual populations.
 
 



