Introduction
When communicating, bilinguals have to control or even inhibit to a certain extent the language that they do not intend to produce. Interference of the non-target over the target language must be kept minimal for accurate output to be achieved. Yet, this task can be especially tricky for certain aspects of grammar. There is one feature notorious for being problematic and creating situations of long-term interference during language processing that are not easily overcome, and that is grammatical gender (Carroll, Reference Carroll1989; Franceschina, Reference Franceschina2005; Hawkins, Reference Hawkins and Benati2009).
Grammatical gender is an inherent abstract characteristic of nouns that partially determines the form of other words in speechFootnote 1. It is present in gendered languages, which have gender systems that classify nouns according to different values. The number (and type) of gender values depends on the language itself (Corbett, Reference Corbett, Dryer and Haspelmath2013). For instance, in European Portuguese (EP) nouns can be either masculine (M, clock: “relógio”), or feminine (F, table: “mesa”), whereas in German nouns can also be neuter (N, room: “Zimmer”) besides feminine and masculine. Due to the abstract nature of grammatical gender, there is no particular reason for assigning one value or another to nouns, which makes gender assignment arbitrary in terms of semantics. Because of this arbitrariness, the gender value assigned to a certain noun might differ across languages. Thus, it is easy to imagine how tricky this feature can be for bilinguals, both in terms of acquisition and of processing. Indeed, even though in EP “clock” is masculine and “table” is feminine, in German this classification is reversed and hence “clock” is feminine (“Uhr”), and “table” is masculine (“Tisch”). Other nouns, however, keep the same gender (e.g., “door”, which is feminine in both EP and German, “porta” and “Tür”, respectively). This cross-linguistic (in)congruency between gender values is reflected in two specific terms: heterogeneric and homogeneric translations. Heterogeneric pairs have different gender values and thus are gender incongruent, whereas homogeneric pairs have the same gender value and thus are gender congruent. This situation of (in)congruency evidently increases the difficulty of learning and correctly using the gender of nouns across multiple languages (Franceschina, Reference Franceschina2005). Yet, the target-like assignment of gender is still crucial to assure accurate output in terms of agreement, as the form of other words such as articles or adjectives changes in order to agree with the gender of the head noun. For instance, in EP we would say “O relógio caro” (the expensive clock) but “A mesa cara” (the expensive table).
The repercussions that the mismatches between gender systems can have on the acquisition, representation and retrieval of gender in bilinguals have been a focus of interest of many researchers (e.g., Egger et al., Reference Egger, Hulk and Tsimpli2018; Kupisch et al., Reference Kupisch, Mitrofanova and Westergaard2022; Lemhöfer et al., Reference Lemhöfer, Spalek and Schriefers2008; Unsworth, Reference Unsworth2008, among many others). In the present study, we focused on the representation and processing of grammatical gender in both early and late bilinguals. More specifically, we asked EP and German bilinguals to translate bare nouns in backward (from L2 to L1) and forward (from L1 to L2) direction in order to understand how gender values are selected during the lexical access to nouns depending on the gender congruency between equivalent translations. We did so by focusing on the so-called gender congruency effect (Sá-Leite et al., Reference Sá-Leite, Fraga and Comesaña2019, Reference Sá-Leite, Luna, Fraga and Comesaña2020). Importantly, we took into consideration two relevant individual background variables that have remained poorly assessed in the literature on gender processing by adult speakers: the age of acquisition (AoA) of the two languages and the language balance, assessed through a proficiency-based measurement of each language.
Tackling gender retrieval during bilingual noun production
The study of the representation and processing of grammatical gender during bilingual language production has attracted the attention of many scholars during the last decades. Particularly, the debate has focused on whether the gender systems of each language are represented separately in the bilingual's mind or rather, languages share a unique system (the autonomous vs. the integrative view, see Sá-Leite et al., Reference Sá-Leite, Luna, Fraga and Comesaña2020). To understand this debate, we must first comprehend how gender is represented in the linguistic system as a feature. It is widely consensual among theories of lexical selection that gender is located at the lexico-syntactic level of word representation in the form of one node per gender value (e.g., the M, F, and N gender nodes; Caramazza, Reference Caramazza1997; Levelt et al., Reference Levelt, Roelofs and Meyer1999). These gender nodes are connected to all the nouns in the lexicon as a function of their gender values, so that “Tisch” (“table”, M) is connected to the masculine node (see Figure 1). Yet, bilinguals could either have an autonomous gender system per language, each one with its own nodes (e.g., a masculine gender node for EP and a masculine gender node for German), or rather an integrated system in which all lexical entries belonging to the same value are connected to the same gender node, regardless of the language they belong to (e.g., “porta” and “Tür” [“door”] would be connected to the same feminine gender node).
To explore this issue, researchers have relied on a specific effect, the cross-linguistic gender congruency effect (GCE) obtained mainly with two different experimental settings: picture naming and forward translation tasks. In the former, participants are asked to name pictures orally in the second language (L2, e.g., bilinguals of EP [first language, L1] and German [L2] would say “Tisch” when presented with the picture of a table); in the latter, participants have to orally translate to the L2 nouns written in their L1 (e.g., saying “Tisch” [German] when presented with the word “mesa” [EP]). In both tasks, conditions of gender congruency and incongruency are created through the selection of homogeneric and heterogeneric translation pairs. According to the integrative view of bilingual gender representation, variations on the response times (RTs) of the participants are expected depending on the type of translation pair. These variations on the RTs are not expected if each language has its own autonomous gender system with its own nodes. Imagine that a bilingual of EP and German wanted to produce the German noun “Tisch”. The processing of this noun would require first the activation and selection of its semantic features, that later would activate the grammatical and syntactic information of the word associated to that concept (tisch) as well as its morpho-phonological realization (/tisch/)Footnote 2. Hence, the masculine gender node would receive activation coming from the semantic representation of “TISCH” and would thus be selected.Footnote 2 In the experimental context of a naming or translation task, when bilingual participants are presented with the image of a table or the word “table” in their L1 (e.g., in EP, “mesa”), they activate not one but two appropriate lexical entries (“Tisch” and “mesa”; e.g., Hatzidaki et al., Reference Hatzidaki, Branigan and Pickering2011; Klaus et al., Reference Klaus, Lemhöfer and Schriefers2018). The lexico-syntactic representation of tisch would activate the masculine gender node, conversely to mesa, which would activate the feminine node. Since both gender nodes are active, they compete for selection (see Figure 2). This process of competition is mostly addressed in the WEAVER++ model of language processing by Levelt et al. (Reference Levelt, Roelofs and Meyer1999). According to the authors and as it has been shown in several computational simulations, for a node to be selected, activation has to reach a threshold defined by the difference of activation across nodes (the Luce ratio, see Roelofs, Reference Roelofs1992). Hence, the greater the activation strength of a non-target node is, the harder it is for the target node to reach the difference of activation marked by the threshold for selection. Conversely, if different sources of activation converge in the same node, facilitation is observed. Yet, if gender systems were autonomous and thus separated depending on the language, convergence on the same node would not occur and facilitation would not be observed (and the same would apply for mechanisms based on competition rather than on facilitation as competition would not occur between autonomous systems).
Results from the naming and translation tasks support the integrative view of the bilingual gender representation as they reveal a consistent cross-linguistic GCE by which RTs are significantly lower for the gender congruent condition (homogeneric translations) than for the gender incongruent one (heterogeneric translations; for a review, see Sá-Leite et al., Reference Sá-Leite, Fraga and Comesaña2019). In other words, in the case of homogeneric translations, the threshold for selection is being reached more easily in comparison to when activation is spread to the opposite gender. This effect has been tested almost exclusively in research on late bilinguals (but see Fuchs, Reference Fuchs2022, for an ongoing study on Spanish heritage speakers), focusing on their L2, and observed in picture naming tasks (Bordag, Reference Bordag2004; Bordag & Pechmann, Reference Bordag and Pechmann2007; Klassen, Reference Klassen2016; Lemhöfer et al., Reference Lemhöfer, Spalek and Schriefers2008; Manolescu & Jarema, Reference Manolescu and Jarema2015; Morales et al., Reference Morales, Paolieri and Bajo2011; Paolieri et al., Reference Paolieri, Cubelli, Macizo, Bajo, Lotto and Job2010; but for null results see Costa et al., Reference Costa, Kovacic, Franck and Caramazza2003), as well as in translation tasks (Bordag & Pechmann, Reference Bordag and Pechmann2008; Manolescu & Jarema, Reference Manolescu and Jarema2015; Paolieri et al., Reference Paolieri, Cubelli, Macizo, Bajo, Lotto and Job2010, Reference Paolieri, Padilla, Koreneva, Morales and Macizo2019; Salamoura & Williams, Reference Salamoura and Williams2007) featuring multiple language pairs from the Germanic, Slavic, and Romance language families (i.e., Czech, Dutch, French, German, Greek, Italian, Romanian, Russian, and Spanish).
Current gaps in the literature
The cross-linguistic GCE is without a doubt highly informative when it comes to the organization of grammatical gender within the bilingual mind and appears to clearly support the integrative view. However, this scenario is likely to be more complex than what has been assumed so far. On the one hand, there are subtle differences in the outcome obtained by the above-mentioned tasks. Indeed, naming tasks seem to better detect the cross-linguistic GCE in comparison to translation tasks. This idea was first pointed out by Bordag and Pechmann (Reference Bordag and Pechmann2008), who failed to obtain the effect in three translations tasks with Czech and German late bilinguals. Still, they had obtained the effect in naming tasks with participants of the same population (Bordag & Pechmann, Reference Bordag and Pechmann2007). The authors proposed that differences across tasks in the time course of gender activation were responsible for the absence of the effect. A recent proposal on the mechanisms underlying gender retrieval supports this idea by noting that gender effects are sensitive to the time course of lexical access because of the inherent low degree of activation of gender nodes (Sá-Leite, Reference Sá-Leite2021). Indeed, in naming tasks, the activation spreads in parallel from the shared concept across languages to both L1 and L2 lemmas. So, gender nodes of one language and another are activated practically at the same time, creating an ideal situation for facilitation or competition to arise. Yet, in forward translation tasks, the word-form representation of the L1 noun that appears on the screen is activated along with its lemma earlier than that of the L2 translation equivalent. In fact, it is the activation of the L1 noun that spreads to the lemma of the L2 translation equivalent, for which the gender nodes are inevitably activated after the activation of the L1 gender node. If gender nodes accumulated high degrees of activation, this probably would not be a problem, but meta-analytic research by Sá-Leite et al. (Reference Sá-Leite, Luna, Fraga and Comesaña2020, Reference Sá-Leite, Luna, Tomaz, Fraga and Comesaña2022) very much suggests that gender nodes accumulate low levels of activation, which in turn originates slippery experimental gender effects characterized by a high degree of heterogeneity and small sizes. In this sense, Sá-Leite (Reference Sá-Leite2021) explains that the level of activation would depend on the language itself. Languages like Italian, EP or Spanish have a high degree of transparency as more than 60% of their nouns end in the quite simple ortho-phonological gender cues “-a” (for feminine nouns) and “-o” (for masculine nouns). Many studies have shown that transparent nouns are more accurately processed and require more cognitive resources to be accessed (Caffarra et al., Reference Caffarra, Janssen and Barber2014). Following the Dual-Route model of gender retrieval by Gollan and Frost (Reference Gollan and Frost2001), transparent nouns seem to rely on an extra route of gender selection besides the lexical memory-based route: the form-based route represented by the noun ending. This does not seem to be the case for languages like French, German, or Dutch, which rely on an extremely complex and sometimes contradictory body of gender-form regularities and are hence considered less transparent for gender (Kupisch et al., Reference Kupisch, Geiß, Mitrofanova and Westergaard2018). For instance, in German, Köpcke (Reference Köpcke1982) and Köpcke and Zubin (Reference Köpcke and Zubin1983) found a quite high number of gender regularities in monomorphemic and extended monomorphemic nouns (44 of which are not coincident with any gender morphemes), which, in addition, depend on the case (e.g., nominative vs. dative), as gender intertwines with declension. Consequently, regularities in opaque languages are not as useful as the typical “-a” and “-o” of transparent languages and the retrieval of gender seems to rely mainly on one source: the lexical memory-based route. Sá-Leite (Reference Sá-Leite2021) proposes that the existence of an extra source of gender activation for most nouns in gender-transparent languages increases the resting level of activation of gender nodes. Ultimately, in these cases, gender selection involves higher levels of activation, which might make gender competition for selection (e.g., masculine vs. feminine) more easily observed. To be precise, bilinguals are the population in which genuine effects of gender congruency are most consistently obtained, even for languages with low degrees of transparency (Bordag & Pechmann, Reference Bordag and Pechmann2007; Lemhöfer et al., Reference Lemhöfer, Spalek and Schriefers2008; for a review see also Sá-Leite et al., Reference Sá-Leite, Fraga and Comesaña2019). This should not surprise us, as bilinguals have a systematic double source of gender activation due to the activation of equivalent translations. Yet, we may note that when the time course of activation of both competing nodes differs, effects may become slippery, especially when less transparent languages such as German are involved. To be precise, null effects within the area of the cross-linguistic GCE have been obtained for Germanic languages in translation tasks but not in naming tasks (Bordag & Pechmann, Reference Bordag and Pechmann2008; Salamoura & Williams, Reference Salamoura and Williams2007). In sum, the slipperiness of the results obtained with translation tasks depending on the transparency of the language is still a matter to be explored.
On the other hand, all the studies on the cross-linguistic GCE have focused on the impact of the L1 on the L2, but not of the L2 on the L1, even though there is increasing evidence suggesting that the acquisition and presence of an L2 within the linguistic system may modulate the representation and processing of the L1 (e.g., Hamann et al., Reference Hamann, Chilla, Gagarina, Abed-Ibrahim and Di Domenico2017; Ulbrich & Ordin, Reference Ulbrich and Ordin2014). More specifically, evidence is quite robust for long-term residents of the L2 environment who achieve near-native L2 proficiency (Schmid, Reference Schmid, Roberts, Verónique, Nilsson and Tellier2009). Thus, a comprehensive view of the bilingual gender selection requires a thoughtful examination of language interference when gender is retrieved for the L1 and not only for the L2. In this sense, two variables that have been shown to be critical for language processing in bilingualism remain mainly unexplored in this area: AoA and language proficiency.
Regarding the AoA, most of the studies on the cross-linguistic GCE have tested participants that acquired their L2 later in life (after the age of 10 – Bordag, Reference Bordag2004; Bordag & Pechmann, Reference Bordag and Pechmann2007, Reference Bordag and Pechmann2008; Klassen, Reference Klassen2016; Lemhöfer et al., Reference Lemhöfer, Spalek and Schriefers2008; Morales et al., Reference Morales, Paolieri and Bajo2011; Paolieri et al., Reference Paolieri, Cubelli, Macizo, Bajo, Lotto and Job2010, Reference Paolieri, Padilla, Koreneva, Morales and Macizo2019; Salamoura & Williams, Reference Salamoura and Williams2007). We believe this bias for late learners should be broken in order to draw more precise conclusions on the way gender is organized and selected in bilinguals. Indeed, only two published studies have tested early bilinguals (Costa et al., Reference Costa, Kovacic, Franck and Caramazza2003; Manolescu & Jarema, Reference Manolescu and Jarema2015). Costa et al. (Reference Costa, Kovacic, Franck and Caramazza2003) conducted picture naming tasks with speakers of Croatian and Italian, Spanish and Catalan, and Italian and French that had acquired their L2 after an average age of five. However, the study used small samples ranging from 10 to 22 participants and was appointed with multiple methodological flaws (for an overview, see Sá-Leite et al., Reference Sá-Leite, Fraga and Comesaña2019; see also Lemhöfer et al., Reference Lemhöfer, Spalek and Schriefers2008). The other study that previously tested early bilinguals was that of Manolescu and Jarema (Reference Manolescu and Jarema2015), who unfortunately provided scarce information regarding the linguistic background of their participants. They only stated that they were children of Romanian parents that moved to Montreal and started acquiring French during “childhood”, which is not particularly specific. In sum and due to the scarcity of research with early bilinguals, thus far conclusions on the organization of the bilingual gender system derived from research with late learners cannot be generalized to early learners. Hence there is a need to test early learners of an L2 or even simultaneous bilinguals, assuring a detailed assessment of their linguistic background, so that we can understand gender retrieval in bilingual language production in a broader sense.
Finally, language proficiency is perhaps the variable that concerns us the most. As pointed out by Sá-Leite et al. (Reference Sá-Leite, Luna, Fraga and Comesaña2020) in their meta-analysis on the cross-linguistic GCE, proficiency in the L2 has never been included as a factor in the analyses of any study on this effect. It has been, however, controlled in many different ways: informal interviews that took place prior to the experiment (e.g., Bordag & Pechmann, Reference Bordag and Pechmann2007), different self-informed subjective questionnaires made by the authors (e.g., Costa et al., Reference Costa, Kovacic, Franck and Caramazza2003; Lemhöfer et al., Reference Lemhöfer, Spalek and Schriefers2008; Morales et al., Reference Morales, Paolieri and Bajo2011; Paolieri et al., Reference Paolieri, Cubelli, Macizo, Bajo, Lotto and Job2010, Reference Paolieri, Padilla, Koreneva, Morales and Macizo2019) – in some cases the authors did not report any information about what the participants were asked, which skills were considered, or what scores were obtained in average (e.g., Manolescu & Jarema, Reference Manolescu and Jarema2015) –, self-informed standardized questionnaires such as Hermans et al.'s (Reference Hermans, Bongaerts, De Bot and Schreuder1998) or Bachman and Palmer's (Reference Bachman and Palmer1989; e.g., Bordag & Pechmann, Reference Bordag and Pechmann2007; Salamoura & Williams Reference Salamoura and Williams2007), ratings in measures such as the familiarity of the target nouns (e.g., Bordag & Pechmann, Reference Bordag and Pechmann2007), and official language tests that had been successfully completed by the participants (although it is not said when – e.g., Bordag, Reference Bordag2004; Salamoura & Williams, Reference Salamoura and Williams2007). To our knowledge, only Klassen (Reference Klassen2016) seems to have used an objective measure of proficiency (the proficiency test of the Goethe-Institut, 2010). In sum, most studies have exclusively used subjective and non-standardized ways of measuring L2 proficiency. Likewise, when checking the self-informed data on the L2 AoA, L2 age of first exposure, and L2 time of exposure across all studies, Sá-Leite et al. (Reference Sá-Leite, Luna, Fraga and Comesaña2020) noticed great variations across studies for populations that were said to have the same level of proficiency. This might be another index of a subjective and non-precise way of understanding what constitutes proficiency.
It is certainly quite baffling that a variable that has been shown as extremely relevant for bilingual language processing in many other areas (see, for instance, Bultena et al., Reference Bultena, Dijkstra and Van Hell2015; Lim & Christianson, Reference Lim and Christianson2015; Prior et al., Reference Prior, MacWhinney and Kroll2007) has been mostly neglected in this specific area. In fact, we believe that the language proficiency of the participants should not only be properly controlled and reported in the subsequent works on bilingual gender processing but should also be included in the analyses. The literature on other areas of bilingualism shows enough evidence to suspect it may have a great role in determining the way languages interact and hence in how gender is selected in one language or another (Kupisch et al., Reference Kupisch, Akpınar and Stöhr2013; Soares et al., Reference Soares, Oliveira, Ferreira, Comesaña, Macedo, Ferré, Acuña-Fariña, Hernández-Cabrera and Fraga2019). More precisely, it is essential to test bilinguals' proficiency in both of their languages and to assess the balance between them, i.e., their relative language dominance. We therefore ought to (1) use more objective measurements to test proficiency and (2) include this variable in the analysis, along with gender congruency, to truly understand its role in the effect.
Theoretical background for the role of AoA and proficiency in gender retrieval
We will now present a proposal on how AoA and proficiency may impact grammatical gender representation and processing. We will base ourselves on the idea of relative language dominance (henceforth: language balance), which refers to the degree of balance between the languages of the bilingual speaker. To do so, we will rely on two popular and highly supported models of bilingual lexical processing: the developmental Bilingual Interactive Activation model (BIA-d, Grainger et al., Reference Grainger, Midgley, Holcomb, Kail and Hickmann2010) and the Multilink (Dijkstra et al., Reference Dijkstra, Wahl, Buytenhuijs, Van Halem, Al-Jibouri, De Korte and Rekké2019). The BIA-d model is an extension of the connectionist but also localist BIA model (Grainger & Dijkstra, Reference Grainger and Dijkstra1992) that offers an interesting view on the adaptations that our linguistic system experiences during the acquisition of an L2. The Multilink, however, is the most recent connectionist model of bilingual language production and comprehension. By combining both models (and also considering that language learning involves an on-going fine-tuning of the learning system that continues across life span, see Ramscar et al., Reference Ramscar, Hendrix, Shaoul, Milin and Baayen2014; also Chuang & Baayen, Reference Chuang, Baayen and Aronoff2021), we believe we can address the dynamics of a flexible system susceptible to changes in proficiency depending on variations in word frequency of exposure and use.
Following the BIA-d model, as well as extensive empirical evidence on the matter, when the L2 proficiency increases, the L2 dependency on the L1 decreases, and hence the ability of the L1 to create interference during L2 processing decreases as well (Abutalebi & Green, Reference Abutalebi and Green2007; Pivneva et al., Reference Pivneva, Palmer and Titone2012). Indeed, it is known that the higher the L2 proficiency of the bilingual, the better their performance during L2 production at different levels (Costa & Caramazza, Reference Costa and Caramazza1999; Kroll et al., Reference Kroll, Michael, Tokowicz and Dufour2002; Pivneva et al., Reference Pivneva, Palmer and Titone2012). The tenets of the Multilink are in line with this: as the frequency of encounters with words increases, the links between their lexico-syntactic representations (i.e., lemmas) and their grammatical features become stronger. We hypothesize that this would also affect grammatical gender and, hence, the stronger the link between the L2 lemma and the gender node, the easier the retrieval of gender and the less the dependency on the L1 gender node as well as its ability to interfere. Consequently, effects of cross-linguistic gender interaction such as the GCE should be smaller, the higher the proficiency in the target language.
Regarding the AoA, it is still an open question whether early bilinguals have a better language control (Berken et al., Reference Berken, Chai, Chen, Gracco and Klein2016, Reference Berken, Gracco and Klein2017; Bonfieni et al., Reference Bonfieni, Branigan, Pickering and Sorace2019; Wattendorf et al., Reference Wattendorf, Festman, Westermann, Keil, Zappatore, Franceschini, Luedi, Radue, Münte, Rager and Nitsch2014), yet what seems clear is that other kinds of factors, such as daily exposure to each language or changes in the language environment, can modulate the interaction of both languages during language processing regardless of when the L2 was acquired (for a review, see Van Hell & Tanner, Reference Van Hell and Tanner2012; see also Bonfieni et al., Reference Bonfieni, Branigan, Pickering and Sorace2019). This occurs largely because these factors directly influence the degree of proficiency of the speaker in each language (e.g., Dussias & Sagarra, Reference Dussias and Sagarra2007; Levy et al., Reference Levy, McVeigh, Marful and Anderson2007; but for more detail, see Van Hell & Tanner, Reference Van Hell and Tanner2012). Therefore, whether or not a cross-linguistic GCE is observable should depend mainly on the degree of proficiency in each language, and being an early bilingual should not be an impediment to language interaction happening (for a study in language comprehension that supports this prediction, see Paolieri et al., Reference Paolieri, Demestre, Guasch, Bajo and Ferré2020). In fact, many studies assume that AoA effects may be leveled by increasing proficiency (see Gagarina & Klassert, Reference Gagarina and Klassert2018). We thus believe that modulations in proficiency of either of the two languages should produce changes in the strength of the links connecting grammatical gender to lemmas. The changes in strength create states of dependence or interference of one language over another, somehow regressing in the phases of acquisition defined by the BIA-d model (see our proposal in Figure 3). In this sense, note that both the L2 or the L1 could suffer changes in their representational state, depending on these fluctuations in proficiency, in line with previous evidence (e.g., Dussias & Sagarra, Reference Dussias and Sagarra2007; Guo et al., Reference Guo, Liu, Misra and Kroll2011; Linck et al., Reference Linck, Kroll and Sunderman2009; see also Morales et al., Reference Morales, Paolieri, Cubelli and Bajo2014). Thus, we do not expect AoA to be a better predictor of the GCE than proficiency.
The present study
In the present study, we aimed to: (1) explore the cross-linguistic GCE not only in an L2 but also in an L1 by assessing EP and German bilingual adult speakers to understand how cross-linguistic interference might modulate gender retrieval as a whole; (2) test bilinguals who speak languages with different degrees of transparency (a more transparent [EP] and a more opaque [German] language) to examine if the degree of gender transparency of the languages may have any impact on the cross-linguistic GCE; (3) for the first time test the role of AoA and language balance in the effect. To do so, we conducted translation tasks as is traditionally done in this field of research. However, instead of using only forward translation tasks as it has been previously done in the literature (Bordag & Pechmann, Reference Bordag and Pechmann2008; Manolescu & Jarema, Reference Manolescu and Jarema2015; Paolieri et al., Reference Paolieri, Cubelli, Macizo, Bajo, Lotto and Job2010; Salamoura & Williams, Reference Salamoura and Williams2007) we also used backward translation tasks, which allowed us to explore the effect of one language on another and vice-versa during language production. We recruited participants from two different populations: (a) the so-called heritage speakers, i.e., early bilinguals of EP and German that had Portuguese parents and were born or lived during childhood in a German-speaking country (either Germany or Switzerland) with EP as home language, and (b) monolingual-raised native speakers of EP who started to learn German after the age of 10 in a classroom setting. The early bilinguals differed in the number of years they had been in German-speaking countries, with some living there even during adulthood and others leaving during childhood. At the moment of testing, 29 early bilinguals lived in Portugal, 12 in Germany and one participant in the German-speaking part of Switzerland. All early bilinguals have in common having acquired German in childhood through immersion before the age of 7 (following Hyltenstam & Abrahamsson's [Reference Hyltenstam, Abrahamsson, Doughty and Long2003] cut-off of early bilinguals). Consequently, they differed in their degree of proficiency in each language, suffering certain imbalances between languages due to varying proficiency in either EP or German (from now on, L1 and L2).Footnote 3 According to our proposal, for some participants, the strength of the links between the gender nodes and the L1 lemmas should have likely decreased, creating a situation of possible interference between languages, even though they were early bilinguals and highly proficient in the L2. We hence considered not only the proficiency of the L2 but also that of the L1. More specifically, we used a standardized and highly valid and reliable measurement of the degree of proficiency in both languages, the DIALANG Vocabulary Size Placement Test (see Alderson, Reference Alderson2005). We obtained a measure of language balance by subtracting the DIALANG score in one language from the score in the other, thus obtaining a differential-based dominance index, as suggested by Birdsong (Reference Birdsong, Silva-Corvalán and Treffers-Daller2015).
We therefore tested the following hypotheses: (a) the cross-linguistic GCE is dependent on the language balance, so that the higher the imbalance, the greater the effect; (b) the effect can be obtained in both an L1 (EP) and an L2 (German), even though it might be modulated by the opaqueness of the Germanic language, being null for this language as obtained in previous studies with translation tasks assessing German as L2 (Bordag & Pechmann, Reference Bordag and Pechmann2008; Salamoura & Williams, Reference Salamoura and Williams2007); (c) hence the effect does not depend on the AoA. The data and scripts used in this study are available online at the following link: http://doi.org/10.17605/OSF.IO/UE9XH
Method
Participants
Seventy-four voluntary adult bilinguals of EP and German (62 female; M age = 38.12 years, SD = 9.73) were recruited online via email and social media and personal contact was made with each one of them. The requirements for participation for late bilinguals were (1) having started to learn German in a classroom context as teenagers or adults; and (2) having studied German for 5 years or more or having lived in a German-speaking country as adults. As for early bilinguals, the requirements were (1) being born or having immigrated to a German-speaking country before age 7 (Hyltenstam & Abrahamsson, Reference Hyltenstam, Abrahamsson, Doughty and Long2003); and (2) having lived there for 6 or more years. Ten of them reported moderate to high proficiency in another gendered language apart from these languages in the Language History Questionnaire (LHQ, Li et al., Reference Li, Zhang, Yu and Zhao2020). Those were French, Spanish and Italian.Footnote 4 In addition to AoA, country of residence and years living in a German-speaking country, participants were also asked to self-rate their proficiency in all languages they knew and to estimate their degree of contact with each language in their daily life (by dividing 100% of contact among all relevant languages). The early bilinguals (n = 42) grew up as Portuguese-descendant second generation immigrants in Germany or in the German-speaking part of Switzerland. They started to acquire EP from birth as heritage language, i.e., as the main language spoken within the family. As is typical for heritage speakers, contact with the majority language (German) started either from birth or during pre-school age.Footnote 5 Thus, all speakers were either simultaneous or early successive bilinguals who became German-dominant in childhood.Footnote 6 Due to various reasons (remigration, changes to the family constellations, professional reasons, among others), the degree of contact with either EP or German in daily life is diverse across speakers.
The late L2 learners started to acquire German in a classroom setting after age 10. All late learners are highly proficient in German, either because they studied German at the university or moved to Germany or Switzerland for professional or personal reasons. At the moment of testing, 7 participants were living in Germany, 2 in a German Swiss canton, 12 in Portugal and one participant moved recently from Germany to Northern Spain (Galicia).Footnote 7
All participants signed informed consents for experimentation with human subjects previously approved by the Ethics Council of the University of Minho (CEICSH 120/2020) through Google Forms.
Materials
Measurement of language balance
We assessed proficiency in each language through the DIALANG Vocabulary Size Placement Test (VSPT, version 1) for EP and German. Note that the lexical competence has been shown to be a reliable predictor of language proficiency (Bonvin et al., Reference Bonvin, Brugger and Berthele2021; Laufer & Nation, Reference Laufer and Nation1999; Treffers-Daller & Korybski, Reference Treffers-Daller, Korybski, Silva-Corvalán and Treffers Daller2016), since the learners' lexical knowledge grows when proficiency increases, and adequate lexical knowledge is a prerequisite of effective language use. The DIALANG VSPT is a questionnaire that assesses lexical competence through a list of 75 words, of which 50 are real words and 25 are pseudo-words (for a more detailed explanation of the concept of language proficiency and balance as well as of supporting evidence of the DIALANG VSPT as a reliable proficiency indicator see Appendix S1). Participants were requested to indicate whether or not each word was an existing word in EP or German. Following Alderson (Reference Alderson2005), the test score was computed based on the total of words correctly identified as either real words or pseudo-words. A measurement of language balance was obtained by calculating between-language subtractive differentials, i.e., the score obtained for German was subtracted from that of EP. Negative values indicate higher proficiency in EP and positive values in German (e.g., German: 62 - EP: 72; dif: -10). Values close to zero indicates high language balance.
Complementing this, we asked participants to self-assess their proficiency in Portuguese and German, on a scale from 1 to 7 in speaking and writing. For the quantification we added the ratings for both skills and obtained a total self-assessment score on a scale from 1 to 14 for both languages. We then computed the differential between both language scores to define a value for self-assessed relative proficiency. Again, negative values indicate higher self-estimated proficiency in Portuguese and positive values in German (e.g., Portuguese speaking: 7 + Portuguese writing: 6 = 13; German speaking: 5 + German writing: 5 = 10, dif: - 3). A positive strong correlation between the language balance score obtained through the DIALANG VSPT and the self-assessment scores will further support the reliability of the DIALANG VSPT as a proficiency test (for more detail, check footnote 6 in the Results section; see also Flores et al., Reference Flores, Zhou and Eira2022).
Stimuli
We selected 180 EP inanimate nouns from the P-PAL database (for all the stimuli, see Appendix S2; Soares et al., Reference Soares, Iriarte, Almeida, Simões, Costa, Machado, França, Comesaña, Rauber, Rato and Perea2018) and translated them into German. They were selected by taking into account the gender value in both languages, so that we had the same number of stimuli in each of these 6 translation types: heterogeneric feminine-masculine (“abóbora” [F] in EP, “Kürbis” in German [M], “pumpkin”), heterogeneric masculine-feminine (“journal” [M] in EP, “Zeitung” in German [F], “newspaper”), homogeneric feminine (“cenoura” in EP, “Karotte” in German, “carrot”), homogeneric masculine (“bosque” in Portuguese, “Wald” in German, “forest”), feminine-neuter (“perna” in EP, “Bein” in German, “leg”), and masculine-neuter (“carro” in EP, “Auto” in German, “car”). We avoided EP nouns with more than one German translation that had similar frequencies of use according to SUBTLEX-DE (Brysbaert et al., Reference Brysbaert, Buchmeier, Conrad, Jacobs, Bölte and Böhl2011). Besides, we avoided nouns in German that could also be verbs (e.g., “Leben” [life/to live]), nouns with high positive or negative affective valence related to death or sexuality (e.g., corpse, death, penis, etc.), German nouns that were cognates in English (e.g., “Butter”), and nouns that had more than one possible translation, when these diverged in gender within that language (e.g., “Miete” [rent, F] in German translates to “renda” [F] or “aluguer” [M] in EP). In terms of ortho-phonological gender transparency, we did not select any irregular nouns in EP, and included 144 transparent nouns and 36 opaque nouns evenly distributed across the six translation types (24 transparent nouns and 6 opaque nouns per type). We created two different blocks as a function of translation direction (EP to German and vice-versa), each composed of 90 stimuli, so that the presentation of both blocks was counterbalanced across participants. All EP nouns were controlled through a one-way ANOVAs across the 6 translation types for per million and logarithmic frequency, number of phonological and orthographic neighbours, number of letters, and mean logarithmic bigram frequency, taken from P-PAL (all ps > .247, Soares et al., Reference Soares, Iriarte, Almeida, Simões, Costa, Machado, França, Comesaña, Rauber, Rato and Perea2018), logarithmic frequency, taken from SUBTLEX-PT (p = .544, Soares et al., Reference Soares, Machado, Costa, Iriarte, Simões, de Almeida, Comesaña and Perea2015), and subjective frequency, concreteness, and imageability, taken from the Minho Word Pool (all ps > .525, Soares et al., Reference Soares, Costa, Machado, Comesaña and Oliveira2017). See mean values in Table S1 of Supplementary Materials.
German nouns were controlled across the six translation types through a one-way ANOVA for absolute logarithmic frequency, number of letters, logarithmic number of neighbours based on the Levenshtein distance, initial logarithmic bigram frequency (normalized), and familiarity, as taken from dlexDB (all ps > .091; Heister et al., Reference Heister, Würzner, Bubenzer, Pohl, Hanneforth, Geyken and Kliegl2011), and logarithmic frequency as taken from SUBTLEX-DE (Brysbaert et al., Reference Brysbaert, Buchmeier, Conrad, Jacobs, Bölte and Böhl2011; p = .29). See Table S2 of Supplementary Materials for means and standard errors. Translation pairs across translation types were controlled for equivalent measures – namely, number of letters, and subjective frequency/familiarity (ps > .563). Although logarithmic frequency (SUBTLEX-PT and SUBTLEX-DE) showed significant differences, these differences were not between the conditions that subsequently showed significant results – namely, gender incongruency vs. congruency (ps > .143). The translations were also controlled for orthographic overlap using the NIM database (Guasch et al., Reference Guasch, Boada, Ferré and Sánchez-Casas2013) and phonological overlap using the PHOR-in-One database (Costa et al., Reference Costa, Comesaña and Soares2021; all ps > .174). See Table S3 of Supplementary Materials for means and standard errors of overlap measures.
Conditions regarding the stimuli were created taking into consideration the factors of Gender Congruency (gender congruent, gender incongruent), Target Gender (masculine, feminine), and Target Language (EP, German). Note that target gender and target language refer to the gender of the noun to be produced, and the language to be produced, respectively. Importantly, the factor of gender congruency included the four experimental conditions: heterogeneric masculine and feminine (incongruent) and homogeneric masculine and feminine (congruent) nouns. This allowed us to make a direct comparison between the gender systems of EP and German. Neuter gender in German would constitute a third category, in which rather than gender incongruency there is a situation in which one gender value does not exist in the other language, and this may change the representation of that value and the interaction between languages when it comes to its retrieval (for more information on the bilingual representation of differing gender nodes such as that of German and Spanish bilinguals in regard to the neuter node, see Klassen, Reference Klassen2016, and Klassen et al., Reference Klassen, Kolb, Hopp and Westergaard2022) . The scope of our study is to replicate the cross-linguistic GCE and test the activation of gender nodes and the processes of competition that may arise between them depending on language proficiency. These aims can be fulfilled with heterogeneric and homogeneric nouns, avoiding a greater degree of complexity in the experimental design, as we did. Nevertheless, neuter nouns were included among our stimuli in order to avoid artificial contexts that might in some way influence the performance of our participants.
Procedure
The experiment was conducted online due to the public international health emergency caused by the COVID-19 pandemic. We followed Burke and James’ (Reference Burke and James2006) recommendations for online research and data collection.
Participants started by filling out the LHQ (Li et al., Reference Li, Zhang, Yu and Zhao2020) using Google Forms. This questionnaire allowed us to explore their linguistic background and check their knowledge of any other gendered language. Afterwards, links were sent for the EP and German versions of the DIALANG, a standardized lexical test to objectively assess the proficiency in each of these languages while guaranteeing a high degree of validity and reliability (Alderson, Reference Alderson2005). The task was timed through the Google Add-on Quilgo, which, in addition to the timer, set for 7 minutes in total, allows for screen tracking, thus informing us if the participants kept focused on the task. Only one participant was excluded due to unfocussed participation.
Participants then received a link that opened the experiment in a browser. The experiment was programmed using the JavaScript library jsPsych (de Leeuw, Reference de Leeuw2015). Two blocks of 90 nouns each (90 in EP and 90 in German) were created, but its order of presentation and the language to be produced (i.e., the target language) was counterbalanced, so that we had four different links depending on these two factors. After clicking on the link, the participant first read the instructions regarding the procedure for the whole experiment. These instructions appeared in the target language of the first sub-block. Then, a familiarization phase started in which the 90 translation pairs were sequentially presented, one by one. Participants controlled the presentation of the stimuli using the spacebar. The aim of the familiarization phase was to decrease mistranslations and non-responses. After that, participants tested their microphone, following instructions on the screen, and once checked, instructions appeared for the translation of the first block. They were asked to translate each noun into the target language as fast as possible, avoiding mistakes and trying to speak loudly and clearly into the microphone. Upon starting, participants went through a session of eight training items (different from the experimental items), then the experimental trials started. Each experimental trial had the following structure: a fixation point (+) at the centre of the computer screen, for 500 ms; the target noun for 3,000 ms or until response; a blank space for 500 ms as an inter-trial interval. Trials were presented randomly per participant. For the second sub-block, instructions appeared in the other language (which would be the new target language). The familiarization phase and translation task occurred again with the remaining 90 nouns. Responses were recorded and saved in a private directory on the University of Rovira i Virgili server. RTs were calculated offline from the presentation of the noun to be translated to the onset of the translation response using the PRAAT software (Boersma & Weenink, Reference Boersma and Weenink2018).
The session for each block lasted approximately 20 min.
Results
We removed the RTs of incorrect responses (18.04% data points), RTs above 3,000 ms and RTs that exceeded 2.5 SD of each participant's mean (2.35% data points). We also removed the data from ten participants that made more than 40% of incorrect responses. Hence, the final sample was composed of 64 participants. Table 1 gives an overview over the predictor variables for these 64 participants – namely, AoA, self-rating in each language, self-assessed relative proficiency, proficiency in each language and language balance score.
Note. M = mean; SD = standard deviation; min = minimum; max = maximum. Negative values indicate higher proficiency in Portuguese in Self-assessed relative proficiency and Language Balance Score (DIALANG).
RTs were analysed using linear mixed-effect models (e.g., Baayen, Reference Baayen2008; Baayen et al., Reference Baayen, Davidson and Bates2008). To this end, we used the lme4 package of R (Bates et al., Reference Bates, Maechler, Bolker and Walker2015). We created a fixed structure model to examine the hypotheses of the study, with the inverse of RTs (-1000/RT) as the dependent variable. As fixed effects, the model included the triple interaction and second-order interactions between Gender Congruency (GC, GI), Target Language (language that was produced: Portuguese or German) and absolute Language Balance Score (through the DIALANG)Footnote 8, the triple interaction and second-order interactions between Gender Congruency, absolute Language Balance Score and Age of Acquisition (AoA) of the German Language, the interaction between Block Order (first and second) and Target Language, and, finally, Target Gender (gender of the noun that was produced: Feminine or Masculine). Continuous variables were centered and transformed into Z-scores. In addition, following the guidelines of Schad et al. (Reference Schad, Vasishth, Hohenstein and Kliegl2020), all dichotomous variables were coded using sum contrast coding (-0.5 for the first level and +0.5 for the second level of each factor); Gender Congruency: GC (-0.5), GI (+0.5); Target Language: GER (-0.5), PT (+0.5); Target Gender: FEM (-0.5), MASC (+0.5), and Block Order: first (-0.5), second (+0.5). We also examined the multicollinearity of the fixed effects introduced in the model (R VIF function). All VIF values were less than 3, suggesting non multicollinearity (Zuur et al., Reference Zuur, Ieno and Elphick2010).
Participants and words were included as grouping factors for random effects. We followed a maximal random-effects structure (Barr et al., Reference Barr, Levy, Scheepers and Tily2013) by adding as random slopes the most complex structure that allowed convergence. We incorporated Target Language and Gender Congruency into the random slope of participants, and Target Gender into the random slope of words. The structure of the models for evaluating the RTs in Portuguese and German naming was the same as above, but excluding Target Language for participants.
The significance of interactions was determined using log-likelihood ratio tests (R ANOVA function). We assessed the contribution of each interaction by comparing a model that included them with another model in which they were not included. We also report the results of the t-test analyses for the coefficient estimates of fixed effects and interactions. To this end, we used Satterthwaite's approximations to the degrees of freedom of the denominator (p-values were estimated by the lmerTest package, Kuznetsova et al., Reference Kuznetsova, Brockhoff and Christensen2017).
The results showed a three-way interaction between Gender Congruency, Language Balance Score, and Target Language, estimate = 0.02, SE = 0.01, t = 2.12, p = .034, χ2(1) = 4.55, p = .033 (see Table 2 for the results of the linear mixed-effects model and Table 3 for mean RTs and standard errors). This triple interaction indicates that the language balance score influenced the cross-linguistic GCE when producing EP words, estimate = 0.02, SE = 0.01, t = 2.44, p = .017, χ2(1) = 5.78, p = .016, but not when producing German words, estimate = -0.00, SE = 0.01, t = 0.56 p = .577, χ2(1) = 0.33, p = .566. The results show that, when producing EP words, GCE increased in line with participants' difference in proficiency between languages (see Figure 4), i.e., the higher the imbalance between languages, the higher the effect. A Target Language effect was also observed, estimate = -0.02, SE = 0.01, t = 3.28, p = .001, showing that participants were faster at translating words into EP than into German, probably because EP was their L1, the language they learned at home. In contrast, neither an effect of AoA of the German language nor the interaction between that variable and the rest of variables was observed (all ps > .05).
Note. Results reported for the conditions of Gender Congruency taken into consideration Target Language and Target Gender.
Discussion
In the present study, we conducted a forward and a backward translation task with EP and German adult bilinguals. We were interested in testing the cross-linguistic GCE (i.e., facilitation in the processing of homogeneric translations in comparison to heterogeneric translations) in both languages, including as factors within the analyses two usually ignored but relevant variables: AoA and language balance. By following the tenets of the BIA-d model (Grainger et al., Reference Grainger, Midgley, Holcomb, Kail and Hickmann2010) and the Multilink (Dijkstra et al., Reference Dijkstra, Wahl, Buytenhuijs, Van Halem, Al-Jibouri, De Korte and Rekké2019), as well as previous evidence in other areas of bilingualism (e.g., Abutalebi & Green, Reference Abutalebi and Green2007; Pivneva et al., Reference Pivneva, Palmer and Titone2012; Soares et al., Reference Soares, Oliveira, Ferreira, Comesaña, Macedo, Ferré, Acuña-Fariña, Hernández-Cabrera and Fraga2019), we proposed that the strength of the links between the L2 lemmas and their gender values varied according to the balance of proficiency between the two languages regardless of the AoA. As a consequence, the dependency of the L2 on the L1 representation would also vary, so that the state of development of the L2 following the BIA-d model also varied depending on this strength. Ultimately, the interference of the L1 on the L2 during gender selection would be more reduced the greater the strength between the L2 lemmas and gender nodes (since there should be less dependency of the L2 on the L1). As the strength of these links is related to frequency of use and exposition, we also hypothesized that a reduction in the use of the L1 would affect the strength of its links and would create a mirroring situation in which the L1 would be more dependent on the L2 and suffer from its interference. Importantly, following previous evidence with translation tasks (Bordag & Pechmann, Reference Bordag and Pechmann2008; Salamoura & Williams, Reference Salamoura and Williams2007) and recent proposals on the slipperiness of gender effects due to a low degree of activation of gender nodes and their sensitiveness to the time course of lexical access, we also consider the possibility of obtaining null results in the less transparent language, German. Hence, we expected to obtain a cross-linguistic GCE that: (a) was dependent on the language balance, so that the greater the imbalance, the greater the GCE; (b) was observable in both the L1 and the L2 but could be affected by the gender opaqueness of the language; (c) was not dependent on the AoA.
The results were clear-cut: they confirmed the existence of a cross-linguistic GCE with bare nouns in a translation task with early and late bilinguals of EP and German. In line with hypothesis a) our results showed that the GCE increased in parallel with the difference in proficiency between the L1 and L2, so the greater the imbalance, the greater the effect. Note that, indeed, balanced bilinguals did not show the effect. This does not necessarily imply that balanced bilinguals have an autonomous gender system for each language (Costa et al., Reference Costa, Kovacic, Franck and Caramazza2003). Rather, the gender system in bilinguals may be integrated, and following our proposal based on the BIA-d and Multilink models, whether or not interference happens will depend on the strength of the lemma-gender nodes connections and the independence of both languages’ representations within the same system.
Furthermore, the triple interaction between Gender Congruency, Language Balance Score, and Target Language showed that, partially in line with hypothesis b) the effect was actually obtained in the more transparent language (EP) and not in the more opaque language (German) and, in line with hypothesis c), the effect was independent of the AoA, thus it was obtained in early as well as in late bilinguals. Our study hence constitutes supporting evidence to the idea that, indeed, it is the proficiency, and more specifically, the balance between languages that better explains the cross-linguistic influences between languages at the level of processing. Furthermore, our analyses showed that it did not matter whether the imbalance was due to higher proficiency scores in EP or in German: regardless of which language was more dominant, the effect was always visible in EP, never in German. On the one hand, perhaps, once there is imbalance and dependency of one language over another, connections between languages allow for interaction to occur during gender selection even when producing the dominant language. On the other hand, these results are in line with the idea that for gender opaque languages, gender competition during lexical access entails lesser levels of activation that produce smaller and more slippery effects, as shown by Bordag and Pechmann (Reference Bordag and Pechmann2008) and Salamoura and Williams (Reference Salamoura and Williams2007).
In conclusion, these results show that gender is selected across languages competitively and that this competition depends mainly on language balance regardless of the AoA or the direction of language dominance. The fact that the effect was restricted to EP corroborates the idea that the resting level of activation of gender nodes and the levels of activation involved in the process of gender selection is higher than that of less transparent languages, like German. We recognize, nevertheless, that more research is necessary to sustain this hypothesis. More specifically, since it seems a quantitative problem on the levels of activation, future studies should examine the cross-linguistic GCE within translation tasks comparing language pairs of different degrees of gender transparency. If we are right, the effect should become stable and greater with transparent pairs, whilst absent with opaque pairs. Yet, with mixed pairs, the effect should be present when the target language has a high degree of transparency and absent (or slippery) when the target language has a low degree of transparency. A more fine-grained analysis is also possible, considering the regularities within the language itself rather than its overall degree of gender transparency. Hence, a factor of transparency congruency mirroring that of gender congruency could be an interesting addition: comparing the effects between transparent translation pairs and opaque pairs, as well as mixed pairs. We also encourage future studies to include the variable of language balance in their design, and to further explore the finding of gender effects in the dominant language, especially if both languages are highly transparent. In that case, the effect would be expected for both the L1 and the L2, and so differences on the size of these effects may be encountered across languages. In this sense, if we were to test the effect of language balance in other L2 languages, it would be interesting to try other type of measurements, such as these based on reaction times in both languages (rather than accuracy), since they can be useful when exploring the differences of naming and translation tasks related to the time course of lexical access (see Casado et al., Reference Casado, Szewczyk, Wolna and Wodniecka2022). Finally, the fact that participants were faster at translating words into EP than into German, independently of their dominant language, yields an interesting result that requires a closer look in future studies. We hypothesized that this may be due to the status of EP as main family language, which is present in the heritage speaker's daily life, despite their higher proficiency in German; however, our explanation is only tentative and calls for more research.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/S1366728923000378
APPENDIX S1 - Language proficiency, language balance and the DIALANG test
APPENDIX S2 - All target nouns according to their translation type
Table S1 - Means and standard errors of the controlled variables for Portuguese nouns across translation types
Table S2 - Means and standard errors of the controlled variables for German nouns across translation types
Table S3 - Means and standard errors of the overlap measures
Acknowledgements
This work was funded by the Foundation for Science and Technology (FCT) through the Portuguese State Budget (UIDP/01662/2020) and the grant UIDB/00305/2020, as well as by the Spanish Ministry of Education and Vocational Training, through the Training program for Academic Staff (Ayudas para la Formación del Profesorado Universitario [FPU16/06983]), and the Spanish Ministry of Science and Innovation [research project PID2019-110583GB-I00]. We thank the Reviewer Alba Casado for her helpful comments especially regarding the data analysis, and all the participants that agreed to be part of our task during the COVID-19 pandemic.
Competing interests
The author(s) declare none.
Data availability
The data that support the findings of this study are openly available at http://doi.org/10.17605/OSF.IO/UE9XH