1. Introduction
Building upon “sonority law” (Sievers Reference Sievers1876) and typological overviews such as that of Greenberg (Reference Greenberg, Greenberg, Ferguson and Moravscik1978), Clements (Reference Clements, Kingston and Beckman1990) introduced the Sonority Sequencing Principle (henceforth SSP),Footnote 1 which states that “between any member of a syllable and the syllable peak, only sounds of higher sonority rank are permitted” (Clements Reference Clements, Kingston and Beckman1990: 285).Footnote 2 Hence, an onset cluster such as pl can precede a vowel nucleus, since the sonority of the liquid [l] is higher than that of the stop [p], while lp will be ruled out. Conversely, lp is a suitable coda cluster according to the SSP, while a coda cluster pl should be dismissed. Several authors have attempted to explain such trends (for a review, see Parker Reference Parker2017). The main issue is to understand whether the constraints concerning syllable structure emerge from the statistical properties of languages (e.g., syllable frequency in the lexicon) or whether they reflect a formal phonological rule.
To address this question, a classical approach is to examine how phonological constraints, such as the SSP, are applied to novel forms. The rationale is as follows. If phonological constraints are independent of the learned examples in the lexicon, speakers should apply these rules to new linguistic material. Otherwise, if phonological constraints emerge from the examples in the lexicon, speakers will probably process new forms differently from lexically attested forms. Using this rationale, several experimental studies compared the acceptability of non-words containing syllables having CC clusters (C: consonant) that respect the SSP, with that of non-words that do not respect the SSP (e.g., Greenberg and Jenkins Reference Greenberg and Jenkins1964, Albright Reference Albright2009, Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011, Hayes and White Reference Hayes and White2013). Speakers considered that non-words beginning with rising sonority clusters (e.g., onset pl) were more acceptable as potential new words than those beginning with falling sonority clusters (e.g., onset lp), even when the degree of lexical attestedness of the clusters was controlled for. Such effects, known as sonority projection effects (Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011), exist in various languages and seem to be distinct from sensorimotor constraints (for a review, see Berent Reference Berent2017). They are thought to demonstrate the existence of a formal phonological rule concerning SSP (Berent et al. Reference Berent, Steriade, Lennertz and Vaknin2007).
However, the above-mentioned interpretation has been challenged by modeling studies suggesting that a more modest assumption about the SSP is sufficient to obtain sonority projection effects (e.g., Hayes Reference Hayes2011). If, for instance, experimental data can be obtained without assuming the SSP in the model, if the frequency of features in the lexicon is taken into account,. For example, although [lb] and [tl] onsets are equally unattested in English words, they are not equally infrequent featurally, as there are syllable onsets in English that are featurally similar to [tl] but not to [lb]. Therefore, native speakers of English may show sonority projection effects only by extracting and generalizing these featural cues without a pre-existing SSP (Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011; see also Van de Vijver and Baer-Henney Reference van de Vijver, Baer-Henney and Parker2012 for German). The ability to generalize from the inventory of existing sonority-respecting clusters seems to be one of the key mechanisms, in such models, to generate sonority projection effects without implementing the SSP as a formal phonological rule (e.g., Hayes Reference Hayes2011; see also Bárkányi Reference Bárkányi2011 on the role of relative probability of different combinations of natural classes).
Although these studies show that sonority projection effects can be predicted by lexicon statistics (but see Jarosz Reference Jarosz2017 for a different pattern in Polish), this does not imply that non-lexical knowledge is not taken into account while processing novel forms. For example, using a series of phonological constraints that were learnable from the lexicon, Hayes and White (Reference Hayes and White2013) observed that speakers did not solely use their lexicon-based knowledge, but also relied on a set of constraints that were irrespective of the lexicon (e.g., to avoid articulatory difficulty and/or to maintain perceptual distance between contrasting forms). They posited that speakers have a “learning bias” and do not evaluate new forms only in terms of learned examples.
The current study aimed to better understand the interplay between the processing of lexical attestedness and SSP violations in French. Participants were asked to rate the acceptability of three types of CCVC non-words using a 6-point scale: non-words with attested sonority rising, non-words with unattested sonority rising, and non-words with unattested sonority falling CC onsets. Hereafter, in accordance with the previous studies on the sonority projection effects (e.g., White and Chiu Reference White and Chiu2017; Ulbrich et al., Reference Ulbrich, Alday, Knaus, Orzechowska and Wiese2016), we refer to sonority rising clusters as phonological well-formed clusters and to sonority falling as phonological ill-formed ones. Thus, in our study, only well-formed non-words are “sonority-compliant”. In addition to classical analyses of rating scores, we used the signal detection theory approach (Macmillan and Creelman Reference Macmillan and Creelman1991). This approach is highly relevant as it allows the sensitivity of speakers to lexical attestedness and phonological well-formedness to be measured precisely without any response bias (see Huang and Ferreira Reference Huang and Ferreira2020 for a discussion on the application of this approach to acceptability judgments in linguistic tasks). We expected that participants would be sensitive to both lexical attestedness and phonological well-formedness, but perhaps to a different degree.
2. Methods
The experimental approach and data analysis are described below.
2.1. Participants
Twenty-four native French speakers (17 females) participated in the study. They were recruited through our personal network or via an announcement. Most of them were students at the University of Lille. Their mean age was 23 years (SD = 2). Their mean number of years of education was 16 (SD = 1). All had self-reported normal or corrected-to-normal vision without any hearing or language problems. Their performance, assessed with standardized French batteries, in auditory phonological discrimination (Majerus et al. Reference Majerus, van der Kaa, Renard, Van der Linden and Poncelet2005) and written lexical decision (Macoir et al. Reference Macoir, Gauthier, Jean and Potvin2016) were in the normal range. An additional individual was tested but excluded because their data could not be used in the analysis (see Data analysis). The study was conducted in accordance with the tenets of the Helsinki Declaration. Before testing, all participants gave their written informed consent.
2.2. Material
Three lists of 48 CCVC non-words were generated by concatenating a CC onset with a VC tail (e.g., bl + ar → blar) (Appendix A). The three lists differed in terms of the syllable onsets, which were (1) frequent in French with rising sonority, such as dr, (2) unattested or very rare in French with rising sonority, such as tl, or (3) unattested or very rare in French with falling sonority, such as vp. The sonority profile of the onsets was calculated using Steriade's (Reference Steriade1990) scale. Their frequency was extracted from the Lexique database (New et al. Reference New, Pallier, Brysbaert and Ferrand2004). Twelve VC tails were selected among possible tails in French. Each tail was used four times with four different clusters in each list (12 tails x 4 CC per list = 48 non-words per list). Non-words of the attested and well-formed list and those of the unattested and well-formed list were matched in terms of the sonority profile of their onset, but were significantly different in terms of the frequency of their onset and the number of phonological and orthographic neighbors (Appendix B). Non-words of the unattested and well-formed list and those of the unattested and ill-formed list were matched in terms of the frequency of their onset and the number of phonological and orthographic neighbors, but were significantly different in terms of the sonority profile of their onset (Appendix B). Each list constituted one experimental condition in our study (see below).
2.3. Experimental design and procedure
The experimental design was adapted from Daland et al. (Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011). During the experiment, participants performed a non-word acceptability task on 144 non-words (48 non-words x 3 conditions). For each non-word presented in written form, participants were asked to indicate the likelihood that it would become a new word in French. The possible responses were 1 (unlikely), 2, 3, 4, 5, and 6 (likely). Stimuli were presented on the screen and participants scored them by clicking on a visual Likert scale. The order of presentation was fully randomized. Presentation and response collection were performed using Octave software and Psychtoolbox.
2.4. Data Analysis
We performed two types of analyses to examine how participants rated different non-words. The main approach was based on signal detection theory (Macmillan and Creelman Reference Macmillan and Creelman1991), which is commonly used in the field of psychophysics (e.g., Jozefowiez et al. Reference Jozefowiez, Gaudichon, Mekkass and Machado2018) and psycholinguistics (e.g., Sato et al. Reference Sato, Grabski, Glenberg, Brisebois, Basirat, Menard and Cattaneo2011; see also Huang and Ferreira Reference Huang and Ferreira2020). We also analyzed the data using a more classical approach to test the effect of Condition on participants’ responses. To do so, the mean of ratings was calculated per participant and per condition. A repeated-measure ANOVA was conducted on these values with Condition as within-subject factor. The post-hoc comparisons were done using t-tests.
Regarding the signal detection theory approach, we performed a similar analysis as in Jozefowiez et al. (Reference Jozefowiez, Gaudichon, Mekkass and Machado2018). For each participant and each list, the proportion of trials for which the participant provided the rating R in {1,2,3,4,5,6} was computed. These proportions were used to perform data analysis. Figure 1 shows the model on which the analysis relied. We assume that, when presented with a non-word, the participant's answer is based on the value of an internal decision variable x. There is a set {λ1, λ2, λ3, λ4, λ5} of critical values for the decision variable: if x falls below λ1, the participant provides the rating 1; if x falls between λ1 and λ2, the participant provides the rating 2, etc. When presented with an unattested and ill-formed onset (referred to as UA-IF in the following equations; unattested and well-formed are referred to as UA-WF and attested and well-formed are referred to as A-WF), x would be drawn from a Gaussian distribution with mean μUA-IF (μUA-WF, μA-WF) and standard deviation σ. Without loss of generality, we can assume σ = 1. Let i be an integer between 1 and 6. PUA-IF(i) (PUA-WF(i), PA-WF(i) is the probability that the participant provides a rating between i and 6 when presented with an unattested and ill-formed onset (unattested and well-formed, attested and well-formed onset). Let d'(UA − IF,UA − WF) be the difference μUA-IF − μUA-WF. It is a measure of the ability of the participant to discriminate between non-words with unattested and ill-formed and unattested and well-formed onsets, which is not affected by any response bias toward a specific rating the participant might have. It can be shown that
where Z(x) is the inverse cumulative distribution of the standard normal distribution.
Likewise, d'(A − WF,UA − WF), i.e., μA-WF − μUA-WF, can be computed using Equation (2)
Thus, obtaining a d' near 0 suggests that participants do not discriminate between the corresponding lists and that they are not sensitive to the manipulated factor (i.e., attestedness or well-formedness).
Note that to calculate each d', i was set to a different value to satisfy two constraints. First, Z(x) is not defined for 0 and 1. Second, the higher the P values, the better the estimate of d'. Choosing different i is not problematic because the value of d' is independent from the location of the response criteria. One participant was excluded as none of the values for i allowed these two constraints to be satisfied. Table 1 provides an example of our analysis and shows the values of i, the portion of the data they induced, and the values of P and d' computed from them for one of the participants.
3. Results
The data corresponding to our analyses are shown in Figure 2. The average responses were 5.16 (95% Confidence Interval [C.I.] = [4.95 5.38]), 2.28 (95% C.I. = [2.02 2.54]), and 1.52 (95% C.I. = [1.33 1.71]) for attested and well-formed, unattested and well-formed, and unattested and ill-formed conditions, respectively (see Figure 2A). The effect of Condition was significant (F(2,46) = 790.9, p < 0.001). Pairwise comparisons showed that non-words with attested and well-formed onset were rated higher than those with unattested and well-formed (t(23) = 29.23, p < 0.001) and unattested and ill-formed (t(23) = 35.05, p < 0.001) onsets. Non-words with unattested and well-formed onsets were also rated higher than those with unattested and ill-formed onsets (t(23) = 8.76, p < 0.001).
Figure 2B shows the values of d'. The mean d' for the analysis performed on unattested and well-formed and attested and well-formed conditions, reflecting sensitivity to attestedness, was 2.73 (95% C.I. = [2.51, 2.95]). The d' for the analysis performed on unattested and well-formed and unattested and ill-formed conditions, indicating sensitivity to well-formedness, was −0.99 (95% C.I. = [-1.21, -0.77]). The difference between d' for attestedness and d' for well-formed was significant (t(23) = 30.70, p < 0.001).
4. Discussion
We examined the sensitivity of speakers to the lexical attestedness and phonological well-formedness (with respect to the SSP) of CC onsets in French using a non-word acceptability task. We used signal detection theory, which is a useful technique to analyze acceptability judgments in language research (Huang and Ferreira, Reference Huang and Ferreira2020). The results showed that participants were sensitive to attestedness. This was reflected by d' values much larger than 0. Participants were also able to distinguish non-words with unattested well-formed onsets from those with unattested ill-formed onsets. This was reflected by d’ values which were different from 0. The results are thus consistent with previous findings on sonority projection effects (e.g., Daland et al., Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011), showing that participants rate non-words with well-formed onsets as more acceptable, irrespective of their attestedness. Since the condition with non-words whose onsets were unattested and well-formed was used as the reference (see Figure 1 and Equations 1 and 2), a positive and negative d' means that the responses were greater and smaller than the responses to non-words with unattested and well-formed onsets, respectively. Thus, the sensitivity to each manipulated factor (i.e., attestedness and well-formedness) corresponds to the absolute values of d'.
The main aim of the study was to investigate the interplay between the processing of attestedness and well-formedness. Our results on d' show that the sensitivity of participants to attestedness was higher than their sensitivity to well-formedness (absolute mean d': 2.73 vs. 0.99). This shows that participants perceived non-words whose onset differed regarding attestedness to be more distinct than those whose onset differed regarding the well-formedness of their onset. Indirectly, this could be consistent with the view that participants’ knowledge about well-formedness is not based on statistical properties extracted from the lexicon (Berent et al. Reference Berent, Steriade, Lennertz and Vaknin2007; for a review see Berent Reference Berent2017) and that attestedness and well-formedness are perceived differently.
This interpretation is consistent with studies that observed brain responses to well-formedness that were independent of attestedness. In an fMRI study in English, Deschamps et al. (Reference Deschamps, Baum and Gracco2015) examined brain responses to the presentation of non-words beginning with CC clusters that had different sonority rankings (rising vs. plateau/less rising). They observed that the activity of the posterior part of the left inferior frontal gyrus was modulated by the sonority ranking of the onsets in the reading task. In an EEG study of coda clusters in German, Ulbrich et al. (Reference Ulbrich, Alday, Knaus, Orzechowska and Wiese2016) examined the effect of well-formedness on word learning. They investigated the relative ease or difficulty with which novel words containing these clusters could be learned when the clusters eitherviolated or respected the SSP. Participants performed a matching task between auditory non-words and pictures after a learning phase. The authors also investigated the interaction between well-formedness and the frequency of clusters in the lexicon, using attested and unattested clusters. The results showed distinct but interacting responses to well-formedness and attestedness (see also Wiese et al., Reference Wiese, Orzechowska, Alday and Ulbrich2017 for a study of Polish). In a passive-listening EEG study in English, White and Chiu (Reference White and Chiu2017) also observed distinct responses to the attestedness of CC onsets and their well-formedness. In line with these findings, our results support the view that speakers rely on two sources of knowledge (formal phonological rules such as the SSP, and statistical properties extracted from the lexicon) when processing novel forms in their language, with the sensitivity to attestedness being higher than the sensitivity to well-formedness.
In the current study, we focused on syllable frequency, and did not consider the frequency of features when constructing our linguistic material. Thus, the unattested well-formed clusters we used may still have been more attested than unattested ill-formed clusters, in terms of feature frequency. A model that can extract and generalize featural cues from the lexicon (e.g., Phonotactic Learner; see Hayes and Wilson, Reference Hayes and Wilson2008) can also predict sonority projection effects by using only lexical statistics (Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011). In future work, it will be important to investigate whether similar results are obtained without adding any assumption about the SSP in this model. Importantly, Hayes and White (Reference Hayes and White2013), using a modeling approach, observed that while some constraints could be extracted from the lexicon, others could not. A similar approach could be used to establish whether the observed difference in the current study between sensitivity to lexical attestedness and to phonological well-formedness is due to lexical knowledge (e.g., issues such as differences in terms of the frequency of features), or whether it reflects the fact that both lexical attestedness and phonological well-formedness are used by speakers, but to a different extent.
Our study had some limitations. First, we used printed materials. To minimize the impact of the issues related to grapheme-phoneme correspondence and written language processing, it would be interesting to replicate the study using auditory stimuli. However, it is unlikely that the observed pattern was dependent on the modality (auditory vs. printed), as previous studies have shown that sonority projection effects extend to printed materials because the underlying mechanisms are related to phonological knowledge (Berent and Lennertz Reference Berent and Lennertz2010, Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011). The second limitation concerns the behavioral task we used. Although the non-word acceptability task has been widely used (e.g., Greenberg and Jenkins Reference Greenberg and Jenkins1964, Albright Reference Albright2009, Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011, Hayes and White Reference Hayes and White2013), such tasks might not fully engage the phonological system of speakers. For instance, in a non-word generation task, Bucci et al. (Reference Bucci, Lorusso, Gerber, Grimaldi and Schwartz2019) observed wide variability between participants and suggested that some participants may not have considered the task as a linguistic task, but rather as a non-linguistic game. While this possibility cannot be ruled out, note that the variabilities between participants regarding the sensitivity to attestedness and well-formedness were small (see Figure 2), so our findings were likely unaffected by this issue. The third limitation concerns the sonority scale that we used to create our stimuli. Our study used Steriade's (Reference Steriade1990) sonority hierarchy, with fricatives more sonorous than stops, as for instance in the study of Maïonchi-Pino et al. (Reference Maïonchi-Pino, Taki, Magnan, Yokoyama, Écalle, Takahashi, Hashizume and Kawashima2015) in French. However, a coarse-grained scale, as in Clements (Reference Clements, Kingston and Beckman1990) (obstruents < nasals < liquids < glides < vowels), seems to be more consensual (e.g., Daland et al, Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011). Although most of our clusters were also consistent with the latter scale (see Appendix A), few of them, such as fk, were not. Thus, it would be interesting to replicate our results with linguistic material created using a less fined-grained scale.
Overall, using an acceptability judgment task as in Daland et al. (Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011), our findings show the existence of the sonority projection effect in French. They also show that French speakers are more sensitive to lexical attestedness than to phonological well-formedness (defined as sonority rising vs. sonority falling onsets). This suggests that knowledge of phonological constraints regarding the SSP is used differently from that concerning the statistical properties of the lexicon. In addition, our study shows that the signal detection theory approach is useful to analyze data obtained on sonority projection effects, thus extending the models discussed in Huang and Ferreira (Reference Huang and Ferreira2020). Future studies should investigate whether and how our experimental results might be obtained without any assumption (or with only a moderate assumption, as in Hayes Reference Hayes2011) about the SSP.
Appendix A
Onsets and tails used in study to create 144 CCVC non-words (48 per condition).
Appendix B
Properties of non-words used in study, extracted from Lexique database (New et al. Reference New, Pallier, Brysbaert and Ferrand2004), averaged per Condition. Frequencies are in terms of occurrences per million.