1. Introduction
Shift is an input–output mapping where a feature or autosegment loses its underlying associations and surfaces with different associations. For example, Kibondei (Bantu; Kisseberth & Cassimjee Reference Kisseberth and Cassimjee2006) disallows high tones from surfacing phrase-finally. Among other things (see Section 2), this restriction motivates high tones underlyingly associated to the final vowel of verbs to shift onto the penult phrase-finally (1).
Serial phonological frameworks, such as Harmonic Serialism (HS; Prince & Smolensky Reference Prince and Smolensky1993/Reference Prince and Smolensky2004, McCarthy Reference McCarthy, Hirotani, Coetzee, Hall and Kim2000, Reference McCarthy, McCarthy and Pater2016), explicitly delimit the set of operations available to model input–output mappings. Previous approaches to shift in HS have decomposed it into more basic steps of creating and removing autosegmental links (for tone, see McCarthy Reference McCarthy, Baković, Itô and McCarthy2006, Reference McCarthy, Blaho and Bye2008b, McCarthy et al. Reference McCarthy, Mullin, Smith, Botma and Noske2012, Calamaro Reference Calamaro2017, Breteler Reference Breteler2018, Lamont Reference Lamont2022a; for moras, see Torres-Tamarit Reference Torres-Tamarit2012, Lamont Reference Lamont2023; for segmental metathesis, see Takahashi Reference Takahashi2019, Mooney Reference Mooney2022). These approaches are illustrated in (2), where the input–output mapping is broken down into two intermediate steps: either the high tone is first linked to the penultimate vowel and then delinked from the final vowel (2a), or the high tone is first delinked from the final vowel and then linked to the penultimate vowel (2b). Hereafter, the derivations exemplified by (2a) and (2b) are referred to as linking $ + $ delinking and delinking $ + $ linking, respectively.
By contrast, Gietz et al. (Reference Gietz, Jurgec and Percival2023) identify shift mappings that they argue cannot be modeled by linking $ + $ delinking or delinking $ + $ linking. On that basis, they conclude that HS cannot adequately model shift unless Gen is equipped with a basic operation that implements shift in one step (hereafter, a single-step shift operation). This paper reexamines their arguments and refutes their claims. Specifically, it demonstrates that shift in Kibondei and Halkomelem, the languages analyzed by Gietz et al. (Reference Gietz, Jurgec and Percival2023), does not require a single-step shift operation, and that the analyses they present are empirically inadequate. Typological modeling not only reinforces the result that a single-step shift operation is superfluous but demonstrates that grammars with such an operation undergenerate with respect to the attested typology. Sections 2 and 3 provide the analyses of Kibondei and Halkomelem, respectively, and Section 4 provides the typology. Section 5 concludes.
2. High tone shift in Kibondei
In Kibondei (Bantu; Merlevede Reference Merlevede1995, Kisseberth & Cassimjee Reference Kisseberth and Cassimjee2006; see also Cassimjee & Kisseberth Reference Cassimjee, Kisseberth, Hyman and Kisseberth1998: 51–53, 66–69, Lee & Lee Reference Lee, Lee and Samiian2002, Lee Reference Lee2013), high tones shift to surface as close as possible to the right edge of phonological phrases without actually surfacing phrase-finally. The examples in (3) illustrate the high tone underlyingly associated to the final vowel of the verb /hàgiá/ surfacing on the phrase-penultimate vowel; it shifts regressively in (3a), shifts progressively in (3b), and surfaces in situ in (3c). The analysis in this section derives regressive shift as delinking $ + $ linking and progressive shift as linking $ + $ delinking. The examples below also illustrate phrase-penultimate vowel lengthening, which independently supports the status of the phonological phrase.
Kibondei verbs are either underlyingly toneless or have one underlying high tone; of the 451 verbs in Kisseberth & Cassimjee (Reference Kisseberth and Cassimjee2006), 277 are toneless and 174 are high-toned (see Merlevede Reference Merlevede1995 for discussion of nominal tonology). The table in (4) summarizes the possible underlying forms of verbs up to three moras long; there are no monomoraic toneless verbs. As discussed later in this section, the data on longer verbs are too sparse to precisely characterize their underlying forms. Prefixes are also underlyingly toneless or associated to one high tone.
Toneless verbs surface faithfully with toneless prefixes, such as the infinitive /ku-/ and first person present affirmative /ni-a-/ (5a, b); the high vowel never surfaces in the latter, which is omitted from the analysis as a simplification. With high-toned prefixes, such as the class 1 third person present affirmative /á-/, the high tone shifts onto the verb’s penultimate syllable (5c, d).
As the tableaux in (6) illustrate with /á-senɡa ɲama/ $ \to $ [a-sénɡa ɲa:ma] ‘cl.1 is cutting meat’ (5d.ii), progressive shift is modeled as linking $ + $ delinking. In these and following tableaux, pairwise relations between constraints reflect the ranking represented in the Hasse diagram (53) in Appendix A, which also provides constraint definitions, discussion of Gen, and the full set of tableaux. In the first step of the derivation, the prefix high tone is linked to the penultimate syllable of the verb (6b). This improves on Align-R, which penalizes syllables that intervene between the rightmost syllable linked to a high tone and the end of the phonological phrase,Footnote 2 at the expense of *Link(H), which penalizes syllables linked to high tones. Delinking (6c) or deleting the high tone (6d) satisfies Align-R and *Link(H), but is ruled out by higher ranked constraints. Further improvement on Align-R is blocked in the second step by *H $ \Big]{}_{\omega } $ , which penalizes word-final high tones (6f). Instead, the link to the prefix is lost, improving on *Link(H) (6g). The derivation converges in the third step with the prefix high tone on the penultimate syllable of the verb (6h). Linking a high tone to a syllable that precedes its leftmost link does not improve on any phonotactic constraint and is excluded from tableaux.
Because it is not optimal to link high tones to word-final syllables, prefix high tones cannot shift into following words. However, high tones that are underlyingly word-final can. In addition to other contexts, inter-word shift occurs with monosyllabic verbs; Kisseberth & Cassimjee (Reference Kisseberth and Cassimjee2006) give eight such verbs, which are all underlyingly high-toned. Phrase-finally, monosyllabic verbs do not surface with high-toned syllables (7a, c), and phrase-medially, their high tones shift onto the penultimate syllable of a following toneless word (7b.i, ii). When the following word has a high tone, the verb’s high tone surfaces in the preceding syllable, triggering downstep (7b.iii, iv). While the high tones in gray do not surface linked to any syllables, it is ambiguous whether they remain floating or are deleted. As a simplification, the analysis in this section assumes they remain floating; Section 3 addresses an unambiguous case of deletion.
The tableaux in (8) illustrate inter-word shift in /ni-a-dá n-khooʃo/ $ \to $ [n-a-da n-khoó:ʃo] ‘I am eating cashew nuts’ (7b.i). As before, the derivation begins by linking the high tone to the following word, improving on Align-R (8b). The violation of *H $ \Big]{}_{\omega } $ cannot be removed in this step: delinking the high tone violates *Float (8c) and deletion violates Max(H) (8d), which both dominate *H $ \Big]{}_{\omega } $ . However, in the second step, delinking does not create a floating tone, and the high tone loses its link to the verb, satisfying *H $ \Big]{}_{\omega } $ (8g). The remaining derivation is identical to that in (6), with the high tone shifting from the initial syllable of the noun to its penult (8l). Spreading to the final vowel not only violates *H $ \Big]{}_{\omega } $ but also *H $ \Big]{}_{\varphi } $ , which penalizes phrase-final high tones (8k).
Underlyingly phrase-final high tones delink from their hosts to satisfy *H $ \Big]{}_{\varphi } $ , as the tableaux in (9) illustrate with /ku-dá/ $ \to $ [ku:-daH] ‘to eat’ (7a.i). Because Max(H) dominates *Float, floating tones cannot be deleted, and remain floating unless they can optimally be linked to a syllable. In this case, neither syllable is an acceptable host: linking to the verb would violate *H $ \Big]{}_{\varphi } $ and linking to the prefix fatally violates CrispEdge-L, which penalizes linking high tones to syllables that precede the leftmost syllable associated to the same morpheme (Itô & Mester Reference Itô, Mester, Kager, van der Hulst and Zonneveld1999, Kaplan Reference Kaplan, Bennett, Angeles, Brasoveanu, Buckley, Kalivoda, Kawahara, McGuire and Padgett2018; see Rolle & Merrill Reference Rolle and John2022 for an alternative approach). The derivation converges in the second step with the verb’s tone floating (9d).
Similar effects are observed in disyllabic high-toned verbs; of the 199 disyllabic verbs in Kisseberth & Cassimjee (Reference Kisseberth and Cassimjee2006), 69 have underlying high tones. Phrase-finally, disyllabic verbs do not surface associated to their lexical high tones (10a, c). Their high tones shift onto the penult of following toneless words (10b.i–iii) or up to following high tones, triggering downstep (10b.iv, v, d). Unlike toneless verbs, high tones never surface on the initial syllable of disyllabic high-toned verbs, for example, /á-kòmá/ $ \to $ [á-kòmaH] *[a-kómaH] (cf. 5c.ii, d.ii). In general, no high-toned verb with more than one syllable allows high tones to link to its initial syllable. This motivates postulating lexical low tones at the left edge of high-toned verbs (see Marlo Reference Marlo2013: 155 fn. 11 for discussion of similar analyses of Luyia and Hyman & Valinande Reference Hyman, Valinande and Goyvaerts1985, Hyman Reference Hyman, Brentari and Lee2018, and Hyman & Katamba Reference Hyman and Katamba1993 for a diachronic perspective), resulting in three lexical tone melodies: toneless /Ø/, high-toned monosyllabic /H/, and low-high multisyllabic /LH/.
The blocking effects of lexical low tones are illustrated in the tableaux in (11) with /á-kòmá/ $ \to $ [á-kòmaH] ‘cl.1 is killing’ (10c.ii). As in (9), the phrase-final high tone is delinked in the first step of the derivation, satisfying *H $ \Big]{}_{\varphi } $ (11b); the derivation converges in the next step (11c). Linking the prefix high tone to the verb-initial syllable would improve on Align-R (11d) and linking the floating tone to it would satisfy *Float (11e). Both, however, fatally violate *Contour, which penalizes syllables linked to multiple tones. Between the lexical low tones and *Contour, high tones are blocked from linking to initial syllables of multisyllabic high-toned verbs.
Like disyllabic high-toned verbs, high-toned verbs with three or four syllables all disallow high tones from linking to their initial syllables (12–13); in Kisseberth & Cassimjee (Reference Kisseberth and Cassimjee2006), 75 of the 198 trisyllabic verbs are high-toned, 22 of 45 quadrisyllabic verbs are high-toned, the only verb with five syllables is toneless, and there are no verbs with six or more syllables. In trisyllabic verbs, high tones are either underlyingly associated to the penultimate vowel as in /òmbéza/ or the final vowel as in /hàɡiá/. Phrase-finally, these high tones surface on the penultimate vowel (12a), either surfacing in situ (12a.i, iii) or undergoing regressive shift (12a.ii). Phrase-medially, verb-penultimate high tones surface in situ (12b.i), while verb-final high tones shift into the following word (12b.ii), mirroring the behavior of high tones in prefixes and verbs with one or two syllables, respectively. Because there is not enough data in Kisseberth & Cassimjee (Reference Kisseberth and Cassimjee2006) to determine the underlying forms of quadrisyllabic verbs, the examples in (13) only present their surface forms with the toneless infinitive prefix /ku-/ (13a) and the high-toned class 1 present affirmative /á-/ (13b).
Regressive shift is illustrated in the tableaux in (14) with /ku-hàɡiá/ $ \to $ [ku-hàɡí:a] ‘to sweep’ (12a.ii). The high tone is delinked in the first step, satisfying *H $ \Big]{}_{\varphi } $ (14b). In the second step, it is linked to the penult, satisfying *Float (14d), and the derivation converges in the third step (14e). The crucial difference between trisyllabic verbs and shorter verbs is that only the former have an acceptable syllable to host floating tones. In monosyllabic verbs, floating tones can only link to prefixes, violating CrispEdge-L, or the verb itself, violating *H $ \Big]{}_{\varphi } $ (9). In disyllabic verbs, floating tones can only link to the verb-initial syllable, violating *Contour, or the verb-final syllable, violating *H $ \Big]{}_{\varphi } $ (11).
To summarize the analysis, Kibondei verbs are underlyingly specified with one of three tonal melodies: toneless /Ø/, high /H/, or low-high /LH/. All and only monosyllabic verbs take the high melody. Verbs that take the low-high melody have the low tone associated to their initial vowel and the high associated to a following vowel. Verbal prefixes are either underlyingly toneless /Ø/ or associated to a high tone /H/. All high tones are drawn to the right edge of phonological phrases, but cannot link to word-final syllables or syllables already associated to a tone. Underlyingly word-final tones shift onto the penult of following toneless words and other tones shift onto the penult of their host word. Underlyingly phrase-final tones shift onto the penult of their host word if it is at least trisyllabic and fail to surface otherwise.
The analysis derives progressive high tone shift as linking $ + $ delinking and regressive high tone shift as delinking $ + $ linking. It thus invalidates the claims by Gietz et al. (Reference Gietz, Jurgec and Percival2023) that Harmonic Serialism cannot model high tone shift in Kibondei or regressive shift from final to penultimate syllables without modeling shift as a single-step operation. Their latter claim is an artifact of their typology not allowing floating features (see Section 4 for discussion).
While the present analysis benefits from a more complete data set, the analysis Gietz et al. (Reference Gietz, Jurgec and Percival2023) present cannot even account for the data in the sources they cite, Cassimjee & Kisseberth (Reference Cassimjee, Kisseberth, Hyman and Kisseberth1998) and Lee & Lee (Reference Lee, Lee and Samiian2002). Specifically, their analysis incorrectly predicts regressive shift whenever word-final high tones cannot shift into the following word, as in [á-fisá h!é:a] ‘cl.1 is hiding money’ (10d); this specific example is discussed by Lee & Lee (Reference Lee, Lee and Samiian2002: 346).Footnote 3 The support in (15) illustrates the ranking paradox that emerges when this example is included in their analysis (see Appendix B for constraint definitions). The winner $ \sim $ loser pair in (15a) illustrates inter-word shift in /á-tàɡá m-phombe/ $ \to $ [á-tàga m-phó:mbe] ‘cl.1 is selling beer’ (Kisseberth & Cassimjee Reference Kisseberth and Cassimjee2006: 136); these candidates come from tableau 16 in Gietz et al. (Reference Gietz, Jurgec and Percival2023: 33), which does not posit lexical low tones. To rule out the faithful loser, high tones must be worse word-finally than removed from their base position and misaligned from the right edge of a word. This contradicts the ranking required to prevent the verb-final high tone in [á-fisá h!é:a] from shifting regressively (15b). I do not claim that every analysis of Kibondei that assumes a single-step shift operation fails, but the analysis presented by Gietz et al. (Reference Gietz, Jurgec and Percival2023) does.
Before moving on, it is worth justifying another difference between the present analysis and Gietz et al.’s. Whereas the present analysis penalizes autosegmental linking and delinking with the constraints Dep(link) and Max(link), respectively, Gietz et al. (Reference Gietz, Jurgec and Percival2023) employ an Ident constraint. This is explicit in sections 4 and 5 of their paper, but not in their analysis of Kibondei; their definition of Base-Ident (p. 32) implies a mix of Max(link) and Ident. In the analysis of Kibondei, the choice of faithfulness constraints is immaterial because Dep(link) and Max(link) are not crucially ranked relative to each other. However, other systems provide evidence against using Ident.
For example, one method of avoiding vowel hiatus in Logoori (Bantu; Leung Reference Leung1991, Zymet Reference Zymet2018, Odden Reference Odden2021, Reference Oddenn.d.) involves glide formation and compensatory lengthening (see Gess Reference Gess, van Oostendorp, Ewen, Hume and Rice2011, Kiparsky Reference Kiparsky, Cairns and Raimy2011, Borgeson Reference Borgeson2022 for overviews of compensatory lengthening, Torres-Tamarit Reference Torres-Tamarit2012, Reference Torres-Tamarit, McCarthy and Pater2016, Jacobs Reference Jacobs2019, Lamont Reference Lamont2023 for analysis in HS, and Shaw Reference Shaw, Schardl, Walkow and Abdurrahman2009 for analysis in a related framework). While word-internal vowels are lengthened (16.ii), word-final vowels are not (16.iii), obeying a phonotactic restriction on word-final long vowels. However, this restriction is not absolute, as underlying word-final long vowels surface faithfully (17). This pattern is straightforwardly modeled assuming Dep(link) and Max(link), but cannot be modeled using Ident. As above, whether the moras in gray remain floating or are deleted is ambiguous (see Section 3 for discussion).
The difference between derived and underlying word-final long vowels is captured by ranking *Float between Max(link) and Dep(link) (see Lamont (Reference Lamont2023) for a full analysis of hiatus resolution that minimally differs in its assumptions). As in the tableaux in (18), which illustrate the mapping /mu-aŋɡɛ/ $ \to $ [mw-a:ŋɡɛ] ‘my cl.1’ (16a.ii), compensatory lengthening is modeled as mora delinking $ + $ linking. Pairwise relations between constraints reflect the ranking represented in the Hasse diagram in (85) (see Appendix C for the full set of tableaux). The prefix mora is delinked in the first step (18a), satisfying the constraint *VV, which penalizes adjacent vowels. Deleting the mora also satisfies *VV, but is dispreferred because Max( $ \mu $ ) dominates *Float (18c). In the second step, the floating mora is linked to the stem-initial vowel, satisfying *Float (18e). The derivation converges in the third step (18f).
Delinked moras remain floating to avoid surfacing word-finally, as the tableaux in (19) illustrate with /mu-a/ $ \to $ [mw-a $ {}^{\mu } $ ] ‘of cl.1’ (16a.iii). After the prefix mora is delinked (19b), the derivation converges (19c). Linking the floating mora to the vowel fatally violates *Vː $ \Big]{}_{\omega } $ , which penalizes word-final long vowels (19d).
While *Vː $ \Big]{}_{\omega } $ prevents floating moras from linking to word-final vowels, it cannot motivate delinking or deleting underlying word-final long vowels, as the tableau in (20) illustrates with [mbeɡáː] ‘I am shaving’ (17a). Delinking (20b) or deleting (20c) one of the moras associated to the word-final vowel would satisfy *Vː $ \Big]{}_{\omega } $ , but both violate Max(link), which is ranked higher.
Because Max(link) and Dep(link) are crucially ranked, they cannot be combined into a single constraint like Ident. The support in (21) illustrates the resulting ranking paradox.
In summary, the distribution of long vowels in Logoori provides empirical evidence that moraic faithfulness must be modeled by two constraints, Max(link) and Dep(link), not a symmetrical faithfulness constraint, Ident( $ \mu $ ), in Harmonic Serialism (see McCarthy Reference McCarthy2008a for discussion of this question with respect to place assimilation). Assuming a uniform analysis of autosegments generally, this extends to tones and other features.
3. [low] shift in Halkomelem
In Halkomelem (Salishan; Suttles Reference Suttles2004), the feature [low] must be linked to a stressed vowel. Among other things, this restriction motivates delinking [low] from vowels that are not assigned stress and then either deleting it or linking it to a stressed vowel. The examples in (22) illustrate the basic pattern with the reflexive suffix /-θat/. The suffix vowel surfaces faithfully when it is stressed (22a) and as [ə] otherwise (22b–f). When the mid-vowel /e/ is stressed, the [low] feature shifts onto it, lowering it to [a] (22b, c), and deletes otherwise (22d–f). The analysis in this section models [low] shift as delinking $ + $ linking.
Halkomelem has a five vowel system, composed of the high vowels /i(ː), u(ː)/, the mid vowels /e(ː), ə/, and the low vowel /a(ː)/ (Suttles Reference Suttles2004: 9). The distribution of [ə] is largely predictable and typically analyzed as epenthetic (Bianco Reference Bianco1996, Kinkade Reference Kinkade, Czaykowska-Higgins and Kinkade1998); as a simplification, I assume schwa is present in the input, either underlying or having been inserted at a previous derivational step. The leftmost full vowel in a word usually attracts primary stress, which is otherwise assigned to the leftmost [ə], but this is complicated by non-trivial interactions with the morphology and lexical specification (Suttles Reference Suttles2004: 13–14; see Bianco Reference Bianco1996, Reference Bianco1998 for thorough description and analysis; see Pruitt Reference Pruitt2012, Reference Pruitt2019, Reference Pruitt2022 for discussion of primary stress in Harmonic Serialism).
Several suffixes with underlying low vowels including the reflexive /-θat/ and the reciprocal /-talʔ/ undergo [low] shift but surface faithfully when stressed (23a). Otherwise, the [low] feature is delinked from the suffix vowel. If a mid vowel /e/ is assigned stress, the [low] feature is linked to it, lowering it to [a] (22b). This occurs even when a vowel intervenes between the stressed vowel and the suffix (22b.ii, iv). While shift between adjacent syllables appears to be exceptionless, there are exceptions when a vowel intervenes, for example, ['t’ejəm-təlʔ] ‘stick together’ (Suttles Reference Suttles2004: 246) and ['tθ’ekw’ə-təl] ‘shine light on e.o.’ (Gerdts Reference Gerdts, Frajzyngier and Walker2000: 141).
The tableaux in (24) illustrate [low] surfacing in situ in [ɬəqw-ˈθat] ‘get all wet’ (23a.i). In this and following tableaux, pairwise relations between constraints reflect the ranking represented in the Hasse diagram in Appendix D (94), which also provides constraint definitions and the full set of tableaux. In the first step of the derivation, primary stress is assigned to the suffix (24c), satisfying * $ -{\Delta}_{\omega}\ge $ {i,u}, which penalizes vowels more sonorous than schwa that are not the Designated Terminal Element (DTE; de Lacy Reference de Lacy2006: 66–69) of the word. Assigning stress to the root is preferred by Align-L(' $ \sigma $ ), which penalizes syllables that intervene between the left edge of the word and the stressed syllable, but fails to satisfy * $ -{\Delta}_{\omega}\ge $ {i,u}, which is higher ranked (24b). In addition to motivating stress assignment, * $ -{\Delta}_{\omega}\ge $ {i,u} also blocks linking the [low] feature to an unstressed vowel (24d). Delinking (24e) or deleting the [low] feature (24f) also satisfies * $ -{\Delta}_{\omega}\ge $ {i,u}, but violates higher ranked constraints, including Max(link). For reasons to be discussed below, it is necessary to bifurcate Max(low) into a general Max(low) constraint and a constraint that specifically penalizes deleting [low] features that are linked to a vowel, MaxLinked(low).Footnote 4 The latter is ranked high enough to rule out all such candidates, which are excluded from the following tableaux. Along with assigning primary stress to the suffix, deleting the [low] feature is the only other way to satisfy [low] $ \to $ ' $ \sigma $ , which penalizes [low] features not linked to a stressed syllable (24f). The derivation converges in the next step (24g).
When the root contains a vowel other than schwa, it attracts primary stress, as the tableaux in (25) illustrate with /kwes-θat/ $ \to $ [ˈkwas-θət] ‘become hot (weather)’ (23b.i). The first two steps improve on * $ -{\Delta}_{\omega}\ge $ {i,u}: first, the leftmost vowel is stressed (25b), and then the [low] feature is delinked from the suffix (25f). In the first step, assigning stress to either vowel removes one violation of * $ -{\Delta}_{\omega}\ge $ {i,u}, and Align-L(' $ \sigma $ ) prefers that the root vowel is stressed (25b). Other than deleting the [low] feature in the second step, which is ruled out by MaxLinked(low), only delinking it improves on * $ -{\Delta}_{\omega}\ge $ {i,u}. The floating [low] feature is then linked to the root vowel in the third step (25h), satisfying *Float and [low] $ \to $ ' $ \sigma $ . Deletion would also satisfy these constraints, but is ruled out because Max(low) dominates Dep(link) (25i). The derivation converges in the fourth step with the [low] feature on the root vowel.
The feature [low] can only shift onto the mid vowel /e/; it deletes in words with stressed low vowels (26a), high vowels (26b, c), and schwa (26d). Unlike high tones in Kibondei and moras in Logoori, the [low] feature cannot remain floating; it must delete. As discussed below, this motivates the bifurcation of Max(low) into two constraints.
The tableaux in (27) illustrate [low] deletion with a stressed high vowel in /hi:l-θat/ $ \to $ ['hi:l-θət] ‘let oneself fall’ (26b.i). The first two steps of the derivation are identical to those in (25): primary stress is assigned to the root (27b) and the [low] feature is delinked from the suffix (27d). In the third step, however, the floating [low] feature is deleted (27g) rather than being linked to the stressed vowel (27f). The latter is ruled out by *[high, low] which penalizes segments that are specified as both high and low. The derivation converges in the fourth step.
To prevent [low] from shifting onto schwa as in / $ \overset{\frown }{\mathrm{ts}} $ ə-θat/ $ \to $ [' $ \overset{\frown }{\mathrm{ts}} $ ə-θət] *[' $ \overset{\frown }{\mathrm{ts}} $ a-θət] ‘approach’ (26d.i), it must be deleted before stress is assigned. This is modeled by dividing stress assignment across two steps, one which stresses full vowels and one which stresses schwa, and ordering [low] delinking between them (see Elfner Reference Elfner, McCarthy and Pater2016 for discussion of phenomena where stress assignment must be ordered between segmental processes). The tableaux in (28) illustrate this with the derivation of [' $ \overset{\frown }{\mathrm{ts}} $ ə-θət]. The stress pattern of this word is unexpected given words like [ɬəqw-'θat] ‘get all wet’ (23a.i), where the full vowel attracts primary stress. A lexically indexed constraint (Pater Reference Pater, Bateman, O’Keefe, Reilly and Werle2007, Reference Pater and Parker2009) is employed to differentiate these words, which penalizes words that bear the index II and assign primary stress to an affix: *StressedAffix II. If this exceptionality is associated to the root, then the index must propagate up to the word level (Jurgec & Bjorkman Reference Jurgec and Bjorkman2018). However, this appears to be construction-specific, as other roots do not uniformly block affix stress; compare ['jəθ-əs-t] ‘tell him’, which attracts stress away from the recipient suffix /-as/, another locus of [low] shift, with [jəθəs-'t-alə] ‘tell-tr-you(pl)’ (Suttles Reference Suttles2004: 240, 477). Formalization aside, *StressedAffix II blocks stress from being assigned to the suffix (28c). Because the low vowel cannot be stressed, the [low] feature is delinked, lowering its sonority, and satisfying * $ -{\Delta}_{\omega}\ge $ {i,u} (28d); deletion is ruled out by MaxLinked(low). Stressing the root vowel improves on * $ -{\Delta}_{\omega}\ge $ ə, which penalizes all vowels that are not the DTE of the word, but because this constraint is low ranked, it is not yet active (28b). The [low] feature is deleted in the second step, satisfying *Float and [low] $ \to $ ' $ \sigma $ (28h). Because neither vowel is stressed, linking the [low] feature fatally violates * $ -{\Delta}_{\omega}\ge $ {i,u} by needlessly increasing their sonority (28f, g). Finally, the leftmost schwa is stressed, improving on * $ -{\Delta}_{\omega}\ge $ ə (28j), and the derivation converges in the fourth step.
If the [low] feature was not deleted before stress was assigned, nothing would block the grammar from linking it to the root vowel, yielding the wrong output *[' $ \overset{\frown }{\mathrm{ts}} $ a-θət]. This motivates bifurcating Max(low) into two constraints: the general Max(low) and the specific MaxLinked(low). The latter is ranked high enough to allow [low] features to float rather than being deleted directly off of vowels, and the former is crucially ranked low enough to allow floating features to delete when they cannot be linked. This is impossible with a single Max(low) constraint, as the support in (29) summarizes.
To summarize the analysis above, low vowels must be stressed to surface faithfully. Otherwise, the [low] feature is delinked. If it can link to a stressed mid vowel, it does, and otherwise, it is deleted. Shift in Halkomelem is thus derived as delinking $ + $ linking. Other suffixes, including /-aqw/ ‘head’ and the attributive /-aʔɬ/ exceptionally block [low] from delinking, and license it by linking it to a stressed vowel; the examples in (30) illustrate. In these examples, the position of the stress marker follows Bianco (Reference Bianco1996) in assuming that onset clusters are maximally composed of two plosives. As expected, these suffixes surface faithfully when they attract primary stress (30a). When they do not, instead of reducing to schwa, the suffixes surface with low vowels (30b–e). When the stressed vowel is [e] or [ə], the [low] feature is linked to it, lowering it to [a] (30d, e). As with [low] shift, there are exceptions, such as ['χəɬ-aqw] ‘have a headache’ (Suttles Reference Suttles2004: 312).
While the exceptional behavior of the attributive could be explained by the phonotactic restriction against [əʔ] (see Gietz et al. Reference Gietz, Jurgec and Percival2023: 43 for discussion), this is unlikely for the suffix /-aqw/, which Suttles (Reference Suttles2004) usually transcribes without a glottal stop (see, e.g., page 312). Instead, the exceptional behavior of these suffixes is modeled by a lexically indexed constraint against delinking [low], Max(link)I. The tableaux in (31) illustrate /-aqw/ surfacing faithfully in ['stθ’amʔ-aqw] ‘skull’ (30b.i). In the first step, the root vowel is stressed, improving on * $ -{\Delta}_{\omega}\ge $ {i,u} (31b), and the derivation converges in the second step (31c). Delinking (31d) or deleting the [low] feature from the suffix would satisfy * $ -{\Delta}_{\omega}\ge $ {i,u}, but is ruled out by Max(link)I.
Along similar lines, Max(link)I prevents [low] from shifting onto a stressed mid vowel. The tableaux in (32) illustrate with the derivation of [' aqt-aqw] ‘high (of a mountain)’ (30d.i). After the root vowel is stressed in the first step (32b), the [low] feature is linked to it (32d). This licenses the [low] feature, satisfying [low] $ \to $ ' $ \sigma $ . Delinking (32e) or deleting the [low] feature is blocked by Max(link)I in this step, and in the next step (32g), where the derivation converges (32f).
The analysis models [low] shift as delinking $ + $ linking and extends to exceptional forms that license [low] through linking alone. Unlike Kibondei high tones and Logoori moras, I agree with Gietz et al. (Reference Gietz, Jurgec and Percival2023) that unlicensed [low] features must be deleted. Otherwise, they are expected incorrectly to link to stressed schwas. However, this does not necessitate a single-step shift operation. As the present analysis demonstrates, it suffices to bifurcate Max(F) into a general constraint and a specific constraint that penalizes deleting linked features, MaxLinked(F). Furthermore, the analysis Gietz et al. (Reference Gietz, Jurgec and Percival2023) present cannot account for [low] shift across intervening vowels.
As discussed above, [low] shifts onto stressed mid vowels even when a vowel intervenes. Intervening vowels are not limited to schwa, as the examples in (33) illustrate. Gietz et al.’s analysis has [low] shift onto an adjacent stressed mid vowel and deleting otherwise (except when blocked, as with the exceptional suffixes in 30). The ranking arguments of their analysis are summarized in the support in (34); candidates are taken from tableaux 26 and 39 on pages 41 and 48, respectively.
The problems examples like those in (33) present for their analysis are that linking or shifting the suffix [low] onto the intervening vowel is not harmonically improving, and delinking but not deleting [low] contradicts their ranking. The former problem is illustrated in the tableau in (35). Linking (35b) or shifting the [low] feature (35c) onto the intervening vowel does not improve on any phonotactic constraint, and is harmonically bounded by the fully faithful candidate (35a). The latter problem is illustrated in the support in (36). If the delinking candidate is chosen as optimal, Max[ $ + $ low] must dominate *Float, contradicting the ranking established in (34). In order to preserve the [low] feature and link it across the intervening vowel in a future step, the grammar cannot delete [low] features, yielding floating features that Gietz et al. specifically argue their analysis avoids (pp. 46–48). This can be remedied by bifurcating Max, as above, but doing so eliminates any explanatory power gained by positing a single-step shift operation. Thus, as with Kibondei, not only is it not necessary to posit a single-step shift operation to model [low] shift in Halkomelem, but the analysis Gietz et al. (Reference Gietz, Jurgec and Percival2023) present in support of doing so is empirically inadequate. Once again, it is not my claim that every analysis of Halkomelem with a single-step shift operation fails, but Gietz et al.’s does.
4. Typology
The previous two sections demonstrate that, at least for Kibondei and Halkomelem, positing a single-step shift operation is superfluous. Further, the analyses Gietz et al. (Reference Gietz, Jurgec and Percival2023) offer for these languages are not empirically adequate. This section turns to a typological perspective on shift. It demonstrates that, instead of increasing empirical coverage, a single-step shift operation actually makes it impossible to model attested patterns.
Following the typological modeling in Gietz et al. (Reference Gietz, Jurgec and Percival2023: 48–53), typologies of Kibondei-like systems were calculated with and without a single-step shift operation (the Python scripts used in the calculation are available online at https://github.com/aphonologist/hs-shift). As in the analyses above, Gen was able to delete features, remove autosegmental links to features, and add autosegmental links to features. One typology additionally allowed Gen to model shift in a single step. Con comprised the faithfulness constraints Max(link), Dep(link), Max(F), and MaxLinked(F), and the phonotactic constraints *Float, *Link(F), *F], and Align-R (definitions of these constraints are given in Appendix E). Both typologies were run on six inputs, which each contained five segments (or F-bearing units more generally). One input contained a floating feature, and the other five contained one feature linked to one segment.
In total, the two typologies produced 38 languages; 27 were shared between them. The typology calculated without a single-step shift operation included an additional five languages, and the typology calculated with it included an additional four languages. Surprisingly, the typology with more operations available to Gen was smaller (31 languages) than the more restricted typology (32 languages). The table in (37) summarizes the typologies in terms of input–output mappings (see Appendix E for the constraint rankings). Language numbers in bold were only produced in the typology without a single-step shift operation, and those that are underlined were only produced in the typology with one. Double arrows $ \Rightarrow $ indicate spreading, and single arrows $ \to $ indicate shifting; delinked features float in the output.
For a more compact representation, the table in (38) summarizes the typology in terms of the possible outputs of a given grammar. In other words, it represents the sets of phonotactically well-formed strings. In this table, Ø represents a segment (or F-bearing unit) not linked to a feature, and F represents a segment linked to a feature. The numerical notation is taken from Chomsky & Halle (Reference Chomsky and Halle1968); F $ {}_m^n $ denotes the set of strings composed of the symbol F of at least length $ m $ and at most length $ n $ . From this perspective, it is clear that positing a single-step shift operation is unnecessary. The four languages uniquely produced in that typology (8, 16, 24, 31) are produced without a single-step shift operation. Further, no grammar with a single-step shift operation can model languages that only allow a feature on the penultimate or final segment (38.vii); this pattern is uniquely produced without such an operation (19, 20). Because it is attested, grammars with a single-step shift operation undergenerate.
One language that attests this pattern (38.vii) is Wolaitta (Omotic; Amha Reference Amha1996). With the exception of words with historically frozen suffixes, bare nouns and adjectives in the language all have a single high tone associated either to the final syllable (39(i)) or the penult (39(ii)).
Assuming richness of the base (Prince & Smolensky Reference Prince and Smolensky1993/Reference Prince and Smolensky2004), the grammar of Wolaitta must map any underlying form with one high tone onto an output, where the high tone is associated to one of the two final syllables. The grammar illustrated below (40)–(41), minimally different from that for Kibondei, does exactly that. A hypothetical input with an initial high tone (40a) surfaces with penultimate tone (40h), having linked it to the penult (40e) and then removed the underlying association (40g). Words with underlyingly final high tones, such as [miʃirá] ‘bride’ (39c.i), surface faithfully (41a). Delinking the high tone satisfies *H $ \Big]{}_{\varphi } $ , but violates the higher ranked *Float (41b), and deleting it violates the high ranked Max(H) (41c). Thus, underlyingly final tones surface in situ and all others shift to the penult.
All six languages that shift pre-penultimate features to the penult (15, 16, 17, 18, 19, 20) share the ranking *F] $ \gg $ Align-R $ \gg $ Dep(link), Max(link). If Gen is able to shift a feature in one step, it is impossible to keep final features in place under this ranking, as the tableau in (42) illustrates. By shifting the high tone onto the penult, the unintended winner satisfies *H $ \Big]{}_{\varphi } $ without violating a higher ranked constraint (42b).
This problem is avoided by eschewing a single-step shift operation. As in Kibondei, regressive shift requires an intermediate step with a floating feature. Whether this is optimal is determined by the constraint ranking. Thus, requiring an intermediate step crucially shapes the typology, echoing arguments against parallel theories of Gen (McCarthy Reference McCarthy, Baković, Itô and McCarthy2006, Reference McCarthy, Blaho and Bye2008b).
As a point of comparison, in their reported typologies, Gietz et al. (Reference Gietz, Jurgec and Percival2023) do not allow features to float. As they point out (p. 49), their results do not reflect a complete typology, but a subset where *Float is never dominated. Nevertheless, they spuriously conclude (pp. 51–53) that regressive shift to the penult is impossible without a single-step shift operation. Because regressive shift requires an intermediate optimum that violates *Float, it is excluded from their typology not because Gen is too restrictive, but because they do not provide a full typology.
The results of this section support the conclusions drawn in the previous two sections. Not only is a single-step shift operation superfluous, grammars that assume it undergenerate with respect to the attested typology.
5. Conclusion
Like the content of Con, exactly what operations are available to Gen is an empirical question. Every Harmonic Serialism grammar in part poses hypotheses about the sets of constraints and operations. When the attested typology is only consistent with a superset or subset of the assumed operations, it is necessary to change our hypothesis of what comprises Gen.
With respect to shift, an input–output mapping where a feature associated with some segment loses its underlying association and surfaces associated to another segment, McCarthy (Reference McCarthy, Baković, Itô and McCarthy2006, Reference McCarthy, Blaho and Bye2008b) argues that it is empirically adequate to restrict Gen to create or remove one autosegmental link per step. Gietz et al. (Reference Gietz, Jurgec and Percival2023), however, argue that that approach is infeasible and propose that Gen must also include a single-step shift operation. This paper refutes their arguments and supports the more restrictive hypothesis of Gen. The analyses of shift in Kibondei and Halkomelem above do not rely on a single-step shift operation and adequately model the data. Gietz et al.’s analyses, on the other hand, do not stand up to scrutiny and do not make a convincing argument in favor of a single-step shift operation. Furthermore, typological modeling reveals that a single-step shift operation reduces the empirical adequacy of Harmonic Serialism, rather than increasing it. I conclude that the more restricted operation set is empirically superior to the superset proposed by Gietz et al. (Reference Gietz, Jurgec and Percival2023).
A. Kibondei appendix
This appendix presents the full analysis of Kibondei; that is more tableaux are included here than in the text of the paper and tableaux include more candidates. Harmonically bounded candidates in tableaux are grayed out.
The Gen function is based on Lamont (Reference Lamont2022a, Reference Lamontb): it can delete any tone (linked or floating),Footnote 5 remove the leftmost/rightmost link of a tone, add a local link, and link a floating tone to any syllable. These operations are restricted by the No Crossing Constraint (Goldsmith Reference Goldsmith1976). As a simplification, I do not include candidates where a high tone is linked to a syllable that is already linked to a high tone. Lamont (Reference Lamont2022a, Reference Lamontb) allows this operation and automatically fuses the two high tones. This can be avoided by including a faithfulness constraint against fusion.
The constraints used in the analysis are defined below (43–52). Their ranking is represented in the Hasse diagram immediately following (53).
As a further simplification, because regressive linking does not improve on any of the constraints (54), I exclude those candidates.
The analysis ignores phrase-penultimate vowel lengthening and treats syllables as the tone-bearing units.
Tableaux included in this appendix:
A.1. Toneless verbs
A.1.1. Toneless prefix
Because there is no motivation to insert a high tone, no tableaux are needed.
A.1.2. High toned prefix
A.1.3 Phrase-medial
Because there is no motivation to insert a high tone, no tableaux are needed.
A.2. Monosyllabic verbs
A.2.1. Toneless prefix
A.2.2. High toned prefix
A.2.3. Phrase-medial
A.3. Disyllabic verbs
A.3.1. Toneless prefix
A.3.2. High toned prefix
A.3.3. Phrase-medial
A.4. Trisyllabic verbs
A.4.1. Infinitive
A.4.2. High toned prefix
A.4.3. Phrase-medial
B. Gietz et al.’s constraint definitions
This appendix provides Gietz et al.’s (Reference Gietz, Jurgec and Percival2023: 30-32) definitions of the constraints that appear in the support in (15).
C. Logoori appendix
D. Halkomelem appendix
This appendix presents the full analysis of Halkomelem; that is more tableaux are included here than in the text of the paper and tableaux include more candidates. Harmonically bounded candidates in tableaux are grayed out.
The Gen function is identical to that used for Kibondei with the addition of an operation that designates one syllable as the head of the word. The constraints used in the analysis not already defined are defined below (89–93). Their ranking is represented in the Hasse diagram immediately following (94).
Some of the candidates raise representational questions, such as (104d), where [low] is linked to a high vowel. Because this is never optimal, it is irrelevant for the analysis exactly how to interpret this. Likewise, when [low] is delinked from an underlying mid vowel as in (100m–n), it could plausibly result in either a mid vowel or schwa. Again, because these candidates are not optimal, the resolution of this question is irrelevant to the analysis.
E. Typology
This appendix presents the constraint rankings as skeletal bases (Brasoveanu & Prince Reference Brasoveanu and Prince2011) for the 38 languages produced in the typology. Bases in the left column are from the typology without a single-step shift operation, and those in the right column are from the typology with one.
Supplementary material
To view supplementary material for this article, please visit http://doi.org/10.1017/S0022226723000294.