Southern Tati is a North-Western Iranian language in the Indo-European language family. Different varieties of this language group are spoken intermittently in the northern and northwestern parts of Iran, mainly in the Qazvin, Alborz, Markazi, Tehran, Ardabil, Gilan, Zanjan, and Khorasan-e-Shomali provinces. Previous linguistic work on the language consists of multiple descriptive grammars. These include Yarshater’s (Reference Yarshater1969) analysis of multiple Tati dialects including Takestani, Chali, Eshtehardi, Xiaraji, Ebrahim-abadi, Sagz-abadi, Danesfehani, Esfarvarini, and Xozini, as well as Taheri’s (Reference Taheri2009) and Rahmani & Rahmni’s (Reference Rahmani and Rahmani2021) analyses of the Takestani dialect. The variety of Southern Tati analyzed in this study is the Takestani dialect (tks, ISO 639-3). Takestani is spoken by the Tat community in the city of Takestan, known by its residents as Siayden (IPA: [sijʌˈten]). The speakers of Takestani know their dialect of the Tati language as Siaydiniji (IPA: [sijʌtiniˈd͡ʒi]), as ‘related to the residents of Siaden.’ Figure 1 shows the location of Takestan in the Qazvin province in Iran.
Southern Tati has been categorized by UNESCO’s Atlas of the World’s Languages in Danger project as ‘definitely endangered.’ According to Ethnologue, Takestani is classified as ‘shifting,’ as children no longer acquire the language. Although Ethnologue reports the number of Southern Tati speakers at 395,000 in 2009, the current number of its speakers and the number of its speakers in the city of Takestan is not clear to us. Among the several reasons for the endangered status of Takestani are (a) proximity to Tehran, the capital of Iran, where Persian, as the standard language of the country, is spoken, (b) the limitation of the educational system to teaching Persian, (c) the dominance of Persian in the media, (d) higher prestige attributed to Persian speakers who are socio-economically higher in the area, compared to the farmers in Takestan, and (e) limited resources for language research and documentation of Tati.
Southern Tati lacks a writing system of its own, and it has been passed down from generation to generation orally. Tati speakers have adapted the Persian alphabet, which itself is adapted from the Arabic writing system, into their writings, such as in personal correspondence. All the data in this study comes from the first author, a thirty-seven-year-old native female speaker of Takestani who was born and raised in Takestan before age eighteen. The speaker is also a native speaker of Persian and has learned Arabic and English since childhood. She has been living in the United States for eight years.
Consonants
Takestani has a laryngeal contrast (voiceless unaspirated vs. voiceless aspirated) for three of the four plosive places of articulation (bilabial, alveolar and velar) and the affricate series (voiced vs. voiceless). Yarshater (Reference Yarshater1969) previously analyzed the laryngeal contrast as being voiced vs. voiceless. He indicates that the voiceless stops are slightly aspirated, while there is no explicit discussion of the acoustic properties of the voiced series. Our analysis of the voice onset time characteristics (VOT; Lisker & Abramson, Reference Lisker and Abramson1964) of these sounds indicates that the ‘voiceless’ series has a moderate-to-long VOT value with noticeable aspiration, while the ‘voiced’ series occasionally shows minor pre-voicing but most often has a short lag VOT value with no aspiration. This analysis is supported by the VOT data presented in Table 1, which outlines the mean and standard deviations for VOT of initial plosives in Takestani. This makes it clear that the laryngeal contrast in plosives in the language is marked by aspiration and not voicing. There is no laryngeal contrast for the uvular plosive, but it has a short lag VOT with no aspiration. Yarshater (Reference Yarshater1969) previously used the symbol [q] to represent this sound despite writing that ‘it appears to be partially voiced’ (p. 34) and using the voiced plosive symbols for the other non-aspirated plosives in the language. The VOT analysis therefore helps to better show the relationship between the different plosives in Takestani.
The velar∼uvular plosive contrast is typologically rare. In the PHOIBLE database of phonemic inventories (Moran & McCloy Reference Moran and McCloy2019), only 232 out of the 3,020 inventories (7.7%) contain both a voiceless velar plosive and a voiceless uvular plosive. Figure 2 shows spectrograms for the words /kal/ ‘throat’ and /qar/ ‘anger’. The formant trajectories are noticeably different in /kal/ and /qar/. Furthermore, the velar stop causes the vowel /a/ to have a higher F2 value which has previously been identified as a cue for place difference between velar and uvular fricatives (Gordon et al. Reference Gordon, Barthmaier and Sands2002).
There is also a laryngeal contrast for the labiodental, alveolar and postalveolar articulated fricatives. The fricatives articulated further back in the vocal tract only occur in the voiceless form. Previous analyses of Takestani have not included fricative spectra. Here we provide information about both the spectral peak, as well as the shape of the spectrum for the voiceless fricatives in the language. The general pattern follows standard assumptions about voiceless fricatives: fricatives with places of articulation further back in the vocal tract have lower spectral peaks and the labiodental has a relatively flat shaped spectrum (Strevens Reference Strevens1960; Heinz & Stevens Reference Heinz and Stevens1961). An average FFT spectrum for each voiceless fricative in the language is shown in Figure 3. The spectra were computed in Praat (Boersma & Weenink Reference Boersma and Weenink2022) using a 5 ms Hamming window around the midpoint of 697 fricative tokens (Mean of 139.4 tokens per fricative; SD = 42.6). Spectral peak locations are listed in Table 2.
There are several words in the language that contain long/geminated consonants. The only consonants in the inventory that are not found in the geminate form are [v, ʒ, h, ʔ]. It is not entirely clear whether the geminate forms should be considered phonemes, allophones, or sequences of two identical segments. There does not seem to be a singular rule governing their distribution, but there are also no minimal pairs that contrast only for gemination. There are certain words that come close to being minimal pairs, such as /ˈtʰulɛ/ ‘cheek’ and /tʰuˈlːɛ/ ‘stable (n)’. Since these words also vary in stress location, it appears at first glance that gemination may be the result of stress, but words such as /poχːoasˈtʰan/ ‘cough-inf’ and /ˈtʰopːɛ/ ‘ball’ show that this is not a generalization throughout the language.
Another possibility for explaining gemination may be through loanword adaptation. Words originally adapted from Arabic such as /ˈelːʌ/ ‘except’ and /ˈvʌlːʌ/ ‘I swear to God’ retain the native language gemination, but native Takestani words such as /poχːoˈastʰ/ ‘he coughed’ and /takː-e-tim/ ‘mouth and face area’ show that gemination is not limited to just these loan words. Some geminated consonants can be explained through a process of compensatory lengthening due to syllable simplification (e.g., taʃtʰ-i-ʃɛ → tɛˈt͡ʃːiɛ ‘She/He has had it’). But once again, this cannot completely explain the data, as words with geminates such as /pet͡ʃːarˈt-an/ ‘graze-inf’ and /peˈkːert-em/ ‘plant.pst.1sg’ are not able to be analyzed the same way.
Two nasal stops are phonemic in Takestani, but additional nasal segments surface phonetically due to a process of nasal place assimilation. The labiovelar nasal [ɱ] is found in words like [saɱˈvʌr] ‘samovar’, the velar nasal [ŋ] is found in words like [saŋk] ‘stone’, and the uvular nasal [ɴ] is found in words like [t͡ʃʌɴˈqʌlɛ] ‘a traditional food’.
Takestani allows for many different types of CC clusters in codas, both in native words and loanwords. Rahmani’s (Reference Rahmani2021) dictionary of Takestani includes words containing 180 of the 506 possible coda clusters that can be made with the segment inventory of the language (36%). Table 3 shows all attested combinations. The cluster numbers can be further split to show that there is no hard limitation on the types of consonants that can combine, at least at a broad level. Using a sonority scale where the sonority of segment classes increases such that approximants > nasals > fricatives > affricates > stops, there are more examples of clusters where the sonority falls (eighty-four out of the 192 possible clusters; 44%) than examples where the sonority rises (sixty-one out of the 192 possible clusters; 32%). There are an additional thirty-five out of 122 possible clusters where the sonority plateaus (29%). /r/ is the segment most likely to appear first in a consonant cluster and is attested in eighteen out of the ternty-one possible clusters. /s/, /m/, and /t/ are the most likely consonants to appear as the second consonant in a consonant cluster. Each of them appears in fourteen out of the twenty-one possible non-geminate clusters that can be constructed for each phone. In some cases, there is variation when a word ends with an obstruent-resonant sequence, such as /sefr/ ‘zero’, which sometimes is pronounced with an [e] vowel breaking up the cluster and other times with no intervening vowel.
Vowels
Takestani has eight phonemic vowels /i e ɛ ø u o ʌ a/. Noteworthy is the inclusion of the mid-front rounded vowel /ø/, which is probably due to language contact with Azerbaijani and other Turkic languages. Figure 4 shows the phonemic layout of the Takestani vowel system while Figure 5 shows the acoustic space for the vowel system. All vowels can be allophonically nasalized when appearing before or after a nasal consonant, but the nasalization is stronger and more likely in the carryover nasalization cases. There is no contrastive vowel length, but length differences do appear due to stress, which will be explained in more detail below. Additionally, vowels at the end of words and phrases are generally lengthened.
Formant measurements were taken at the midpoint of vowel tokens that appeared in non-final open syllables and were not flanked by a nasal consonant. Additional measurements of duration, intensity and f0 were taken. Table 4 lists the mean and standard deviation measurements for each vowel grouped by stress. In total, 892 vowel tokens were included (239 stressed; 653 unstressed). Table 4 also included counts for each individual vowel by condition. There were no unstressed tokens of [a] in the subset of the data set used, but for all other vowels, the stressed variant was longer, louder, and higher in pitch than the unstressed variant. Furthermore, Figure 4 shows that while there is no categorical reduction, the formant space does reduce when vowels are unstressed. Figure 5 also highlights that the vowel transcribed as /ʌ/ has significant variation along the F1 dimension. When stressed, it can appear phonetically much closer to [ɑ] in specific tokens.
Prosody
Stress
Stress is contrastive in Takestani words and partly predictable from word class and shape. Stress in nouns is generally ultimate (e.g., /peˈrʌ/ ‘brother’) unless in nouns ending with the feminine suffix -ɛ (e.g., /zɛjˈni.ɛ/ ‘woman’). Most adverbs and adjectives, like nouns, are stressed on their last syllable, such as [pøland-pøˈland] ‘loud-loud (loudly)’ and [qaˈʃank] ‘beautiful.’ Adpositions are primarily mono-syllabic and are typically cliticized to a nominal complement, which is stressed. An example is the noun-postposition – [zʌˈrine rʌ] ‘child for (for the child)’ in which the nominal component [zʌˈrin] rather than the adposition rʌ is stressed. At the phrase level, nouns attract stress, so adverbs and adjectives are typically less prominent overall. For example, in the adjective-noun sequence [qaʃankɛ zʌˈrin] ‘beautiful child,’ the noun [zʌˈrin] ‘child’ rather than the adjective has a prominent stress.
Stress assignment in verbs depends on the presence of specific morpho-syntactic features or stress-attracting morphemes that compete for stress in a hierarchy. For example, present tense verbs are generally stressed on their last syllable (e.g., [meχanˈte] ‘laugh.3sg.m.ind (He laughs)’, and past tense verbs are stressed on their stem /ʌˈχɑrtem/ ‘pv.drink.pst.1sg (I drank)’. The interfaces with morphology for stress assignment of verbs go beyond the scope of this study.
Regarding acoustic properties of stress in Takestani, stressed syllables often appear to be longer than their unstressed counterparts. In our complete vowel data set of 2,579 tokens, there are 1,055 stressed vowels and 1,524 unstressed vowels. The stressed vowels have a mean duration of 125.99 ms while the unstressed vowels have a mean duration of 105.18 ms. This difference turns out to be correlated with the fact that stress often falls on final vowels and final vowels lengthen by default.Footnote 1 This suggests that duration is not used as a cue for prominence in Takestani. Both f0 and intensity also interact with finality, but unlike duration, they also significantly cue prominence as f0 is higher in stressed vowels and lower in final vowels while intensity is higher in both stressed vowels and final vowels.Footnote 2 Figures 6, 7 and 8 provide boxplots of duration (ms), f0 (Hz), and intensity (dB) in stressed and unstressed position and broken up by whether the syllable is a final or non-final syllable.
There are some examples of minimal pairs that directly contrast stress between final and non-final position, such as the nouns /vɛˈjɛ/ ‘wedding’ and /ˈvɛj-ɛ/ ‘bride’ or the verbs /uˈvaz/ ‘dancer’ and /ˈuvaz/ ‘dance!’. The spectrograms of the recordings of the latter two words are shown in Figure 9. The analysis above indicates that vowels in stressed syllables typically have a higher average intensity and a higher average f0 value than their unstressed counterparts. Curves indicating both intensity (dashed) and f0 (solid) are superimposed over the spectrogram. While there has been much influence from Persian on Takestani, it does not seem to extend into the prosodic domain. Following Beckman (Reference Beckman2012), we differentiate pitch-accent languages from non-pitch-accent languages based on whether other cues beyond f0 indicate prominence. Persian has been argued to be a pitch accent language due to its reliance on f0 for cueing word-level prominence (Mahjani Reference Mahjani2003; Abolhasanizadeh et al. Reference Abolhasanizadeh, Gussenhoven and Bijankhan2011), but since intensity also significantly cues word-level prominence in Takestani, the use of f0 appears to serve a supplementary rather than primary purpose in this domain.
Syllable structure
Various analyses of syllable structure can be made for different dialects of Takestani and related languages, such as Persian. Scott (Reference Scott1964) suggests that the Persian syllable structure is CV(C)(C) by assuming that any vowel-initial word has a glottal stop in the initial position. Asadi et al. (Reference Asadi, Hariry and Kiyani2014) say that for the Kajal dialect of Tati that the syllable structure is (C)V(C)(C), and the glottal stop sound is not present phonemically or phonetically. The Takestani dialect does allow glottal stops, at least at the phonetic level. This can be seen word-initially in the examples in Figure 6, but glottal stops in Takestani also appear in non-word initial positions such as [zɛˈʔif] ‘week.’ That being said, glottal stops are often realized phonetically as the lengthening of a neighboring vowel. This corresponds with Yarshater (Reference Yarshater1969)’s interpretation of the glottal stop phoneme. Additionally, the variation between producing the glottal stop or lengthening the vowel is often tied to a speaker’s knowledge of the origin and spelling of a given word. For these reasons, it seems possible that for Takestani, the syllable structure is (C)V(C)(C).
Illustrative passage
English version of the passage
The North Wind and the Sun were disputing, which was the strongest, when a traveler came along wrapped in a warm cloak. They agreed that the one who first succeeded in making the traveler take off his cloak should be considered stronger than the other. Then the North Wind blew as hard as he could, but the more he blew the more closely did the traveler fold his cloak around him; and at last, the North Wind gave up the attempt. Then the Sun shone out warmly, and immediately the traveler took off his cloak. And so, the North Wind was obliged to confess that the Sun was the stronger of the two.
Phonemic transcription
Abbreviations
- 1
First Person
- 3
Third person
- adj
Adjective
- adv
Adverb
- ar
Areal (place, time)
- caus
Causative marker
- com
Comparative adjective
- comp
Complementizer
- conj
Conjunction
- cop
Copula
- det
Determiner
- dist
Distal
- e
Eventive
- ez
Ezafe marker
- ind
Indicative mood
- inf
Infinitive
- m
Masculine
- n
Noun
- nom
Nominative
- num
Numeral
- pl
Plural
- poss
Possessive pronoun
- post
Postposition
- prn
Pronoun
- prog
Progressive aspect
- prs
Present time
- pst
Past tense
- pv
Preverbal morpheme
- prox
Proximal
- refl
Reflexive pronoun
- sg
Singular
- v
Verb
Supplementary material
To view supplementary material for this article (including audio files to accompany the language examples), please visit https://doi.org/10.1017/S0025100323000270.
Acknowledgments
We would like to express our great appreciation to Dr. Marie Huffman for her valuable suggestions during the planning and development of this paper. Her willingness to give her time generously has been very much appreciated. Our gratitude also goes to Dr. Jeffrey Heinz for his constructive advice.
We would also like to thank Dr. Richard Larson and Dr. Vida Samiian for their professional guidance and enthusiastic encouragement of this research work. Our grateful thanks are extended to Roshan Cultural Heritage Institute (RCHI) and Dr. Elahé Omidyar Mir-Djalali for funding and supporting the research project on Southern Tati at Stony Brook University.
Furthermore, we extend our sincere appreciation to the countless individuals within the Tat community in Takestan (Siyaden) in Iran, who helped us verify the data. Their willingness to share personal experiences, linguistic insights, and cultural nuances has greatly enriched our understanding of the language and its heritage.