Skip to main content Accessibility help
×
Hostname: page-component-cd9895bd7-jkksz Total loading time: 0 Render date: 2024-12-28T13:42:50.221Z Has data issue: false hasContentIssue false

14 - Observing and Measuring Speech Articulation

from Section III - Measuring Speech

Published online by Cambridge University Press:  11 November 2021

Rachael-Anne Knight
Affiliation:
City, University of London
Jane Setter
Affiliation:
University of Reading
Get access

Summary

The observation and measurement of the movement of the organs of the vocal tract during speech is relevant for the understanding of phonetic phenomena, from descriptions of under-documented languages and cross-linguistic comparison of speech sound production, to investigations of factors impacting speech motor planning, and to testing models of the relationship between the vocal tract and acoustics. This chapter describes the most commonly used methods for measuring or recording the position and movements of the organs that make up the vocal tract during speech. Techniques discussed in this chapter include direct vocal tract imaging (e.g. magnetic resonance imaging (MRI), laryngoscopy, ultrasound imaging), articulatory point tracking (e.g. X-ray microbeam tracking (XRMB), electromagnetic articulography (EMA), Velotrace), and indirect measures of articulator movement (e.g. electroglottography (EGG), airflow and air pressure measures, static palatography and electropalatography (EPG)). These methods vary in a number of respects. This chapter discusses advantages and drawbacks of each method described, as well as factors relevant to researchers during the planning stages of a study.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2021

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

14.7 References

Anderson, V. B. (2000). Giving Weight to Phonetic Principles: The Case of Place of Articulation in Western Arrernte. PhD Thesis, UCLA.Google Scholar
Articulate Instruments Ltd. (2010). Articulate Assistant User Guide: Version 1.18, Edinburgh, UK: Articulate Instruments Ltd.Google Scholar
Bell-Berti, F. & Krakow, R. A. (1991). Anticipatory velar lowering: A coproduction account. Journal of the Acoustical Society of America, 90(1), 112–23.CrossRefGoogle ScholarPubMed
Bernhardt, B., Gick, B., Bacsfalvi, P. & Adler-Bock, M. (2005). Ultrasound in speech therapy with adolescents and adults. Clinical Linguistics and Phonetics, 19(6–7), 605–17.Google Scholar
Bouhuys, A., Proctor, D. F. & Mead, J. (1966). Kinetic aspects of singing. Journal of Applied Physiology, 21(2), 483–96.Google Scholar
Browman, C. & Goldstein, L. (1992). Articulatory Phonology: An overview. Phonetica, 49(3–4), 155–80.CrossRefGoogle ScholarPubMed
Brunner, J., Fuchs, S. & Perrier, P. (2009). On the relationship between palate shape and articulatory behavior. Journal of the Acoustical Society of America, 125(6), 3936–49.Google Scholar
Byrd, D. & Saltzman, E. (1998). Intragestural dynamics of multiple prosodic boundaries. Journal of Phonetics, 26(2), 173–99.CrossRefGoogle Scholar
Byrd, D., Tobin, S., Bresch, E. & Narayanan, S. (2009). Timing effects of syllable structure and stress on nasals: A real-time MRI examination. Journal of Phonetics, 37(1), 97110.Google Scholar
Chen, E. (2017, August 20). Guess the Word. Retrieved 26 September 2018, from https://ericlgame.itch.io/guess-the-word.Google Scholar
Cheng, H. Y., Murdoch, B. E., Goozée, J. V. & Scott, D. (2007). Electropalatographic assessment of tongue-to-palate contact patterns and variability in children, adolescents, and adults. Journal of Speech, Language, and Hearing Research, 50(2), 375–92.CrossRefGoogle ScholarPubMed
Chiba, T. & Kajiyama, M. (1941). The Vowel: Its Nature and Structure. Tokyo: Tokyo-Kaiseikan.Google Scholar
Childers, D. G. & Krishnamurthy, A. K. (1985). A critical review of electroglottography. Critical Reviews in Biomedical Engineering, 12(2), 131–61.Google Scholar
Cusack, R., Cumming, N., Bor, D., Norris, D. & Lyzenga, J. (2005). Automated post-hoc noise cancellation tool for audio recordings acquired in an MRI scanner. Human Brain Mapping, 24(4), 299304.CrossRefGoogle Scholar
Dart, S. N. (1991). Articulatory and acoustic properties of apical and laminal articulations. In UCLA Working Papers in Phonetics, 79, 1155.Google Scholar
Davidson, L. (2006). Comparing tongue shapes from ultrasound imaging using smoothing spline analysis of variance. Journal of the Acoustical Society of America, 120, 407–15.CrossRefGoogle ScholarPubMed
Delvaux, V., Demolin, D., Harmegnies, B. & Soquet, A. (2008). The aerodynamics of nasalization in French. Journal of Phonetics, 36(4), 578606.CrossRefGoogle Scholar
Demolin, D. (2011). Aerodynamic techniques for phonetic fieldwork. In Proceedings of the 17th International Congress of Phonetic Sciences. City University of Hong Kong: Hong Kong, 84–7.Google Scholar
Ellis, L. & Hardcastle, W. (2002). Categorical and gradient properties of assimilation in alveolar to velar sequences: Evidence from EPG and EMA data. Journal of Phonetics, 30(3), 373–96.Google Scholar
Esling, J. H. (1996). Pharyngeal consonants and the aryepiglottic sphincter. Journal of the International Phonetic Association, 26(2), 6588.Google Scholar
Esling, J. H., Fraser, K. E. & Harris, J. G. (2005). Glottal stop, glottalized resonants, and pharyngeals: A reinterpretation with evidence from a laryngoscopic study of Nuuchahnulth (Nootka). Journal of Phonetics, 33(4), 383410.Google Scholar
Esposito, C. M. (2012). An acoustic and electroglottographic study of White Hmong tone and phonation. Journal of Phonetics, 40(3), 466–76.Google Scholar
Fant, G. (1970). Acoustic Theory of Speech Production: with Calculations Based on X-Ray Studies of Russian Articulations, vol. 2. Berlin: Walter de Gruyter.Google Scholar
Firth, J. (1948). Word-palatograms and articulation. Bulletin of the School of Oriental and African Studies, 12(3–4), 857–64.Google Scholar
Fowler, C. A. & Saltzman, E. (1993). Coordination and coarticulation in speech production. Language and Speech, 36(2–3), 171–95.CrossRefGoogle ScholarPubMed
Frisch, S. A. & Wodzinski, S. M. (2016). Velar–vowel coarticulation in a virtual target model of stop production. Journal of Phonetics, 56, 5265.Google Scholar
Fuchs, S. & Koenig, L. L. (2009). Simultaneous measures of electropalatography and intraoral pressure in selected voiceless lingual consonants and consonant sequences of German. Journal of the Acoustical Society of America, 126(4), 1988.Google Scholar
Fujimura, O., Kiritani, S. & Ishida, H. (1973). Computer controlled radiography for observation of movements of articulatory and other human organs. Computers in Biology and Medicine, 3(4), 371–84.Google Scholar
Gafos, A. I., Charlow, S., Shaw, J. A. & Hoole, P. (2014). Stochastic time analysis of syllable-referential intervals and simplex onsets. Journal of Phonetics, 44, 152–66.Google Scholar
Gibbon, F. E. (1990). Lingual activity in two speech-disordered children’s attempts to produce velar and alveolar stop consonants: Evidence from electropalatographic (EPG) data. International Journal of Language & Communication Disorders, 25(3), 329–40.Google Scholar
Giles, S. B. & Moll, K. L. (1975). Cinefluorographic study of selected allophones of English /l/. Phonetica, 31(3–4), 206–27.Google Scholar
Hardcastle, W. J. (1972). The use of electropalatography in phonetic research. Phonetica, 25(4), 197215.CrossRefGoogle ScholarPubMed
Herbst, C. T., Fitch, W. T. & Švec, J. G. (2010). Electroglottographic wavegrams: A technique for visualizing vocal fold dynamics noninvasively. Journal of the Acoustical Society of America, 128(5), 3070–8.Google Scholar
Horiguchi, S. & Bell-Berti, F. (1987). The Velotrace: A device for monitoring velar position. The Cleft Palate Journal, 24(2), 104–11.Google Scholar
Isshiki, N. (1964). Regulatory mechanism of voice intensity variation. Journal of Speech, Language, and Hearing Research, 7(1), 1729.CrossRefGoogle ScholarPubMed
Johnson, K. (2003). Acoustic and Auditory Phonetics, 2nd ed. Oxford: Blackwell.Google Scholar
Keating, P. A. (1990). The window model of coarticulation: articulatory evidence. In Kingston, J. & Beckman, M., eds., Papers in Laboratory Phonology I. Cambridge: Cambridge University Press, pp. 451470.Google Scholar
Keating, P. A. (1991). Coronal places of articulation. In Paradis, C. & Prunet, J., eds., Phonetics and Phonology, Volume 2: The Special Status of Coronals. Cambridge, MA: Academic Press, pp. 2948.Google Scholar
Kelsey, C. A., Minifie, F. D. & Hixon, T. (1969). Applications of ultrasound in speech research. Journal of Speech, Language, and Hearing Research, 12(3), 564.Google Scholar
Kemp, J. A. (1995). Phonetics: Precursors to modern approaches. In E. F. K. Koerner & R. E. Asher, eds., Concise History of the Language Sciences. Amsterdam: Elsevier, pp. 371–88.Google Scholar
Khatiwada, R. (2007). Nepalese retroflex stops: a static palatography study of inter-and intra-speaker variability. In Proceedings of the 8th INTERSPEECH, pp. 1422–5.CrossRefGoogle Scholar
Krakow, R. A. (1999). Physiological organization of syllables: A review. Journal of Phonetics, 27(1), 2354.Google Scholar
Krausert, C. R., Olszewski, A. E., Taylor, L. N., McMurray, J. S., Dailey, S. H. & Jiang, J. J. (2011). Mucosal wave measurement and visualization techniques. Journal of Voice, 25(4), 395405.CrossRefGoogle ScholarPubMed
Ladefoged, P. (1968). A Phonetic Study of West African Languages: An Auditory-Instrumental Survey. Cambridge: Cambridge University Press.Google Scholar
Li, M., Akgul, Y. & Kambhamettu, C. (2005). EdgeTrak [Computer Program]. Version 1.0.0.4.Google Scholar
Lieberman, P. (1968). Direct comparison of subglottal and esophageal pressure during speech. Journal of the Acoustical Society of America, 43(5), 1157–64.CrossRefGoogle ScholarPubMed
Lin, S., Beddor, P. S. & Coetzee, A. W. (2014). Gestural reduction, lexical frequency, and sound change: A study of post-vocalic /l/. Laboratory Phonology, 5(1), 936.CrossRefGoogle Scholar
Lin, S. & Demuth, K. (2015). Children’s acquisition of English onset and coda /l/: Articulatory evidence. Journal of Speech, Language, and Hearing Research, 58(1), 1327.CrossRefGoogle Scholar
Lingala, S. G., Sutton, B. P., Miquel, M. E. & Nayak, K. S. (2016). Recommendations for real-time speech MRI: Real-Time Speech MRI. Journal of Magnetic Resonance Imaging, 43(1), 2844.Google Scholar
Lohscheller, J., Eysholdt, U., Toy, H. & Dollinger, M. (2008). Phonovibrography: Mapping high-speed movies of vocal fold vibrations into 2-D diagrams for visualizing and analyzing the underlying laryngeal dynamics. IEEE Transactions on Medical Imaging, 27(3), 300–9.Google Scholar
McAllister Byun, T. & Hitchcock, E. R. (2012). Investigating the use of traditional and spectral biofeedback approaches to intervention for /r/ misarticulation. American Journal of Speech-Language Pathology, 21(3), 207–21.Google Scholar
McAllister Byun, T., Buchwald, A. & Mizoguchi, A. (2016). Covert contrast in velar fronting: An acoustic and ultrasound study. Clinical Linguistics & Phonetics, 30(3–5), 249–76.CrossRefGoogle ScholarPubMed
Ménard, L., Toupin, C., Baum, S. R., Drouin, S., Aubin, J. & Tiede, M. (2013). Acoustic and articulatory analysis of French vowels produced by congenitally blind adults and sighted adults. Journal of the Acoustical Society of America, 134(4), 2975–87.Google Scholar
Mielke, J., Baker, A. & Archangeli, D. (2010). Variability and homogeneity in American English /r/ allophony and /s/ retraction. In Fougeron, C., Kuehnert, B., Imperio, M. & Vallee, N., eds., Papers in Laboratory Phonology X. Berlin: Mouton De Gruyter, pp. 699730.Google Scholar
Mielke, J., Olson, K. S., Baker, A. & Archangeli, D. (2011). Articulation of the Kagayanen interdental approximant: An ultrasound study. Journal of Phonetics, 39(3), 403–12.Google Scholar
Mielke, J., Carignan, C. & Thomas, E. R. (2017). The articulatory dynamics of pre-velar and pre-nasal /æ/-raising in English: An ultrasound study. Journal of the Acoustical Society of America, 142(1), 332–49.Google Scholar
Miller, A. & Finch, K. (2011). Corrected high-frame rate anchored ultrasound with software alignment. Journal of Speech, Language, and Hearing Research, 54(2), 471–86.Google Scholar
Moisik, S. R., Lin, H. & Esling, J. H. (2014). A study of laryngeal gestures in Mandarin citation tones using simultaneous laryngoscopy and laryngeal ultrasound (SLLUS). Journal of the International Phonetic Association, 44(01), 2158.Google Scholar
Narayanan, S., Nayak, K., Lee, S., Sethy, A. & Byrd, D. (2004). An approach to real-time magnetic resonance imaging for speech production. Journal of the Acoustical Society of America, 115(4), 1771–6.Google Scholar
Narayanan, S., Toutios, A., Ramanarayanan, V., Lammert, A., Kim, J., Lee, S. et al. (2014). Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC). Journal of the Acoustical Society of America, 136(3), 1307–11.Google Scholar
Öhman, S. & Stevens, K. (1963). Cineradiographic studies of speech: Procedures and objectives. Journal of the Acoustical Society of America, 35(11), 1889.Google Scholar
Ramanarayana, V., Tilsen, S., Proctor, M., Töger, J., Goldstein, L., Nayak, K. S. et al. (2018). Analysis of speech production real-time MRI. Computer Speech & Language, 52, 122.Google Scholar
Rothenberg, M. (1992). A multichannel electroglottograph. Journal of Voice, 6(1), 3643.CrossRefGoogle Scholar
Russell, G. O. (1929). The mechanism of speech. Journal of the Acoustical Society of America, 1(1), 83109.Google Scholar
Schönle, P. W., Gräbe, K., Wenig, P., Höhne, J., Schrader, J. & Conrad, B. (1987). Electromagnetic articulography: Use of alternating magnetic fields for tracking movements of multiple points inside and outside the vocal tract. Brain and Language, 31(1), 2635.Google Scholar
Scobbie, J. M., Gibbon, F., Hardcastle, W. J. & Fletcher, P. (2000). Covert contrast as a stage in the acquisition of phonetics and phonology. In Broe, M. & Pierrehumbert, J., eds., Papers in Laboratory Phonology V. Cambridge: Cambridge University Press, pp. 194207.Google Scholar
Scobbie, J. M., Wrench, A. & van der Linden, M. (2008). Head-probe stabilisation in ultrasound tongue imaging using a headset to permit natural head movement. In Proceedings of the Eighth International Seminar on Speech Production, Strasbourg, pp. 373–6.Google Scholar
Scobbie, J. M., Turk, A., Geng, C., King, S., Lickley, R. & Richmond, K. (2013). The Edinburgh Speech Production Facility DoubleTalk Corpus. In Proceedings of the 14th INTERSPEECH, pp. 764–6.Google Scholar
Stevens, K. N. (1989). On the quantal nature of speech. Journal of Phonetics, 17, 346.CrossRefGoogle Scholar
Stevens, K. N. & House, A. S. (1955). Development of a quantitative description of vowel articulation. Journal of the Acoustical Society of America, 27(3), 484–93.Google Scholar
Stone, M. (2005). A guide to analysing tongue motion from ultrasound images. Clinical Linguistics and Phonetics, 19(6–7), 455502.Google Scholar
Stone, M., Davis, E. P., Douglas, A. S., Aiver, M. N., Gullapalli, R., Levine, W. S. et al. (2001). Modeling tongue surface contours from cine-MRI images. Journal of Speech, Language, and Hearing Research, 44(5), 1026–40.Google Scholar
Strenger, F. (1959). Methods for direct and indirect measurement of the sub-glottal air-pressure in phonation. Studia Linguistica, 13(1–2), 98112.Google Scholar
Styler, W., Krivokapic, J., Parrell, B. & Kim, J. (2017). Using machine learning to identify articulatory gestures in time course data. Journal of the Acoustical Society of America, 142(4), 2579.Google Scholar
Švec, J. G. & Schutte, H. K. (1996). Videokymography: High-speed line scanning of vocal fold vibration. Journal of Voice, 10(2), 201–5.Google Scholar
Tabain, M., Fletcher, J. & Butcher, A. (2011). An EPG study of palatal consonants in two Australian languages. Language and Speech, 54(2), 265–82.Google Scholar
Titze, I. R. (1990). Interpretation of the electroglottographic signal. Journal of Voice, 4(1), 19.Google Scholar
Westbury, J., Milenkovic, P., Weismer, G. & Kent, R. (1990). X-ray microbeam speech production database. Journal of the Acoustical Society of America, 88(S1), S56–S56.Google Scholar
Wrench, A. (1999). MOCHA-TIMIT, speech database. Department of Speech and Language Sciences, Queen Margaret University College, Edinburgh.Google Scholar
Yehia, H., Rubin, P. & Vatikiotis-Bateson, E. (1998). Quantitative association of vocal-tract and facial behavior. Speech Communication, 26(1–2), 2343.Google Scholar
Yuan, J. & Liberman, M. (2008). Speaker identification on the SCOTUS corpus. Journal of the Acoustical Society of America, 123(5), 5687–890.Google Scholar
Zharkova, N., Gibbon, F. E. & Lee, A. (2017). Using ultrasound tongue imaging to identify covert contrasts in children’s speech. Clinical Linguistics & Phonetics, 31(1), 2134.Google Scholar
Zhou, X., Espy-Wilson, C., Boyce, S., Tiede, M., Holland, C. & Choe, A. (2008). A magnetic resonance imaging-based articulatory and acoustic study of ‘retroflex’ and ‘bunched’ American English /r/. Journal of the Acoustical Society of America, 103(6), 4466–81.Google Scholar
Zue, V., Seneff, S. & Glass, J. (1990). Speech database development at MIT: TIMIT and beyond. Speech Communication, 9(4), 351–6.Google Scholar

Save book to Kindle

To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×