Hostname: page-component-745bb68f8f-d8cs5 Total loading time: 0 Render date: 2025-01-28T05:10:06.948Z Has data issue: false hasContentIssue false

Annotating an oral corpus using the Text Encoding Initiative. Methodology, problems, solutions1

Published online by Cambridge University Press:  01 March 2008

JANICE CARRUTHERS*
Affiliation:
French Studies, Queen's University Belfast, Belfast BT7 1NN, Northern Ireland, UK e-mail: j.carruthers@qub.ac.uk

Abstract

The objective of this paper is to describe and evaluate the application of the Text Encoding Initiative (TEI) Guidelines to a corpus of oral French, this being the first corpus of oral French where the TEI has been used. The paper explains the purpose of the corpus, both in creating a specialist corpus of néo-contage that will broaden the range of oral corpora available, and, more importantly, in creating a dataset to explore a variety of oral French that has a particularly interesting status in terms of factors such as conception orale/écrite, réalisation médiale and comportement communicatif (Koch and Oesterreicher 2001). The linguistic phenomena to be encoded are both stylistic (speech and thought presentation) and syntactic (negation, detachment, inversion), and all represent areas where previous research has highlighted the significance of factors such as medium, register and discourse type, as well as a host of linguistic factors (syntactic, phonetic, lexical). After a discussion of how a tagset can be designed and applied within the TEI to encode speech and thought presentation, negation, detachment and inversion, the final section of the paper evaluates the benefits and possible drawbacks of the methodology offered by the TEI when applied to a syntactic and stylistic markup of an oral corpus.

Type
Articles
Copyright
Copyright © Cambridge University Press 2008

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

REFERENCES

Armstrong, N. (2001). Social and Stylistic Variation in Spoken French. A Comparative Approach. Amsterdam/Philadelphia: Benjamins.CrossRefGoogle Scholar
Armstrong, N. and Smith, A. (2002). The influence of linguistic and social factors on the recent decline of French ‘ne’. Journal of French Language Studies, 12: 2341.CrossRefGoogle Scholar
Ashby, W. (1982). The drift of French syntax. Lingua, 57: 2946.Google Scholar
Ashby, W. (1988). The syntax, pragmatics and sociolinguistics of left- and right-dislocations in French. Lingua, 75: 203229.Google Scholar
Ashby, W. (2001). Un nouveau regard sur la chute du ne en français parlé tourangeau. Journal of French Language Studies, 11: 122.CrossRefGoogle Scholar
Ayres-Bennett, W., Carruthers, J. with Temple, R. (2001). Problems and Perspectives. Studies in the Modern French Language. London: Longman.Google Scholar
Barnes, B. (1985). The Pragmatics of Left Detachment in Spoken Standard French. Amsterdam: Benjamins.CrossRefGoogle Scholar
Beeching, K. (2002). Gender, Politeness and Pragmatic Particles in French. Amsterdam-Philadelphia: Benjamins.CrossRefGoogle Scholar
Bilger, M. (ed.) (2000). Corpus. Méthodologie et applications linguistiques. Paris: Champion.CrossRefGoogle Scholar
Bilger, M. and Cappeau, P. (2004). L'oral ou la multiplication des styles. Langage et société, 109: 1330.Google Scholar
Blasco-Dulbecco, M. (1999). Les Dislocations en français contemporain: étude syntaxique. Paris: Champion.Google Scholar
Cappeau, P. (1992). Le Sujet postposé en français contemporain. Université de Provence: thèse nouveau régime.Google Scholar
Cappeau, P. and Seijido, M. (2005). Les Corpus oraux en français. www.culture.gouv.fr/culture/dglf/recherche/corpus_parole/Presentation_Inventaire.pdfGoogle Scholar
Carruthers, J. (2005). Oral Narration in Modern French. A Linguistic Analysis of Temporal Patterns. Oxford: Legenda.Google Scholar
Carruthers, J. and Marnette, S. (2007). Tense, voices and point of view in medieval and modern oral narration. In: Labeau, E., Vetters, C. and Caudal, P. (eds), Sémantique et diachronie du système verbal français. Amsterdam: Rodopi, pp. 177202.Google Scholar
Clifford, P. (1973). Inversion of the Subject in French Narrative Prose from 1500 to the Present Day. Oxford: Blackwell.Google Scholar
Coveney, A. (1996). Variability in Spoken French. A Sociolinguistic Study of Interrogation and Negation. Exeter: Elm Bank.Google Scholar
Coveney, A. (1998). Awareness of linguistic constraints on variable ‘ne’ omission. Journal of French Language Studies, 8: 159187.CrossRefGoogle Scholar
De La Salle, B. (2004). Plaidoyer pour les arts de la parole. Vendôme: CLIO.Google Scholar
Delais-Roussarie, E. (2000). Constitution et annotation de corpus: méthode et recommandations. In: Elisabeth, Delais-Roussarie and Jacques, Durand (eds.), Corpus et variation en phonologie du français. Toulouse: Presses Universitaires du Mirail, pp. 91125.Google Scholar
Engel, D. (1999). Radio talk: French and English perfects on air. Languages in Contrast, 2.2: 255277.Google Scholar
Johansson, S. (1995). The approach of the Text Encoding Initiative to the encoding of spoken discourse. In: Leech, G., Myers, G. and Thomas, J. (eds.), Spoken English on Computer. Harlow: Longman, pp. 8298.Google Scholar
Koch, P. and Oesterreicher, W. (2001). Gesprochene Sprache und geschriebene Sprache (Langage parlé et langage écrit). In: Holtus, G., Metzeltin, M. and Schmitt, C., Lexikon der Romanistischen Linguistik, Band 1.2. Tübingen: Max Niemeyer Verlag, pp.584627.Google Scholar
Lahousse, K. (2003). NP subject inversion in French and (preposed) adverbs. Amsterdam Studies in the Theory and History of Linguistic Science, 4:161176.Google Scholar
Lahousse, K. (2006). NP subject inversion in French: two types, two configurations. Lingua, 116: 424461.Google Scholar
Leech, G. (2005). Adding linguistic annotation. In: Wynne, M. (ed.), Developing Linguistic Corpora. A Guide to Good Practice. Oxford: Oxbow, pp.1729.Google Scholar
Leech, G. and Short, M. (1981). Style in Fiction: A Linguistic Introduction to English Fictional Prose.Google Scholar
Lyons, J. (1981). Language and Linguistics. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Marnette, S. (2005). Speech and Thought Presentation in French. Amsterdam: Benjamins.CrossRefGoogle Scholar
Montelle, E. (1994). A quel temps se vouer. Dire, 21: 3542.Google Scholar
Postal, P. (1989). Masked Inversion in French. Chicago: University of Chicago Press.Google Scholar
Semino, E. and Short, M. (2004). Corpus Stylistics. Speech, Writing and Thought Presentation in a Corpus of English Writing. London: Routledge.CrossRefGoogle Scholar
Söll, L. (1983). Gesprochenes und geschriebenes Französisch. Berlin: Schmidt.Google Scholar
Thompson, P. (2005). Spoken language corpora. In: Wynne, M. (ed.), Developing Linguistic Corpora. A Guide to Good Practice. Oxford: Oxbow, pp. 5970.Google Scholar