Estimating the number of segments for improving dialogue act labelling†

VICENT TAMARIT; CARLOS-D. MARTÍNEZ-HINAREJOS; JOSÉ-MIGUEL BENEDÍ

doi:10.1017/S135132491000032X

Estimating the number of segments for improving dialogue act labelling†

Published online by Cambridge University Press: 14 February 2011

VICENT TAMARIT ,

CARLOS-D. MARTÍNEZ-HINAREJOS and

JOSÉ-MIGUEL BENEDÍ

Show author details

VICENT TAMARIT: Affiliation:
Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Valencia, Spain e-mail: vtamarit@iti.upv.es, cmartine@iti.upv.es, jbenedi@iti.upv.es
CARLOS-D. MARTÍNEZ-HINAREJOS: Affiliation:
Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Valencia, Spain e-mail: vtamarit@iti.upv.es, cmartine@iti.upv.es, jbenedi@iti.upv.es
JOSÉ-MIGUEL BENEDÍ: Affiliation:
Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Valencia, Spain e-mail: vtamarit@iti.upv.es, cmartine@iti.upv.es, jbenedi@iti.upv.es

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

In dialogue systems it is important to label the dialogue turns with dialogue-related meaning. Each turn is usually divided into segments and these segments are labelled with dialogue acts (DAs). A DA is a representation of the functional role of the segment. Each segment is labelled with one DA, representing its role in the ongoing discourse. The sequence of DAs given a dialogue turn is used by the dialogue manager to understand the turn. Probabilistic models that perform DA labelling can be used on segmented or unsegmented turns. The last option is more likely for a practical dialogue system, but it provides poorer results. In that case, a hypothesis for the number of segments can be provided to improve the results. We propose some methods to estimate the probability of the number of segments based on the transcription of the turn. The new labelling model includes the estimation of the probability of the number of segments in the turn. We tested this new approach with two different dialogue corpora: SwitchBoard and Dihana. The results show that this inclusion significantly improves the labelling accuracy.

Information

Type: Articles
Information: Natural Language Engineering , Volume 18 , Issue 1 , January 2012 , pp. 1 - 19

DOI: https://doi.org/10.1017/S135132491000032X [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2011

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Ang, J., Liu, Y. and Shriberg, E. 2005. Automatic dialog act segmentation and classification in multiparty meetings. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processings, vol. 1, pp. 1061–4, Philadelphia.Google Scholar

Alcácer, N., Benedí, J. M., Blat, F., Granell, R., Martínez, C. D., and Torres, F. 2005. Acquisition and labelling of a spontaneous speech dialogue corpus. In Proceeding of 10th International Conference on Speech and Computer (SPECOM). Patras, Greece, pp. 583–6.Google Scholar

Benedí, J. M., Lleida, E., Varona, A., Castro, M. J., Galiano, I., Justo, R., López de Letona, I., and Miguel, A. 2006. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: Dihana. In Fifth International Conference on Language Resources and Evaluation (LREC), pp. 1636–9.Google Scholar

Bisani, M. and Ney, H. 2004. Bootstrap estimates for confidence intervals in asr performance evaluation. In Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on, vol. 1, pp. 1:I–409–12.Google Scholar

Bunt, H. 1994 Context and dialogue control. THINK Quarterly 3.Google Scholar

Core, M. G. and Allen, J. F. 2007. Coding dialogues with the DAMSL annotation scheme. In Fall Symposium on Communicative Action in Humans and Machines. American Association for Artificial Intelligence, pp. 28–35.Google Scholar

Dybkjaer, L. and Minker, W. 2008. Recent Trends in Discourse and Dialogue, vol. 39 of Text, Speech and Language Technology. Springer.Google Scholar

Fraser, M. and Gilbert, G. 1991. ‘Simulating speech systems.’ Computer Speech and Language (5): 81–9.CrossRef Google Scholar

Fukada, T., Koll, D., Waibel, A. and Tanigaki, K. 1998. Probabilistic dialogue act extraction for concept based multilingual translation systems. ICSLP 98 2771–4.Google Scholar

García, P. and Vidal, E. 1990. Inference of k-testable languages in the strict sense and application to syntactic pattern recognition. IEEE Transactions on Pattern Analysis Machine Intelligence 12 (9): 920–5. ISSN: . (1990), IEEE Computer Society.CrossRef Google Scholar

Godfrey, J., Holliman, E. and McDaniel, J. 1992. Switchboard: telephone speech corpus for research and development. In Acoustics, Speech, and Signal Processing, IEEE International Conference on, vol. 1, pp. 517–20, IEEE.Google Scholar

Gorin, A., Riccardi, G. and Wright, J. 1997. How may I help you? Speech Communication 23: 113–27.Google Scholar

Jurafsky, D., Shriberg, E. and Biasca, D. 1997. Switchboard SWBD-DAMSL shallow discourse function annotation coders manual - draft 13. Technical Report 97-01, University of Colorado Institute of Cognitive Science.Google Scholar

Lavie, A., Levin, L., Zhan, P., Taboada, M., Gates, D., Lapata, M. M., Clark, C., Broadhead, M., and Waibel, A. 1997. Expanding the domain of a multi-lingual speech-to-speech translation system. In Proceedings of the Workshop on Spoken Language Translation, ACL/EACL-97, Madrid, Spain.Google Scholar

Levin, L., Ries, K., Thymé-Gobbel, A., and Levie, A. 1999. Tagging of speech acts and dialogue games in Spanish call home. In Workshop: Towards Standards and Tools for Discourse Tagging, pp. 42–7.Google Scholar

Manning, C. D. and Schütze, H. 1999. Foundations of Statistical Natural Language Processing, Cambridge, Massachussetts: Massachussetts Institute of Thechnology Press, ISBN:0262133601.Google Scholar

Martínez-Hinarejos, C.-D. 2009. A study of a segmentation technique for dialogue act assignation. In Proceedings of the Eighth International Conference in Computational Semantics IWCS8, Tilburg University, Department of Communication and Information Sciences, pp. 299–304.Google Scholar

Martínez-Hinarejos, C. D., Benedí, J. M., and Granell, R. 2008. Statistical framework for a Spanish spoken dialogue corpus. Speech Communication 50: 992–1008.Google Scholar

Martínez-Hinarejos, C. D., Granell, R., and Benedí, J. M. 2006. Segmented and unsegmented dialogue-act annotation with statistical dialogue models. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sesions, Sydney, Australia, pp. 563–70.CrossRef Google Scholar

Schatzmann, J., Thomson, B. and Young, S. 2007. Statistical user simulation with a hidden agenda. In Proceedings of the SIGdial Workshop on Discourse and Dialogue, pp. 273–82.Google Scholar

Stolcke, A., Coccaro, N., Bates, R., Taylor, P., van Ess-Dykema, C., Ries, K., Shriberg, E., Jurafsky, D., Martin, R., and Meteer, M. 2000. Dialogue act modelling for automatic tagging and recognition of conversational speech. Computational Linguistics 26 (3): 1–34.CrossRef Google Scholar

Young, S. 2000. Probabilistic methods in spoken dialogue systems. Philosophical Trans Royal Society (Series A) 358(1769): 1389–402.CrossRef Google Scholar

Walker, M. A. 2000 An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email. Journal of Artificial Intelligence Research 12: 387–416.CrossRef Google Scholar

Webb, N., Hepple, M. and Wiks, Y. 2005 Dialogue act classification using intra-utterance features. In Proceedings of the AAAI Workshop on Spoken Language Understanding, Pittsburgh, USA.Google Scholar

Article contents

Estimating the number of segments for improving dialogue act labelling†

Abstract

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests