Hostname: page-component-745bb68f8f-mzp66 Total loading time: 0 Render date: 2025-01-19T06:39:32.537Z Has data issue: false hasContentIssue false

Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent

Published online by Cambridge University Press:  19 July 2023

A. König*
Affiliation:
1Cobtek (Cognition, Behaviour, Technology) Lab, 1 Institut national de recherche en informatique et en automatique (INRIA), Valbonne, France
M. Mina
Affiliation:
2ki:elements GmbH, Saarbrücken, Germany
S. Schäfer
Affiliation:
2ki:elements GmbH, Saarbrücken, Germany
N. Linz
Affiliation:
2ki:elements GmbH, Saarbrücken, Germany
J. Tröger
Affiliation:
2ki:elements GmbH, Saarbrücken, Germany
*
*Corresponding author.

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.
Introduction

One of the major challenges in clinical psychiatry remains the absence of well established objective measures of symptoms’ severity. Clinical insights are mainly provided through keen behavioral observation and subjective questionnaires and scales.

Objectives

The aim of this paper is to predict depression severity through speech using the features extracted from the speech as provided by participants during a semi-structured dialogue with a virtual avatar.

Methods

We use data from a subset of the DAICWOZ dataset consisting in 142 dialogues between participants and a virtual avatar during which the avatar uses several prompts to maintain a conversation with the participant. The avatar uses prompts involving the topics of travel, dream jobs, and memorable experiences. From the speech generated from the dialogue, we extract participant utterances separated by prompt and extract features from the three sets of transcripts. We extract content features from the transcript and acoustic features from the excerpt corresponding to the speech from the participant for the prompt in question.We perform regression experiments on the PHQ8 items using the features extracted from each set of transcripts. Furthermore, we combine the features extracted from each set of transcripts and compute partial spearman correlations between them and the PHQ8 items using gender as a covariate.

Results

With our best regression model we obtain an R2 of 0.1, explaining 10% of the variance in the PHQ total score. Additionally, we obtain a mean absolute error of 1.25, suggesting that the regressor can detect with more or less precision clinically meaningful differences in depression severity between participants. Partial correlations between the total score and the features show significant correlations between features dependent on the amount of speech generated by each participant, along with the complexity of syntactic structures used.

Conclusions

Automatic analysis of spontaneous speech could help with the detection and monitoring of signs of depression. By combining the use of this technology with timely intervention strategies for instance provided by a virtual agent it could contribute to timely prevention.

Disclosure of Interest

A. König: None Declared, M. Mina Employee of: ki:elements GmbH, S. Schäfer Employee of: ki:elements GmbH, N. Linz Shareolder of: ki:elements GmbH, Employee of: ki:elements GmbH, J. Tröger Shareolder of: ki:elements GmbH, Employee of: ki:elements GmbH

Type
Abstract
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of the European Psychiatric Association
Submit a response

Comments

No Comments have been published for this article.