Search

Use case identification of natural language system requirements with graph-based clustering
Simon Schleifer, Adriana Lungu, Benjamin Kruse, Sebastiaan van Putten, Stefan Goetz, Sandro Wartzack
Journal:

Design Science / Volume 11 / 2025

Published online by Cambridge University Press:

21 July 2025, e30
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Due to the ever-increasing complexity of technical products, the quantity of system requirements, which are typically expressed in natural language, is inevitably rising. Model-based formalization through the application of Model-based Systems Engineering is a common solution to cope with rising complexity. Thereby, grouping requirements to use cases forms the first step towards model-based requirements and allows to improve the understanding of the system. To support this manual and subjective task, automation by artificial intelligence and methods of natural language processing are needed. This contribution proposes a novel pipeline to derive use cases from natural language requirements by considering incomplete manual mappings and the possibility that one requirement contributes to multiple use cases. The approach utilizes semi-supervised requirements graph generation with subsequent overlapping graph clustering. Each identified use case is described by keyphrases to increase accessibility for the user. Industrial requirement sets from the automotive industry are used to evaluate the pipeline in two application scenarios. The proposed pipeline overcomes limitations of prior work in the practical application, which is emphasized by critical discussions with experts from the industry. The proposed pipeline automatically generates proposals for use cases defined in the requirement set, forming the basis for use case diagrams.

Measuring the Quality of Answers in Political Q&As with Large Language Models
R. Michael Alvarez, Jacob Morrier
Journal:

Political Analysis , First View

Published online by Cambridge University Press:

16 July 2025, pp. 1-18
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This article proposes a new approach for measuring the quality of answers in political question-and-answer sessions. We assess the quality of an answer based on how easily and accurately it can be recognized among a random set of candidate answers given the question’s text. This measure reflects the answer’s relevance and depth of engagement with the question. Drawing a parallel with semantic search, we can implement this approach by training a language model on the corpus of observed questions and answers without additional human-labeled data. We showcase and validate our methodology within the context of the Question Period in the Canadian House of Commons. Our analysis reveals that while some answers only have a weak semantic connection to questions, suggesting some evasion or obfuscation, they are generally at least moderately relevant, far exceeding what we would expect from random replies. We also find meaningful correlations between the quality of answers and the party affiliation of the members of Parliament asking the questions.

Characterisation of serious mental illness trajectories through transdiagnostic clinical features
Juan F. De la Hoz, Alejandro Arias, Susan K. Service, Mauricio Castaño, Ana M. Díaz-Zuluaga, Janet Song, Cristian Gallego, Sergio Ruiz-Sánchez, Javier I. Escobar, Alex A. T. Bui, Carrie E. Bearden, Victor Reus, Carlos López-Jaramillo, Nelson B. Freimer, Loes M. Olde Loohuis
Journal:

The British Journal of Psychiatry , FirstView

Published online by Cambridge University Press:

23 June 2025, pp. 1-8
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Electronic health records (EHRs), increasingly available in low- and middle-income countries (LMICs), provide an opportunity to study transdiagnostic features of serious mental illness (SMI) and its trajectories.
Aims
Characterise transdiagnostic features and diagnostic trajectories of SMI using an EHR database in an LMIC institution.
Method
We conducted a retrospective cohort study using EHRs from 2005–2022 at Clínica San Juan de Dios Manizales, a specialised mental health facility in Colombia, including 22 447 patients with schizophrenia (SCZ), bipolar disorder (BPD) or severe/recurrent major depressive disorder (MDD). Using diagnostic codes and clinical notes, we analysed the frequency of suicidality and psychosis across diagnoses, patterns of diagnostic switching and the accumulation of comorbidities. Mixed-effect logistic regression was used to identify factors influencing diagnostic stability.
Results
High frequencies of suicidality and psychosis were observed across diagnoses of SCZ, BPD and MDD. Most patients (64%) received multiple diagnoses over time, including switches between primary SMI diagnoses (19%), diagnostic comorbidities (30%) or both (15%). Predictors of diagnostic switching included mentions of delusions (odds ratio = 1.47, 95% CI 1.34–1.61), prior diagnostic switching (odds ratio = 4.01, 95% CI 3.7–4.34) and time in treatment, independent of age (log of visit number; odds ratio = 0.57, 95% CI 0.54–0.61). Over 80% of patients reached diagnostic stability within 6 years of their first record.
Conclusions
Integrating structured and unstructured EHR data reveals transdiagnostic patterns in SMI and predictors of disease trajectories, highlighting the potential of EHR-based tools for research and precision psychiatry in LMICs.

Automated analysis of common errors in L2 learner production: Prototype web application development
Atsushi Mizumoto
Journal:

Studies in Second Language Acquisition , First View

Published online by Cambridge University Press:

19 June 2025, pp. 1-18
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This research report presents the development and validation of Auto Error Analyzer, a prototype web application designed to automate the calculation of accuracy and its related metrics for measuring second language (L2) production. Building on recent advancements in natural language processing (NLP) and artificial intelligence (AI), Auto Error Analyzer introduces an automated accuracy measurement component, bridging a gap in existing assessment tools, which traditionally require human judgment for accuracy evaluation. By utilizing a state-of-the-art generative AI model (Llama 3.3) for error detection, Auto Error Analyzer analyzes L2 texts efficiently and cost-effectively, producing accuracy metrics (e.g., errors per 100 words). Validation results demonstrate high agreement between the tool’s error counts and human rater judgments (r = .94), with microaverage precision and recall in error detection being high as well (.96 and .94 respectively, F1 = .95), and its T-unit and clause counts matched outputs from established tools like L2SCA. Developed under open science principles to ensure transparency and replicability, the tool aims to support researchers and educators while emphasizing the complementary role of human expertise in language assessment. The possibilities of Auto Error Analyzer for efficient and scalable error analysis, as well as its limitations in detecting context-dependent and first-language (L1)-influenced errors, are also discussed.

Toward HydroLLM: a benchmark dataset for hydrology-specific knowledge assessment for large language models
Dilara Kizilkaya, Ramteja Sajja, Yusuf Sermet, Ibrahim Demir
Journal:

Environmental Data Science / Volume 4 / 2025

Published online by Cambridge University Press:

02 June 2025, e31
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The rapid advancement of large language models (LLMs) has enabled their integration into a wide range of scientific disciplines. This article introduces a comprehensive benchmark dataset specifically designed for testing recent LLMs in the hydrology domain. Leveraging a collection of research articles and hydrology textbooks, we generated a wide array of hydrology-specific questions in various formats, including true/false, multiple-choice, open-ended, and fill-in-the-blank. These questions serve as a robust foundation for evaluating the performance of state-of-the-art LLMs, including GPT-4o-mini, Llama3:8B, and Llama3.1:70B, in addressing domain-specific queries. Our evaluation framework employs accuracy metrics for objective question types and cosine similarity measures for subjective responses, ensuring a thorough assessment of the models’ proficiency in understanding and responding to hydrological content. The results underscore both the capabilities and limitations of artificial intelligence (AI)-driven tools within this specialized field, providing valuable insights for future research and the development of educational resources. By introducing HydroLLM-Benchmark, this study contributes a vital resource to the growing body of work on domain-specific AI applications, demonstrating the potential of LLMs to support complex, field-specific tasks in hydrology.

The architecture of language: Understanding the mechanics behind LLMs
Part of
- Comparative Perspectives on the Regulation of Large Language Models
Andrea Filippo Ferraris, Davide Audrito, Luigi Di Caro, Cristina Poncibò
Journal:

Cambridge Forum on AI: Law and Governance / Volume 1 / 2025

Published online by Cambridge University Press:

06 January 2025, e11
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Large language models (LLMs) have significantly advanced artificial intelligence (AI) and natural language processing (NLP) by excelling in tasks like text generation, machine translation, question answering and sentiment analysis, often rivaling human performance. This paper reviews LLMs’ foundations, advancements and applications, beginning with the transformative transformer architecture, which improved on earlier models like recurrent neural networks and convolutional neural networks through self-attention mechanisms that capture long-range dependencies and contextual relationships. Key innovations such as masked language modeling and causal language modeling underpin leading models like Bidirectional encoder representations from transformers (BERT) and the Generative Pre-trained Transformer (GPT) series. The paper highlights scaling laws, model size increases and advanced training techniques that have driven LLMs’ growth. It also explores methodologies to enhance their precision and adaptability, including parameter-efficient fine-tuning and prompt engineering. Challenges like high computational demands, biases and hallucinations are addressed, with solutions such as retrieval-augmented generation to improve factual accuracy. By discussing LLMs’ strengths, limitations and transformative potential, this paper provides researchers, practitioners and students with a comprehensive understanding. It underscores the importance of ongoing research to improve efficiency, manage ethical concerns and shape the future of AI and language technologies.

Challenges in applying large language models to requirements engineering tasks
Johannes J. Norheim, Eric Rebentisch, Dekai Xiao, Lorenz Draeger, Alain Kerbrat, Olivier L. de Weck
Journal:

Design Science / Volume 10 / 2024

Published online by Cambridge University Press:

18 September 2024, e16
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Growth in the complexity of advanced systems is mirrored by a growth in the number of engineering requirements and related upstream and downstream tasks. These requirements are typically expressed in natural language and require human expertise to manage. Natural language processing (NLP) technology has long been seen as promising to increase requirements engineering (RE) productivity but has yet to demonstrate substantive benefits. The recent addition of large language models (LLMs) to the NLP toolbox is now generating renewed enthusiasm in the hope that it will overcome past shortcomings. This article scrutinizes this claim by reviewing the application of LLMs for engineering requirements tasks. We survey the success of applying LLMs and the scale to which they have been used. We also identify groups of challenges shared across different engineering requirement tasks. These challenges show how this technology has been applied to RE tasks that need reassessment. We finalize by drawing a parallel to other engineering fields with similar challenges and how they have been overcome in the past – and suggest these as future directions to be investigated.

Generative large language models in engineering design: opportunities and challenges
Filippo Chiarello, Simone Barandoni, Marija Majda Škec, Gualtiero Fantoni
Journal:

Proceedings of the Design Society / Volume 4 / May 2024

Published online by Cambridge University Press:

16 May 2024, pp. 1959-1968
- Article
- - You have access
  - Open access
- PDF
- Export citation
Despite the rapid advancement of generative Large Language Models (LLMs), there is still limited understanding of their potential impacts on engineering design (ED). This study fills this gap by collecting the tasks LLMs can perform within ED, using a Natural Language Processing analysis of 15,355 ED research papers. The results lead to a framework of LLM tasks in design, classifying them for different functions of LLMs and ED phases. Our findings illuminate the opportunities and risks of using LLMs for design, offering a foundation for future research and application in this domain.

Towards the extraction of semantic relations in design with natural language processing
Vito Giordano, Marco Consoloni, Filippo Chiarello, Gualtiero Fantoni
Journal:

Proceedings of the Design Society / Volume 4 / May 2024

Published online by Cambridge University Press:

16 May 2024, pp. 2059-2068
- Article
- - You have access
  - Open access
- PDF
- Export citation
Natural Language Processing (NLP) has been extensively applied in design, particularly for analyzing technical documents like patents and scientific papers to identify entities such as functions, technical feature, and problems. However, there has been less focus on understanding semantic relations within literature, and a comprehensive definition of what constitutes a relation is still lacking. In this paper, we define relation in the context of design and the fundamental concepts linked to it. Subsequently, we introduce a framework for employing NLP to extract relations relevant to design.

A theory landscape of design: mapping the theoretical discourse of the discipline
Katja Thoring, Roland M. Mueller
Journal:

Proceedings of the Design Society / Volume 4 / May 2024

Published online by Cambridge University Press:

16 May 2024, pp. 145-154
- Article
- - You have access
  - Open access
- PDF
- Export citation
This paper presents a mapping of theory use in the design discipline based on the corpus of the published ICED and DESIGN conference papers since 2010. We searched the resulting 4,451 papers for occurrences of theories and compared them with an existing ontology of named theories through natural language processing (NLP). The results yielded a variety of analyses, illustrating, for example, the most-used theories and which disciplines these theories stem from. This paper presents a rich overview of the theories relevant to the design discipline and a novel approach to bibliometric analyses.

Automatic derivation of use case diagrams from interrelated natural language requirements
Simon Schleifer, Adriana Lungu, Benjamin Kruse, Sebastiaan van Putten, Stefan Goetz, Sandro Wartzack
Journal:

Proceedings of the Design Society / Volume 4 / May 2024

Published online by Cambridge University Press:

16 May 2024, pp. 2725-2734
- Article
- - You have access
  - Open access
- PDF
- Export citation
Transferring natural language requirements to use case diagrams helps to avoid inherent ambiguities. However, this is usually a manual, time-consuming task that can be accelerated by utilizing Artificial Intelligence in terms of Natural Language Processing. Thus, this contribution proposes a conceptual framework for automatically grouping interrelated functional requirements and deriving use case diagrams by combining formerly isolated approaches. Moreover, the latter are evaluated by a qualitative potential analysis to support their future industrial application.

Introducing a multipliable BOM-based automatic definition of information retrieval in plant engineering
Max Layer, Sebastian Neubert, Ralph Stelzer
Journal:

Proceedings of the Design Society / Volume 4 / May 2024

Published online by Cambridge University Press:

16 May 2024, pp. 413-422
- Article
- - You have access
  - Open access
- PDF
- Export citation
The complexity of process plants and the growing demand for digitalization require efficient and accurate information retrieval throughout the lifecycle phases of a process plant. This paper discusses the concept of instantiation and introduces a method for identifying and multiplying required information in plant engineering using scalable so-called Instantiation Blocks linked to the Bill of Material. Core functionality, an ontology graph and a user interface based on Python and React are developed to demonstrate the implementation of the framework and validate its effectiveness in practice.

Assessing text-image patent datasets with text-based metrics for engineering design applications
Marco Consoloni, Vito Giordano, Gualtiero Fantoni
Journal:

Proceedings of the Design Society / Volume 4 / May 2024

Published online by Cambridge University Press:

16 May 2024, pp. 1969-1978
- Article
- - You have access
  - Open access
- PDF
- Export citation
Images provide concise representations of design artifacts and emerge as the primary mode of communication among innovators, engineers, and designers. The advanced of Artificial Intelligence tools which integrates image and textual information can significantly support the Engineering Design process. In this paper we create 5 different datasets combining both images and text of patents and we develop a set of text-based metrics to assess the quality of text for multimodal applications. Finally, we discuss the challenges arising in the development of multimodal patent datasets.

4 - Professional Context: Journalism, PR, and Community Communication
Aleksandra Gnach, Zurich University of Applied Sciences, Wibke Weber, Zurich University of Applied Sciences, Martin Engebretsen, Universitetet i Agder, Norway, Daniel Perrin, Zurich University of Applied Sciences
Book:

Digital Communication and Media Linguistics

Published online:

18 February 2023

Print publication:

29 December 2022, pp 95-136
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter answers questions such as: How are digital media and digitalization transforming public communication? What is the working framework in which journalism and PR operate? What is journalists’ and communications professionals’ daily work? The first part of the chapter covers the impact of digitization on journalism and PR, and how this affects their relationship. It introduces the concept of attention economy to elucidate the consequences that the digital financing model has on public communication. It then provides an insight into the recent developments in journalism and PR by presenting novel forms and formats of digital communication, which are at the heart of media linguistics research. The second part of the chapter focuses on the concepts of media literacy, digital literacy, visual and visualization literacy and data literacy, and how these skills translate into journalists’ and communication experts’ daily job, particularly when faced with the new ethical challenges posed by new digital technologies and tools. The chapter closes by presenting the discipline of ethics in general and with a special focus on media ethics in journalism and PR and digital media ethics.

PRESTOapp for health workers with mental health symptoms related to the COVID-19 pandemic
M. Primé Tous, G. Anmella, X. Segú, M.D.R. Fernández Canseco, C. Carrino, M. Villegas, V. Vicens, J. Blanch, M. Cavero, E. Vieta, D. Hidalgo-Mazzei
Journal:

European Psychiatry / Volume 65 / Issue S1 / June 2022

Published online by Cambridge University Press:

01 September 2022, p. S575
- Article
- - You have access
  - Open access
- PDF
- Export citation
Introduction
The COVID-19 pandemic has caused a significant impact on the mental health of health workers that has brought many hospitals to launch immediate preventive mental health programs.
Objectives
(1) To adapt and enhance a smartphone app (PRESTOapp) for health workers with mental health symptoms related to the COVID-19, and (2) to demonstrate its potential effectiveness in significantly reducing anxiety-depressive and PTSD symptoms in this population. We aim to incorporate Natural Language Processing (NLP)-based techniques in a chatbot user-interface that will enable a more personalized and accurate monitoring and intervention.
Methods
An 18-months study with a 6-months preliminary phase to adapt PRESTOapp to health workers, enhance it with NLP-based techniques and chatbot user-interface, and evaluate its feasibility, and effectiveness in 12-months.
Results
PRESTOapp has the potential to provide a prompt, personalized and integral response to the mental health demand due to the COVID-19. It will help by providing an innovative digital platform, that will allow remote monitoring of the symptoms course, provide brief psychotherapeutic interventions, and detect urgent situations. If the preliminary results of this study point to a potential effectiveness of the intervention, PRESTOapp may be easily adapted to the general population.
Conclusions
PRESTOapp may be one of the key digital platforms that may help preventing and treating potentially severe mental health consequences. Considering the unresolved problem of burnout in health workers even before the COVID-19, this project will develop the necessary technology for implementing cost-effective mental health solutions, not only during the pandemic.
Disclosure
No significant relationships.

Chapter 9 - Formalization of the Theory and Additional Issues
Andrew Ortony, Northwestern University, Illinois, Gerald L. Clore, University of Virginia, Allan Collins, Northwestern University, Illinois
Book:

The Cognitive Structure of Emotions

Published online:

04 August 2022

Print publication:

18 August 2022, pp 204-231
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Some ancillary cognitive issues are raised, including the fact that characterizing emotions in terms of emergence conditions frees the theory from the bonds of everyday emotion terms, leaving room for projects that explore the relation between emotions cast in terms independent of language-culture and the emotion words of particular languages. Additionally, some functions of emotions relating to attention, coping, and memory are discussed, as is the fact that emotions, construed as necessarily conscious experiences, can have unconscious bases. Finally, the possibility that the emergence conditions of an emotion might not be sufficient to generate an actual emotional experience leads to the introduction of the concepts of emotion potentials and emotion thresholds, wherein the magnitude of an emotion potential must exceed a context-sensitive threshold to allow for an emotion to emerge and its intensity to be computed. With the help of these key constructs, the computational tractability of the theory is illustrated using detailed examples of how characterizations of emotions presented in earlier chapters might be formalized.

Search Results

Refine search

Refine search

Actions for selected content:

16 results

Use case identification of natural language system requirements with graph-based clustering

Measuring the Quality of Answers in Political Q&As with Large Language Models

Characterisation of serious mental illness trajectories through transdiagnostic clinical features

Automated analysis of common errors in L2 learner production: Prototype web application development

Toward HydroLLM: a benchmark dataset for hydrology-specific knowledge assessment for large language models

The architecture of language: Understanding the mechanics behind LLMs

Challenges in applying large language models to requirements engineering tasks

Generative large language models in engineering design: opportunities and challenges

Towards the extraction of semantic relations in design with natural language processing

A theory landscape of design: mapping the theoretical discourse of the discipline

Automatic derivation of use case diagrams from interrelated natural language requirements

Introducing a multipliable BOM-based automatic definition of information retrieval in plant engineering

Assessing text-image patent datasets with text-based metrics for engineering design applications

4 - Professional Context: Journalism, PR, and Community Communication

Summary

PRESTOapp for health workers with mental health symptoms related to the COVID-19 pandemic

Chapter 9 - Formalization of the Theory and Additional Issues

Summary

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

16 results

Summary

Summary