Exploring In-Context Learning Capabilities of ChatGPT for Pathological Speech Detection

Conference: Speech Communication - 16th ITG Conference
09/24/2025 - 09/26/2025 at Berlin, Germany

Proceedings: ITG-Fb. 321: Speech Communication

Pages: 5Language: englishTyp: PDF

Authors:
Amiri, Mahdi; Shahreza, Hatef Otroshi; Kodrasi, Ina

Abstract:
Automatic pathological speech detection approaches have shown promising results, gaining attention as potential diagnostic tools alongside costly traditional methods. Recently, it has been demonstrated that large language models (LLMs) can be leveraged for downstream tasks through few-shot in-context learning. In this paper, we investigate the use of multimodal LLMs, specifically ChatGPT-4o, for automatic pathological speech detection in a few-shot in-context learning setting. Experimental results demonstrate that this approach achieves competitive performance compared to state-of-the-art methods. To further understand its effectiveness, we conduct an ablation study to analyze the impact of different factors, such as input type and system prompts, on the final results. Our findings highlight the potential of multimodal LLMs for further exploration and advancement in automatic pathological speech detection.