Exploring In-Context Learning Capabilities of ChatGPT for Pathological Speech Detection
Conference: Speech Communication - 16th ITG Conference
09/24/2025 - 09/26/2025 at Berlin, Germany
Proceedings: ITG-Fb. 321: Speech Communication
Pages: 5Language: englishTyp: PDF
Authors:
Amiri, Mahdi; Shahreza, Hatef Otroshi; Kodrasi, Ina
Abstract:
Automatic pathological speech detection approaches have shown promising results, gaining attention as potential diagnostic tools alongside costly traditional methods. Recently, it has been demonstrated that large language models (LLMs) can be leveraged for downstream tasks through few-shot in-context learning. In this paper, we investigate the use of multimodal LLMs, specifically ChatGPT-4o, for automatic pathological speech detection in a few-shot in-context learning setting. Experimental results demonstrate that this approach achieves competitive performance compared to state-of-the-art methods. To further understand its effectiveness, we conduct an ablation study to analyze the impact of different factors, such as input type and system prompts, on the final results. Our findings highlight the potential of multimodal LLMs for further exploration and advancement in automatic pathological speech detection.

