Supervised Speech Representation Learning for Parkinson’s Disease Classification
                  Konferenz: Speech Communication - 14th ITG Conference
                  29.09.2021 - 01.10.2021 in online              
Tagungsband: ITG-Fb. 298: Speech Communication
Seiten: 5Sprache: EnglischTyp: PDF
Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt
            Autoren:
                          Janbakhshi, Parvaneh (Idiap Research Institute, Martigny & École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland)
                          Kodrasi, Ina (Idiap Research Institute, Martigny, Switzerland)
                      
              Inhalt:
              Recently proposed automatic pathological speech classification techniques use unsupervised auto-encoders to obtain a high-level abstract representation of speech. Since these representations are learned based on reconstructing the input, there is no guarantee that they are robust to pathology-unrelated cues such as speaker identity information. Further, these representations are not necessarily discriminative for pathology detection. In this paper, we exploit supervised auto-encoders to extract robust and discriminative speech representations for Parkinson’s disease classification. To reduce the influence of speaker variabilities unrelated to pathology, we propose to obtain speaker identity-invariant representations by adversarial training of an auto-encoder and a speaker identification task. To obtain a discriminative representation, we propose to jointly train an auto-encoder and a pathological speech classifier. Experimental results on a Spanish database show that the proposed supervised representation learning methods yield more robust and discriminative representations for automatically classifying Parkinson’s disease speech, outperforming the baseline unsupervised representation learning system.            


