论文标题
卷积神经网络和转移学习策略,以三种不同语言的语音对帕金森氏病进行分类
Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson's Disease from Speech in Three Different Languages
论文作者
论文摘要
帕金森氏病患者会产生不同的语音障碍,影响其沟通能力。对患者言语的自动评估允许开发计算机辅助工具,以支持诊断和评估疾病的严重程度。本文介绍了一种用三种不同语言的语音分类的方法,可以对帕金森氏病进行分类:西班牙,德语和捷克语。拟议的方法考虑了经过时间频率表示训练的卷积神经网络和三种语言之间的转移学习策略。转移学习方案旨在提高模型的准确性,当时神经网络的权重用与测试集使用的语言不同的语言初始化。结果表明,当用来初始化分类器的权重的基本模型足够强大时,提出的策略可提高模型的准确性高达8 \%。此外,在转移学习后获得的结果在特异性敏感性方面比没有转移学习策略的训练的结果更平衡。
Parkinson's disease patients develop different speech impairments that affect their communication capabilities. The automatic assessment of the speech of the patients allows the development of computer aided tools to support the diagnosis and the evaluation of the disease severity. This paper introduces a methodology to classify Parkinson's disease from speech in three different languages: Spanish, German, and Czech. The proposed approach considers convolutional neural networks trained with time frequency representations and a transfer learning strategy among the three languages. The transfer learning scheme aims to improve the accuracy of the models when the weights of the neural network are initialized with utterances from a different language than the used for the test set. The results suggest that the proposed strategy improves the accuracy of the models in up to 8\% when the base model used to initialize the weights of the classifier is robust enough. In addition, the results obtained after the transfer learning are in most cases more balanced in terms of specificity-sensitivity than those trained without the transfer learning strategy.