论文标题
自动语音和暂停措施的同时有效性在ALS中通过
Concurrent Validity of Automatic Speech and Pause Measures During Passage Reading in ALS
论文作者
论文摘要
分析肌萎缩性侧面硬化症患者(ALS)的语音测量可以为早期诊断和跟踪疾病进展提供必不可少的信息。但是,当前提取语音和暂停功能的方法是手动或半自动性的,这使它们既耗时又富含劳动力。语音文本一致性算法的出现为对ALS患者的语音测量的廉价,自动化和准确分析提供了机会。需要验证这些算法对当前黄金标准方法计算的语音和暂停功能。在这项研究中,我们从646个具有ALS和健康对照的人的音频文件中提取了8个语音/暂停功能。两种预估计的强制对齐模型 - 一种使用变压器,另一种使用高斯混合物 /隐藏的马尔可夫体系结构 - 用于自动提取。然后,根据半自动语音/暂停分析软件对结果进行验证,并基于音频质量和疾病的严重程度进行了进一步的亚组分析。使用基于变压器的强制排列提取的功能与黄金标准的一致性最高,包括音频质量和疾病严重程度。这项研究为临床医生的未来智能诊断支持系统和新颖的疾病进展方法远程远程追踪方法奠定了基础。
The analysis of speech measures in individuals with amyotrophic lateral sclerosis (ALS) can provide essential information for early diagnosis and tracking disease progression. However, current methods for extracting speech and pause features are manual or semi-automatic, which makes them time-consuming and labour-intensive. The advent of speech-text alignment algorithms provides an opportunity for inexpensive, automated, and accurate analysis of speech measures in individuals with ALS. There is a need to validate speech and pause features calculated by these algorithms against current gold standard methods. In this study, we extracted 8 speech/pause features from 646 audio files of individuals with ALS and healthy controls performing passage reading. Two pretrained forced alignment models - one using transformers and another using a Gaussian mixture / hidden Markov architecture - were used for automatic feature extraction. The results were then validated against semi-automatic speech/pause analysis software, with further subgroup analyses based on audio quality and disease severity. Features extracted using transformer-based forced alignment had the highest agreement with gold standards, including in terms of audio quality and disease severity. This study lays the groundwork for future intelligent diagnostic support systems for clinicians, and for novel methods of tracking disease progression remotely from home.