论文标题

阈值独立评估声音事件检测分数

Threshold Independent Evaluation of Sound Event Detection Scores

论文作者

Ebbers, Janek, Serizel, Romain, Haeb-Umbach, Reinhold

论文摘要

对声音事件检测(SED)系统进行足够的评估远非微不足道,并且仍在进行持续的研究。最近提出的复合声音检测(PSD) - 接种操作特征(ROC)和PSD评分(PSD)迈出了重要的一步,迈向了SED系统评估的方向,该系统独立于某个决策阈值。这允许获得更完整的整体系统行为的图片,而整体系统行为则不太受阈值调整的偏见。但是,当前仅使用有限的阈值来近似PSD-ROC。但是,近似中使用的阈值的选择可能会对所得PSD产生严重影响。在本文中,我们提出了一种方法,该方法允许在所有可能的阈值的评估集上计算系统性能,不仅可以准确地计算PSD-ROC和PSD,还可以对其他基于锁骨和基于相交的性能曲线进行计算。它进一步允许选择最能满足给定应用程序要求的阈值。源代码在我们的SED评估软件包SED_SCORES_EVAL中公开可用。

Performing an adequate evaluation of sound event detection (SED) systems is far from trivial and is still subject to ongoing research. The recently proposed polyphonic sound detection (PSD)-receiver operating characteristic (ROC) and PSD score (PSDS) make an important step into the direction of an evaluation of SED systems which is independent from a certain decision threshold. This allows to obtain a more complete picture of the overall system behavior which is less biased by threshold tuning. Yet, the PSD-ROC is currently only approximated using a finite set of thresholds. The choice of the thresholds used in approximation, however, can have a severe impact on the resulting PSDS. In this paper we propose a method which allows for computing system performance on an evaluation set for all possible thresholds jointly, enabling accurate computation not only of the PSD-ROC and PSDS but also of other collar-based and intersection-based performance curves. It further allows to select the threshold which best fulfills the requirements of a given application. Source code is publicly available in our SED evaluation package sed_scores_eval.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源