深林的解释和简化

论文标题

深林的解释和简化

Interpretation and Simplification of Deep Forest

论文作者

Kim, Sangwon, Jeong, Mira, Ko, Byoung Chul

论文摘要

本文提出了一种新的方法，用于解释和简化深层随机森林（RF）的黑匣子模型，并使用拟议的规则消除。在深RF中，大量决策树连接到多层，从而使分析变得困难。它具有与深神经网络（DNN）相似的高性能，但实现了更好的概括。因此，在这项研究中，我们考虑以决策规则集的形式量化全面训练的深RF的特征贡献和频率。该功能贡献为确定功能如何影响规则集中的决策过程提供了基础。模型简化是通过测量功能贡献来消除不必要的规则来实现的。因此，简化模型的参数和规则比以前更少。实验结果表明，特征贡献分析允许分解黑匣子模型以定量解释规则集。尽管消除了大量规则，但该方法已成功地应用于各种深层RF模型和基准数据集，同时保持了强大的性能。

This paper proposes a new method for interpreting and simplifying a black box model of a deep random forest (RF) using a proposed rule elimination. In deep RF, a large number of decision trees are connected to multiple layers, thereby making an analysis difficult. It has a high performance similar to that of a deep neural network (DNN), but achieves a better generalizability. Therefore, in this study, we consider quantifying the feature contributions and frequency of the fully trained deep RF in the form of a decision rule set. The feature contributions provide a basis for determining how features affect the decision process in a rule set. Model simplification is achieved by eliminating unnecessary rules by measuring the feature contributions. Consequently, the simplified model has fewer parameters and rules than before. Experiment results have shown that a feature contribution analysis allows a black box model to be decomposed for quantitatively interpreting a rule set. The proposed method was successfully applied to various deep RF models and benchmark datasets while maintaining a robust performance despite the elimination of a large number of rules.

下载PDF全文

下载文献需遵守相关版权规定

论文标题