评估决策树学习算法的方法

论文标题

评估决策树学习算法的方法

An Approach to Evaluating Learning Algorithms for Decision Trees

论文作者

Xiao, Tianqi, Timo, Omer Nguena, Avellaneda, Florent, Malik, Yasir, Bruda, Stefan

论文摘要

学习算法产生用于实现关键分类任务的软件模型。决策树模型比其他模型（例如神经网络）更简单，它们用于医疗和航空等各种关键领域。低或未知的学习能力算法不允许我们信任所产生的软件模型，这导致了昂贵的测试活动，以验证模型，并浪费学习时间，以防该模型由于学习的无能而可能是错误的。需要评估决策树学习能力以及其他模型的方法，尤其是因为学习模型的测试仍然是一个热门话题。我们提出了一种以甲骨文为中心的新方法来评估决策树的学习算法的（学习能力）。它包括从参考树中生成扮演Oracles角色的参考树，通过现有学习算法产生学习的树木，并通过将其与甲骨鱼进行比较来确定学到的树木的正确程度（DOE）。平均DOE用于估计学习算法的质量。我们根据提出的方法评估五种决策树学习算法。

Learning algorithms produce software models for realising critical classification tasks. Decision trees models are simpler than other models such as neural network and they are used in various critical domains such as the medical and the aeronautics. Low or unknown learning ability algorithms does not permit us to trust the produced software models, which lead to costly test activities for validating the models and to the waste of learning time in case the models are likely to be faulty due to the learning inability. Methods for evaluating the decision trees learning ability, as well as that for the other models, are needed especially since the testing of the learned models is still a hot topic. We propose a novel oracle-centered approach to evaluate (the learning ability of) learning algorithms for decision trees. It consists of generating data from reference trees playing the role of oracles, producing learned trees with existing learning algorithms, and determining the degree of correctness (DOE) of the learned trees by comparing them with the oracles. The average DOE is used to estimate the quality of the learning algorithm. the We assess five decision tree learning algorithms based on the proposed approach.

下载PDF全文

下载文献需遵守相关版权规定

论文标题