论文标题
通过检测最佳依赖图来改善基于启发式的过程发现方法
Improving Heuristic-based Process Discovery Methods by Detecting Optimal Dependency Graphs
论文作者
论文摘要
基于启发式的方法是过程发现领域中最受欢迎的方法之一。此类别的方法由两个主要步骤组成:1)发现依赖关系图2)确定依赖关系图的拆分/联接模式。基于启发式方法的当前依赖图图发现技术根据依赖项测量选择了图弧的初始集合,然后在某些标准上修改集合。这可能导致选择非最佳弧集。同样,修改可能会导致对罕见行为进行建模,从而导致低精度和非简单过程模型。因此,通过选择最佳弧集构建依赖图具有提高图质量的高潜力。因此,本文提出了一个新的整数线性编程模型,该模型确定了有关依赖关系度量的最佳图弧集。同时,提出的方法可以消除现有方法无法完全处理的其他一些问题。即,即使在循环的存在下,它也可以保证所有任务都处于从初始任务到最终任务的路径。这种方法还允许通过引入适当的约束来利用领域知识,这可能是现实世界中问题的实际优势。为了评估结果,我们修改了两种评估过程模型的现有方法,以使它们能够测量依赖图的质量。根据评估,就适应性,精度,尤其是简单性而言,所提出的方法的输出优于最突出的依赖图发现方法的输出。
Heuristic-based methods are among the most popular methods in the process discovery area. This category of methods is composed of two main steps: 1) discovering a dependency graph 2) determining the split/join patterns of the dependency graph. The current dependency graph discovery techniques of heuristic-based methods select the initial set of graph arcs according to dependency measures and then modify the set regarding some criteria. This can lead to selecting the non-optimal set of arcs. Also, the modifications can result in modeling rare behaviors and, consequently, low precision and non-simple process models. Thus, constructing dependency graphs through selecting the optimal set of arcs has a high potential for improving graphs quality. Hence, this paper proposes a new integer linear programming model that determines the optimal set of graph arcs regarding dependency measures. Simultaneously, the proposed method can eliminate some other issues that the existing methods cannot handle completely; i.e., even in the presence of loops, it guarantees that all tasks are on a path from the initial to the final tasks. This approach also allows utilizing domain knowledge by introducing appropriate constraints, which can be a practical advantage in real-world problems. To assess the results, we modified two existing methods of evaluating process models to make them capable of measuring the quality of dependency graphs. According to assessments, the outputs of the proposed method are superior to the outputs of the most prominent dependency graph discovery methods in terms of fitness, precision, and especially simplicity.