论文标题
Moraldial:通过道德讨论训练和评估道德对话系统的框架
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions
论文作者
论文摘要
对话系统中的道德在最近在研究中引起了极大的关注。与用户值一致的道德对话系统可以增强对话参与度和用户连接。在本文中,我们提出了一个框架,漫画训练和评估道德对话系统。在我们的框架中,我们首先探索道德的沟通机制,并决心将道德表达为三个部分,这表明了建立道德对话系统的路线图。基于此,我们设计了一种简单而有效的方法:在模拟特定用户和对话系统之间构建道德讨论。构建的讨论包括在对话交流中表达,解释,修改和推断道德观点,这使对话模型以自然的方式很好地学习道德。此外,我们在框架下提出了一种新颖的评估方法。我们通过判断讨论中的对话反应与人类价值观之间的关系来评估道德的多个方面,在讨论中特别考虑了道德的多方面性质。自动和手动实验表明,我们的框架有望训练和评估道德对话系统。
Morality in dialogue systems has raised great attention in research recently. A moral dialogue system aligned with users' values could enhance conversation engagement and user connections. In this paper, we propose a framework, MoralDial to train and evaluate moral dialogue systems. In our framework, we first explore the communication mechanisms of morality and resolve expressed morality into three parts, which indicate the roadmap for building a moral dialogue system. Based on that, we design a simple yet effective method: constructing moral discussions between simulated specific users and the dialogue system. The constructed discussions consist of expressing, explaining, revising, and inferring moral views in dialogue exchanges, which makes conversational models learn morality well in a natural manner. Furthermore, we propose a novel evaluation method under the framework. We evaluate the multiple aspects of morality by judging the relation between dialogue responses and human values in discussions, where the multifaceted nature of morality is particularly considered. Automatic and manual experiments demonstrate that our framework is promising to train and evaluate moral dialogue systems.