论文标题
资产:用于调整和评估句子简化模型的数据集,具有多个重写转换
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
论文作者
论文摘要
为了简化句子,人类编辑执行多个重写转换:他们将其分为几个较短的句子,措辞单词(即,通过更简单的同义词替换复杂的单词或短语),重新排序组件和/或删除被认为是不必要的信息。尽管有各种各样的文本更改范围,但使用侧重于单个转换(例如词汇释义或分裂)的数据集评估了自动句子简化的当前模型。这使得无法在更现实的设置中了解简化模型的能力。为了减轻这一限制,本文介绍了资产,这是一种用于评估英语简化句子的新数据集。资产是众包的多参考语料库,通过执行多个重写转换来产生每个简化。通过定量和定性实验,我们表明,与任务的其他标准评估数据集相比,资产中的简化更擅长捕获简单性的特征。此外,我们激励需要使用资产来开发更好的自动评估方法,因为我们表明当执行多个简化转换时,当前流行的指标可能不合适。
In order to simplify a sentence, human editors perform multiple rewriting transformations: they split it into several shorter sentences, paraphrase words (i.e. replacing complex words or phrases by simpler synonyms), reorder components, and/or delete information deemed unnecessary. Despite these varied range of possible text alterations, current models for automatic sentence simplification are evaluated using datasets that are focused on a single transformation, such as lexical paraphrasing or splitting. This makes it impossible to understand the ability of simplification models in more realistic settings. To alleviate this limitation, this paper introduces ASSET, a new dataset for assessing sentence simplification in English. ASSET is a crowdsourced multi-reference corpus where each simplification was produced by executing several rewriting transformations. Through quantitative and qualitative experiments, we show that simplifications in ASSET are better at capturing characteristics of simplicity when compared to other standard evaluation datasets for the task. Furthermore, we motivate the need for developing better methods for automatic evaluation using ASSET, since we show that current popular metrics may not be suitable when multiple simplification transformations are performed.