POS标签是否必要甚至有助于神经依赖性解析？

论文标题

POS标签是否必要甚至有助于神经依赖性解析？

Is POS Tagging Necessary or Even Helpful for Neural Dependency Parsing?

论文作者

Zhou, Houquan, Zhang, Yu, Li, Zhenghua, Zhang, Min

论文摘要

在深度学习前的时代，言论标签被认为是依赖性解析中功能工程的必不可少成分。但是，有很多作品集中在关节标签和解析模型上，以避免错误传播。相比之下，最近的研究表明，POS标记对于神经解析而言变得不那么重要甚至没有用，尤其是在使用基于字符的单词表示时。然而，从经验和语言上讲，没有足够的调查重点放在这个问题上。为了回答这一点，我们设计和比较了三个典型的多任务学习框架，即，基于最先进的Biaffine Parser，用于联合标记和解析，即共享浮动，股票和堆栈。考虑到注释POS标签比解析树更便宜，我们还研究了大规模异构POS标签数据的利用。我们对英语和中文数据集进行了实验，结果清楚地表明，使用堆栈关节框架时，POS标签（均质和异质性）仍然可以显着提高解析性能。我们进行详细的分析并从语言方面获得更多见解。

In the pre deep learning era, part-of-speech tags have been considered as indispensable ingredients for feature engineering in dependency parsing. But quite a few works focus on joint tagging and parsing models to avoid error propagation. In contrast, recent studies suggest that POS tagging becomes much less important or even useless for neural parsing, especially when using character-based word representations. Yet there are not enough investigations focusing on this issue, both empirically and linguistically. To answer this, we design and compare three typical multi-task learning framework, i.e., Share-Loose, Share-Tight, and Stack, for joint tagging and parsing based on the state-of-the-art biaffine parser. Considering that it is much cheaper to annotate POS tags than parse trees, we also investigate the utilization of large-scale heterogeneous POS tag data. We conduct experiments on both English and Chinese datasets, and the results clearly show that POS tagging (both homogeneous and heterogeneous) can still significantly improve parsing performance when using the Stack joint framework. We conduct detailed analysis and gain more insights from the linguistic aspect.

下载PDF全文

下载文献需遵守相关版权规定

论文标题