论文标题
波斯依赖树库使通用
The Persian Dependency Treebank Made Universal
论文作者
论文摘要
我们描述了一种自动方法,用于将波斯依赖性树库(Rasooli et al,2013)转换为普遍依赖性。该树库包含29107个句子。我们的实验以及手动语言分析表明,与Uppsala Persian普遍依赖性Treebank(Seraji等,2016)相比,我们的数据与通用依赖性更兼容,并且规模更大,词汇量更大。我们的数据带来了标记的附件F-评分为85.2的监督解析。我们的省视波斯对英语解析器转移实验表明,对我们数据训练的解析模型比Seraji等人的数据绝对准确2%。 (2016)在标记的附件评分方面。
We describe an automatic method for converting the Persian Dependency Treebank (Rasooli et al, 2013) to Universal Dependencies. This treebank contains 29107 sentences. Our experiments along with manual linguistic analysis show that our data is more compatible with Universal Dependencies than the Uppsala Persian Universal Dependency Treebank (Seraji et al., 2016), and is larger in size and more diverse in vocabulary. Our data brings in a labeled attachment F-score of 85.2 in supervised parsing. Our delexicalized Persian-to-English parser transfer experiments show that a parsing model trained on our data is ~2% absolutely more accurate than that of Seraji et al. (2016) in terms of labeled attachment score.