论文标题

猫攀登需要哺乳动物移动:保留构图分布语义中的替象

Cats climb entails mammals move: preserving hyponymy in compositional distributional semantics

论文作者

Cuevas, Gemma De las, Klingler, Andreas, Lewis, Martha, Netzer, Tim

论文摘要

为了提供基于矢量的含义更大结构的表示,一种方法是使用阳性半限定(PSD)矩阵。这些使我们能够建模单词的相似性以及sipyny或is-a关系。可以在给定的矢量空间$ m \ otimes m^*$中相对容易地学习PSD矩阵,但是要撰写单词以形成短语和句子,我们需要在较大空间中的表示形式。在本文中,我们介绍了一种构成与单词相对应的PSD矩阵的通用方法。我们建议将动词,形容词和其他功能单词的PSD矩阵提升为完全正面的(CP)图,使其语法类型匹配。这种提升是由我们称为压缩的组成规则进行的。与以前的构图规则(例如Fuzz和Phaser(又称Kmult and Bmult))相比,构成伪证。从数学上讲,compr本身就是CP图,因此是线性的,通常是非交通性的。我们根据蜘蛛,杯子和盖子为组合的结构提供了许多建议,并生成了一系列组成规则。我们在小句子需要数据集上测试这些规则,并在模糊和移相器的性能方面进行了一些改进。

To give vector-based representations of meaning more structure, one approach is to use positive semidefinite (psd) matrices. These allow us to model similarity of words as well as the hyponymy or is-a relationship. Psd matrices can be learnt relatively easily in a given vector space $M\otimes M^*$, but to compose words to form phrases and sentences, we need representations in larger spaces. In this paper, we introduce a generic way of composing the psd matrices corresponding to words. We propose that psd matrices for verbs, adjectives, and other functional words be lifted to completely positive (CP) maps that match their grammatical type. This lifting is carried out by our composition rule called Compression, Compr. In contrast to previous composition rules like Fuzz and Phaser (a.k.a. KMult and BMult), Compr preserves hyponymy. Mathematically, Compr is itself a CP map, and is therefore linear and generally non-commutative. We give a number of proposals for the structure of Compr, based on spiders, cups and caps, and generate a range of composition rules. We test these rules on a small sentence entailment dataset, and see some improvements over the performance of Fuzz and Phaser.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源