论文标题

UWB @ ducr-ita:CCA和正交转换的词汇语义变化检测

UWB @ DIACR-Ita: Lexical Semantic Change Detection with CCA and Orthogonal Transformation

论文作者

Pražák, Ondřej, Přibáň, Pavel, Taylor, Stephen

论文摘要

在本文中,我们描述了用于检测diacr-ita共享任务的词法语义变化(即,单词感随时间变化)的方法,我们在其中排名$ 1^{st} $。我们研究了从不同时间段选择的两个意大利语料库中特定单词之间的语义差异。我们的方法是完全无监督的,语言是独立的。它包括为每个语料库准备一个语义矢量空间。然后,我们使用CCA和正交转换来计算早期和后期空间之间的线性转换。最后,我们测量转化的向量之间的余弦。

In this paper, we describe our method for detection of lexical semantic change (i.e., word sense changes over time) for the DIACR-Ita shared task, where we ranked $1^{st}$. We examine semantic differences between specific words in two Italian corpora, chosen from different time periods. Our method is fully unsupervised and language independent. It consists of preparing a semantic vector space for each corpus, earlier and later. Then we compute a linear transformation between earlier and later spaces, using CCA and Orthogonal Transformation. Finally, we measure the cosines between the transformed vectors.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源