论文标题
在指导的非负矩阵分解上
On a Guided Nonnegative Matrix Factorization
论文作者
论文摘要
完全无监督的主题模型在文档群集和分类中获得了巨大的成功。但是,当数据偏向一组功能时,这些模型通常会遭受学习不太卑鄙甚至多余的主题的趋势。因此,我们提出了一种基于被认为\ textit {指导nmf}的非负矩阵分解(NMF)模型的方法,该模型包含了用户设计的种子单词监督。我们的实验结果证明了该模型的希望,并说明它与此类同类的其他方法具有竞争力,只有很少的监督信息。
Fully unsupervised topic models have found fantastic success in document clustering and classification. However, these models often suffer from the tendency to learn less-than-meaningful or even redundant topics when the data is biased towards a set of features. For this reason, we propose an approach based upon the nonnegative matrix factorization (NMF) model, deemed \textit{Guided NMF}, that incorporates user-designed seed word supervision. Our experimental results demonstrate the promise of this model and illustrate that it is competitive with other methods of this ilk with only very little supervision information.