论文标题
用于图像识别的多空间神经网络
Multi-Subspace Neural Network for Image Recognition
论文作者
论文摘要
在图像分类任务中,特征提取始终是一个大问题。类内变异性增加了设计提取器的困难。此外,手工制作的功能提取器不能简单地调整新情况。最近,深度学习引起了人们对从数据自动学习功能的关注。在这项研究中,我们提出了多个空间神经网络(MSNN),该神经网络将卷积神经网络(CNN)的关键组成部分与子空间概念结合在一起。将子空间与深网络相关联是一种新颖的设计,提供了各种数据观点。通过自适应子空间自组织映射(ASSOM)跨越子空间训练的基础矢量,用作传输函数,以访问轴向组件并将接收场定义以提取数据的基本模式,而不会在视觉任务中扭曲拓扑。此外,多种空间策略被作为平行块实现,以适应现实世界数据,并对数据进行各种解释,希望能够更强大地处理阶层内变异性问题。为此,采用手写数字和对象图像数据集(即MNIST和COIL-20)进行分类来验证所提出的MSNN体系结构。实验结果表明,MSNN与其他最新方法具有竞争力。
In image classification task, feature extraction is always a big issue. Intra-class variability increases the difficulty in designing the extractors. Furthermore, hand-crafted feature extractor cannot simply adapt new situation. Recently, deep learning has drawn lots of attention on automatically learning features from data. In this study, we proposed multi-subspace neural network (MSNN) which integrates key components of the convolutional neural network (CNN), receptive field, with subspace concept. Associating subspace with the deep network is a novel designing, providing various viewpoints of data. Basis vectors, trained by adaptive subspace self-organization map (ASSOM) span the subspace, serve as a transfer function to access axial components and define the receptive field to extract basic patterns of data without distorting the topology in the visual task. Moreover, the multiple-subspace strategy is implemented as parallel blocks to adapt real-world data and contribute various interpretations of data hoping to be more robust dealing with intra-class variability issues. To this end, handwritten digit and object image datasets (i.e., MNIST and COIL-20) for classification are employed to validate the proposed MSNN architecture. Experimental results show MSNN is competitive to other state-of-the-art approaches.