用于图像识别的多空间神经网络

论文标题

用于图像识别的多空间神经网络

Multi-Subspace Neural Network for Image Recognition

论文作者

Fang, Chieh-Ning, Lin, Chin-Teng

论文摘要

在图像分类任务中，特征提取始终是一个大问题。类内变异性增加了设计提取器的困难。此外，手工制作的功能提取器不能简单地调整新情况。最近，深度学习引起了人们对从数据自动学习功能的关注。在这项研究中，我们提出了多个空间神经网络（MSNN），该神经网络将卷积神经网络（CNN）的关键组成部分与子空间概念结合在一起。将子空间与深网络相关联是一种新颖的设计，提供了各种数据观点。通过自适应子空间自组织映射（ASSOM）跨越子空间训练的基础矢量，用作传输函数，以访问轴向组件并将接收场定义以提取数据的基本模式，而不会在视觉任务中扭曲拓扑。此外，多种空间策略被作为平行块实现，以适应现实世界数据，并对数据进行各种解释，希望能够更强大地处理阶层内变异性问题。为此，采用手写数字和对象图像数据集（即MNIST和COIL-20）进行分类来验证所提出的MSNN体系结构。实验结果表明，MSNN与其他最新方法具有竞争力。

In image classification task, feature extraction is always a big issue. Intra-class variability increases the difficulty in designing the extractors. Furthermore, hand-crafted feature extractor cannot simply adapt new situation. Recently, deep learning has drawn lots of attention on automatically learning features from data. In this study, we proposed multi-subspace neural network (MSNN) which integrates key components of the convolutional neural network (CNN), receptive field, with subspace concept. Associating subspace with the deep network is a novel designing, providing various viewpoints of data. Basis vectors, trained by adaptive subspace self-organization map (ASSOM) span the subspace, serve as a transfer function to access axial components and define the receptive field to extract basic patterns of data without distorting the topology in the visual task. Moreover, the multiple-subspace strategy is implemented as parallel blocks to adapt real-world data and contribute various interpretations of data hoping to be more robust dealing with intra-class variability issues. To this end, handwritten digit and object image datasets (i.e., MNIST and COIL-20) for classification are employed to validate the proposed MSNN architecture. Experimental results show MSNN is competitive to other state-of-the-art approaches.

下载PDF全文

下载文献需遵守相关版权规定

论文标题