论文标题
使用卷积神经网络独立于文本的作者识别
Text-independent writer identification using convolutional neural network
论文作者
论文摘要
独立于文本的作者识别方法不需要作者编写一些预定的文本。以前关于独立文本作家识别的研究基于识别专家设计的特定特定特征。但是,在过去的十年中,深度学习方法已成功地用于自动学习数据。我们在这里提出了一种独立于文本的作者识别的端到端深度学习方法,该方法不需要事先识别功能。卷积神经网络(CNN)最初是在提取本地特征的训练,该特征代表了整个角色图像及其子区域中单个手写的特征。从训练组中随机采样图像的元素用于训练CNN并汇总来自元组的图像的局部特征,以形成全局特征。对于每个训练时期,都重复随机采样元素的过程,这相当于准备大量的训练模式,以训练CNN以供无独立的作者识别。我们在离线手写日本角色模式的Jeita-HP数据库上进行了实验。我们的方法具有200个字符,可以将100个作家分类为99.97%。即使为100个作家使用50个字符或为400个作家使用100个字符,我们的方法也达到了92.80%或93.82%的准确性水平。我们对离线手写英语文本的消防员和IAM数据库进行了进一步的实验。我们的每个作者只使用一页训练,我们的方法实现了900多名作家的精度超过91.81%。总体而言,我们基于手工制作的功能和聚类算法的最佳成果比以前发表的最佳结果更好,这也证明了我们对手写英语文本的有效性。
The text-independent approach to writer identification does not require the writer to write some predetermined text. Previous research on text-independent writer identification has been based on identifying writer-specific features designed by experts. However, in the last decade, deep learning methods have been successfully applied to learn features from data automatically. We propose here an end-to-end deep-learning method for text-independent writer identification that does not require prior identification of features. A Convolutional Neural Network (CNN) is trained initially to extract local features, which represent characteristics of individual handwriting in the whole character images and their sub-regions. Randomly sampled tuples of images from the training set are used to train the CNN and aggregate the extracted local features of images from the tuples to form global features. For every training epoch, the process of randomly sampling tuples is repeated, which is equivalent to a large number of training patterns being prepared for training the CNN for text-independent writer identification. We conducted experiments on the JEITA-HP database of offline handwritten Japanese character patterns. With 200 characters, our method achieved an accuracy of 99.97% to classify 100 writers. Even when using 50 characters for 100 writers or 100 characters for 400 writers, our method achieved accuracy levels of 92.80% or 93.82%, respectively. We conducted further experiments on the Firemaker and IAM databases of offline handwritten English text. Using only one page per writer to train, our method achieved over 91.81% accuracy to classify 900 writers. Overall, we achieved a better performance than the previously published best result based on handcrafted features and clustering algorithms, which demonstrates the effectiveness of our method for handwritten English text also.