论文标题

多个代码用于有效图像检索

Multiple Code Hashing for Efficient Image Retrieval

论文作者

Li, Ming-Wei, Jiang, Qing-Yuan, Li, Wu-Jun

论文摘要

由于其存储成本较低和快速查询速度,因此在大规模图像检索任务中广泛使用了哈希。哈希桶搜索将给定的锤radius中的数据点返回到每个查询中,这可以以恒定或下线的时间成本启用搜索。但是,在复杂的方案中,现有的哈希方法无法实现哈希桶搜索的令人满意的检索性能,因为它们仅对每个图像学习一个哈希代码。更具体地说,通过使用一个Hash代码表示一个图像,现有方法可能无法将类似的图像对放在图像的语义信息复杂时与查询距离较小的锤子距离的存储桶对。结果,需要根据学习的代码来检索类似图像的大量哈希桶。这将恶化哈希​​桶搜索的效率。在本文中,我们提出了一个新颖的哈希框架,称为多个代码哈希(MCH),以提高哈希桶搜索的性能。 MCH的主要思想是为每个图像学习多个哈希代码,每个代码代表图像的不同区域。此外,我们提出了一种深入的增强学习算法,以学习MCH中的参数。据我们所知,这是第一项提议在图像检索中学习每个图像的多个哈希码的作品。实验表明,与仅学习每个图像的哈希代码相比,MCH可以在哈希桶搜索方面取得重大改进。

Due to its low storage cost and fast query speed, hashing has been widely used in large-scale image retrieval tasks. Hash bucket search returns data points within a given Hamming radius to each query, which can enable search at a constant or sub-linear time cost. However, existing hashing methods cannot achieve satisfactory retrieval performance for hash bucket search in complex scenarios, since they learn only one hash code for each image. More specifically, by using one hash code to represent one image, existing methods might fail to put similar image pairs to the buckets with a small Hamming distance to the query when the semantic information of images is complex. As a result, a large number of hash buckets need to be visited for retrieving similar images, based on the learned codes. This will deteriorate the efficiency of hash bucket search. In this paper, we propose a novel hashing framework, called multiple code hashing (MCH), to improve the performance of hash bucket search. The main idea of MCH is to learn multiple hash codes for each image, with each code representing a different region of the image. Furthermore, we propose a deep reinforcement learning algorithm to learn the parameters in MCH. To the best of our knowledge, this is the first work that proposes to learn multiple hash codes for each image in image retrieval. Experiments demonstrate that MCH can achieve a significant improvement in hash bucket search, compared with existing methods that learn only one hash code for each image.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源