适当重复使用图像分类功能改善对象检测

论文标题

适当重复使用图像分类功能改善对象检测

Proper Reuse of Image Classification Features Improves Object Detection

论文作者

Vasconcelos, Cristina, Birodkar, Vighnesh, Dumoulin, Vincent

论文摘要

转移学习的一种常见做法是通过预先培训数据丰富的上游任务来初始化下游模型权重。在对象检测中，特征主干通常用成像网分类器的权重初始化，并在对象检测任务上进行微调。最近的作品表明，在更长的培训方案下，这不是严格必要的，并提供了从头开始训练骨干的食谱。我们研究了这种端到端训练趋势的相反方向：我们表明，一种极端的知识保存形式 - 冻结分类器至关重要的骨干 - 始终改善许多不同的检测模型，并导致可观的资源节省。我们假设并通过实验证实，其余的检测器成分的容量和结构是利用冷冻骨架的关键因素。我们发现的立即应用包括对严重案例的绩效改进，例如检测长尾对象类别以及计算和内存资源节省，这有助于使该领域更容易访问具有更少的计算资源的研究人员。

A common practice in transfer learning is to initialize the downstream model weights by pre-training on a data-abundant upstream task. In object detection specifically, the feature backbone is typically initialized with Imagenet classifier weights and fine-tuned on the object detection task. Recent works show this is not strictly necessary under longer training regimes and provide recipes for training the backbone from scratch. We investigate the opposite direction of this end-to-end training trend: we show that an extreme form of knowledge preservation -- freezing the classifier-initialized backbone -- consistently improves many different detection models, and leads to considerable resource savings. We hypothesize and corroborate experimentally that the remaining detector components capacity and structure is a crucial factor in leveraging the frozen backbone. Immediate applications of our findings include performance improvements on hard cases like detection of long-tail object classes and computational and memory resource savings that contribute to making the field more accessible to researchers with access to fewer computational resources.

下载PDF全文

下载文献需遵守相关版权规定

论文标题