自我监督的视频表示学习的自行车对比度学习

论文标题

自我监督的视频表示学习的自行车对比度学习

Cycle-Contrast for Self-Supervised Video Representation Learning

论文作者

Kong, Quan, Wei, Wenpeng, Deng, Ziwei, Yoshinaga, Tomoaki, Murakami, Tomokazu

论文摘要

我们提出了自行车对抗性学习（CCL），这是一种新型的学习视频表示的自我监督方法。遵循视频及其帧的属性和包容性关系的性质，CCL旨在在其域中分别考虑对比度表示的跨帧和视频的对应关系。这与最近仅学习跨帧或剪辑的对应关系的方法不同。在我们的方法中，帧和视频表示是根据R3D体系结构从单个网络中学到的，在周期对比度损失之前，具有共享的非线性转换，用于嵌入帧和视频功能。我们证明，CCL学到的视频表示可以很好地传输到视频理解的下游任务，在UCF101，HMDB51和MMACT上的最近邻居检索和动作识别任务中表现优于先前的方法。

We present Cycle-Contrastive Learning (CCL), a novel self-supervised method for learning video representation. Following a nature that there is a belong and inclusion relation of video and its frames, CCL is designed to find correspondences across frames and videos considering the contrastive representation in their domains respectively. It is different from recent approaches that merely learn correspondences across frames or clips. In our method, the frame and video representations are learned from a single network based on an R3D architecture, with a shared non-linear transformation for embedding both frame and video features before the cycle-contrastive loss. We demonstrate that the video representation learned by CCL can be transferred well to downstream tasks of video understanding, outperforming previous methods in nearest neighbour retrieval and action recognition tasks on UCF101, HMDB51 and MMAct.

下载PDF全文

下载文献需遵守相关版权规定

论文标题