基于内容的Micro-Video背景音乐推荐的DEBIAS跨模式匹配

论文标题

基于内容的Micro-Video背景音乐推荐的DEBIAS跨模式匹配

Debiased Cross-modal Matching for Content-based Micro-video Background Music Recommendation

论文作者

Yi, Jinng, Chen, Zhenzhong

论文摘要

Micro-Video背景音乐推荐是一项复杂的任务，其中视频和上载器选择的背景音乐之间的匹配度是一个主要问题。但是，用户生成的内容（UGC）的选择是由于知识限制和每个上传器的历史偏好引起的。在本文中，我们提出了一个伪造的跨模式（DEBCM）匹配模型，以减轻此类选择偏见的影响。具体来说，我们设计了一个教师 - 学生网络来利用音乐视频段的匹配，该视频是专业生成的内容（PGC），具有专门的音乐匹配技术，以更好地减轻因用户知识不足而导致的偏见。 PGC数据由教师网络捕获，以指导基于KL的知识转移的Uploader选择的UGC数据的匹配。此外，上传者对音乐流派的个人偏好被确定为混杂因素，这些混杂因素微不足道地将音乐嵌入和背景音乐选择相关联，从而导致学习的推荐系统从多数群体中过度强调音乐。为了在学生网络的UGC数据中解决此类混杂因素，后门调整被用来解除音乐嵌入和预测分数之间的虚假相关性。我们进一步利用批处理平均值的蒙特卡洛（MC）估计器作为近似值，以避免整合根据调整计算的整个混杂空间。在TT-150K-GENRE数据集上进行了广泛的实验证明了所提出的方法对选择偏差的有效性。该代码可公开可用：\ url {https://github.com/jing-1/debcm}。

Micro-video background music recommendation is a complicated task where the matching degree between videos and uploader-selected background music is a major issue. However, the selection of the user-generated content (UGC) is biased caused by knowledge limitations and historical preferences among music of each uploader. In this paper, we propose a Debiased Cross-Modal (DebCM) matching model to alleviate the influence of such selection bias. Specifically, we design a teacher-student network to utilize the matching of segments of music videos, which is professional-generated content (PGC) with specialized music-matching techniques, to better alleviate the bias caused by insufficient knowledge of users. The PGC data is captured by a teacher network to guide the matching of uploader-selected UGC data of the student network by KL-based knowledge transfer. In addition, uploaders' personal preferences of music genres are identified as confounders that spuriously correlate music embeddings and background music selections, resulting in the learned recommender system to over-recommend music from the majority groups. To resolve such confounders in the UGC data of the student network, backdoor adjustment is utilized to deconfound the spurious correlation between music embeddings and prediction scores. We further utilize Monte Carlo (MC) estimator with batch-level average as the approximations to avoid integrating the entire confounder space calculated by the adjustment. Extensive experiments on the TT-150k-genre dataset demonstrate the effectiveness of the proposed method towards the selection bias. The code is publicly available on: \url{https://github.com/jing-1/DebCM}.

下载PDF全文

下载文献需遵守相关版权规定

论文标题