论文标题
使用多目标优化的多模式摘要生成
Multi-Modal Summary Generation using Multi-Objective Optimization
论文作者
论文摘要
在过去的几年中,通信技术的重大发展激发了多模式摘要技术的研究。以前的大多数关于多模式摘要的作品都集中在文本和图像上。在本文中,我们提出了一种新型的基于多目标优化的模型,以生成包含文本,图像和视频的多模式摘要。在多目标优化框架中同时优化了重要目标,例如模式内的显着性,跨模式冗余和跨模式相似性,以产生有效的多模式输出。该模型已分别评估了不同的模式,并且发现其性能比最先进的方法更好。
Significant development of communication technology over the past few years has motivated research in multi-modal summarization techniques. A majority of the previous works on multi-modal summarization focus on text and images. In this paper, we propose a novel extractive multi-objective optimization based model to produce a multi-modal summary containing text, images, and videos. Important objectives such as intra-modality salience, cross-modal redundancy and cross-modal similarity are optimized simultaneously in a multi-objective optimization framework to produce effective multi-modal output. The proposed model has been evaluated separately for different modalities, and has been found to perform better than state-of-the-art approaches.