使用多目标优化的多模式摘要生成

论文标题

使用多目标优化的多模式摘要生成

Multi-Modal Summary Generation using Multi-Objective Optimization

论文作者

Jangra, Anubhav, Saha, Sriparna, Jatowt, Adam, Hasanuzzaman, Mohammad

论文摘要

在过去的几年中，通信技术的重大发展激发了多模式摘要技术的研究。以前的大多数关于多模式摘要的作品都集中在文本和图像上。在本文中，我们提出了一种新型的基于多目标优化的模型，以生成包含文本，图像和视频的多模式摘要。在多目标优化框架中同时优化了重要目标，例如模式内的显着性，跨模式冗余和跨模式相似性，以产生有效的多模式输出。该模型已分别评估了不同的模式，并且发现其性能比最先进的方法更好。

Significant development of communication technology over the past few years has motivated research in multi-modal summarization techniques. A majority of the previous works on multi-modal summarization focus on text and images. In this paper, we propose a novel extractive multi-objective optimization based model to produce a multi-modal summary containing text, images, and videos. Important objectives such as intra-modality salience, cross-modal redundancy and cross-modal similarity are optimized simultaneously in a multi-objective optimization framework to produce effective multi-modal output. The proposed model has been evaluated separately for different modalities, and has been found to perform better than state-of-the-art approaches.

下载PDF全文

下载文献需遵守相关版权规定

论文标题