论文标题

视觉问题回答体系结构的最新,快速进步:评论

Recent, rapid advancement in visual question answering architecture: a review

论文作者

Kodali, Venkat, Berleant, Daniel

论文摘要

了解视觉问题的回答对于众多人类活动至关重要。但是,它提出了人工智能努力的核心。本文介绍了使用过去几年中发生的图像的视觉问题回答快速进步的最新进展。最近已经发布了有关改进视觉问题答案系统体系结构的研究的巨大增长,显示了多模式体系结构的重要性。 Manmadhan等人的评论论文中提到了有关视觉问题回答的好处的几点。 (2020),本文构建的,包括该领域的后续更新。

Understanding visual question answering is going to be crucial for numerous human activities. However, it presents major challenges at the heart of the artificial intelligence endeavor. This paper presents an update on the rapid advancements in visual question answering using images that have occurred in the last couple of years. Tremendous growth in research on improving visual question answering system architecture has been published recently, showing the importance of multimodal architectures. Several points on the benefits of visual question answering are mentioned in the review paper by Manmadhan et al. (2020), on which the present article builds, including subsequent updates in the field.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源