医疗计算机视觉中的视觉变压器 - 沉思的回顾

论文标题

医疗计算机视觉中的视觉变压器 - 沉思的回顾

Vision Transformers in Medical Computer Vision -- A Contemplative Retrospection

论文作者

Parvaiz, Arshi, Khalid, Muhammad Anwaar, Zafar, Rukhsana, Ameer, Huma, Ali, Muhammad, Fraz, Muhammad Moazam

论文摘要

在计算机视觉领域的最新升级基于一种杂种算法，具有阐明图像中包含的信息的巨大潜力。这些计算机视觉算法正在医学图像分析中实践，并且正在转化成像数据的感知和解释。在这些算法中，视觉变压器被演变为在计算机视觉领域中使用的最现代和主要占主导地位的体系结构之一。这些研究人员大量使用这些来执行新的和以前的实验。在这里，在本文中，我们研究了视觉变压器和医学图像的交汇处，并概述了不同研究人员正在使用的各种基于VIT的框架，以破译医疗计算机视觉中的障碍。我们调查了视觉变压器在医用计算机视觉的不同领域中的应用，例如基于图像的疾病分类，解剖结构分割，注册，基于区域的病变检测，字幕化，报告生成，使用多种医学成像方式重建，这些模式极大地有助于医学诊断和治疗过程。随之而来的是，我们还揭开医疗计算机视觉中使用的几种成像方式。此外，为了获得更多的洞察力和更深入的理解，也简要解释了变形金刚的自我发挥作用机制。最后，我们还以讨论的形式阐明了可用的数据集，采用方法，其绩效指标，挑战和解决方案。我们希望这篇评论文章能为医学计算机视觉中的研究人员开放未来的方向。

Recent escalation in the field of computer vision underpins a huddle of algorithms with the magnificent potential to unravel the information contained within images. These computer vision algorithms are being practised in medical image analysis and are transfiguring the perception and interpretation of Imaging data. Among these algorithms, Vision Transformers are evolved as one of the most contemporary and dominant architectures that are being used in the field of computer vision. These are immensely utilized by a plenty of researchers to perform new as well as former experiments. Here, in this article we investigate the intersection of Vision Transformers and Medical images and proffered an overview of various ViTs based frameworks that are being used by different researchers in order to decipher the obstacles in Medical Computer Vision. We surveyed the application of Vision transformers in different areas of medical computer vision such as image-based disease classification, anatomical structure segmentation, registration, region-based lesion Detection, captioning, report generation, reconstruction using multiple medical imaging modalities that greatly assist in medical diagnosis and hence treatment process. Along with this, we also demystify several imaging modalities used in Medical Computer Vision. Moreover, to get more insight and deeper understanding, self-attention mechanism of transformers is also explained briefly. Conclusively, we also put some light on available data sets, adopted methodology, their performance measures, challenges and their solutions in form of discussion. We hope that this review article will open future directions for researchers in medical computer vision.

下载PDF全文

下载文献需遵守相关版权规定

论文标题