论文标题

蒙版图像建模进步3D医学图像分析

Masked Image Modeling Advances 3D Medical Image Analysis

论文作者

Chen, Zekai, Agarwal, Devansh, Aggarwal, Kshitij, Safta, Wiem, Hirawat, Samit, Sethuraman, Venkat, Balan, Mariann Micsinai, Brown, Kevin

论文摘要

最近,蒙面图像建模(MIM)由于能力从大量未标记的数据中学习而引起了人们的关注,并且已被证明对涉及自然图像的各种视觉任务有效。同时,由于未标记的图像的数量高,预计3D医学图像在建模3D医学图像中的潜力预计将是巨大的,以及质量标签的费用和困难。但是,MIM对医学图像的适用性仍然不确定。在本文中,我们证明了掩盖的图像建模方法还可以推进3D医学图像分析,除了自然图像。我们研究掩盖图像建模策略如何从3D医疗图像分割的角度利用性能,作为一项代表性的下游任务:i)与天真的对比度学习相比,掩盖的图像建模方法可以加快监督培训的收敛性,甚至更快地(1.40美元$ \ times $),并最终产生更高的骰子得分; ii)预测具有较高掩盖比和相对较小的斑块大小的原始体素值是用于医学图像建模的非平凡的自我监督借口任务; iii)重建的轻量级解码器或投影头设计对于3D医学图像上的掩盖图像建模非常有力,从而加快了训练并降低成本; iv)最后,我们还研究了在不同的实际情况下使用不同图像分辨率和标记的数据比率的MIM方法的有效性。

Recently, masked image modeling (MIM) has gained considerable attention due to its capacity to learn from vast amounts of unlabeled data and has been demonstrated to be effective on a wide variety of vision tasks involving natural images. Meanwhile, the potential of self-supervised learning in modeling 3D medical images is anticipated to be immense due to the high quantities of unlabeled images, and the expense and difficulty of quality labels. However, MIM's applicability to medical images remains uncertain. In this paper, we demonstrate that masked image modeling approaches can also advance 3D medical images analysis in addition to natural images. We study how masked image modeling strategies leverage performance from the viewpoints of 3D medical image segmentation as a representative downstream task: i) when compared to naive contrastive learning, masked image modeling approaches accelerate the convergence of supervised training even faster (1.40$\times$) and ultimately produce a higher dice score; ii) predicting raw voxel values with a high masking ratio and a relatively smaller patch size is non-trivial self-supervised pretext-task for medical images modeling; iii) a lightweight decoder or projection head design for reconstruction is powerful for masked image modeling on 3D medical images which speeds up training and reduce cost; iv) finally, we also investigate the effectiveness of MIM methods under different practical scenarios where different image resolutions and labeled data ratios are applied.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源