将深度学习与胃肠道中基于图像的本地化的几何特征相结合

论文标题

将深度学习与胃肠道中基于图像的本地化的几何特征相结合

Combining Deep Learning with Geometric Features for Image based Localization in the Gastrointestinal Tract

论文作者

Song, Jingwei, Patel, Mitesh, Girgensohn, Andreas, Kim, Chelhwon

论文摘要

在胃肠道（GI）中跟踪单眼结肠镜（GI）是一个具有挑战性的问题，因为这些图像遭受了变形，纹理模糊，外观的显着变化。它们极大地限制了基于几何方法的跟踪能力。即使深度学习（DL）可以克服这些问题，但有限的标签数据是ART DL方法的障碍。考虑到这些，我们提出了一种新颖的方法，将DL方法与传统特征方法结合起来，以通过小型培训数据获得更好的本地化。我们的方法通过引入暹罗网络结构来完全利用两全其美的最佳，以对分段训练图像集中的最接近区域进行几次射击分类。进一步采用分类标签以初始化范围的姿势。为了充分使用训练数据集，训练集中的区域内的预先生成的三角映射点进行了观察注册，并有助于估计测试图像的最佳姿势。对所提出的混合方法进行了广泛的测试并与现有方法进行了比较，结果比基于传统几何或基于DL的本地化显示出显着改善。相对于最先进的方法，准确性提高了28.94％（位置）和10.97％（方向）。

Tracking monocular colonoscope in the Gastrointestinal tract (GI) is a challenging problem as the images suffer from deformation, blurred textures, significant changes in appearance. They greatly restrict the tracking ability of conventional geometry based methods. Even though Deep Learning (DL) can overcome these issues, limited labeling data is a roadblock to state-of-art DL method. Considering these, we propose a novel approach to combine DL method with traditional feature based approach to achieve better localization with small training data. Our method fully exploits the best of both worlds by introducing a Siamese network structure to perform few-shot classification to the closest zone in the segmented training image set. The classified label is further adopted to initialize the pose of scope. To fully use the training dataset, a pre-generated triangulated map points within the zone in the training set are registered with observation and contribute to estimating the optimal pose of the test image. The proposed hybrid method is extensively tested and compared with existing methods, and the result shows significant improvement over traditional geometric based or DL based localization. The accuracy is improved by 28.94% (Position) and 10.97% (Orientation) with respect to state-of-art method.

下载PDF全文

下载文献需遵守相关版权规定

论文标题