DeepFusionMot：一个基于摄像头融合的3D多对象跟踪框架与深层关联

论文标题

DeepFusionMot：一个基于摄像头融合的3D多对象跟踪框架与深层关联

DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association

论文作者

Wang, Xiyang, Fu, Chunyun, Li, Zhankun, Lai, Ying, He, Jiawei

论文摘要

一方面，在最近的文献中，许多3D多对象跟踪（MOT）的作品集中在跟踪准确性和被忽视的计算速度上，通常是通过设计相当复杂的成本功能和功能提取器。另一方面，某些方法过多地集中在计算速度上，但以跟踪准确性为代价。鉴于这些问题，本文提出了一种强大而快速的基于相机融合的MOT方法，该方法在准确性和速度之间实现了良好的权衡。依靠相机和激光雷达传感器的特性，设计并嵌入了提出的MOT方法中的有效的深层关联机制。当对象距离遥远并仅由相机检测到对象，并在对象在对象出现在LIDAR视野中时获得的3D信息以实现2D和3D轨迹的平滑融合时，该关联机制实现了2D域中对象的跟踪。基于典型数据集的广泛实验表明，就跟踪准确性和处理速度而言，我们提出的方法在最新MOT方法上具有明显的优势。我们的代码可公开用于社区的利益。

In the recent literature, on the one hand, many 3D multi-object tracking (MOT) works have focused on tracking accuracy and neglected computation speed, commonly by designing rather complex cost functions and feature extractors. On the other hand, some methods have focused too much on computation speed at the expense of tracking accuracy. In view of these issues, this paper proposes a robust and fast camera-LiDAR fusion-based MOT method that achieves a good trade-off between accuracy and speed. Relying on the characteristics of camera and LiDAR sensors, an effective deep association mechanism is designed and embedded in the proposed MOT method. This association mechanism realizes tracking of an object in a 2D domain when the object is far away and only detected by the camera, and updating of the 2D trajectory with 3D information obtained when the object appears in the LiDAR field of view to achieve a smooth fusion of 2D and 3D trajectories. Extensive experiments based on the typical datasets indicate that our proposed method presents obvious advantages over the state-of-the-art MOT methods in terms of both tracking accuracy and processing speed. Our code is made publicly available for the benefit of the community.

下载PDF全文

下载文献需遵守相关版权规定

论文标题