论文标题
AerialMptNet:使用时间和图形特征在航空影像中进行多培训跟踪
AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features
论文作者
论文摘要
航空影像中的多人跟踪具有多种应用,例如大规模事件监控,灾难管理,搜索和救援任务,以及对预测性人群动态模型的输入。由于挑战(例如,行人的大小和小规模(例如,4 x 4像素)具有相似的外观以及图像的不同尺度和大气条件,其帧速率极低(例如2 fps)(例如2 fps),包括当前的目前的先进算法,包括深度学习的算法,无法表现良好。在本文中,我们提出了AirialMptNet,这是一种通过暹罗神经网络的外观融合的外观特征,长期短期记忆的运动预测以及来自GraphCNN的行人互连的运动预测,用于在地理参考的空中图像中进行多室外跟踪的新型方法。此外,为了解决缺乏多种空中行人跟踪数据集,我们介绍了空中多人跟踪(AirialMpt)数据集,该数据集由307帧和44,740个行人组成。我们认为,迄今为止,航空媒体是最大,最多样化的数据集,将公开发布。我们在AerialMpt和Kit AIS上评估AerialMptNet,并使用几种最先进的跟踪方法进行基准测试。结果表明,AerialMpTNET明显优于准确性和时间效率的其他方法。
Multi-pedestrian tracking in aerial imagery has several applications such as large-scale event monitoring, disaster management, search-and-rescue missions, and as input into predictive crowd dynamic models. Due to the challenges such as the large number and the tiny size of the pedestrians (e.g., 4 x 4 pixels) with their similar appearances as well as different scales and atmospheric conditions of the images with their extremely low frame rates (e.g., 2 fps), current state-of-the-art algorithms including the deep learning-based ones are unable to perform well. In this paper, we propose AerialMPTNet, a novel approach for multi-pedestrian tracking in geo-referenced aerial imagery by fusing appearance features from a Siamese Neural Network, movement predictions from a Long Short-Term Memory, and pedestrian interconnections from a GraphCNN. In addition, to address the lack of diverse aerial pedestrian tracking datasets, we introduce the Aerial Multi-Pedestrian Tracking (AerialMPT) dataset consisting of 307 frames and 44,740 pedestrians annotated. We believe that AerialMPT is the largest and most diverse dataset to this date and will be released publicly. We evaluate AerialMPTNet on AerialMPT and KIT AIS, and benchmark with several state-of-the-art tracking methods. Results indicate that AerialMPTNet significantly outperforms other methods on accuracy and time-efficiency.