论文标题
学习多模式轨迹预测的行人组表示
Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction
论文作者
论文摘要
对步行的动力进行建模是对计算机视觉的长期兴趣的问题。许多涉及行人轨迹预测的以前的作品定义了一组特定的单个动作来隐式模型组动作。在本文中,我们提出了一种名为GP-GRAPH的新颖架构,该架构具有集体的组表示,用于在拥挤的环境中有效的人行道轨迹预测,并且与所有现有方法兼容。 GP-GRAPH的一个关键思想是将个人和小组关系的关系作为图表表示。为此,GP-Graph首先学会将每个行人分配给最可能的行为组。然后,使用此分配信息,GP编写群体将组内和组间相互作用作为图形,分别考虑了组和群体关系中的人类关系。要具体,对于组内相互作用,我们掩盖了相关组中的行人图边缘。我们还建议小组合并和未解决操作,以代表一个具有多个行人作为一个图节点的组。最后,GP-GRAPH从两个组相互作用的综合特征中渗透了一个可获得社会上可接受的未来轨迹的概率图。此外,我们介绍了一个组级的潜在矢量抽样,以确保对一系列可能的未来轨迹的集体推断。进行了广泛的实验来验证我们的体系结构的有效性,该实验表明,通过公开可用的基准测试表明绩效的一致性提高。代码可在https://github.com/inhwanbae/gpgraph上公开获取。
Modeling the dynamics of people walking is a problem of long-standing interest in computer vision. Many previous works involving pedestrian trajectory prediction define a particular set of individual actions to implicitly model group actions. In this paper, we present a novel architecture named GP-Graph which has collective group representations for effective pedestrian trajectory prediction in crowded environments, and is compatible with all types of existing approaches. A key idea of GP-Graph is to model both individual-wise and group-wise relations as graph representations. To do this, GP-Graph first learns to assign each pedestrian into the most likely behavior group. Using this assignment information, GP-Graph then forms both intra- and inter-group interactions as graphs, accounting for human-human relations within a group and group-group relations, respectively. To be specific, for the intra-group interaction, we mask pedestrian graph edges out of an associated group. We also propose group pooling&unpooling operations to represent a group with multiple pedestrians as one graph node. Lastly, GP-Graph infers a probability map for socially-acceptable future trajectories from the integrated features of both group interactions. Moreover, we introduce a group-level latent vector sampling to ensure collective inferences over a set of possible future trajectories. Extensive experiments are conducted to validate the effectiveness of our architecture, which demonstrates consistent performance improvements with publicly available benchmarks. Code is publicly available at https://github.com/inhwanbae/GPGraph.