论文标题
流量密度关系中波动中流量数据的异常检测和分类
Anomaly detection and classification in traffic flow data from fluctuations in the flow-density relationship
论文作者
论文摘要
我们根据密度与流动之间关系中非典型波动的识别来描述并验证一种新型数据驱动的方法,以实时检测和分类的交通异常分类。对于固定条件下的汇总数据,流量和密度与基本图相关。但是,从现代传感器网络获得的高分辨率数据通常是非平稳的,并且分解了。因此,此类数据显示出明显的统计波动。最好使用密度流平面中的双变量概率分布来描述这些波动。通过将内核密度估计应用于英国国家交通信息服务(NTIS)的大量数据,我们从经验上为伦敦的M25高速公路构建了这些分布。然后构建密度流平面中的曲线,类似于单变量分布的分位数。这些曲线与典型的交通状态定量分开的非典型波动。尽管该算法总体上识别异常,而不是特定事件,但我们发现95 \%概率曲线之外的波动与与大型拥堵事件相关的旅行时间的峰值密切相关。此外,从典型区域的游览的大小提供了一种简单的实时度量,以衡量检测到的异常的严重程度。我们通过基准根据文献中一些常用方法识别历史NTIS数据中标记事件的标记事件的能力来验证该算法。检测率,检测时间和错误警报率被用作指标,并且通常是可以比较的,除非在速度分布是双模式的情况下。在这种情况下,新算法达到的错误警报率要低得多,而不会对其他指标产生重大降解。该方法具有自我校准的其他优势。
We describe and validate a novel data-driven approach to the real time detection and classification of traffic anomalies based on the identification of atypical fluctuations in the relationship between density and flow. For aggregated data under stationary conditions, flow and density are related by the fundamental diagram. However, high resolution data obtained from modern sensor networks is generally non-stationary and disaggregated. Such data consequently show significant statistical fluctuations. These fluctuations are best described using a bivariate probability distribution in the density-flow plane. By applying kernel density estimation to high-volume data from the UK National Traffic Information Service (NTIS), we empirically construct these distributions for London's M25 motorway. Curves in the density-flow plane are then constructed, analogous to quantiles of univariate distributions. These curves quantitatively separate atypical fluctuations from typical traffic states. Although the algorithm identifies anomalies in general rather than specific events, we find that fluctuations outside the 95\% probability curve correlate strongly with the spikes in travel time associated with significant congestion events. Moreover, the size of an excursion from the typical region provides a simple, real-time measure of the severity of detected anomalies. We validate the algorithm by benchmarking its ability to identify labelled events in historical NTIS data against some commonly used methods from the literature. Detection rate, time-to-detect and false alarm rate are used as metrics and found to be generally comparable except in situations when the speed distribution is bi-modal. In such situations, the new algorithm achieves a much lower false alarm rate without suffering significant degradation on the other metrics. This method has the additional advantage of being self-calibrating.