论文标题

Graph3s:简单,快速且可扩展的分布式图处理系统

Graph3S: A Simple, Speedy and Scalable Distributed Graph Processing System

论文作者

Wang, Xubo, Qin, Lu, Chang, Lijun, Zhang, Ying, Wen, Dong, Lin, Xuemin

论文摘要

图是许多域中无处不在的结构。迅速增加的数据量需要有效且可扩展的图数据处理。近年来,设计分布式图形处理系统是满足分布式环境中处理大图的需求的越来越重要的领域。尽管已经开发了各种分布式的图形处理系统,但就使用简单性,效率和可扩展性而言,几乎没有引起人们的关注来实现良好的组合系统性能。为了有助于分布式图处理系统的研究,这项工作试图通过设计一个简单,快速和可扩展的系统来填补这一空白。我们的观察是,实施系统的沟通灵活性会导致系统效率和可扩展性以及简单使用的增长。我们在系统图3中意识到自己的想法,并通过来自不同域的大图进行了多种算法进行广泛的实验,以测试其性能。结果表明,除了简单的用法外,我们的系统在各种图形算法上都具有出色的性能,甚至在应用于某些算法时,甚至可以在现有内存系统上达到两个数量级加速。同样,它的可伸缩性对于基于磁盘的系统具有竞争力,并且使用更少的机器时甚至更好。

Graph is a ubiquitous structure in many domains. The rapidly increasing data volume calls for efficient and scalable graph data processing. In recent years, designing distributed graph processing systems has been an increasingly important area to fulfil the demands of processing big graphs in a distributed environment. Though a variety of distributed graph processing systems have been developed, very little attention has been paid to achieving a good combinational system performance in terms of usage simplicity, efficiency and scalability. To contribute to the study of distributed graph processing system, this work tries to fill this gap by designing a simple, speedy and scalable system. Our observation is that enforcing the communication flexibility of a system leads to the gains of both system efficiency and scalability as well as simple usage. We realize our idea in a system Graph3S and conduct extensive experiments with diverse algorithms over big graphs from different domains to test its performance. The results show that, besides simple usage, our system has outstanding performance over various graph algorithms and can even reach up to two orders of magnitude speedup over existing in-memory systems when applying to some algorithms. Also, its scalability is competitive to disk-based systems and even better when less machines are used.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源