论文标题
Georocket:用于大型地理空间文件的可扩展和基于云的数据存储
GeoRocket: A scalable and cloud-based data store for big geospatial files
论文作者
论文摘要
我们提出Georocket,这是一种用于管理云中非常大的地理空间数据集的软件。 Georocket采用一种新颖的方式来处理任意大型数据集,通过将它们分成单独处理的块来处理任意大型数据集。该软件具有现代的反应式体系结构,并利用现有服务,包括Elasticsearch和存储后端,例如MongoDB或Amazon S3。 Georocket是模式不平衡的,并支持多种异质地理空间文件格式。它也是格式化的,并且不会以任何方式更改导入数据。 Georocket的主要好处是其性能,可伸缩性和可用性,这使其适用于许多科学和商业用例,这些案例处理非常高的数据量,复杂的数据集和高速度(大数据)。 Georocket还为地理空间数据管理领域提供了许多进一步研究的机会。
We present GeoRocket, a software for the management of very large geospatial datasets in the cloud. GeoRocket employs a novel way to handle arbitrarily large datasets by splitting them into chunks that are processed individually. The software has a modern reactive architecture and makes use of existing services including Elasticsearch and storage back ends such as MongoDB or Amazon S3. GeoRocket is schema-agnostic and supports a wide range of heterogeneous geospatial file formats. It is also format-preserving and does not alter imported data in any way. The main benefits of GeoRocket are its performance, scalability, and usability, which make it suitable for a number of scientific and commercial use cases dealing with very high data volumes, complex datasets, and high velocity (Big Data). GeoRocket also provides many opportunities for further research in the area of geospatial data management.