论文标题
Piveau:基于语义Web技术的大规模开放数据管理平台
Piveau: A Large-scale Open Data Management Platform based on Semantic Web Technologies
论文作者
论文摘要
公开数据的出版和(重新利用)仍面临着技术,组织和法律层面的多个障碍。这包括界面,搜索功能,提供质量信息的限制以及缺乏确定的标准和实施指南。许多语义Web规格和技术都是专门设计的,以解决网络上数据的发布。此外,许多官方出版机构鼓励并促进基于语义网络原则的开放数据标准的制定。但是,没有现有用于管理开放数据的解决方案可以充分利用这些可能性和本方面的优势。在本文中,我们介绍了基于语义Web技术的成熟的开放数据管理解决方案“ Piveau”。它利用RDF,DCAT,DQV和SKO等各种标准来克服开放数据出版物的障碍。该解决方案的重点放在确保数据质量和可扩展性上。我们详细说明了基础,高度扩展,面向服务的体系结构,我们如何整合上述标准,并将TripLestore用作我们的主要数据库。我们已经在与已建立的解决方案的全面比较中评估了我们的工作,并通过在欧洲数据门户的生产环境中的实用应用进行了比较。我们的解决方案可作为开源。
The publication and (re)utilization of Open Data is still facing multiple barriers on technical, organizational and legal levels. This includes limitations in interfaces, search capabilities, provision of quality information and the lack of definite standards and implementation guidelines. Many Semantic Web specifications and technologies are specifically designed to address the publication of data on the web. In addition, many official publication bodies encourage and foster the development of Open Data standards based on Semantic Web principles. However, no existing solution for managing Open Data takes full advantage of these possibilities and benfits. In this paper, we present our solution "Piveau", a fully-fledged Open Data management solution, based on Semantic Web technologies. It harnesses a variety of standards, like RDF, DCAT, DQV, and SKOS, to overcome the barriers in Open Data publication. The solution puts a strong focus on assuring data quality and scalability. We give a detailed description of the underlying, highly scalable, service-oriented architecture, how we integrated the aforementioned standards, and used a triplestore as our primary database. We have evaluated our work in a comprehensive feature comparison to established solutions and through a practical application in a production environment, the European Data Portal. Our solution is available as Open Source.