论文标题

数据交换平台上数据集的基于可变的网络分析

Variable-Based Network Analysis of Datasets on Data Exchange Platforms

论文作者

Hayashi, Teruaki, Ohsawa, Yukio

论文摘要

最近,数据交换平台已在数字经济中出现,以在数据驱动的社会中获得更好的资源分配,这需要跨组织数据合作。了解这些平台上数据的特征对于它们的应用很重要。但是,此类平台的结构尚未得到广泛的研究。在这项研究中,我们将一种网络方法应用于两个数据平台服务的数据集的元数据,并将基于新型的基于可变的结构分析应用于。值得注意的是,数据网络的结构在局部密集且具有高度分类,类似于与人类相关的网络工作。尽管这些平台上的数据的设计和收集不同,具体取决于使用目标,但异质数据的变量显示出功率分布,并且数据网络表现出多尺度的行为。此外,我们发现平台的数据收集策略与从可持续性和数据平台的可持续性和社会可接受性的角度相关的变量,网络密度及其鲁棒性有关。

Recently, data exchange platforms have emerged in the digital economy to enable better resource allocation in a data-driven society, which requires cross-organizational data collaborations. Understanding the characteristics of the data on these platforms is important for their application; however, the structures of such platforms have not been extensively investigated. In this study, we apply a network approach with a novel variable-based structural analysis to the metadata of datasets on two data platform services. It was noted that the structures of the data networks are locally dense and highly assortative, similar to human-related net-works. Even though the data on these platforms are designed and collected differently, depending on the use objectives, the variables of heterogeneous data exhibit a power distribution, and the data networks exhibit multi-scaling behavior. Furthermore, we found that the data collection strategies of the platforms are related to the variety of variables, density of the networks, and their robustness from the viewpoint of sustainability and social acceptability of the data platforms.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源