论文标题
相关的重要性:依赖性在跨多个网络的关节光谱推断中的含义
The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks
论文作者
论文摘要
多个网络上的光谱推断是图形统计的快速发展子字段。最近的工作表明,与这些网络的单个频谱分解相比,多个独立网络的关节或同时的光谱嵌入可以提供更准确的估计。这种推理程序通常在很大程度上依赖于多个网络实现的独立假设,即使在这种情况下,几乎没有关注此类联合嵌入中的诱导网络相关性。在这里,我们提出了一种广义的综合嵌入方法,并对独立和相关网络中的这种嵌入进行了详细的分析,后者的嵌入方式显着扩展了此类程序的覆盖范围。我们描述了这种综合嵌入方式本身如何诱导相关性,使我们区分固有的相关性 - 在多样本网络数据中自然产生的相关性以及诱导的相关性,这是关节嵌入方法的技巧。我们表明,广义的综合嵌入过程是灵活且健壮的,并且证明了嵌入点的一致性和中心限制定理。我们研究了诱导和固有的相关性如何影响网络时间序列数据的推断,并提供了经典问题的网络类似物,例如有效的样本量,以提供更普遍的相关数据。此外,我们展示了一个适当校准的广义综合嵌入如何检测实际生物网络中以前的嵌入过程无法辨别的变化,证实了固有和诱导的相关性的效果可以是微妙和变革性的,并在理论和实践中导入。
Spectral inference on multiple networks is a rapidly-developing subfield of graph statistics. Recent work has demonstrated that joint, or simultaneous, spectral embedding of multiple independent networks can deliver more accurate estimation than individual spectral decompositions of those same networks. Such inference procedures typically rely heavily on independence assumptions across the multiple network realizations, and even in this case, little attention has been paid to the induced network correlation in such joint embeddings. Here, we present a generalized omnibus embedding methodology and provide a detailed analysis of this embedding across both independent and correlated networks, the latter of which significantly extends the reach of such procedures. We describe how this omnibus embedding can itself induce correlation, leading us to distinguish between inherent correlation -- the correlation that arises naturally in multisample network data -- and induced correlation, which is an artifice of the joint embedding methodology. We show that the generalized omnibus embedding procedure is flexible and robust, and prove both consistency and a central limit theorem for the embedded points. We examine how induced and inherent correlation can impact inference for network time series data, and we provide network analogues of classical questions such as the effective sample size for more generally correlated data. Further, we show how an appropriately calibrated generalized omnibus embedding can detect changes in real biological networks that previous embedding procedures could not discern, confirming that the effect of inherent and induced correlation can be subtle and transformative, with import in theory and practice.