On Identifying Significant Edges in Graphical Models of Molecular Networks

by   Marco Scutari, et al.

Objective: Modelling the associations from high-throughput experimental molecular data has provided unprecedented insights into biological pathways and signalling mechanisms. Graphical models and networks have especially proven to be useful abstractions in this regard. Ad-hoc thresholds are often used in conjunction with structure learning algorithms to determine significant associations. The present study overcomes this limitation by proposing a statistically-motivated approach for identifying significant associations in a network. Methods and Materials: A new method that identifies significant associations in graphical models by estimating the threshold minimising the L_1 norm between the cumulative distribution function (CDF) of the observed edge confidences and those of its asymptotic counterpart is proposed. The effectiveness of the proposed method is demonstrated on popular synthetic data sets as well as publicly available experimental molecular data corresponding to gene and protein expression profiles. Results: The improved performance of the proposed approach is demonstrated across the synthetic data sets using sensitivity, specificity and accuracy as performance metrics. The results are also demonstrated across varying sample sizes and three different structure learning algorithms with widely varying assumptions. In all cases, the proposed approach has specificity and accuracy close to 1, while sensitivity increases linearly in the logarithm of the sample size. The estimated threshold systematically outperforms common ad-hoc ones in terms of sensitivity while maintaining comparable levels of specificity and accuracy. Networks from experimental data sets are reconstructed accurately with respect to the results from the original papers.


page 1

page 2

page 3

page 4


Uniform Inference in High-Dimensional Gaussian Graphical Models

Graphical models have become a very popular tool for representing depend...

Learning Multiple Gene Regulatory Networks in Type 1 Diabetes through a Fast Bayesian Integrative Method

Accurate inference of Gene Regulatory Networks (GRNs) is pivotal to gain...

Toward a Wired Ad Hoc Nanonetwork

Nanomachines promise to enable new medical applications, including drug ...

HNet: Graphical Hypergeometric Networks

Motivation: Real-world data often contain measurements with both continu...

On the Statistical Efficiency of ℓ_1,p Multi-Task Learning of Gaussian Graphical Models

In this paper, we present ℓ_1,p multi-task structure learning for Gaussi...

Characterization of differentially expressed genes using high-dimensional co-expression networks

We present a technique to characterize differentially expressed genes in...

Searching for a source of difference in Gaussian graphical models

In this work, we look at a two-sample problem within the framework of Ga...

Please sign up or login with your details

Forgot password? Click here to reset